Category Archives: assessment

November 21, 2024 · 8:39 am

Improving Interrater Reliability in Healthcare Simulation-Based Assessments: The RST Approach

Achieving high interrater reliability (IRR) is a cornerstone of any effective medium or high stakes assessment in healthcare simulation. Without consistent and dependable scoring across multiple raters, the validity of an assessment can be called into question. Interrater reliability ensures that evaluations are fair, objective, and truly reflective of the participant’s performance rather than the subjective biases or variability among raters.

For simulation-based assessments, however, maintaining IRR can be particularly challenging due to the complex, dynamic, and multifaceted nature of healthcare scenarios. This is where the RST approach—focusing on changes to the Rater, the Simulation, and the Tool—can offer a systematic and impactful framework for improvement. In this post I’ll walk you through this approach, providing insights and practical strategies for applying RST to your simulation programs.

The R in RST: Changing the Rater

One of the most straightforward avenues to improve IRR is addressing variability related to the rater. This is critical because raters bring their own perspectives, experiences, and biases to the evaluation process, all of which can affect their scoring.

Strategies for Enhancing the Rater’s Consistency:

Rater Calibration Sessions
Conducting rater calibration sessions is one of the most effective ways to ensure raters have a shared understanding of the evaluation criteria. These sessions involve reviewing sample performances as a group and discussing scoring rationales to align perceptions. This shared experience helps raters interpret assessment tools in the same way, leading to more consistent scoring.
Rater Selection and Expertise
Consider who is performing the assessment. Are they subject matter experts? Are they trained educators? Selecting raters with relevant expertise and familiarity with the assessment content can reduce variability. Alternatively, inexperienced or overly diverse rater pools may introduce inconsistencies.
Addressing Rater Bias
Even with calibration, unconscious biases can creep into assessments. Training raters to recognize and mitigate biases—such as favoring individuals who perform similarly to the rater’s own practice style—can improve consistency.
Changing Raters
If specific raters consistently show discrepancies in their scoring compared to others, it may be necessary to replace them or limit their participation in high-stakes assessments. Using multiple raters per simulation and averaging scores can also dilute individual biases.

The S in RST: Changing the Simulation

The second dimension of the RST approach involves modifying the simulation itself to make it more assessable. By carefully designing simulations to make critical behaviors, thought processes, and decisions more observable, you enhance the ability of raters to evaluate participants consistently.

Strategies for Simulation Adjustments:

Prompting Observable Actions
Simulations can be structured to encourage participants to verbalize their thought processes or articulate their decisions. For instance, during a scenario involving a critical diagnosis, asking participants to “think aloud” as they interpret clinical findings can provide raters with clear evidence of decision-making skills, making scoring more straightforward.
Embedding Structured Checkpoints
Building structured checkpoints into the simulation—such as specific moments when participants are asked to summarize their findings or outline their next steps—creates clear opportunities for assessment. This reduces ambiguity for raters.
Standardizing Simulation Flow
Variability in how simulations unfold can lead to scoring challenges. Using standardized patient scripts, consistent cues, and fixed timing for critical events ensures that all participants encounter the same conditions, making assessments more comparable. If high technology simulators are being used for the simulation, consider the use of preprogram scenario to ensure the physiology changes are consistent across all episodes of the same scenario.
Revisiting Scenario Complexity
While realism is a hallmark of effective simulation, excessive complexity can overwhelm raters and obscure key performance indicators. Simplifying scenarios to focus on specific competencies can improve the clarity and reliability of evaluations.

The T in RST: Changing the Tool

The assessment tool is often an overlooked factor in achieving IRR, yet it plays a pivotal role in how raters interpret and apply scoring criteria. A well-designed tool minimizes ambiguity and makes scoring intuitive, even for less experienced raters.

Strategies for Tool Optimization:

Behavioral Anchors for Rating Scales
Adding specific behavioral examples or descriptors to rating scale items helps raters apply the scales consistently. For instance, instead of a vague “Good” rating, an anchored descriptor like “Effectively communicates diagnosis and treatment plan to patient” provides clarity.
Item Grouping and Ordering
Organizing items logically—for example, grouping communication skills, clinical decision-making, and procedural skills separately—makes it easier for raters to focus on one domain at a time. A cluttered or disorganized tool can lead to confusion and inconsistent scoring.
Simplifying Language
Ensure that the language in the tool is straightforward and free of jargon. If raters struggle to interpret an item, their scoring may vary widely.
Usability Enhancements
Small changes, like improving the font size, using bullet points, or incorporating intuitive layouts, can significantly reduce rater fatigue and errors during scoring. A user-friendly tool ensures raters stay focused on the participant’s performance rather than grappling with the mechanics of the tool.
Pretesting the Tool
Conduct pilot assessments using the tool to identify problematic items or inconsistencies. This feedback loop allows you to refine the tool before deploying it in high-stakes simulations.

Putting It All Together: The RST Approach in Action

To illustrate how the RST approach works holistically, imagine a healthcare simulation designed to assess a participant’s ability to manage a cardiac arrest scenario:

Rater: You organize a calibration session where all raters review a sample video of a cardiac arrest scenario and agree on scoring criteria. You also ensure raters have experience in emergency medicine and provide bias-awareness training.
Simulation: The scenario is adjusted to include a structured moment where the participant is required to verbalize their reasoning for choosing a particular medication. Additionally, standardized cues are used to ensure all participants face identical conditions.
Tool: The assessment tool is revised to include behavioral anchors, such as “Identifies and administers epinephrine within 3 minutes” for procedural accuracy. The tool’s layout is simplified, grouping items under headings like “Clinical Judgment” and “Communication.”

With these changes, the IRR for this simulation-based assessment improves, as raters now have a shared understanding, participants’ actions are more easily observable, and the tool provides clearer guidance.

Conclusion: Adopting the RST Approach for Better Assessments

While I will agree, improving interrater reliability in healthcare simulation assessments is no small task, but the RST approach offers a structured framework to tackle the challenge. By focusing on the Rater, the Simulation, and the Tool, you can systematically address the factors that contribute to variability and ensure more consistent, fair, and accurate evaluations. For more on this see my previous blog post on interrater reliability.

Whether you are designing a new assessment or refining an existing one, considering how changes in these three areas might influence IRR is a worthwhile investment. With reliable assessments, we not only enhance the quality of simulation-based education but also uphold the integrity of our evaluations—ultimately contributing to better-prepared healthcare professionals.

Are you ready to elevate your simulation assessments? The RST approach is here to guide your journey.

Please like and comment if you would like to see more topics like this in my blog!

Until next time, Happy Simulating!

5 Tips to Improve Interrater Reliability During Healthcare Simulation Assessments

One of the most important concepts in simulation-based assessment is achieving reliability, and specifically interrater reliability. While I have discussed previously in this blog every simulation is assessment, in this article I am speaking of the type of simulation assessment that requires one or more raters to record data associated with the performance or more specifically an assessment tool.

Interpreter reliability simply put is that if we have multiple raters watching a simulation and using a scoring rubric or tool, that they will produce similar scores. Achieving intermittent reliability is important for several reasons including that we are usually using more than one rater to evaluate simulations over time. Other times we are engaged in research and other high stakes reasons to complete assessment tools and want to be certain that we are reaching correct conclusions.

Improving assessment capabilities for stimulation requires a significant amount of effort. The amount of time and effort that can go into the assessment process should be directly proportional to the stakes of the assessment.

In this article I offer five tips to consider for improving into rate of reliability when conducting simulation-based assessment

1 – Train Your Raters

The most basic and overlooked aspect of achieving into rate and reliability comes from training of the raters. The raters need to be trained to the process, the assessment tools, and each item of the assessment that they are rendering an opinion on. It is tempting to think of subject matter experts as knowledgeable enough to fill out simple assessments however you will find out with detailed testing that often the scoring of the item is truly in the eye of the beholder. Simple items like “asked medical history” may be difficult to achieve reliability if not defined prior to the assessment activity. Other things may affect the assessment that require rater calibration/training such as limitations of the simulation, and how something is being simulated and/or overall familiarity with the technology that may be used to collect the data.

2 – Modify Your Assessment Tool

Modifications to the assessment tool can enhance interrelated reliability. Sometimes it can be extreme as having to remove an assessment item because you figure out that you are unable to achieve reliability despite iterative attempts at improvement. Other less drastic changes can come in the form of clarifying the text directives that are associated with the item. Sometimes removing qualitative wording such as “appropriately” or “correctly” can help to improve reliability. Adding descriptors of expected behavior or behaviorally anchored statements to items can help to improve reliability. However, these modifications and qualifying statements should also be addressed in the training of the raters as described above.

3 – Make Things Assessable (Scenario Design)

An often-overlooked factor that can help to improve indurated reliability is make modifications to the simulation scenario to allow things to be more “assessable”. We make a sizable number of decisions when creating simulation-based scenarios for education purposes. There are other decisions and functions that can be designed into the scenario to allow assessments to be more accurate and reliable. For example, if we want to know if someone correctly interpreted wheezing in the lung sounds of the simulator, we introduced design elements in the scenario that could help us to gather this information accurately and thus increase into rater reliability. For example, we could embed a person in the scenario to play the role of another healthcare provider that simply asks the participant what they heard. Alternatively, we could have the participant fill out a questionnaire at the end of the scenario, or even complete an assessment form regarding the simulation encounter. Lastly, we could embed the assessment tool into the debriefing process and simply ask the participant during the debriefing what they heard when I auscultated the lungs. There is no correct way to do this, I am trying to articulate different solutions to the same problem that could represent solutions based on the context of your scenario design.

4 – Assessment Tool Technology

Gathering assessment data electronically can help significantly. When compared to a paper and pencil collection scheme technology enhanced or “smart” scoring systems can assist. For example, if there are many items on a paper scoring tool the page can sometimes become unwieldy to monitor. Electronic systems can continuously update and filter out data that does not need to be displayed at a given point in time during the unfolding of the simulation assessment. Simply having previously evaluated items disappear off the screen can reduce the clutter associated with scoring tools.

5 – Consider Video Scoring

For high stakes assessment and research purposes it is often wise to consider video scoring. High stakes meaning pass/fail criteria associated with advancement in a program, heavy weighting of a grade, licensure, or practice decisions. The ability to add multiple camera angles as well as the functionality to rewind and play back things that occurred during the simulation are valuable in improving the scoring accuracy of the collected data which will subsequently improve the interrater reliability. Video scoring associated with assessments requires considerable time and effort and thus reserved for the times when it is necessary.

I hope that you found these tips useful. Assessment during simulations can be an important part of improving the quality and safety of patient care!

If you found this post useful please consider subscribing to this blog!

Thanks and until next time! Happy Simulating.

1 Comment

Filed under assessment, Curriculum, design, scenario design

Tagged as assessment, healthcare simulation, scenario design, simulation

January 25, 2019 · 5:08 pm

Education may NOT be the Return on Investment Value of Healthcare Simulation

Its January 2019 and I am flying to San Antonio, Tx to attend the International Meeting for Simulation in Healthcare. While traveling (in coach) I cannot help but to ponder where we are in simulation and where we are going. While I feel that simulation has a bright future and will earn a deservedly important role in healthcare it feels as if it is taking longer then it should.

In my overly simplistic view of simulation I envision two primary user groups. Those who utilize simulation to teach students of various healthcare professions (schools) and those who use simulation to somehow improve the quality of the delivery of healthcare. The latter of which likely includes education of individuals as well, but more of the ilk of practicing healthcare professionals and those in the apprentice phases of training such as resident physicians.

For the purpose of this post, I will be focusing on simulation efforts associated with healthcare delivery. Toward the end, I will circle back to the “school” environment again.

As healthcare dollars for the delivery of healthcare continue to be under more pressure and harder to come by there is great interest in controlling spending and increasing vigilance by corporate overlords on money being spent on investments. Investments or capital purchases are under higher levels of scrutiny than ever before. Simulationists must bear in mind that simulation is an investment, or at the least a capital expense for healthcare systems. This realization is accompanied by the stark reality that whatever you want to purchase for your simulation efforts whether it be a single simulator, or a suite of training equipment is competing against other “things” also associated with the delivery of care. Pesky things such as CT Scanners, ultrasound machines, laproscopic surgical equipment for the operating room or dialysis machines.

Why pesky? From my view as a simulation and safety leader I am envious. I am flat our jealous that it is so easy for the purchasers of the above listed examples, it is so easy for them to justify their return on investment (RPOI). Huh? What’s that? In simple terms the ROI is the business term and calculations that allow spreadsheet drivers to determine how much profit an investment of dollars in a “thing” will bring back to the

Perhaps looking at an overly simplistic explanation will help. Let’s say som Female patient undergoing MRI - Magnetic resonance imaging scan eone wants to put in a new CT Scanner. The costs of the scanner and installation, maintenance, staffing, and operational expenses are calculated. Then how much can be charged for each scan, how many scans can be done by the hour, and how many hours per day the scanner will be running calculates the revenue that the new CT scan will bring in. After the install is paid for, all of the rest of the revenue coming in once the expenses are deducted is profit. Thus at least when justifying the new CT scanner a requester of funds will create a fancy business proposal with colors and graphs that show money flowing in as a result of the purchase after a given period time. Purchase approved!

Now let’s take a typical cost justification scenario discussion between a simulationist (sim) and a Chief Finance Officer CFO of a healthcare system: Corporate Bean Counting

Simulationist (Sim): I’d like $250,000 to buy a simulator.

CFO: How is that going to make us more money?

SIM: To educate people and make them smarter and reduce mistakes?

CFO: We have lots of smart doctors and nurses working here. You should be reducing mistakes anyway.

SIM: There is a study showing a reduction of central line infections saves money.

CFO: Save who money? We still make money when the patient is in the hospital. And besides, your not asking for central line simulator.

SIM: But insurers are not going to pay for errors and hospital acquired infections anymore

CFO: Maybe not. We still make money when the patient is in the hospital. What’s your return on investment for this doll?

SIM: We are buying the simulator to train people to work together better. To work as highly functional interdisciplinary teams.

CFO: Right. We have lots of smart doctors and nurses working here. You should be reducing mistakes anyway. They are smart enough to work as teams. They do it every day.

SIM: But we can make the teams work better and make people enjoy working together more and improve patient care.

CFO: People like working here. You should be improving patient care. Where is the proof that simulation is needed to train teams AND that team training improves patient care?

SIM: The airlines have been doing it for years.

CFO: Where is the proof that airline simulation improves the airlines?

SIM: everybody just knows. It makes sense. And planes don’t crash as much as they used to.

CFO: Hospitals don’t burn down either. You know, we bought the new CT Scanner last year, and we have been able to make money on it. Its just like radiology predicted in their purchase proposal. Let me think about your request and I’ll get to you.

While the above scenario is somewhat tongue in cheek, sadly, I think it is closer to real life then many simulations we conduct. The fact of the matter is the true ROI of simulation is buried in nuances, potential opportunities, mired by anecdotal enthusiasm with a scant amount of hard-core evidence that provides the black and white spreadsheet numbers that make the bean counters excited.

It is upon us to figure out ways to describe the ROI of simulation more coherently, accompanied by facts and figures that make a difference to the leaders of healthcare systems. Let me give you a hint……. It aint about education.

We must transcend long hold belief and common assumption that the value of simulation is the education. I think the realization and yet unlocked true potential of simulation remains ties up in the ability to assess. It is tough to pivot from thinking that simulation is primarily an education methodology. But I encourage you to do so. Now before you get your hair on fire and leave me nasty comments, I’m not suggesting that we abandon simulation which we know to be an incredibly powerful education platform/modality. I just believe it you think it the main power is education first and foremost its becomes difficult to strategically plan, document, and provide leadership in other directions.

I think in the healthcare delivery space a more powerful argument that can contribute to the ROI of simulation is to harness the ability of simulation to identify the best deployment of judicious resources. So, what does this mean? Stop teaching with simulation? No, of course not.

Focusing more on the use of simulation as an assessment and surveillance tool can help to create bigger value. When teaching with simulation, conducting assessments of what people or perhaps units are doing well, what they are struggling with in a more quantitative way can help to identify the true needs of the organization. Understanding the local struggle and perhaps what the local community is not struggling with allows for a smarter utilization strategy for simulation.

Now before the heads pop off of the safe learning environment people, I’m not Stressful girl with exploded head suggesting we need to turn every simulation into a summative performance assessment and give passing and failing grades that will ruin peoples lives. However consideration should be given to the gathering of data to show improvement is critically important as you do all of your great education work. After you collect the data is a systematic way have the courage to abandon what participants always do well on, focus or increase in the areas of greatest improvements.

Carefully collect the data if you use your simulation activities for on-boarding. Don’t ask if they liked the simulation. That’s not the data you need for your ROI justifications. Can you shorten aspects of on-boarding through the use of simulation? Showing credible evidence that nursing on-boarding can be shortened by x number of days or weeks through the strategic and judicious use of simulation will bring music to the ears of the bean counting crew who don’t fancy paying for the training of people when they could be working.

Other thoughts…. Using simulation as an evaluation tool in a human factors applications can assist other departments in increasing efficiency, and improving throughput. Think about the importance of that. What????? Not your cup of tea? Think back a few paragraphs on calculations leading to justifying the need for the new CT Scanner.

Carefully documenting that simulation trained anesthesiologists, CRNA’s , endoscopists and surgeons for example may shorten OR time which means more surgeries can occur, which generate lots of revenue is part of the ROI that should be in capital letters. This is the data that matters for the ROI justifications.

In-situ programs can give valuable feedback to hospital safety and quality leaders to demonstrate volatilities in the system with regard to both process, staffing, human performance etc. It can also demonstrate where the strengths lie. If there is unnecessary training going on where the strengths lie, then redeploy or readjust to the actual needs of your system. Additionally, formulating such relationships with the quality and safety leaders of your institution and letting them know of you true capabilities that are more then making people happy and smarter through education, can win you some powerful allies in the corporate leadership suites.

Lastly circling back to the schools……. Looking past the education benefits of simulation to use it as a tool to create data that can lead to information the underpins significant change, cost savings, and allocations of precious resources (people and money) will do you well. With the exception of more students, I don’t think it is likely that windfalls of money are coming your way either…….

So is you are carefully assessing you simulation efforts and activities in a thoughtful manner, you can help to reduce redundancy, unnecessary training intervals, or repetitions and on and on. Doing less of that which is ineffective save money. Saving money is a variable of the ROI that your CFO will pay attention to.

Don’t be Confused! Every Simulation is an Assessment

Recently as I lecture and conduct workshops I have been asking people who run simulations how often they do assessments with their simulations. The answers are astounding. Every time there are a few too many people reporting that they are performing assessments less than 100% of the time that they run their simulations. Then they are shocked when I tell them that they do assessments EVERY TIME they run their simulations.

While some of this may be a bit of a play on words there should be careful consideration given to the fact that each time we run a simulation scenario we must be assessing the student(s) that are the learners. If we are going to deliver feedback, whether intrinsic to the design of the simulation, or promote discovery during a debriefing process, somewhere at some point we had to decide what we thought they did well and identify areas for needed improvement. To be able to do this you had to perform an assessment.

Kundenbewertungen - Rezensionen

Now let’s dissect a bit. Many people tend to equate the word assessment with some sort of grade assignment. Classically we think of a test that may have some threshold of passing or failing or contribute in some way to figure out if someone has mastered certain learnings. Often this may be part of the steps one needs to move on, graduate, or perhaps obtain a license to practice. The technical term for this type of assessment is summative. People in healthcare are all too familiar with such types of assessment!

Other times however, assessments can be made periodically with a goal of NOT whether someone has mastered something, but with more of a focus of figuring out what one needs to do to get better at what they are trying to learn. The technical term for this is formative assessment. Stated another way, formative assessment is used to promote more learning while summative assesses whether something was learned.

When things can get even more confusing is when assessment activities can have components or traits of both types of assessment activities. None the less, what is less important then the technical details is the self-realization and acceptance of simulation faculty members that every time you observe a simulation and then lead a debriefing you are conducting an assessment.

Such realization should allow you to understand that there is really no such thing as non-judgmental debriefing or non-judgement observations of a simulation-based learning encounter. All goal directed debriefing MUST be predicated upon someone’s judgement of the performance of the participant(s) of the simulation. Elsewise you cannot provide and optimally promote discovery of the needed understanding of areas that require improvement, and/or understanding of the topic, skills, or decisions that were carried out correctly during the simulation.

So, if you are going to take the time and effort to conduct simulations, please be sure and understand that assessment, and rendering judgement of performance, is an integral part of the learning process. Once this concept is fully embraced by the simulation educator greater clarity can be gained in ways to optimize assessment vantage points in the design of simulations. Deciding the assessment goals with some specificity early in the process of simulation scenario design can lead to better decisions associated design elements of the scenario. The optimizing of scenario design to enhance “assess-ability” will help you whether you are applying your assessments in a formative or summative way!

So, go forth and create, facilitate and debrief simulation-based learning encounters with a keen fresh new understanding that every simulation is an assessment!

Until Next Time Happy Simulating!

Category Archives: assessment

Improving Interrater Reliability in Healthcare Simulation-Based Assessments: The RST Approach

Like this:

5 Tips to Improve Interrater Reliability During Healthcare Simulation Assessments

Like this:

Education may NOT be the Return on Investment Value of Healthcare Simulation

Like this:

Don’t be Confused! Every Simulation is an Assessment

Like this:

Recent Posts

Top Posts & Pages

Categories

Follow Blog via Email

Simulating Healthcare

Category Archives: assessment

Improving Interrater Reliability in Healthcare Simulation-Based Assessments: The RST Approach

Share this:

Like this:

5 Tips to Improve Interrater Reliability During Healthcare Simulation Assessments

Share this:

Like this:

Education may NOT be the Return on Investment Value of Healthcare Simulation

Share this:

Like this:

Don’t be Confused! Every Simulation is an Assessment

Share this:

Like this:

Recent Posts

Top Posts & Pages

Categories

Follow Blog via Email

Simulating Healthcare