Head CT or MRI Scan Results for Acute Ischemic Stroke or Hemorrhagic Stroke Patients who Received Head CT or MRI Scan Interpretation within 45 minutes of ED Arrival

CBE ID

0661

1.5 Project

Endorsement Status

E&M Committee Rationale/Justification

Explore, with the developer’s technical experts, and facilities why the measure has leveled out in performance ratings. Have the measure submitted for maintenance review in three years.

1.0 New or Maintenance

Maintenance

Previous Endorsement Cycle

Fall 2023

Is Under Review

Next Maintenance Cycle

Fall 2026

1.6 Measure Description

This measure calculates the percentage of acute ischemic stroke or hemorrhagic stroke patients who arrive at the emergency department (ED) within two hours of the onset of symptoms and have a head computed tomography (CT) or magnetic resonance imaging (MRI) scan interpreted within 45 minutes of ED arrival. The measure is calculated using chart abstracted data, on a rolling, quarterly basis and is publicly reported, in aggregate, for one calendar year. The measure has been publicly reported, annually, by CMS as a component of its Hospital Outpatient Quality Reporting (OQR) Program since 2012.

Measure Specs

General Information

1.7 Measure Type

Process

1.7 Composite Measure

1.3 Electronic Clinical Quality Measure (eCQM)

1.8 Level of Analysis

Facility

1.9 Care Setting

Emergency Department

Hospital: Outpatient

1.10 Measure Rationale

Not applicable; this measure is not a paired or grouped measure.

1.11 Measure Webpage

https://qualitynet.cms.gov/files/6491ba2304f753001cd0591c?filename=OQR_v17.0_Sp…

1.20 Types of Data Sources

Claims Data

Electronic Health Records

Paper Patient Medical Records

1.25 Data Source Details

This measure is derived from medical record abstraction (paper or electronic). This is not an eMeasure. Administrative claims are listed as a data source as the measure is calculated based on four consecutive quarters of hospital outpatient claims data.

An electronic data collection tool is made available from vendors or facilities can download the free CMS Abstraction & Reporting Tool (CART). Paper tools for manual abstraction, which are posted on www.QualityNet.org, are also available for the CART tool. These tools are posted on www.QualityNet.org

Measure Calculation

1.13a Attach Data Dictionary

AppendixA.zip

1.16 Type of Score

Rate/proportion

1.17 Measure Score Interpretation

Better quality = Higher score

1.18 Calculation of Measure Score

This measure calculates the percentage of acute ischemic stroke or hemorrhagic stroke patient encounters where the arrival time to the ED is within two hours of the last known well/onset of symptoms and have a head CT or MRI interpreted within 45 minutes of ED arrival. The measure is calculated based on four consecutive quarters of hospital outpatient encounter claims data, as follows:

Check E/M Code; if on Table 1.0 proceed
Calculate Patient Age (Outpatient Encounter Date - minus Birthdate)
Check Patient Age; if >= 18, proceed
Check ICD-10-CM Principal Diagnosis Code; if on Table 8.0, proceed
Check Discharge Code; exclude any patients with code 6, 7, or 8
Check for a Head CT or MRI Scan Order; if “Yes,” proceed
Check Last Known Well documented; if “Yes,” proceed
Check Date Last Known Well; if a Unable to Determine (UTD) value, proceed
Check Time Last Known Well; if a UTD value, proceed
Check Arrival Time; if a UTD value, proceed
Calculate measurement value (Outpatient encounter date and arrival time minus Date Last Known Well and Time last known well (in minutes)
Check Last Known Well Minutes measurement value; if >= 0 min and <= 120 min, record as the denominator and proceed
Check Head CT or MRI Scan Interpretation Date; if a Unable to Determine (UTD) value, proceed
Check Head CT or MRI Scan Interpretation Time; if a Unable to Determine (UTD) value, proceed
Calculate Head CT/CTA or MRI/ MRA measurement value Head Ct or MRI scan Interpretation Date and Head CT or MRI Scan Interpretation Time minus Outpatient Encounter Date and Arrival Time (in minutes))

16.Check Head CT, CTA or MRA/MRI scan Minutes measurement value; if >= 0 min and <= 45 min, record as the numerator

17. Aggregate denominator and numerator counts by Medicare provider number Measure = numerator counts / denominator counts [The value should be recorded as a percentage]

1.19 Measure Stratification Details

Not Applicable; this measure is not stratified.

1.26 Minimum Sample Size

Eleven is the minimum number of cases required for public reporting.

Importance

Evidence

2.1 Attach Logic Model

0661_OP-23_Logic Modelv3.pdf

2.2 Evidence of Measure Importance

Powers et al. (2019) and the AHA/ASA Clinical Guidelines Writing Group published updated clinical guideline recommendations for the management of acute ischemic stroke which supports the measures intent. Several strategies in the guide have demonstrated improvement in door-to-imaging times (e.g., Emergency Medical Services activation, assessment, and management of patients). Other strategies, such as telemedicine and teleradiology, can improve access to care. Non-contrast CT and MRI remain effective in excluding intracerebral hemorrhage before intravenous alteplase administration, which aligns with OP-23. To identify patients who may benefit from mechanical thrombectomy between 6 and 24 hours after last know well time, the guidelines also recommend computed tomography angiography or magnetic resonance (MR) angiography with diffusion-weighted magnetic resonance imaging with or without MR perfusion. (Citation: Powers, W. J., Rabinstein, A. A., Ackerson, T., Adeoye, O. M., Bambakidis, N. C., Becker, K., Biller, J., Brown, M., Demaerschalk, B. M., Hoh, B., Jauch, E. C., Kidwell, C. S., Leslie-Mazwi, T. M., Ovbiagele, B., Scott, P. A., Sheth, K. N., Southerland, A. M., Summer, D., & Tirschwell, D. L. (2019). Guidelines for the early management of patients with acute ischemic stroke: 2019 update to the 2018 guidelines for the early management of acute ischemic stroke: A guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke, 50(12), e344–e418. http://doi.org/doi: 10.1161/STR.0000000000000211)

The updated guideline include the following recommendations:

Recommendation 1: All patients with suspected acute stroke should receive emergency brain imaging evaluation on first arrival to a hospital before initiating any specific therapy to treat AIS.

Recommendation 2: Systems should be established so that brain imaging studies can be performed as quickly as possible in patients who may be candidates for IV fibrinolysis or mechanical thrombectomy or both.

The benefit of IV alteplase is time dependent, with earlier treatment within the therapeutic window leading to bigger proportional benefits. A brain imaging study to exclude ICH is recommended as part of the initial evaluation of patients who are potentially eligible for these therapies. With respect to endovascular treatment, a pooled analysis of 5 randomized trials comparing EVT with medical therapy alone in which the majority of the patients were treated within 6 hours found that the odds of improved disability outcomes at 90 days (as measured by the mRS score distribution) declined with longer time from symptom onset to arterial puncture.42 The 6- to 16- and 6- to 24-hour treatment windows trials, which used advanced imaging to identify a relatively uniform patient group, showed limited variability of treatment effect with time in these highly selected patients. The absence of detailed screening logs in these trials limits estimations of the true impact of time in this population. To ensure that the highest proportion of eligible patients presenting in the 6- to 24-hour window have access to mechanical thrombectomy, evaluation and treatment should be as rapid as possible. Reducing the time interval from ED presentation to initial brain imaging can help to reduce the time to treatment initiation. Studies have shown that median or mean door-to-imaging times of ≤20 minutes can be achieved in a variety of different hospital settings.

Recommendation 3: Noncontrast CT (NCCT) is effective to exclude ICH before IV alteplase administration.

Recommendation 4: Magnetic resonance (MR) imaging (MRI) is effective to exclude ICH before IV alteplase administration.

Recommendation 5: (new recommendation) CTA with CTP or MR angiography (MRA) with diffusion-weighted magnetic resonance imaging (DW-MRI) with or without MR perfusion is recommended for certain patients. In many patients, the diagnosis of ischemic stroke can be made accurately on the basis of the clinical presentation and either a negative NCCT or one showing early ischemic changes, which can be detected in the majority of patients with careful attention. NCCT scanning of patients with acute stroke is effective for the rapid detection of acute ICH. NCCT was the only neuroimaging modality used in the National Institute of Neurological Disorders and Stroke (NINDS) rt-PA (Recombinant Tissue-Type Plasminogen Activator) trials and in ECASS (European Cooperative Acute Stroke Study) III and is therefore sufficient neuroimaging for decisions about IV alteplase in most patients. Immediate CT scanning provides high value for patients with acute stroke. MRI was as accurate as NCCT in detecting hyperacute intraparenchymal hemorrhage in patients presenting with stroke symptoms within 6 hours of onset when gradient echo sequences were used. In patients who awake with stroke or have unclear time of onset >4.5 hours from baseline or last known well, MRI to identify diffusion-positive fluid-attenuated inversion recovery (FLAIR)–negative lesions can be useful for selecting those who can benefit from IV alteplase administration within 4.5 hours of stroke symptom recognition. CTA with CTP or MRA with DW-MRI with or without MR perfusion is useful for selecting candidates for mechanical thrombectomy between 6 and 24 hours after last known well.

Waqas et al. (2019) reviewed clinical practice guidelines and literature and recommended that emergency departments (1) develop specific protocols to triage patients based on whether a patient is admitted to the ED via an emergency medical services (EMS) transport, ED walk-in, or in-hospital stroke; (2) initiate imaging orders including non-contrast brain computed tomography (CT) scans, CT angiograms, CT perfusion imaging, and/or magnetic resonance imaging; (3) interpret scans within 20 minutes of presentation (based on the Stroke Process Time Metrics recommended by the Society of Neurointerventional Surgery); and (4) coordinate care transitions with ED facilities or an appropriate stroke center (Waqas 2019). Overall, this article reinforces the intent of OP-23 to provide timely stroke diagnosis and recommends strategies hospitals can take, such as developing context-specific protocols and coordinating care within the ED, to reduce the time from door to imaging results interpretation. (Citation: Waqas, M., Vakharia, K., Munich, S., Morrison, J., Mokin, M., Levy, E., & Siddiqui, A. (2019). Emergency Room Triage of Acute Ischemic Stroke. Neurosurgery, 85(suppl_1).S38-S46. https://doi.org/10.1093/neuros/nyz067)

Performance Gap

Table 1. Performance Scores by Decile

Performance Gap
	Overall	Minimum	Decile_1	Decile_2	Decile_3	Decile_4	Decile_5	Decile_6	Decile_7	Decile_8	Decile_9	Decile_10	Maximum
Mean Performance Score	74.10	5.56	36.70	56.54	64.80	71.41	76.00	79.66	83.10	87.06	91.89	97.96	100
N of Entities	1431	1	161	130	139	144	149	142	151	137	141	137	78
N of Persons / Encounters / Episodes	28174	18	2866	2350	2663	2798	3165	2985	2972	3227	2490	2658	1237

Scientific Acceptability

Testing Data

5.1.1 Data Used for Testing

This measure was tested using patient record data abstracted from paper record, claims, and electronic health records stored in the Clinical Data Warehouse (CDW) and the Clinical Data Abstraction Center (CDAC). Data was obtained for 01-01-2018 through 12-31-2021 exclusive of January 1 through June 30, 2020 arrival date times. There are no differences in data for different aspects of testing.

5.1.2 Differences in Data

Not applicable. There are no differences in data for different aspects of testing.

5.1.3 Characteristics of Measured Entities

Data for both Clinical Data Abstraction Center (CDAC) and Clinical Data Warehouse (CDW) was obtained for 01-01-2018 through 12-31-2021 exclusive of January 1 through June 30, 2020 arrival date times due to COVID-19 considerations. The CDAC data contained 2,654 patients in 968 facilities and CDW contained 213,527 patients in 3,881 facilities. The data presented below in table 2 represents additional characteristics of the data used for testing.

Table 2. Characteristics of Facilities Meeting Minimum Case Count

Characteristics     CDAC     CDW
Date Collected     2018-01-01 to 2021-12-31     2018-01-01 to 2021-12-31
Sampled Population     2,654     213,527
Number of Facilities    968      3,881
Denominator Cases     1,650     139,865
Numerator Cases      1,195     104,023
Level of Analysis     Facility Level    Facility Level

5.1.4 Characteristics of Units of the Eligible Population

The data presented below in table 3 represents characteristics of patients included in the testing analysis. There are no differences in data for different aspects of testing. The majority of patients were white, non-Hispanic from ages 60-79 who suffered from Ischemic stroke. There was a fairly even split between male and female patients.

Table 3. Patient Characteristics among Facilities Meeting Minimum Case Count

Groups    Number of patients (CDW)    Performance Rates (CDW)    Number of patients (CDAC)    Performance Rates (CDAC)
Sex   -     -    -    -
Female     105286     73.41%     1313     70.98%
Male     108194     75.33%     1339     73.84%
Unknown Sex     47     66.67%     2    100%

Age      -    -     -      -
18-39      8885      63.48%     133     57.89%
40-59      52041     71.82%     640     68.95%
60-79     103634     75.19%     1262      73.51%
80 and Older     48967     77.66%    619      77.69%

Race     -     -     -     -
Asian     4704     73.22%     45     77.27%
Black or African American 26352     72.58%     359     71.62%
Unknown or Other    14339     71.15%    171     75.45%
White     168132     74.95%     2079     72.22%

Ethnicity      -    -     -     -
Hispanic/Latino    15977      68.56%     203    68.50%
Not Hispanic/Latino    197550     74.81%     2451     72.75%

Diagnosis      -     -     -     -
Hemorrhagic stroke    47385     66.38%     593    61.20%
Ischemic stroke    166142    76.93%     2061     76.19%

Reliability

5.2.1 Level(s) of Reliability Testing Conducted

Accountable entity level (i.e., measure score) (e.g., signal-to-noise analysis)

5.2.2 Method(s) of Reliability Testing

Reliability was calculated in accordance with the signal-to-noise method discussed in The Reliability of Provider Profiling: A Tutorial (2009). This approach calculates the ability of the measure to distinguish between facility performance. We calculated the signal-to-noise ratio for each facility meeting the minimum case count of 11, established by the measure calculation contractor during the data collection period, with higher scores indicating greater reliability. The reliability score is estimated using a beta-binomial model, which is appropriate for the reliability testing of pass/fail measures. The reliability score for each facility is a function of the facility’s sample size and score on the measure, and the variance across facilities.

Adams JL. The reliability of provider profiling: a tutorial. Santa Monica, CA: RAND Corporation. 2009. Retrieved from http://www.rand.org/pubs/technical_reports/TR653.

5.2.3 Reliability Testing Results

Table 4 displays the distribution of signal to noise scores from 2021. Higher scores denote greater reliability. Reliability scores ranged from 0.43 to 1.00 and mean reliability score was 0.68.

Table 4. Results of Reliability Testing Based on Signal-to-noise analysis
Year: 2021

Number of Facilities : 1431

Mean: 0.68

Standard Deviation: 0.15

Min: 0.43

5th Percentile:0.45

10th Percentile: 0.48

25th Percentile: 0.56

50th Percentile: 0.67

75th Percentile:0.77

90th Percentile:0.87

95th Percentile:1.00

Max:1.00

5.2.4 Interpretation of Reliability Results

While there is no universal standard cut off for signal to noise, a reliability of 0.70 is considered the acceptable threshold for reliability. Our results for 2021 of a median reliability score of 0.67 and mean reliability score of 0.68 approach the 0.7 cut off indicating moderate reliability. Our results also align with the Draft Acceptable Reliability Thresholds suggested by the National Quality Forum (NQF) Scientific Methods Panel (SMP) in 2021 which propose the threshold of 0.6 ≥ 0.9 for adequate reliability. Our results indicate that the measure is able to identify true differences in performance between individual facilities.

Table 2. Accountable Entity Level Reliability Testing Results by Denominator, Target Population Size

Accountable Entity-Level Reliability Testing Results
	Overall	Minimum	Decile_1	Decile_2	Decile_3	Decile_4	Decile_5	Decile_6	Decile_7	Decile_8	Decile_9	Decile_10	Maximum
Reliability	0.68	0.43	0.46	0.52	0.57	0.61	0.65	0.69	0.72	0.77	0.83	0.96	1.00
Mean Performance Score	1431	15	144	153	134	143	146	143	138	144	143	143	78
N of Entities	28174	165	1682	2180	2006	2404	2688	2885	3177	3576	4272	3304	1237

Validity

5.3.1 Level(s) of Validity Testing Conducted

Person or encounter level (i.e., data element) (e.g., sensitivity and specificity)

Accountable entity level (i.e., measure score) (e.g., criterion validity)

5.3.3 Method(s) of Validity Testing

Data Element Validity.

We assessed the data element validity of the measure by calculating a rate of agreement between facility abstraction (sourced from the CDW) and auditor (CDAC) abstraction for each of the data elements used to calculate the measure. The analysis used data element values for 1548 denominator cases abstracted by CDAC, which were previously abstracted by facilities. We then used Gwet’s AC-1 statistic to account for chance agreement. A Gwet’s AC-1 statistic less than 0.5 indicates a fair agreement, 0.5-0.8 indicates a medium effect size, and greater than or equal to 0.8 indicates a large effect size.

Hypothesis-driven validity.

We assessed the validity of the measure through literature informed hypothesis testing. Based on our reviews of literature,^1,2 we anticipated that female patients would have a longer arrival to CT interpretation time than male patients. In addition to the t-statistic to detect statistical differences, we calculated Cohen’s D to show whether a difference is meaningful in practice or not.

Sex and Race‐Ethnic Disparities in Door‐to‐CT Time in Acute Ischemic Stroke: The Florida Stroke Registry. Sai P. Polineni MPH, Enmanuel J. Perez MD, PhD, Kefeng Wang MS, Carolina M. Gutierrez PhD, Jeffrey Walker MBA‐HCM, Dianne Foster RN, BSN, MBA, Chuanhui Dong PhD, Negar Asdaghi MD, Jose G. Romano MD, Ralph L. Sacco MD, MS, Tatjana Rundek MD, PhD trundek@med.miami.edu, and for the Florida Stroke Registry
Predictors of Time From Hospital Arrival to Initial Brain-Imaging Among Suspected Stroke Patients. Kathryn M. Rose, PhD, Wayne D. Rosamond, PhD, Sara L. Huston, PhD, Carol V. Murphy, RN, MPH, and Charles H. Tegeler, MD

5.3.4 Validity Testing Results

As demonstrated in table 5, percent agreement ranged from 85% - 100%. Head CT/MRI Scan Interpretation Time had a percent agreement and Gwet’s AC1 score at 85% and 0.83 respectively. Head CT/MRI Scan Order, Last Known Well, Principal ICD code, E/M Code, Date Last Known Well (LKW), and Head CT/MRI Scan Interpretation Date had complete agreement (100%) and Gwet’s AC1 scores of 1.

Table 5. Data Element Validity for Categorical Variables, Non-categorical Variables, and Constructed Outcomes

Variable     n     Percent Agreement     Gwet’s AC1
Discharge Code     1548     98%     0.98
Head CT/MRI Scan Order      1548    100%     1.00
Last Known Well     1548     100%     1.00
Principal ICD code     1548     100%     1.00
E/M Code     1548     100%     1.00
Arrival time     1548     99%     0.99
Date Last Known Well (LKW)     1548     100%     1.00
Time LKW     1548     93%     0.93
Head CT/MRI Scan Interpretation Date    1548     100%     1.00
Head CT/MRI Scan Interpretation Time    1548     85%     0.83
Numerator     1548     97%     0.97
Denominator     1548     100%     1.00

Hypothesis-driven validity

Table 6 shows that in 2021, the mean difference between females and males was 2.83 with a t-score of 2.47, p-value of 0.01 and Cohen’s d of 0.06.

Table 6. Empirical Validity Analysis of Differences between Males and Females

Year: 2021

Category: Patient Sex

Value: Female vs. Male

Mean Difference: 2.83

Confidence Interval Lower Limit:0.58

Confidence Interval Upper Limit : 5.07

t: 2.47

p : 0.01

Cohen’s d: 0.06

5.3.5 Interpretation of Validity Results

Data Element Validity.

Results demonstrated that the agreement between the data source and the gold standard is high, and the measure score correctly reflects the quality of care provided by identifying differences in quality. We used Gwet’s AC1 statistic to account for agreement by chance, a more robust measure of concordance than overall agreement.

Hypothesis-driven validity.

For 2021, there was a difference between females and males and that difference was statistically significant but based on the Cohen’s d of 0.06, the effect size of that difference is moderate. The groups differ by 0.06 standard deviations. From these results, we conclude that the differences by sex between ED arrival and Head CT/MRI scan are statistically significant. This conclusion aligns with the literature which indicates stroke signs are not always identified as quickly as in women as they are in men.

5.3.2 Type of Accountable Entity Level Validity Testing Conducted (derived)

Empirical validity testing at the accountable entity-level (e.g., criterion validity, construct validity, known groups analysis)

Use & Usability

Use
Usability

Use

6.1.3 Current Use(s)

Public Reporting

Payment Program

Regulatory and Accreditation Programs

Quality Improvement with Benchmarking (external benchmarking to multiple organizations)

6.1.4 Program Details

Name of the program and sponsor

The CMS Hospital Outpatient Quality Reporting Program

Geographic area and percentage of accountable entities and patients included

National

Applicable level of analysis and care setting

The publicly reported values (on Hospital Compare) are calculated for all facilities participating in the Hospital OQR Program in the United States that meet minimum case count requirements. Facilities eligible to report this measure are subject to the Outpatient Prospective Payment System (OPPS) guidelines

Usability

6.2.1 Actions of Measured Entities to Improve Performance

In order to improve performance on this measure, measured entities must educate their providers around following guidelines for diagnosing and treating an acute ischemic stroke. These actions do not cause undue burden to the measure entities.

6.2.2 Feedback on Measure Performance

Feedback received from stakeholders (via the ServiceNow tool) is used to revise the measure specifications. Following receipt of a suggestion to adjust the specifications, a literature review is performed to determine if the proposed change aligns with the empirical evidence base for the measure; feedback from the expert work group is obtained to evaluate the change to the specifications. To date, we have received no significant concerns raised by stakeholders about the measure specifications through ServiceNow. In addition, stakeholders may submit comments on the measure through the Outpatient Prospective Payment System (OPPS) annual rule-making process. No comments were received for this measure during the most recent OPPS rule-making cycle.

6.2.3 Consideration of Measure Feedback

To date, we have received no significant feedback about the measure specifications.

6.2.4 Progress on Improvement

Summary statistics of performance scores during the January 1, 2018 through December 31, 2021 data collection periods are provided in the Gap section. In 2015, the average hospital score was 71.28% among 1276 hospitals. In 2016, there was an average change in hospital scores of 1.43%, the average hospital score was 73.27% among 1401 hospitals. In 2017, there was an average change in hospital scores of 1.64%, the average hospital score was 74.33% among 1507 hospitals. In 2018, there was an average change in hospital scores of 0.26%, the average hospital score was 73.21% among 1607 hospitals. In 2019, there was an average change in hospital scores of 0.28%, the average hospital score was 73.73% among 1592 hospitals. In 2020, there was an average change in hospital scores of 0.54%, the average hospital score was 75.89% among 502 hospitals. In 2021, there was an average change in hospital scores of 1.42%, the average hospital score was 71.53% among 1492 hospitals.

Performance scores have remained stable over the years showing continued room for improvement. As noted in prior submissions, the number of patients receiving high-quality healthcare as performance on the measure improves is larger than the number of cases captured by the measure because a hospital can choose to only report a sample cases.

6.2.5 Unexpected Findings

We did not identify any unintended consequences during measure testing. Similarly, no evidence of unintended consequences to individuals or populations has been reported by external stakeholders since its implementation. We will continue to monitor the potential for unintended consequences through an annual review of the literature as well as an ongoing review of stakeholder comments and inquiries. The risk in advancing measures that address timeliness is that there may be a decrease in testing performance to avoid measurement, however this is not likely due to the need to assess diagnostic results to ensure a proper diagnosis.

Comments

Public Comments

Staff Preliminary Assessment

CBE #0661 Staff Assessment

Importance

Importance Rating

Met

Importance

Strengths:

Updated clinical guidelines cited support the measure concept, including recommendations that suspected stroke patients receive brain imaging studies as soon as possible after arriving at the ED, that non-contrast CT and MRI are both effective at ruling out ICH before treatment, and that certain patients benefit from MRA/CTA (Powers et al. 2019), and a systematic review of guidelines makes similar recommendations (Waqas et al. 2019). The success of treatment with IV alteplase is time dependent, and developers cite a pooled analysis of 5 RCTs showing that treatment with EVT within 6 hours of stroke onset found odds of improved disability outcomes at 90 days (reference not provided).
Mean hospital scores are stable in the 71-75% range from 2015 to 2020, showing room for improvement. Performance scores ranged from a minimum of 5.56% to maximum of 100%. Developer notes that performance scores are not limited to facilities present each year, indicating the limited number of facilities in 2020 skewed performance scores higher.
Meaningfulness to patients was demonstrated by citing one study using patient survey and matched controls, which showed that treatment with tPA within the therapeutic time window was associated with better physical function, communication, cognitive ability, depressive symptomatology, and quality of life/participation compared with control, and fewer SNF stays, ED visits, and readmissions (Lang et al., 2014).

Limitations:

Limitation of the pooled analysis is that 6-16 and 6-24 hour window trials showed limited variability of treatment effect with time, which the developers interpreted as evidence for the importance of rapid imaging (reference not provided).
Sample in Lang et al. was relatively small (tPA (n = 78); control (n = 156))

Rationale:

The 2019 clinical guidelines and a systematic review of 5 randomized-controlled trials presented support the measure concept, emphasizing the importance of rapid brain imaging in suspected stroke patients visiting the ED, the availability of effective imaging tests, and the benefits of early treatment. Meaningfulness to patients was demonstrated in a small sample study using patient surveys, which showed improved function and quality of life, and reduced utilization among patients who received early tPA treatment. Performance scores show room for improvement and substantial variability.

Feasibility Acceptance

Scientific Acceptability

Scientific Acceptability Reliability Rating

Not met but addressable

Scientific Acceptability Reliability

Strengths:

The measure is well defined and specified.
Accountable entity-level (i.e., measure score) reliability testing was estimated using signal-to-noise analysis on a 2021 dataset consisting of 28,174 persons across 1,431 facilities meeting the minimum count of 11 cases. A decile table of reliability by population size is provided. The median reliability 0.68. The mean of the 3rd decile is 0.57, and the mean of the 4th decile is 0.61 which indicates that 65-70% of the entities have a reliability >0.6.

Limitations:

Approximately 30-35% of entities have reliability less than 0.6, likely facilities with a low denominator size. Consider mitigation for entities with low case counts.

Rationale:

Measure score reliability testing (accountable entity-level) performed. However, reliability <0.6 for 30-35% of entities. Some possible mitigation strategies to improve these estimates could be to

Empirical approaches outlined in the report, MAP 2019 Recommendations from the Rural Health Technical Expert Panel Final Report, https://www.qualityforum.org/WorkArea/linkit.aspx?LinkIdentifier=id&ItemID=89673
Consider a higher minimum case volume.
Extend the time frame.
Focus on applying mitigation at the lower volume providers.

Scientific Acceptability Validity Rating

Not met but addressable

Scientific Acceptability Validity

Strengths:

Data element validity testing was previously performed for claims, EHR, and paper records.
Data element validity was assessed by calculating rate of agreement (% agreement and Gwet AC-1) in all data elements used to calculate the measure between facility (CDW) abstracted and auditor abstracted (CDAC, the gold standard) data, using a sample of 1548 denominator cases from CDAC, 2021 data. Minimum agreement was 85% / .83 (head CT/MRI interpretation time); all other elements had 93% / .93 – 100% / 1.0 agreement (large effect).
Developer conducted accountable entity-level (measure score) validity testing, where the developer hypothesized that female patients would have a longer arrival to CT interpretation time compared to male patients. Mean difference showed longer time for female patients, Cohen’s D of 0.06 (moderate effect size), which the developer claims aligns with literature showing stroke signs are not identified as quickly in women.
No risk adjustment since the measure is a process measure.

Limitations:

Hypothesis testing confirms the expected difference in performance between men and women (worse for women), which aligns with expectation based on slower identification of strokes in women, but does not address the rationale for differences at the entity level. Meaning, is the difference a clinical practice concern or is the difference due to underlying patient characteristics.

Rationale:

Data element validity testing demonstrates high agreement between facility data (CDW) and auditor data (CDAC, the gold standard). Hypothesis testing confirms the expected difference in performance between men and women (worse for women), which aligns with expectation based on slower identification of strokes in women, but does not address why there are differences at the entity level. The committee may consider asking the developer to speak to this further.

Equity

Use and Usability

Use and Usability Rating

Not met but addressable

Use and Usability

Strengths:

Currently in use in the CMS Hospital Outpatient Quality Reporting Program.
Measured entities are expected to educate providers on guidelines for diagnosing and treating ischemic stroke.
Feedback is collected through ServiceNow and also through the annual rulemaking process; if warranted, a literature review is performed to evaluate whether the proposed specification change aligns with the evidence base; developers report that no significant concerns were received from stakeholders (to date) or through public comment (over the most recent OPPS rulemaking cycle)
Performance scores continue to show room for improvement.
Developer reports no unexpected findings or unintended consequences.

Limitations:

No mention of feedback reports or similar mechanism for informing providers of their performance.
While performance scores show room for improvement, they have been stable from 2015 to 2021 (range: 71.28% - 75.89%). In addition, the number of hospitals reporting each year changes considerably (range: 502 - 1607), making it difficult to interpret changes in the rate (e.g., the highest rate was reported in the year with the fewest reporting hospitals). Finally, developers note that the number of patients receiving high quality care is larger than the number of cases captured since a hospital can choose to report only a sample, but developers do not indicate how many hospitals are reporting samples or their sample sizes.

Rationale:

This measure is currently in use in the CMS Hospital Outpatient Quality Reporting Program. To improve quality, hospitals are expected to educate providers on guidelines for diagnosing and treating ischemic stroke; they do not mention potential QI mechanisms such as providing performance reports to providers.

This measure continues to show room for improvement; however, the rate has remained largely stable from 2015-2021 and possible reasons for the lack of improvement are not articulated. Developers report no unexpected findings.

Breadcrumb

Head CT or MRI Scan Results for Acute Ischemic Stroke or Hemorrhagic Stroke Patients who Received Head CT or MRI Scan Interpretation within 45 minutes of ED Arrival

Table 6. Empirical Validity Analysis of Differences between Males and Females

CBE #0661 Staff Assessment

Head CT or MRI within 45 ED Arrival for Ischemic Stroke

I look forward to a discussion of Use and Usability.

Stable performance and not improving.

Measure seems important, will like more information

CBE #0661

CBE #0661

Head CT or MRI w/in 45" ED Arrival for Ischemic Stroke

NA

NA

This measure impacts quality…

This is important measure…

CT MRI Results45 - CBE ID 0661

measure seems useful but no improvement

Head CT or MRI Scan Results

Important Measure

Curious case of stagnant performance

Comments on 0661

Door to Imaging Times

A few questions for developer

Head Scan/CT Stroke