30-Day Risk Standardized All-Cause Emergency Department Visit Following an Inpatient Psychiatric Facility Discharge

CBE ID

4190

1.5 Project

Cost, Resource Use, and Efficiency

Endorsement Status

Not Endorsed

E&M Committee Rationale/Justification

Due to No Consensus

1.0 New or Maintenance

New

Previous Endorsement Cycle

Fall 2023

Is Under Review

1.6 Measure Description

The 30-Day Risk Standardized All-Cause Emergency Department Visit Following an Inpatient Psychiatric Facility (IPF) Discharge (IPF ED Visit) measure assesses the proportion of patients ages 18 and older with an emergency department (ED) visit, including observation stays, for any cause, within 30 days of discharge from an IPF, without subsequent admission. The IPF ED Visit measure is an outcome-based measure.

Measure Specs

General Information

1.7 Measure Type

Outcome

1.7 Composite Measure

1.3 Electronic Clinical Quality Measure (eCQM)

1.8 Level of Analysis

Facility

1.9 Care Setting

Behavioral Health: Inpatient (e.g., Inpatient Psychiatric Facility)

1.10 Measure Rationale

The 30-Day Risk Standardized All-Cause Emergency Department Visit Following an Inpatient Psychiatric Facility Discharge (IPF ED Visit) is a new measure that has not been endorsed, and therefore does not have an associated web page at this time. It has been well established that the first three months following an inpatient psychiatric facility (IPF) discharge, particularly the first month, is a period of high risk for readmissions, rehospitalization, and suicidality (Mutschler et al. 2019). The rationale for the IPF ED Visit measure is to encourage IPFs to proactively focus on discharge planning and community reintegration during patients’ IPF stays. The measure is complementary to the 30-Day All-Cause Unplanned Readmission Following Psychiatric Hospitalization in an IPF (IPF Readmission) measure (NQF #2860) and Follow-Up After Psychiatric Hospitalization (FAPH) measure. The proposed measure complements the IPF Readmission measure by providing information on emergency department (ED) visits, including observation, without readmission, thereby more comprehensively assessing IPF discharge planning quality and promoting care coordination post-discharge.

Relevant literature:

Mutschler, C., S. Lichtenstein, S.A. Kidd, and L. Davidson. “Transition Experiences Following Psychiatric Hospitalization: A Systematic Review of the Literature.” Community Mental Health Journal, vol. 55, no. 8, 2019, pp. 1255–1274. doi.org/10.1007/s10597-019-00413-9

1.11 Measure Webpage

https://www.cms.gov/

1.20 Types of Data Sources

Claims Data

1.25 Data Source Details

Medicare beneficiary and coverage files. Provides information on patient demographic, enrollment, and vital status to identify the measure population and certain risk factors.
Medicare fee-for-service (FFS) Part A records. Contains final action claims submitted by acute care and critical access hospitals, IPFs, home health agencies, and skilled nursing facilities to identify the measure population, readmissions, and certain risk factors.
Medicare FFS Part B records. Contains final action claims submitted by physicians, physician assistants, clinical social workers, nurse practitioners, and other outpatient providers to identify certain risk factors. For this measure, claims for services such as laboratory tests, medical supplies, or other ambulatory services were not used. This ensures that diagnoses result from an encounter with a provider trained to establish diagnoses and not from a claim for a diagnostic test.

Measure Calculation

1.13a Attach Data Dictionary

IPFEDVisit_codebook.zip

1.16 Type of Score

Rate/proportion

1.17 Measure Score Interpretation

Better quality = Lower score

1.18 Calculation of Measure Score

Denominator. The denominator is the measure population which consists of patients with eligible index admissions to IPFs during the measurement period. Index admissions are defined at admissions to IPFs for patients with the following characteristics:

Age 18 or older at admission,
Discharged alive,
Enrolled in Medicare FFS Parts A and B during the 12 months prior to, the month of, and at least three months after the month of discharge,
Discharged from IPF with a psychiatric or substance use disorder principal diagnosis.

Denominator Exclusions. Denominator exclusions include:

Patients discharged AMA from the IPF index admission,
Patients transferred from the IPF to another care facility such as an acute care hospital, skilled nursing facility, long-term acute care facility, or residential program,
Patients with unreliable demographic and vital status data defined as the following:
- Age greater than 115 years,
- Missing gender data,
- Discharge status of “dead” but with subsequent admissions,
- Death date prior to admission date,
- Death date within the admission and discharge dates but the discharge status was not “dead.”

Discharged AMA is calculated by index admissions where there is an indicator in the claims data (discharge status code ‘07’) that patients left AMA are excluded because the facility may have limited opportunity to complete treatment and prepare for discharge. Unreliable data is determined by index admissions with unreliable demographic and death information are excluded from the denominator. Unreliable demographic information is defined as age greater than 115 years or missing gender. Unreliable death information is defined as:

An admission with a discharge status of “dead” (discharge status code ‘20’) but the person has subsequent admissions;
The death date is prior to the admission date; or
The death date is within the admission and discharge dates for an admission but the discharge status is not “dead” (discharge status code is not ‘20’).

Transfers/Interrupted Stays are determined as index admissions that result in a transfer or interrupted stay are excluded because transfers and interrupted stays cannot always be distinguished from true readmissions in the claims data. This exclusion is defined as an index admission with a readmission on Days 0, 1, or 2 post-discharge.

Numerator. The numerator is comprised of patients 18 and older with an emergency department (ED) visit, including observation stays, for any cause, within 30 days of discharge from an inpatient psychiatric facility (IPF), without subsequent admission. An ED visit is defined as any ED visit or observation stay that does not result in an admission or transfer and occurs within 30 days after the discharge date from an eligible index admission to an IPF during the measurement period.

Measure Period. The measure period includes adult IPF admission with admission and discharges between June 1, 2019, and July 31, 2021, were used to calculate the IPF ED Visit measure performance rate. We excluded cases during the first two quarters of 2020 due to the COVID-19 public health emergency, being discharged alive with a psychiatric principal discharge diagnosis, and being enrolled in FFS.

Measure Calculation. The key algorithm steps for measure calculation are as follows:

Identify all IPF admissions in the performance period.
Apply inclusion and exclusion criteria to identify index admissions.
Identify ED visits within 30 days of discharge from each IPF index admission.
Identify risk factors in the 12 months before index admission and during the index admission.
Run a hierarchical logistic regression to compute the risk-standardized readmission rate for each IPF.
- Hierarchical logistic regression is used to model the log-odds of the ED visit. The two-level specification enables reliable estimates for small-volume hospitals, while accepting a certain amount of shrinkage toward the mean. The model includes risk factors as fixed effects and a hospital-specific intercept as a random effect. The estimate of hospital-specific intercept reflects the quality of care received at an IPF after adjusting for case mix.
- A standardized risk ratio (SRR), which is the predicted number of readmissions over the expected number of readmissions, is calculated for each IPF. The predicted number of ED visits is based on the IPF’s performance and observed case mix; it is calculated by taking the mean of the estimated probabilities of an ED visit for the index admissions at the IPF, based on the IPF-specific intercept and all other risk factors. The expected number of ED visits is based on the national observed ED visit rate in the measure cohort and the IPF’s observed case mix; it is calculated by taking the mean of the estimated probabilities of an ED visit for the index admissions contributing to the IPF, based on the average intercept across all IPFs nationwide and all other risk factors.
- The risk-standardized ED visit rate is calculated by multiplying the SRR by the overall national ED visit rate. An SRR greater than the national average indicates worse quality of care compared with the national average. An SRR less than the national average indicates better quality of care. The confidence interval of the SRR is calculated by bootstrapping to take into account the uncertainty of the estimate.

1.19 Measure Stratification Details

Not applicable; this measure is not risk-stratified. Please refer to the Risk-factor coeff. all risk-fct tab and Candidate risk factor freq tab in the Risk-model specifications.xlsx file for the final risk-variables selected for the risk-adjustment model andthe candidate risk-variables that were considered for inclusion into the model. Please see the attached Excel file for the codebook as well.

1.26 Minimum Sample Size

The minimum sample size for reporting this measure is IPFs with at least 25 discharges during the measurement period.

Importance

Evidence
Measure Impact

Evidence

2.1 Attach Logic Model

Exhibit 1-CBE Submission.docx

2.2 Evidence of Measure Importance

Studies suggest readmissions and rehospitalizations, particularly within 30 days of discharge from an initial psychiatric hospitalization, are an indicator of poor service delivery and treatment across the psychiatric care continuum (Durbin et al. 2007; Feigenbaum et al. 2012). The first three months following IPF discharge, particularly the first month, are a period of high risk for the patient, particularly for suicidality, such as suicidal ideation and self-harm (Mutschler et al. 2019; Chung et al. 2017). In an assessment of 2008 Medicare claims data, Blair et al. (2019) found ED visits are common 30 days before and after an IPF stay; specifically, 29 percent of beneficiaries had an ED visit 30 days following an IPF discharge. Additionally, Venkatesh et al. (2020) found that Medicare beneficiaries disproportionately visit the ED compared to non-Medicare beneficiaries. One reason for an ED visit following an IPF discharge could be a lack of post-discharge follow-up. Blair et al. (2019) found that less than 40 percent of Medicare IPF patients access a mental health provider 30 days before and after an IPF stay.

Follow-up care post-IPF discharge occurs infrequently. Brown and Bell (2022), in their analysis of 2018 National Mental Health Services Survey data and IPF Quality Reporting program data, found only 28 percent of Medicare beneficiaries discharged from an IPF received follow-up care within seven days. Efforts to improve the timeliness of mental health care (Pearlmutter et al. 2017), focus on intervention strategies to “improve patient safety, quality of care, [and] well-being” (Lorine et al. 2015), and increasing access to affordable, preventative care can reduce reliance on ED visits for psychiatric care (Bruckner et al. 2019). Brown & Bell (2022) cite that improving follow-up care and discharge planning can increase continuity of care for Medicare beneficiaries and thus increase patient engagement following discharge from an IPF. Mutschler et al. (2019), in their systematic review, identified multiple studies that found that patients receive the necessary care while hospitalized, but are often left on their own to find post-discharge supports and services in their local community. They identified barriers to post-IPF discharge care, such as stigma and poverty, and actions that can be taken by IPFs to reduce the likelihood of ED visits and readmissions following IPF discharge such as aiding the transition to outpatient support, focusing on patient self-efficacy, and maximizing social and peer support.

There is limited literature on ED visits without resultant readmission following IPF discharge. However, researchers have examined hospital readmission rates following an IPF discharge. Because an ED visit is often the initial point of contact for a patient seeking care, and unplanned readmissions may result from an ED visit, the research on reducing hospital readmissions following IPF discharge can be informative. Callaly et al. (2010) found that IPF readmissions were 10 times less likely if post-discharge follow-up occurred within seven days of discharge as compared to eight days or more. If the initial ED visit can be avoided, IPF readmission rates may also go down.

Evidence suggests that discharge planning focused on continuity of care (e.g., care coordination) increases the likelihood of successful community reintegration for patients after an IPF discharge, resulting in reduced ED visits (Chung et al. 2017). Care coordination with outpatient services is an important predictor of acute hospital readmission, again often stemming from ED visits. IPFs can play a role in care coordination by arranging follow-up appointments for patients, ensuring medications are available at discharge, assisting patients with accessing medications from external providers, and engaging the patients’ social support system. Hamilton et al. (2015) found that patients who missed their first post-IPF discharge follow-up appointment had a 140 percent increased risk of readmission. Similarly, Abernathy et al. (2016) found that patients with poor post-discharge compliance were 1.7 times more likely to be readmitted following IPF discharge.

The rationale for this measure is to encourage IPFs to proactively focus on discharge planning and community reintegration, at the time of the patients IPF stay. The IPF ED Visit measure is intended to compliment the IPF Readmission measure and the FAPH measure. The IPF ED Visit measure gives IPF providers additional data points on how their patients fare after discharge. This measure will additionally complement existing measures by quantifying follow-up, ED visits, and IPF readmissions, which will assist IPFs in discharge planning, care coordination, and timely follow-up. The IPF ED Visit measure is intended to incentivize providers to conduct discharge planning in ways that ensure patients have an opportunity and ability to adhere to post-discharge instructions associated with medication use and follow-up with community-based providers.

Relevant literature:

Abernathy, K., Zhang, J., Mauldin, P. et al. (2016). Acute care utilization in patients with concurrent mental health and complex chronic medical conditions. Journal of Primary Care & Community Health, 7(4), 226–233.
Blair, R., Brown, J. D., Barry, X., & Schmitt, A. (2019). Transitions in Care and Service Use among Medicare Beneficiaries in Inpatient Psychiatric Facilities Issue Brief. Washington, DC, US Department of Health and Human Services, Office of the Assistant Secretary for Planning and Evaluation.
Brown, J. D. & Bell, N. (2022). Factors Associated with the Receipt of Follow-Up Care Among Medicare Beneficiaries Discharged from Inpatient Psychiatric Facilities. The Journal of Behavioral Health Services & Research. https://doi.org/10.1007/s11414-022-09810-7
Bruckner, T. A., Singh, P., Chakravarthy, B., Snowden, L., & Yoon, J. (2019). Psychiatric emergency department visits after regional expansion of community health centers. Psychiatric Services, 70(10), 901-906.
Callaly, T., Hyland, M., Trauer, T., Dodd, S., & Berk, M. (2010). Readmission to an acute psychiatric unit within 28 days of discharge: Identifying those at risk. Australian Health Review, 34(3), 282–285
Chung, D.T., Ryan, C.J., Hadzi-Pavlovic, D., Singh, S.P., Stanton, C., & Large, M.M. (2017). Suicide rates after discharge from psychiatric facilities: A systematic review and meta-analysis. JAMA Psychiatry, 74(7), 694–702. https://doi.org/10.1001/jamapsychiatry.2017.1044
Durbin, J., Lin, E., Layne, C., et al. (2007). Is readmission a valid indicator of the quality of inpatient psychiatric care? Journal of Behavioral Health Services Research, 34, 137–150. doi:10.1007/s11414- 007-9055-5
Feigenbaum, P., Neuwirth, E., Trowbridge, L., Teplitsky, S., Barnes, C. A., Fireman, E., ... & Bellows, J. (2012). Factors contributing to all-cause 30-day readmissions: a structured case series across 18 hospitals. Medical care, 599-605.
Hamilton, J.E., Rhoades, H., Galvez, J. et al. (2015). Factors differentially associated with early readmission at a university teaching psychiatric hospital. Journal of Evaluation in Clinical Practice, 21(4), 572–578.
Lorine, K., Goenjian, H., Kim, S., Steinberg, A. M., Schmidt, K., & Goenjian, A. K. (2015). Risk factors associated with psychiatric readmission. The Journal of Nervous and Mental disease, 203(6), 425- 430.
Mutschler, C., S. Lichtenstein, S.A. Kidd, and L. Davidson. “Transition Experiences Following Psychiatric Hospitalization: A Systematic Review of the Literature.” Community Mental Health Journal, vol. 55, no. 8, 2019, pp. 1255–1274. doi.org/10.1007/s10597-019-00413-9
Pearlmutter, M.D., Dwyer, K.H., Burke, L.G., Rathlev, N., & Maranda, L. Volturo, G. (2017). Analysis of emergency department length of stay for mental health patients at ten Massachusetts emergency departments. Annals of Emergency Medicine, 70(2),193-202.
Venkatesh AK, Mei H, Shuling L, D'Onofrio G, Rothenberg C, Lin Z, Krumholz HM. Cross-sectional Analysis of Emergency Department and Acute Care Utilization Among Medicare Beneficiaries. Acad Emerg Med. 2020 Jul;27(7):570-579. doi: 10.1111/acem.13971. Epub 2020 May 20. PMID: 32302034.

Measure Impact

2.3 Anticipated Impact

Measure Impact on Care and Health Outcomes. Improvements to IPF care, follow-up, and reduced IPF readmission rates and ED utilization are imperative to improving patient health outcomes, including a reduction of suicidality (Mutschler at al. 2019). Moreover, the proposed measure could facilitate and improve continuity of care following an IPF discharge, including data to improve post-discharge follow-up to minimize ED uptake for psychiatric care. Additionally, the proposed measure may have positive implications on improving continuity to community resources following an IPF discharge. As seen across the literature, the 30 days following an IPF discharge are a key touchpoint ensuring successful facilitation to community reintegration.

Measure Impact on Healthcare Costs. Psychiatric-related care provided in the ED is costly. Using the Nationwide Emergency Department Sample (2017), Karaca & Moore (2020) found the average ED visit cost for psychiatric and substance use diagnoses (PSUD) was $520 per visit across 10.7 million PSUD ED visits. In 2017, PSUD visits cost more than $5.6 billion in ED service delivery, representing more than 7% of the annual ED service delivery costs (Karaca & Moore 2020). The costliest PSUDs are alcohol-related, anxiety-related, depressive diagnoses, suicidality, schizophrenia spectrum disorders and other psychotic disorders (Karaca & Moore 2020). Preventative strategies that focus on reducing risk factors associated with psychiatric care admissions (Lorine et al. 2015), and improving continuity of care upon discharge can reduce healthcare costs. Reducing ED visits for psychiatric care can also reduce hospital costs and improve healthcare outcomes (Oblath et al. 2022). Addressing gaps between an IPF discharge and an ED visit has the potential to reduce ED costs. Likewise, by adopting more preventative care strategies on the front end of an IPF stay, it could be anticipated there would be reduced costs associated with readmission to an IPF.

We developed a cell-based model to estimate the costs associated with ED visits within 30 days of IPF discharge (Exhibits 2 & 3 attached below). Estimates rely on data from two sources: empirical data from measure testing activities on the number of Medicare patients with ED visits within 30 days of IPF discharge (n = 37,054 of 178,143 IPF discharges; 20.8 percent) and empirical cost data from the Health Care Utilization Project. Exhibit 2 provides current cost estimates resulting from ED visits within 30 days of IPF discharge among Medicare patients. Current average hospital costs total $38 million and average hospitals charges total $278 million.

Relevant literature:

Karaca, Z., & Moore, B. J. (2020). Costs of Emergency Department Visits for Mental and Substance Use Disorders in the United States, 2017. In Healthcare Cost and Utilization Project (HCUP) Statistical Briefs. Agency for Healthcare Research and Quality (US).
Lorine, K., Goenjian, H., Kim, S., Steinberg, A. M., Schmidt, K., & Goenjian, A. K. (2015). Risk factors associated with psychiatric readmission. The Journal of Nervous and Mental disease, 203(6), 425- 430.
Mutschler, C., S. Lichtenstein, S.A. Kidd, and L. Davidson. “Transition Experiences Following Psychiatric Hospitalization: A Systematic Review of the Literature.” Community Mental Health Journal, vol. 55, no. 8, 2019, pp. 1255–1274. doi.org/10.1007/s10597-019-00413-9
Oblath, R., Herrera, C. N., Were, L. P. O., Syeda, H. S., Duncan, A., Ferguson, T., Kalesan, B., Perez, D. C., Taglieri, J., Borba, C. P. C., & Henderson, D. C. (2023). Long-Term Trends in Psychiatric Emergency Services Delivered by the Boston Emergency Services Team. Community mental health journal, 59(2), 370–380. https://doi.org/10.1007/s10597-022-01015-8

Exhibit 2. Average hospital cost and charges based on current IPF ED Visit measure rate among Medicare beneficiaries

Current population and estimates Total

Estimated number of Medicare beneficiaries discharged from IPFs in 2020^a178,143

Proportion with an ED visit within 30-days of discharge from IPF^b20.8%

Total patients with an ED visit within 30 days of IPF discharge 37,054

Average hospital cost per ED visit (Medicare), 2020^c$1,031

Average hospital charges per ED visit (Medicare), 2020^d$7,491

Total average hospital costs resulting from ED visits by patients discharged from IPFs (in $) $38,202,410

Total average hospital charges resulting from ED visits by patients discharged from IPFs (in $) $277,569,596

^a Medicare beneficiaries discharged from IPFs between June 1, 2019 and July 31, 2021, excluding the first two quarters of 2020 due to COVID-19, that met denominator criteria for the measure totaled 282,060. We divided this number by 19 months to determine monthly discharges and then multiplied by 12 to generate an annual estimate of 178,143.

^b 30-Day Risk Standardized All-Cause Emergency Department Visit Following an Inpatient Psychiatric Facility Discharge performance rate based on analysis of Medicare claims date for adult IPF discharged between June 1, 2019 and July 31, 20221 (n=282,060).

^c HCUPNet data (2020), Treat-and-release ED visits, Average hospital cost per ED visit for cases with Medicare as expected payment source: $1,031. Costs reflect the actual expenses incurred in the production of hospital ser vices, such as wages, supplies, and utility costs. ‘Treat -and-release’ includes ED visits that did not result in admission to the same hospital and ED visits transferred to another hospital or ED.

^d HCUPNet data (2020), Treat-and-release ED visits, Average hospital charges per Ed Visit for cases with Medicare as expected payment source: $7,491. Charges represent what the hospital billed for a case. ‘Treat-and-release’ includes ED visits that did not result in admission to the same hospital and ED visits transferred to another hospital or ED.

As seen in Exhibit 3, if the rate of ED visits following IPF discharge declined by 5 percentage points, the average hospital costs and average hospital charges would drop by about $9 million and $67 million, respectively.

Exhibit 3. Average hospital cost and charges with reduction in IPF ED Visit measure rate by 5 percentage points among Medicare beneficiaries

Current population and estimates Scenario: Reduce ED visits by 5 percentage points

Total Difference

Estimated number of Medicare beneficiaries discharged from IPFs in 2020^a178,143 -

Proportion with an ED visit within 30-days of discharge from IPF^b15.8% -5.0%

Total patients with an ED visit within 30 days of IPF discharge 28,147 -8,907

Average hospital cost per ED visit (Medicare), 2020^c$1,031 -

Average hospital charges per ED visit (Medicare), 2020^d$7,491 -

Total average hospital costs resulting from ED visits by patients

discharged from IPFs (in $) $29,019,138 -$9,183,272

Total average hospital charges resulting from ED visits by

patients discharged from IPFs (in $) $210,846,136 -$66,723,461

2.5 Health Care Quality Landscape

The IPF Readmission and FAPH measures address IPF quality of care, yet they are not sufficient to address the need to reduce ED visits for acute behavioral health encounters following discharge from the IPF. The IPF Readmission is a facility-level measure that estimates unplanned, 30-day risk-standardized readmission rate for adult Medicare FFS patients discharged from an inpatient psychiatric facility with a principal discharge diagnosis of a psychiatric disorder, dementia, or Alzheimer's. FAPH assesses the percentage of IPF hospitalizations for mental health or substance use disorders that were followed by an outpatient mental health or substance user encounter within 7- and 30-days following discharge.

The IPF ED Visit measure assesses the proportion of patients ages 18 years and older with an ED visit, including observation stays, for any cause, within 30 days of discharge from an IPF, without subsequent admission. The IPF ED Visit measure is intended to compliment and harmonize with the measures mentioned above, to the extent possible (exclusions, age parameters, etc.). The IPF ED Visit measure complements the IPF Readmission by assessing the proportion of patients who may seek ED care without a readmission to an IPF.

The IPF ED Visit measure gives IPF providers additional data points on how their patients fare after discharge, as some patients may seek treatment in an ED instead of an IPF. This measure will additionally complement existing measures by quantifying follow-up, ED visits, and IPF readmissions, which will incentivize IPFs in high quality discharge planning, care coordination, and timely follow-up. The IPF ED Visit measure is intended to incentivize providers to conduct discharge planning in ways that ensure patients have an opportunity and ability to adhere to post-discharge instructions associated with medication use and follow-up with community-based providers.

2.6 Meaningfulness to Target Population

In the spring of 2023, we convened a behavioral health technical expert panel (TEP) (n = 7) that guided our work on this measure, which included three patient caregivers. TEP members were asked if they strongly agree, agree, disagree, or strongly disagree with the following statement, “The IPF ED Visit measure provides information that is easy to understand.” Two of the three patient caregivers that responded to the question agreed with the statement. One of these two patient caregivers agreed that providers can use facility-level scores on the IPF ED Visit measure to help make decisions about inpatient care and workflow. A 30-day public comment period was held for the IPF ED Visit measure. No other responses were received from patients or caregivers.

Equity

Equity

Equity

3.1 Contributions Toward Advancing Health Equity

As previously indicated, we added each SDOH variable on its own to the risk model adjusted by the clinical risk factors, and nearly all SDOH variables had much weaker associations with the outcome (odds ratios closer to 1.0). These variables include dual eligibility status, percentage of patients who do not speak English or speak limited English, rural-urban commuting area (RUCA), which is used as a proxy for the geographical location, and Agency for Healthcare Research and Quality (AHRQ) SES Index.

In the univariate analyses, 14 out of 17 SDOH risk factors had a statistically significant association with the outcome. After controlling for the clinical risk factors, only 11 SDOH risk-factors remained statistically significant (p < = 0.05). At the patient-level, non-dually eligible patients have lower odds of having an ED visit as compared to the patients with dual eligibility status (0.901[95 percent CI: 0.882-0.921]); Black (non-Hispanic) patients had higher odds of having an ED visit (1.140[95 percent CI: 1.107-1.175]) compared to the rest of the patients. At the area-level, neighborhoods where more residents do not speak English or speak limited English, neighborhoods with higher percentage of Black residents, and more urban areas had higher ED visit rates. When controlling for the clinical factors, dual eligibility, race, neighborhoods with higher percentages of Black residents and residents who either do not speak English or speak limited English, more urban areas, and areas with higher patient-to-hospital bed ratios (at the area level) had a statistically significant association with higher ED visit rates. This measure may further advance health equity by increasing data points that further explain disparities in behavioral health care for individuals who are dually eligible, non-white, individuals with limited English language proficiency, and individuals who live in more urban communities.

Scientific Acceptability

Testing Data

5.1.1 Data Used for Testing

The IPF ED Visit measure was tested using Medicare beneficiary and coverage files, Medicare FFS Part A and B records, data from the American Community Survey, Dartmouth Atlas of Health Care (which includes data elements from the Centers for Medicare & Medicaid Services (CMS), United States Census Bureau, the American Hospital Association, the American Medical Association, and the National Center for Health Statistics), and Health Resources and Services Administration HPSA files. Both administrative and claims data were used for testing.

For measure calculation, the following Medicare files are required:

Medicare beneficiary and coverage files. Provides information on patient demographic, enrollment, and vital status to identify the measure population and certain risk factors.
Medicare fee-for-service (FFS) Part A records. Contains final action claims submitted by acute care and critical access hospitals, IPFs, home health agencies, and skilled nursing facilities to identify the measure population, readmissions, and certain risk factors.
Medicare FFS Part B records. Contains final action claims submitted by physicians, physician assistants, clinical social workers, nurse practitioners, and other outpatient providers to identify certain risk factors. For this measure, claims for services such as laboratory tests, medical supplies, or other ambulatory services were not used. This ensures that diagnoses result from an encounter with a provider trained to establish diagnoses and not from a claim for a diagnostic test.

Index admissions and ED visits are identified in the Medicare Part A data. Comorbid conditions for risk adjustment are identified in the Medicare Part A and Part B data in the 12 months prior to and including the index admission. Demographic and FFS enrollment information is identified in the Medicare beneficiary and coverage files. Adult IPF admission with admission and discharges between June 1, 2019, and July 31, 2021, were used to calculate the IPF ED Visit measure performance rate. We excluded cases during the first two quarters of 2020 due to the COVID-19 public health emergency.

5.1.2 Differences in Data

We calculated performance rates and conducted reliability and validity analyses for the 1,483 IPFs with 25 or more discharges during the measurement period (June 1, 2019, through July 31, 2021). We aligned with previous testing of the existing and conceptually related IPF Readmission measure to identify potential social determinants of health (SDOH) that may put patients at a higher risk of ED visits and tested their effect on the measured outcome in the risk-adjustment model. We included two patient-level variables (Dual Medicare-Medicaid status and race/ethnicity) and eighteen neighborhood-level SDOH variables in the analyses. Please refer to the Data sources for SDOH tab in Risk-model specifications.xlsx file, attached below.

Using the datasets listed above (i.e., Medicare beneficiary and coverage files, Medicare FFS Part A and B records) as well as the datasets provided in Data sources for comorbidities tab in Risk-model specifications.xlsx file, we constructed a set of variables capturing distinct aspects of patient- and area-level SDOH characteristics to use in the risk-adjustment model.

The only patient-level SDOH variable considered for inclusion in the final risk-adjustment model was Medicare-Medicaid dual enrollment, as an indicator of poverty. While we assessed the effect of race on the probability of the ED visit, we did not consider risk-adjusting for race or ethnicity, per the recommendation of the National Quality Forum (NQF) Risk-Adjustment Expert Panel and Disparities Standing Committee (National Quality Forum, 2014).

In the absence of patient-level data on beneficiary socioeconomic status (SES) characteristics, area-based variables offer the potential to capture characteristics of patient environments and exposure to social and economic conditions (Krieger, 2003). Studies that used both levels of factors showed similar results and found that area- and individual- factors independently and jointly affect some health outcomes (Roux, 2001). The area-level variables capture demographic and SES characteristics of patient neighborhoods, the local healthcare system, adjusted mortality rates of Medicare beneficiaries, and urbanization level. We were not able to create variables for patient housing stability, marital status, or availability of social support because these data are not collected for all Medicare beneficiaries.

We merged area-level SDOH data to patient-level data using the nine-digit zip-code variable. Recent studies show that block group-, census tract-, and zip-code level indicators detect expected gradients of the SES in health outcomes similarly (Berkowitz et al., 2014). In the patient-level data file use to test the IPF ED Visit measure, 133 IPF index admissions did not have any zip code; 98.4 percent (n = 277,534) had a nine-digit zip code, and 1.6 percent (n = 4,393) had a five-digit zip code only. For the beneficiaries with unavailable nine-digit zip codes, we computed the area-level characteristics at the five-digit zip code.

Relevant literature:

Berkowitz, S. A., Traore, C. Y., Singer, D. E., & Atlas, S. J. (2014). Evaluating Area-Based Socioeconomic Status Indicators for Monitoring Disparities within Health Care Systems: Results from a Primary Care Network. Health Services Research, 2, 398–417. https://doi.org/10.1111/1475-6773.12229
Krieger N, Chen JT, Waterman PD, Rehkopf DH, Subramanian SV. Painting a truer picture of US socioeconomic and racial/ethnic health inequalities: the Public Health Disparities Geocoding Project. Am J Public Health. 2005;95(2):312-323
National Quality Forum (2014). Risk Adjustment for Socioeconomic Status or Other Sociodemographic Factors. Retrieved from http://www.qualityforum.org/Publications/2014/08/RA_SES_Technical_Report.aspx
Roux, A. V. D. (2001). Investigating neighborhood and area effects on health. American Journal of Public Health, 91(11), 1783–1789. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1446876/pdf/0911783.pdf

5.1.3 Characteristics of Measured Entities

The measure was tested at the hospital level (IPF). Testing used claims data for patients discharged from any of the 1,617 IPFs participating in the Inpatient Psychiatric Facility Quality Reporting Program. Some analyses were limited to IPFs with 25 or more discharges during the measurement period (n = 1,483).

5.1.4 Characteristics of Units of the Eligible Population

We tested the measure using data from 194,531 patients across 282,060 index admissions across 1,617 IPFs. The patient characteristics are described in Exhibit 4.

Exhibit 4. Patient characteristics

Patient characteristics n %

Total patients: 194,531 100.0

Age (mean = 58.6, SD = 17.9)

18–64 110,617 56.9

65+ 83,914 43.1

18–34 23,258 12.0

35–64 87,359 44.9

65+ 83,914 43.1

Race/ethnicity

Black 29,024 14.9

White 149,989 77.1

Hispanic 5,933 3.1

Other 5,569 2.9

Unknown/missing 4,016 2.1

Sex

Female 99,951 51.4

Male 94,580 48.6

Dual Medicare-Medicaid status

Yes 104,188 53.6

No 90,343 46.4

Data source: Medicare claims date for adult IPF admissions with admission and discharge between June 1, 2019, and July 31, 2021, excluding the first two quarters of 2020 due to the COVID-19 public health emergency.

Reliability

5.2.1 Level(s) of Reliability Testing Conducted

Accountable entity level (i.e., measure score) (e.g., signal-to-noise analysis)

5.2.2 Method(s) of Reliability Testing

The IPF ED Visit measure rates are computed using a hierarchical logistic regression. This approach adjusts the IPF ED Visit rate results for smaller facilities by smoothing their rates closer to the mean. Thus, signal-to-noise reliability analysis is less suitable for this measure because the hierarchical logistic regression adjustment removes IPF-level variation in the risk-standardized rates and could make the results more reliable. We assessed measure score reliability using a bootstrapped test-retest approach at the facility-level. A test-retest approach examines the agreement between repeated measure scores at the same IPF during the same period taking many different random sample from the test population. A total of 1,483 IPFs with at least 25 discharges during the measurement period were included in the reliability analysis. We assessed reliability using the intra-class correlation coefficient (ICC), a reliability coefficient that reflects both correlation and agreement between measurements (Ranganathan et al., 2017).

We used a bootstrap approach to draw 1,000 pairs of samples from the original measure cohort with replacement (stratified sampling by IPF), maintaining the sample size within each IPF. We estimated the ICC in the bootstrap sampling for each pair of the bootstrap samples. With the 1,000 ICC estimates, we determined the distribution of estimated ICCs and calculated the mean, median, and percentile distribution of the ICC. The randomly sampled sets of admissions from a given hospital are assumed to reflect an independent set of re-measurement of readmission rates for the hospital. Adequate reliability is assumed if the risk-standardized measure rates calculated from the random datasets for the same IPF are similar. Higher ICC values indicate stronger agreement between measure scores in the samples and better measure reliability. The bootstrapping approach has advantages over the split-half method because it avoids biased sampling, maintains the original sample size, and allows estimation of ICC confidence.

Relevant literature:

Ranganathan P, Pramesh CS, Aggarwal R. Common pitfalls in statistical analysis: Measures of agreement. Perspect Clin Res. 2017 Oct-Dec;8(4):187-191.

5.2.3 Reliability Testing Results

The mean ICC obtained from the bootstrapping approach is 0.690 (95% confidence interval [CI] 0.550, 0.831) and the median ICC is 0.698 (ICC interquartile range: 0.690 – 0.705). Table 2 (below) provides detailed reliability testing results across the 1,483 IPFs with 25 or more discharges during the measurement period.

5.2.4 Interpretation of Reliability Results

The ICC obtained from the bootstrapping approach, comparing 1,000 pairs of samples of the original measurement cohort, which were sampled with replacement yielding an identical sample size as the original measurement cohort, is 0.690 (ICC range: 0.683 – 0.756). These results suggest the high reliability of the measure.

Table 2. Accountable Entity Level Reliability Testing Results by Denominator, Target Population Size

Accountable Entity-Level Reliability Testing Results
	Overall	Minimum	Decile_1	Decile_2	Decile_3	Decile_4	Decile_5	Decile_6	Decile_7	Decile_8	Decile_9	Decile_10	Maximum
Reliability	0.690	0.00	0.682550979	0.687673994	0.691466134	0.694924154	0.697561482	0.700271986	0.70340118	0.707332578	0.712465142	0.756010232	0.756
Mean Performance Score	1,483
N of Entities

Validity

5.3.1 Level(s) of Validity Testing Conducted

Accountable entity level (i.e., measure score) (e.g., criterion validity)

5.3.3 Method(s) of Validity Testing

Empirical validity testing was conducted for this measure. We assessed the relationship between the IPF ED Visit measure rate and the IPF Readmission measure rate. We used Spearman rank-order correlation to assess cross-measure validity to account for a possible non-linear relationship between the measure scores. We hypothesized IPFs with a high proportion of patients readmitted following an IPF hospitalization would also have a high proportion of patients with ED visits, without readmission, following an IPF hospitalization.

We also conducted hypothesis-driven validity testing to determine if performance rates varied among subgroups of patients in ways that were consistent with the empirical literature. Our literature-based hypotheses are as follows:

Sex. We hypothesized that male patients will have higher ED visit rates than female patients.
Race/ethnicity. We hypothesized that white, non-Hispanic patients will have lower ED visit rates than non-white, Hispanic patients.
Dual eligibility status. We hypothesized that patients who are dually eligible for Medicare and Medicaid will have higher ED visit rates than patients who are not dually eligible.
IPF length of stay (LOS). We hypothesized that patients with a longer LOS will have higher ED visit rates than patients with a shorter LOS.

Independent samples t-tests were used to compare mean group differences in ED visit scores based on sex, race, Medicare-Medicaid dual enrollment status, and IPF length of stay (LOS). With large sample sizes, small differences that are statistically significant may not always be practical or clinically meaningful. Therefore, we computed Cohen's d effect size (the difference in mean scores divided by the pooled standard deviation). Following Cohen’s (1988) definitions, we defined effect size values as small (0.2), medium (0.5), or large (0.8). In our hypothesis-driven validity testing, we used unadjusted performance rates, as risk-standardization reduces the variability in the distribution of measure scores and could obscure differences between subgroups of patients.

Relevant literature:

Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Lawrence Earlbaum Associates.
Larkin, G. L., Claassen, C. A., Emond, J. A., Pelletier, A. J., & Camargo, C. A. (2005). Trends in US emergency department visits for mental health conditions, 1992 to 2001. Psychiatric services, 56(6), 671-677.
Peters ZJ, Santo L, Davis D, DeFrances CJ. Emergency department visits related to mental health disorders among adults, by race and Hispanic ethnicity: United States, 2018–2020. National Health Statistics Reports; no 181. Hyattsville, MD: National Center for Health Statistics. 2023.
Strashny A, Cairns C, Ashman JJ. Emergency department visits with suicidal ideation: United States, 2016–2020. NCHS Data Brief, no 463. Hyattsville, MD: National Center for Health Statistics. 2023. DOI: https://dx.doi.org/10.15620/cdc:125704.
Stroever S, Brett C, Michael K, Petrini J. Emergency department utilization for mental health conditions before and after the COVID-19 outbreak. Am J Emerg Med. 2021 Sep;47:164-168.

5.3.4 Validity Testing Results

For the characteristics listed above in our validity testing, the statistical results are as follows:

Sex. There is a 2.6 percent difference in the mean IPF ED Visit rate between male and female beneficiaries (Cohen’s D = 0.06).
Race. The difference in the IPF ED Visit rates between Black and White beneficiaries is 3.7 percentage points (Cohen’s D = 0.09).
Dual eligibility status. There is a difference of 5.5 percentage points between the mean IPF ED Visit rate among beneficiaries with dual eligibility for Medicare-Medicaid and Medicare-only (non-dual) status (Cohen’s D = 0.16).
IPF LOS. There is an 8.9 percentage point difference in the mean IPF ED Visit rate for beneficiaries with long LOS (4th quartile) versus short LOS (1st quartile) (Cohen’s D = 0.22).

5.3.5 Interpretation of Validity Results

There is a moderate, yet meaningful, positive relationship between the facility rates on the IPF ED Visit measure and the IPF Readmission measure (Spearman ranked correlation ρ = 0.42). A positive relationship between the measure rates indicates that an increase in ED visit rates is associated with the increase in the unplanned 30-day IPF readmission. We observed small effect sizes for the differences in the observed ED visit measure rates by patient subgroups:

Sex. There is a 2.6 percent difference in the mean IPF ED Visit rate between male and female beneficiaries (Cohen’s D = 0.06). There is a 51.7 percent chance that a male beneficiary will have higher probability of an ED visit compared to a female beneficiary.
Race. The difference in the IPF ED Visit rates between Black and White beneficiaries is 3.7 percentage points (Cohen’s D = 0.09). There is a 52.5 percent chance that a Black patient will have higher probability of an ED visit relative to an average White patient.
Dual eligibility status. There is a difference of 5.5 percentage points between the mean IPF ED Visit rate among beneficiaries with dual eligibility for Medicare-Medicaid and Medicare-only (non-dual) status (Cohen’s D = 0.16). There is a 54.5 percent chance that a patient with dual eligibility status will have higher probability of an ED visit as compared to a non-dual patient.
IPF LOS. There is an 8.9 percentage point difference in the mean IPF ED Visit rate for beneficiaries with long LOS (4th quartile) versus short LOS (1st quartile) (Cohen’s D = 0.22). There is a 56.2 percent chance that a patient with longer LOS will have higher probability of an ED visit compared to a beneficiary with a short LOS.

Observed differences were small, as described above. However, they were in the expected directions and further support the ability of the measure to discriminate between the groups of beneficiaries known to have different ED visit measure rates.

5.3.2 Type of Accountable Entity Level Validity Testing Conducted (derived)

Empirical validity testing at the accountable entity-level (e.g., criterion validity, construct validity, known groups analysis)

Risk Adjustment

5.4.1 Methods Used to Address Risk Factors

Statistical risk adjustment model with risk factors

5.4.2 Conceptual Model Rationale

Patient health characteristics (i.e., principal diagnosis, comorbidities, age, gender, prior history of AMA, violence, and suicidality), patient- and area-level social determinants of health (SDOH) (i.e. disability status, dual eligibility status, race, and ethnicity, socioeconomic status, urbanization/rurality, housing stability, social support, adjusted mortality rate), and access to care/characteristics of local healthcare systems impact the structure (i.e. funding, staffing, clinical expertise, services, location) of an IPF, the quality of care received at an IPF, and therefore impact the risk of an ED visit 30 days following discharge from an IPF.

To identify candidate clinical and SDOH variables for risk adjustment we reviewed existing literature on risk factors for ED visit following psychiatric discharges, reviewed risk variables used in other ED visit measures, and performed bootstrap selection for candidate risk variables. For performance assessment, we only controlled for patient factors that were present prior to the start of care. The risk factors for health status at IPF admission included in the risk model are principal diagnosis of the IPF index admission, comorbidities, demographics of age and gender, and prior history of being discharged AMA, aggressive behavior, or suicidality. We used broad AHRQ CCS categories for the principal diagnoses in risk adjustment. However, while we collapsed unique principal discharge diagnosis ICD-10-CM codes into broader categories, we carefully reviewed crosswalks to ensure optimal capture of differences in readmission rates. This resulted in the development of subcategories for schizophrenia/psychosis and bipolar/depressive disorders and the further collapsing of developmental/childhood disorders and other psychiatric disorders.

For comorbidities, we used the CMS complication or comorbidity (CMS CC) categories to form clusters on comorbidities, but reviewed crosswalks to optimize the predictive performance of each cluster in capturing ICD-10-CM codes with similar associations with ED visits. This resulted in modification of the ICD-10-CM to CC crosswalk, mostly in following assignments in the comparable CCS category or collapsing certain CC categories based on similar ED visit rates. We obtained information on comorbidities from the secondary diagnosis of the index admission, after careful review and exclusion of conditions that may represent hospital-acquired complications rather than preexisting comorbidities, principal or secondary diagnoses of hospital admissions during the 12-month look-back period, or presence of at least two outpatient encounter claims with principal or secondary diagnoses of the same CC.

We also identified other variables in the literature that are relevant for the inpatient psychiatric population. These included history of discharge AMA, suicidality, electroconvulsive therapy/transcranial magnetic stimulation (ECT/TMS), or aggression; admission source (as proxy for involuntary admission); and count of psychiatric comorbidities. The key SDOH constructs that may affect the risk of ED visits for psychiatric patients include risk factors such as income/poverty, disability, race/ethnicity and language barriers, access to care, education, housing stability, and social support. As shown in the conceptual model, the impact of SDOH factors on ED visits can be direct or indirect through their effect on health status, the facility selected to obtain care, and the quality of the specific treatments and care received. Additionally, health status can influence SDOH factors. The mechanisms for the effect of sociodemographic factors on health are complex, interrelated, and may result from a lifelong, cumulative effect of social status on health. External factors related to local health-care markets and IPF structure can also affect patient’s access to services prior to admission. Quality of IPF care can directly affect ED visits after an IPF discharge. Risk models typically do not control for differences in such external factors.

5.4.2a Attach Conceptual Model

Exhibit 8-CBE Submission.zip

5.4.3 Variable Distribution Across Measured Entities

We observed substantial variation in the distribution of most of the risk variables identified in the conceptual model. Please refer to the Descriptive Statistics tab in the Risk-model specifications.xlsx file, attached below.

5.4.4 Risk/Case-Mix Adjustment Modeling and/or Stratification Results

We derived a parsimonious risk adjustment model by using logistic regression with a stepwise backward elimination process, which was repeated in 1,000 bootstrap samples from the entire population via random selection with replacement. This approach allows the use of the entire dataset for model development and a nearly unbiased estimate of predictive accuracy with relatively low variance compared with other validation approaches, such as data splitting and cross-validation. We retained candidate variables demonstrating a positive association with ED visits at p-value <0.15 in at least 70 percent of samples. The p-value cut-off of 0.15 was chosen to approximately mimic variable selection based on the Akaike Information Criterion (AIC). To select a candidate risk factor based on AIC, its chi-squared (χ²) value has to exceed twice its degrees of freedom (df). When considering a predictor with 1 df, such as gender or diagnosis code, this implies χ² >2 with p < 0.157.

The final risk model includes 49 risk-factors (excluding individual categories for categorical variables) capturing patients demographics, principal discharge diagnosis for the index admission, psychiatric and non-psychiatric comorbidities, and risk-factors identified during literature review (that is, suicide attempt/self-harm, aggression, and discharge AMA). Four potential comorbidity risk factors were dropped from the final risk model. We assessed model discrimination using the C-statistic, which reflects how accurately the model distinguishes between an index admission that does or does not have an ED visit. The C-statistic for the final risk-adjustment model was 0.670. A C-statistic of 0.5 represents random prediction and a C-statistic of 1.0 represents perfect prediction.

The variables in the final risk-adjustment model are presented in Risk-factor coeff. all risk-fct tab in the Risk-model specifications.xlsx file. The Candidate risk factor freq tab in the Risk-model specifications.xlsx file lists the frequencies and ED visit rates of all candidate clinical risk variables and details the output of the selection process, including the percent of bootstrap samples in which a variable had a p-value of <0.15. As the tab shows, most of the clinical risk factors were retained during the bootstrap selection process. The variables that did not demonstrate a positive association with ED visits at p-value <0.15 in at least 70 percent of samples were diabetes acute complications, diabetes chronic complications, hematological disorder, and coagulation defects.

In our testing, most of the SDOH variables were not retained in the final model during bootstrap selection process. Therefore, we added all candidate SDOH variables to the risk-adjustment model after selecting clinical risk factors to compare model discrimination for the model with clinical factors only and the model with clinical and SDOH factors. We assessed the impact on the model performance compared to the clinical risk factor only model in terms of predictive ability, c-statistic, distribution of residuals, model chi square, and distributions of RSRRs. Considering the contribution of the SDOH variables on risk model performance, we evaluated the SDOH variables based on their feasibility for use in a national CMS measure.

5.4.4a Attach Risk/Case-mix Adjustment Modeling and/or Stratification Specifications

Copy of Risk-model specifications.xlsx

5.4.5 Calibration and Discrimination

We assessed model calibration (i.e., whether a model accurately predicts probability of an ED visit) via a Hosmer-Lemeshow test (Exhibit 13), and the risk-decile calibration plot (Exhibit 14). Hosmer-Lemeshow test divides the patients into deciles (i.e., ten groups with equal number of patients) based on the expected risk for an ED visit, from lowest to highest risk. The range of expected risks of having an ED visit within each decile is determined by the patients in that decile. The difference between the observed and expected ED visit for each decile is summarized by the Pearson chi-square statistic. The statistics are then summed over the ten deciles and are compared to the chi-square distribution. In decile assessment, we should see similar numbers of observations in each decile group and increasing observed rates when we move from low to high deciles.

The risk-decile calibration plot with observed outcomes versus expected probabilities of readmission was computed to localize possible deviations across risk strata. In the risk-decile calibration plot (Exhibit 14), the diagonal line is the line of perfect calibration. In a well-calibrated model, all markers representing deciles should be close to the diagonal line. In this graph, the markers appear close to the diagonal line, which indicates a close agreement between the observed and expected probabilities of the ED visit.

We assessed model discrimination using the C-statistic, which reflects how accurately the model distinguishes between an index admission that does or does not have an ED visit. The C-statistic for the final risk-adjustment model was 0.667. A C-statistic of 0.5 represents random prediction, and a C-statistic of 1.0 represents perfect prediction.

Risk adjustment model performance parameters showed excellent calibration with no indication of over-fitting. The mean observed ED visit rates range from 45.1 percent in the highest decile to 9.7 percent in the lowest decile, an absolute difference of 35.3 percentage points, suggesting good discrimination. The ratio of observed to predicted IPF readmission rates is close to 1.0 for each decile, suggesting adequate calibration of the model. The Hosmer-Lemeshow statistic was 131.29 (df = 8; p < 0.001). Given the sensitivity of the Hosmer-Lemeshow statistic to sample size, calibration was reassessed using 20 random samples of 5,000 patients taken from the sample. A total of 18 of the 20 randomly selected samples of 5,000 patients showed non-significant H-L statistics, supporting the evidence that the model is correctly specified and fits the data well.

The C-statistic of 0.667 suggests moderate predictive discrimination, expressed as the model’s ability to distinguish between index admissions that are and are not followed by a readmission. Statistical findings of excellent calibration are confirmed when comparing observed to predicted probabilities by risk deciles (Exhibit 14). The results are in-line with the NQF-endorsed measures capturing post-discharge care services (that is, ED use and hospital readmissions) developed for other settings, such as Short-Stay Residents who have had an Outpatient Emergency Department Visit (0.610); Short-Stay Residents who were Re-hospitalized after a Nursing Home Admission (0.614); Hospital 30-Day Heart Failure Readmission measure (0.601); Hospital 30-Day Pneumonia Readmission Measure (0.630); Hospital 30-Day Acute Myocardial Infarction Readmission Measure (0.630); and Hospital-Wide Readmission Measure (0.64 to 0.71) (Centers for Medicare and Medicaid Services. (2020). Nursing Home Compare Quality Measure Technical Specifications, Appendices. Abt Associates. https://www.cms.gov/Medicare/Provider-Enrollment-and-Certification/CertificationandComplianc/Downloads/APPENDIX-New-Claims-based-Measures-Technical-Specifications-January-2020.pdf).

5.4.5a Attach Calibration and Discrimination Testing Results

Exhibits 13 and 14-CBE Submission.pdf

5.4.6 Interpretation of Risk/Case-mix Factor Findings

See above

5.4.7 Final Approach to Address Risk Factors

Statistical risk adjustment model with risk factors

Specify number of risk factors

We selected a total of 53 candidate risk factors based on the conceptual risk model (excluding the SDOH). Of these 53 risk factors, 49 appeared in the final risk adjustment model. Please refer to the Risk-model specifications.xlsx file, attached below.

Comments

Public Comments

Measure 4190 Should Not Be Endorsed

The "30-Day Risk Standardized All-Cause Emergency Department Visit Following Inpatient Psychiatric Facility Discharge" measure should not be endorsed. It is not a valid measure of the quality or efficiency of care provided by inpatient psychiatric hospitals (IPFs) due to problems with the ways the numerator and denominator are defined and problems with the risk adjustment methodology. Public reporting of the results could mislead patients and families about where they should receive treatment for mental health problems, and use of the measure to modify hospital payments could worsen disparities in access and outcomes for patients.

Problems with the Numerator

The stated goal of the measure is to “encourage IPFs to proactively focus on discharge planning and community reintegration during patients’ IPF stays.” The definition of the measure is based on a presumption that an emergency department visit following discharge from the inpatient psychiatric hospital reflects a failure by the IPF to deliver high-quality care. However, the numerator includes emergency department visits that are unrelated to the care delivered by the IPF. At the same time, it excludes many ED visits that are related to the IPF’s care.

Inclusion of Visits Unrelated to Mental Health Problems. The numerator includes any visit to an Emergency Department within 30 days following discharge from the IPF, regardless of whether the ED visit had anything to do with the problem treated during the IPF stay or other mental health problems. Individuals visit emergency departments for a wide variety of reasons, including accidental injuries, new acute illnesses, and exacerbations of chronic medical conditions. Including such visits overestimates the true rate of mental health problems following discharge.
Exclusion of ED Visits That Result in Hospital Admissions. The numerator excludes emergency department visits that result in a hospital admission. As a result, if a patient comes to an ED for a mental health problem and is admitted to the hospital, that visit will not be counted in the measure. This will underestimate the true rate of mental health problems following discharge.

Inclusion of Unrelated Visits

It does not appear that the measure developers made any effort to assess what proportion of ED visits after an IPF discharge were actually related to mental health conditions or how that proportion varied by hospital. According to a 2019 ASPE study, the majority of Medicare IPF patients received outpatient health care services that were unrelated to mental health in the 30 days before their IPF admission, so it would not be surprising to find that a high proportion of ED visits after discharge are for things other than mental health problems, and it would also not be surprising to find that there are differences in the number and types of those problems among the patients treated by different IPFs. (Blair R, et al. Transitions in Care and Service Use Among Medicare Beneficiaries in Inpatient Psychiatric Facilities. HHS Office of the Assistant Secretary for Planning and Evaluation, April 2019.)

The inclusion of so many visits unrelated to mental health issues creates two different problems.

First, it dilutes the effect of differences between IPFs that are related to mental health care. For example, if the rate of ED visits that are actually related to patients’ mental health problems is 50% higher at IPF #1 as at IPF #2, but only half of the visits are related to mental health issues, then the overall visit rate will only be 25% higher at IPF #1 than at IPF #2 (0.5 + 0.75)/(0.5 + 0.5) = 1.25.
Second, differences between hospitals in the rate of visits that are unrelated to mental health issues may cause IPFs to appear to have higher or lower quality care than they actually do. For example, a patient with mental illness may come to the ED for an exacerbation of a chronic condition such as heart failure or COPD, not because of issues related to their mental illness. Rates of ED visits for chronic disease exacerbations will be higher in communities where chronic disease management services are less available, and patients who have lower incomes or lack of family support may have greater difficulty accessing the services that do exist. In these types of communities and for these types of patients, there will likely be more post-discharge ED visits for chronic disease exacerbations that are unrelated to outpatient mental health care. The risk adjustment methodology only controls for the presence of a chronic condition; it does not control for differences in the care patients receive for the chronic condition. Care for the chronic condition is not the responsibility of the IPF that delivered treatment for mental health problems, but under this measure, a higher rate of ED visits for problems related to chronic conditions will be interpreted as poor quality care for the patient’s mental health needs. Similarly, a patient may come to the ED for an acute condition that is unrelated to mental health (e.g., a viral infection). An IPF in a community experiencing high rates of such illnesses could have higher rates of post-discharge ED visits, and this measure would make it appear that the IPF delivers lower quality psychiatric care. The risk adjustment model used in the proposed measure has variables indicating the presence of chronic conditions, but there are no variables to indicate the presence of new acute conditions.

Rather than attempting to correct for these unrelated problems through a risk adjustment model, it would be better to simply limit the numerator to ED visits that are related to mental health problems. The Measure Information Form indicates that public comments on the draft measure recommended removing the “all-cause” definition used for the numerator. This is very feasible to do:

For example, a study of the impact on psychiatric care resulting from improving access to community health centers measured changes in the number of psychiatric ED visits, which were defined as ED visits with a diagnosis code corresponding to mental health disorders. (Bruckner, TA, et al. “Psychiatric Emergency Department Visits After Regional Expansion of Community Health Centers,” Psychiatric Services 70(10): 901-906.)
An analogous approach is used in the “Follow-Up After Psychiatric Hospitalization (FAPH)” measure that is already part of the measures CMS calculates for IPFs. That measure only includes outpatient mental health care encounters after discharge in its numerator, not all outpatient visits, and outpatient mental health care encounters are defined as “outpatient visits, intensive outpatient encounters, or partial hospitalizations provided by a mental health provider for which mental health or SUD diagnoses are mentioned anywhere on the follow-up visit claim.” A similar definition could be used to define ED visits related to mental health issues.

Exclusion of Related Visits

The fact that the measure excludes patients who are admitted to the hospital from the ED creates additional problems. If the patients who are discharged from some IPFs visit emergency departments that are less likely to admit patients to the hospital, those IPFs could have higher ED visit rates, even though the higher rates are due to differences in the way emergency departments operate, not due to differences in either the care delivered by the IPF or the mental health care the patients were receiving in the community after discharge.

It does not appear that the measure developers examined IPF discharges to assess the magnitude of the variation in the proportion of ED visits resulting in hospital admissions. A study of ED visits following hospital discharges for heart and lung conditions found that higher rates of total ED visits were associated with lower rates of hospital readmissions (Venkatesh AK, et al. “Association Between Postdischarge Emergency Department Visitation and Readmission Rates,” Journal of Hospital Medicine 13(9): 589-594). If there is a similar relationship for patients discharged from IPFs, then excluding the subset of ED visits that result in hospital admissions (as this measure proposes to do) would result in erroneous estimates of the differences in rates across hospitals.

This problem could easily be avoided by expanding the numerator to include all mental health-related ED visits, including those that result in a hospital admission.

Failure to Exclude Patients Who Die After Discharge from the Denominator

The denominator excludes patients who died during the IPF stay but it does not exclude patients who died during the 30 days after discharge. As a result, all else being equal, an IPF with a higher post-discharge mortality rate would have a lower ED visit rate and under this measure, the IPF would appear to be delivering higher-quality care. The Measure Information Form describes steps taken to identify patients who were recorded as being dead at discharge but who were actually alive, however, it does not address the issue of determining whether patients died after discharge but before the 30-day measurement window ended.

Problems with the Risk Adjustment Methodology

The measure does not report the actual rate of post-discharge ED visits; it reports a ratio of the “predicted” number of visits to the “expected” number of visits. Both of these parameters are calculated using a hierarchical logistic regression model that is supposed to adjust for differences in the characteristics of patients that affect the rate of ED visits. However, there are serious flaws with the model that is used:

Failure to adjust for availability of community mental health care. The effectiveness of an IPF’s discharge planning and community reintegration activities is inherently constrained by the availability of mental health services in the communities where the patients discharged from the IPF will be living. As a result, an IPF will likely have a higher rate of ED visits if more of its patients live in communities that have limited community mental health services. There are no variables in the risk adjustment methodology to adjust for this.
Failure to adjust for whether the patient was discharged to another facility. An ASPE study found that only 70% of IPF patients were discharged to their home or self-care; most of the others were discharged to Skilled Nursing Facilities, intermediate care facilities (ICFs), or inpatient settings (Blair R, et al. Transitions in Care and Service Use Among Medicare Beneficiaries in Inpatient Psychiatric Facilities. HHS Office of the Assistant Secretary for Planning and Evaluation, April 2019). It seems likely that rates of ED visits will differ for patients in these different post-discharge settings, but there is no adjustment for this in the risk adjustment methodology. There is no indication that the measure developers examined either the magnitude of these differences or the variation in the discharge locations across different IPFs in order to determine whether the measure should be stratified or adjusted based on discharge location so that it more accurately and fairly compares IPFs.
Failure to adjust for propensity of patients to use the ED for chronic conditions. Patients with severe or poorly managed chronic conditions are more likely to have ED visits for exacerbations of those conditions than other patients with the same chronic conditions. Because the numerator of the measure includes ED visits made for any reason, not just visits that are related to mental health conditions, these chronic disease-related visits will increase the rate of ED visits. Although some types of chronic conditions are included as variables in the risk adjustment model, the presence of a condition does not indicate how severe it is or how well managed it is. No variables are included in the risk adjustment model to adjust for this (e.g., the rate at which a patient visited the ED for chronic disease exacerbations before the IPF admission), so an IPF that has more patients with poorly managed chronic conditions could inappropriately appear to be delivering lower-quality mental health care.
Failure to include variables affecting patient access to other types of health services. Patients who have low incomes, patients who live in rural areas, and patients who live in areas with shortages of healthcare professionals will have greater difficulties accessing medical services and thereby be more likely to visit EDs for diagnosis and treatment of acute conditions. According to the Measure Information Form, variables for Medicaid enrollment (assessed through dual Medicare-Medicaid status), rurality, and health professional availability were tested in the modeling and found to have a large and statistically significant impact on the rate of ED visits. However, these variables were not included in the final model used for the measure, and there is no explanation as to why, other than a statement that “risk models typically do not control for differences in such external factors.” The Measure Information Form specifically notes that non-dually eligible patients had lower odds (0.901) of an ED visit than dually eligible patients, but it does not explain why dual eligibility was not included in the final model nor does it examine the implications of failing to do so. While this may have been done in an effort to prevent “hiding” income-related disparities in treatment by IPFs, it means that an IPF can be penalized for problems that low-income patients face in accessing medical care and other services for conditions that are unrelated to their mental health problems.

Inadequate Evaluation of Risk Adjustment Model Performance

Because of the many factors that could affect the rate of post-discharge ED visits, a thorough analysis is needed to determine how well the regression model in this measure adjusts for variations in outcomes and factors that are beyond the control of an IPF. The simplistic analysis provided in the Measure Information Form is not adequate to justify use of this measure.

The developers report that the c-statistic for the risk adjustment model was 0.667, which they describe as “moderate predictive discrimination.” What the statistic means is that there is only a 66.7% chance that a patient who has a post-discharge ED visit will be classified as higher risk than a patient who does not experience a visit, i.e., a patient will only be classified correctly 2/3 of the time. In comparison, a coin flip would have a 50% chance of predicting a visit accurately. Although one should not expect the model to predict 100% of visits if it is believed that hospitals differ in how well they are carrying out discharge planning and facilitating post-discharge transitions, it seems unlikely that a model with a c-statistic of .67 will accurately identify patients who are likely to make ED visits that are unrelated to the IPF’s care. The fact that this statistic is similar to the statistics reported for other measures that use similarly problematic numerators does not mean the model is good. There is nothing in the Measure Information Form indicating whether alternative risk adjustment models were tested, or whether the model discrimination was evaluated using only ED visits for non-mental health problems, which is where discrimination is most important.

The Measure Information Form states that the risk-decile calibration plot “indicates a close agreement,” even though there is 13% underprediction in the lowest decile, 2-3% underprediction in the highest two deciles, and 4% over-prediction in two of the middle-range deciles. These types of prediction errors could lead to erroneous classifications of hospitals with the types of patients with these risk characteristics. The Hosmer-Lemeshow statistic was reportedly statistically significant, which indicates poor calibration. There is no information indicating that any efforts were made to adjust the model or to test alternative models to achieve better calibration, nor were there any efforts to use alternatives to the Hosmer-Lemeshow statistic to assess calibration. Instead, the measure developers stated that they recalculated the Homer-Lemeshow statistic for 20 smaller samples of cases and found that the statistic was not significant in most of these cases. However, they do not show the actual results of these analyses. They state that the rationale for doing these additional estimates was that the Homer-Lemeshow statistic is more likely to be significant with larger numbers of cases, but the converse of this is also true – it is less likely to be significant with smaller samples. Failure to find statistically significant results in artificially-restricted samples does not mean that there is no calibration problem.

Inadequate Assessment of Measure Reliability or Case Threshold

The Intraclass Correlation Coefficient (ICC) is used to assess measure reliability, and the mean ICC is reported as 0.690. Although this is described as “high” reliability, it is actually moderate at best. Moreover, a high ICC value can be misleading, because if the characteristics of patients differ from facility to facility and the risk adjustment model fails to adequately adjust for this, the ICC value will be higher. This is because more of the variation between patients will be inappropriately attributed to differences in the facilities. Because of the problems with the numerator and risk adjustment model in this measure, it is hard to know how to interpret the ICC.

Moreover, if a measure is going to be used to classify individual IPFs, what matters is the reliability of the measure for individual IPFs, not the mean reliability for all IPFs. The Measure Information Form contains no data on how reliability varies based on the number of patients treated by an IPF. Ordinarily, one would expect higher reliability for facilities with more patients, so the fact that the reliability rates in Table 2 of the Measure Information Form are almost identical across deciles suggests that the deciles are not defined based on IPF size. (It is not clear how the deciles are defined in the table; the Battelle Measure Submission Form is problematic because it requires submission of reliability measures by decile, but it does not specify what variable should be used to define the deciles.)

The Measure Information Form states that only IPFs with 25 or more discharges are included in the measure, but it provides no information as to how this threshold was chosen.

If the measure is going to be used to classify hospitals, an additional way to assess reliability is the extent to which the classifications of IPF ED visit rates differ from year to year. If random variation in patients and weaknesses in the risk adjustment model cause hospitals to change classifications from one year to the next (e.g., being classified as “worse than expected” in some years but not others), the measure will be of little value in guiding patients in their choice of facilities or determining whether there has actually been improvement (or deterioration) in performance. There is no indication in the Measure Information Form as to whether year-to-year reliability was assessed.

The measure developers do not report having made any effort to examine individual IPFs that would be classified as having high or low ED visit rates to ensure those classifications are not based on artifacts of the measure definition or the risk adjustment methodology.

Lack of Business Case for Using the Measure and Undesirable Effects of Using It

Most of the studies cited as demonstrating the importance of this measure do not actually discuss ED visits, so there is little evidence to support the presumption that the rate of ED visits after discharge is a valid measure of the quality of care provided by an IPF. There is also no information presented to justify the assumption in Exhibit 3 that ED visits can be reduced by 5%, so the calculation in that exhibit is not a valid assessment of the potential impact of the measure on healthcare costs.

The Measure Information Form provides no information on how many IPFs would be classified as having ED visit rates that are higher or lower than expected. Most measures that are similar to this have such high uncertainty in individual hospital scores that most hospitals can only be classified as “no different than expected.” If the classifications derived from this measure also change from year to year because of the low reliability of the measure, there would be little benefit to using it.

In addition, since the weaknesses in the measure methodology could cause some hospitals to be inappropriately labeled as “worse than expected” because of the kinds of patients they treat, use of the measure would create an undesirable incentive for IPFs to avoid treating patients with characteristics that are likely to result in higher numbers of ED visits, such as patients with multiple comorbidities and patients who do not have good access to primary care and chronic disease management services.

Organization

President and CEO, Center for Healthcare Quality and Payment Reform

Staff Preliminary Assessment

CBE# 4190 Staff Assessment

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Scientific Acceptability Reliability Rating

Not met but addressable

Scientific Acceptability Reliability

Strengths:

Measure is well defined and specified.
An average entity level signal-to-noise reliability was estimated with an ICC using a bootstrap method with a 2019-2021 dataset of 1,483 entities that met the criteria of at least 25 discharges. The median ICC from the bootstrap is 0.698, indicating that more than 50% of the enitites have a reliability >0.6.

Limitations:

The reported deciles seem to be deciles of the results of the bootstrap iterations, not entity-level reliabilities. Without estimates of reliability by entity it is only possible to conclude that less than 50% of the enitites have a reliability <0.6, although that number may be much lower.

Rationale:

Nearly 50% of the entities could have a reliability <0.6. Applying a method such as the Spearman-Brown formula (based on denominator size) would provide estimates of reliability by denominator (population) decile. In addition, if to support any low volume providers, the developer may consider some possible mitigation strategies to improve these estimates, such as:

Empirical approaches outlined in the report, MAP 2019 Recommendations from the Rural Health Technical Expert Panel Final Report, https://www.qualityforum.org/WorkArea/linkit.aspx?LinkIdentifier=id&ItemID=89673
Consider a higher minimum case volume.
Extend the time frame.
Focus on applying mitigation at the lower volume providers.

Scientific Acceptability Validity Rating

Not met but addressable

Scientific Acceptability Validity

Strengths:

Submission includes references to published literature (N=12) demonstrating an association between the measure focus and during-discharge planning and post-discharge timely access to care
For risk adjustment, the submission uses a robust approach to risk factor variable selection. Discrimination statistic (C-statistic) of 0.67 is in line with models for similar measures. Submission uses a robust method for assessing calibration, including random sampling to account for large sample sizes that generally limit the usefulness of the Hosmer-Lemeshow test.

Limitations:

For face validity testing, which is acceptbale for new measures, the submission references a TEP (N=7) but does not report results.
The developer aslo peformed empirical validity testing, in which performance rates are reported by sex, race, dual eligibility status, and IPF LOS. The effects were modest but in the anticipated direction, although no rationale was provided to support the hypothesis and rationale to explain why the modest results. The devleoper also reports results on an association between the measure of interest and IPF 30-day readmission rate. The effects (spearman rank correlation p=0.42) are in the antcipated direction, but again, no rationale provided.
For risk adjustment, for a few persons (<2%) the data needed to link area-level risk factors (9 digit zip code) is missing.
- Risk adjustment: Some social risk factors are at the area-level rather than the person-level due to the lack of available data

Rationale:

The committee may consider asking the developer for a rationale for the antcipated associations that explains the reasons the direction and magnitude of the association, and an explanation for how the known interventions could plausibly account for the between entity variation.

Use and Usability

Summary

Committee Independent Review

Overall summary

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Use and Usability

Summary

Agree with Staff Comments

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Use and Usability

Summary

A

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Use and Usability

Summary

NA

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Use and Usability

Summary

Measure Should Not Be Endorsed

Importance

Closing Care Gaps

Feasibility Assessment

Scientific Acceptability

Scientific Acceptability Reliability Rating

Not Met

Scientific Acceptability Reliability

This is not a reliable measure of the quality or efficiency of care provided by inpatient psychiatric hospitals (IPFs).

The Measure Information Form states that only IPFs with 25 or more discharges are included in the measure, but it provides no information as to how this threshold was chosen.

Scientific Acceptability Validity Rating

Not Met

Scientific Acceptability Validity

This is not a valid measure of the quality or efficiency of care provided by inpatient psychiatric hospitals (IPFs) due to problems with the ways the numerator and denominator are defined and problems with the risk adjustment methodology.

Problems with the Numerator

Inclusion of Visits Unrelated to Mental Health Problems. The numerator includes any visit to an Emergency Department within 30 days following discharge from the IPF, regardless of whether the ED visit had anything to do with the problem treated during the IPF stay or other mental health problems. Individuals visit emergency departments for a wide variety of reasons, including accidental injuries, new acute illnesses, and exacerbations of chronic medical conditions. Including such visits overestimates the true rate of mental health problems following discharge.
Exclusion of ED Visits That Result in Hospital Admissions. The numerator excludes emergency department visits that result in a hospital admission. As a result, if a patient comes to an ED for a mental health problem and is admitted to the hospital, that visit will not be counted in the measure. This will underestimate the true rate of mental health problems following discharge.

Inclusion of Unrelated Visits

The inclusion of so many visits unrelated to mental health issues creates two different problems.

First, it dilutes the effect of differences between IPFs that are related to mental health care. For example, if the rate of ED visits that are actually related to patients’ mental health problems is 50% higher at IPF #1 as at IPF #2, but only half of the visits are related to mental health issues, then the overall visit rate will only be 25% higher at IPF #1 than at IPF #2 (0.5 + 0.75)/(0.5 + 0.5) = 1.25.
Second, differences between hospitals in the rate of visits that are unrelated to mental health issues may cause IPFs to appear to have higher or lower quality care than they actually do. For example, a patient with mental illness may come to the ED for an exacerbation of a chronic condition such as heart failure or COPD, not because of issues related to their mental illness. Rates of ED visits for chronic disease exacerbations will be higher in communities where chronic disease management services are less available, and patients who have lower incomes or lack of family support may have greater difficulty accessing the services that do exist. In these types of communities and for these types of patients, there will likely be more post-discharge ED visits for chronic disease exacerbations that are unrelated to outpatient mental health care. The risk adjustment methodology only controls for the presence of a chronic condition; it does not control for differences in the care patients receive for the chronic condition. Care for the chronic condition is not the responsibility of the IPF that delivered treatment for mental health problems, but under this measure, a higher rate of ED visits for problems related to chronic conditions will be interpreted as poor quality care for the patient’s mental health needs. Similarly, a patient may come to the ED for an acute condition that is unrelated to mental health (e.g., a viral infection). An IPF in a community experiencing high rates of such illnesses could have higher rates of post-discharge ED visits, and this measure would make it appear that the IPF delivers lower quality psychiatric care. The risk adjustment model used in the proposed measure has variables indicating the presence of chronic conditions, but there are no variables to indicate the presence of new acute conditions.

For example, a study of the impact on psychiatric care resulting from improving access to community health centers measured changes in the number of psychiatric ED visits, which were defined as ED visits with a diagnosis code corresponding to mental health disorders. (Bruckner, TA, et al. “Psychiatric Emergency Department Visits After Regional Expansion of Community Health Centers,” Psychiatric Services 70(10): 901-906.)
An analogous approach is used in the “Follow-Up After Psychiatric Hospitalization (FAPH)” measure that is already part of the measures CMS calculates for IPFs. That measure only includes outpatient mental health care encounters after discharge in its numerator, not all outpatient visits, and outpatient mental health care encounters are defined as “outpatient visits, intensive outpatient encounters, or partial hospitalizations provided by a mental health provider for which mental health or SUD diagnoses are mentioned anywhere on the follow-up visit claim.” A similar definition could be used to define ED visits related to mental health issues.

Exclusion of Related Visits

This problem could easily be avoided by expanding the numerator to include all mental health-related ED visits, including those that result in a hospital admission.

Failure to Exclude Patients Who Die After Discharge from the Denominator

Problems with the Risk Adjustment Methodology

Inadequate Evaluation of Risk Adjustment Model Performance

The Measure Information Form states that the risk-decile calibration plot “indicates a close agreement,” even though there is 13% underprediction in the lowest decile, 2-3% underprediction in the highest two deciles, and 4% over-prediction in two of the middle-range deciles. These types of prediction errors could lead to erroneous classifications of hospitals with the types of patients with these risk characteristics. The Hosmer-Lemeshow statistic was reportedly statistically significant, which indicates poor calibration. There is no information indicating that any efforts were made to adjust the model or to test alternative models to achieve better calibration, nor were there any efforts to use alternatives to the Hosmer-Lemeshow statistic to assess calibration. Instead, the measure developers stated that they recalculated the Hosmer-Lemeshow statistic for 20 smaller samples of cases and found that the statistic was not significant in most of these cases. However, they do not show the actual results of these analyses. They state that the rationale for doing these additional estimates was that the Hosmer-Lemeshow statistic is more likely to be significant with larger numbers of cases, but the converse of this is also true – it is less likely to be significant with smaller samples. Failure to find statistically significant results in artificially-restricted samples does not mean that there is no calibration problem.

Breadcrumb

30-Day Risk Standardized All-Cause Emergency Department Visit Following an Inpatient Psychiatric Facility Discharge

Measure 4190 Should Not Be Endorsed

CBE# 4190 Staff Assessment

Overall summary

Agree with Staff Comments

A

NA

Measure Should Not Be Endorsed

This measure needs improved…

Viability

too many unanswered questions to endorse

Review

Would not support endorsement

Not met

Unfortunately, not met

Do not recommend this measure at this time.

30-Day Risk Standardized All-Cause ED Visit following IPF

Overall Summary for Measure # 4190

This measure needs more development and analysis to refine it

The measure needs refinement and not yet ready for endorsement.

no additional comments

Concern regarding validity of data

Agree with majority of staff comments