Rate of Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection

MUC ID

MUC2025-042

Steward Organization Group

Brigham and Women's Hospital

Committee

PRMR Clinician Committee

Considered CMS Programs

Merit-Based Incentive Payment System (MIPS)

Description

This electronic Clinical Quality Measure (eCQM) reports the percentage of female patients aged 40 to 75 years with at least one abnormal screening (BI-RADS 0) or screening-to-diagnostic (BI-RADS 4, 5) mammogram during the measurement period (i.e., calendar year) who received timely diagnostic resolution defined as either follow-up imaging with negative/benign/probably benign results or a breast biopsy within 60 days after their index (i.e., first) abnormal screening mammogram.

Negative/benign/probably benign follow-up imaging was defined as diagnostic mammography, breast ultrasound or magnetic resonance imaging (MRI) with BI-RADS ratings of 1, 2, or 3. Relevant diagnostic breast biopsy procedures were defined as core needle biopsy, fine needle aspiration, and surgical excision.

Breast Imaging – Reporting and Data System (BI-RADS) ratings: 0-incomplete, 1-negative, 2-benign, 3-probably benign, 4-suspicious, 5-highly suggestive of malignancy."

Overview

Measure Overview

Rationale (Excerpt from Submission)

Breast cancer is the second most common cause of cancer deaths among women in the United States. In 2025, around 42,170 women will die from breast cancer, and an estimated 316,950 new cases of invasive breast cancer will be diagnosed.^[1]

Breast cancer survival is dependent upon cancer stage at diagnosis. Approximately 99% of women diagnosed with early stage breast cancer live for 5 years or more.^[2] However, this applies to only about 32% of those diagnosed at the most advanced stage.

Noninvasive mammographic screening is the primary screening modality used to detect breast cancer. Delays in diagnostic follow-up after abnormal mammographic screening results increase the risk of diagnosing cancer at a more advanced stage.^[3]

National screening guidelines recommend that women with abnormal screening mammogram results (BI-RADS 0, 4, or 5) undergo additional follow-up imaging via diagnostic mammography, magnetic resonance imaging (MRI), and/or ultrasound.^[4]^,^[5]^,^[6]^,^[7] While it is recommended that patients with a benign follow-up imaging result return to routine screening, those with abnormal results (BI-RADS 4 or 5) should have diagnostic samples extracted (e.g., via core needle biopsy, fine needle aspiration, or surgical excision) from a suspicious area to evaluate for cancer.^[4]

Expert-based quality measure programs support the need to establish a reasonable timeframe that encompasses this multi-step process. According to the Centers for Disease Control and Prevention (CDC) National Breast and Cervical Cancer Early Detection Program (NBCCEDP), breast cancer screening to diagnostic resolution should occur within 60 days.^[8] It is also expected that over 90% of women complete diagnostic resolution after an abnormal screening mammogram.^[8],^[9] Published literature shows that long wait times to diagnostic evaluation are associated with increased tumor size and lymph node metastases in patients with delays exceeding 12 weeks.^[10]^,^[11]^,^[12] In particular, invasive triple negative breast cancers have been shown to double in size in <60 days.^[13]

Differences in diagnostic follow-up rates after abnormal screening mammograms are reported in the literature. A 2021 systematic review reported rates of failure to follow-up on abnormal screening mammograms ranging from 7.2-33%.^[14] A 2024 study on the American College of Radiology’s National Mammography Database (NMD) observed that only 66.4% of 2.9 million abnormal screening mammograms (BI-RADS 0) documented from 2008-2021 had diagnostic follow-up. In this cohort, women with no family history of breast cancer had lower follow-up rates, and Black and Native American women had lower overall follow-up rates and lower biopsy rates. Rural and community hospital-affiliated facilities had longer median times to biopsy.^[15]

The variability in follow-up rates in the NMD and existing literature imply the existence of barriers limiting mammography facilities from carrying out complete diagnostic resolution within a timely manner for all patients. This electronic clinical quality measure (eCQM) can be used to address quality assessment gaps by monitoring timeliness and completeness of care in medical facilities looking to improve the breast cancer screening and diagnostic process.

[1] Key Statistics for Breast Cancer. American Cancer Society. Updated May 5, 2025. Accessed September 12, 2025. https://www.cancer.org/cancer/types/breast-cancer/about/how-common-is-b….

[2] Cancer Statistics Working Group. U.S. Cancer Statistics Data Visualizations Tool, based on 2021 submission data (1999–2020): U.S. Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Updated June 2024. Accessed July 2024. www.cdc.gov/cancer/dataviz.

[3] McCarthy AM, Kim JJ, Beaber EF, et al. Follow-Up of Abnormal Breast and Colorectal Cancer Screening by Race/Ethnicity. Am J Prev Med. 2016; 51(4):507-512. doi:10.1016/j.amepre.2016.03.017. PMID: 27132628.

[4] Sickles E, D’Orsi CJ. ACR BI-RADS follow-up and outcome monitoring. In: D’Orsi CJ, ed. ACR BI-RADS atlas, breast imaging reporting and data system. Reston, VA: American College of Radiology Reston; 2013:5-67. https://www.acr.org/-/media/ACR/Files/RADS/BI-RADS/BIRADSFAQ.pdf.

[5] Monticciolo DL, Malak SF, Friedewald SM, et al. Breast Cancer Screening Recommendations Inclusive of All Women at Average Risk: Update from the ACR and Society of Breast Imaging. J Am Coll Radiol. 2021; 18(9):1280-1288. doi:10.1016/j.jacr.2021.04.021. PMID: 34154984.

[6] US Preventive Services Task Force, Nicholson WK, Silverstein M, et al. Screening for Breast Cancer: US Preventive Services Task Force Recommendation Statement [published correction appears in JAMA. 2024 Sep 30. doi: 10.1001/jama.2024.19851]. JAMA. 2024; 331(22):1918-1930. doi:10.1001/jama.2024.5534. PMID: 38687503.

[7] Esserman LJ, Joe BN, et al. Diagnostic evaluation of suspected breast cancer. UpToDate. Updated October 31, 2023. Accessed October 31, 2024. https://www.uptodate.com/contents/diagnostic-evaluation-of-suspected-br…-

cancer?search=birads&source=search_result&selectedTitle=2%7E13&usage_type=default&display_rank=2#H24.

[8] DeGroff A, Royalty JE, Howe W, et al. When performance management works: a study of the National Breast and Cervical Cancer Early Detection Program. Cancer. 2014; 120 Suppl 16(Suppl 16):2566-2574. doi:10.1002/cncr.28817. PMID: 25099899.

[9] Miller JW, Hanson V, Johnson GD, Royalty JE, Richardson LC. From cancer screening to treatment: service delivery and referral in the National Breast and Cervical Cancer Early Detection Program. Cancer. 2014; 120 Suppl 16(0 16):2549-2556. doi:10.1002/cncr.28823. PMID: 25099897.

[10] Olivotto IA, Gomi A, Bancej C, et al. Influence of delay to diagnosis on prognostic indicators of screen-detected breast carcinoma. Cancer. 2002; 94(8):2143-2150. doi:10.1002/cncr.10453. PMID: 12001110.

[11] Ganry O, Peng J, Dubreuil A. Influence of abnormal screens on delays and prognostic indicators of

screen-detected breast carcinoma. J Med Screen. 2004; 11(1):28-31. doi:10.1177/096914130301100107. PMID: 15006111.

[12] Doubeni CA, Gabler NB, Wheeler CM, et al. Timely follow-up of positive cancer screening results: A systematic review and recommendations from the PROSPR Consortium. CA Cancer J Clin. 2018; 68(3):199-216. doi:10.3322/caac.21452. PMID: 29603147.

[13] Nakashima K, Uematsu T, Takahashi K, Nishimura S, Tadokoro Y, Hayashi T, Sugino T. Does breast cancer growth rate really depend on tumor subtype? Measurement of tumor doubling time using serial ultrasonography between diagnosis and surgery. Breast Cancer. 2019 Mar; 26(2):206-214. doi: 10.1007/s12282-018-0914-0. Epub 2018 Sep 26. PMID: 30259332.

[14] Reece JC, Neal EFG, Nguyen P, McIntosh JG, Emery JD. Delayed or failure to follow-up abnormal breast cancer screening mammograms in primary care: a systematic review. BMC Cancer. 2021; 21(1):373. Published 2021 Apr 7. doi:10.1186/s12885-021-08100-3. PMID: 33827476.

[15] Oluyemi ET, Grimm LJ, Goldman L, et al. Rate and Timeliness of Diagnostic Evaluation and Biopsy After Recall From Screening Mammography in the National Mammography Database. J Am Coll Radiol. 2024; 21(3):427-438. doi:10.1016/j.jacr.2023.09.002. PMID: 37722468.

CMS Provided Program Rationale

CMS is considering adding this measure to the MIPS quality measure set as a new measure for future performance years. MIPS does not have any related measures that examine timely follow-up for abnormal screening mammograms; therefore, the quality of patient care benefits from the promotion of early detection of breast cancer through this measure. This measure is fully tested and developed at both the facility and clinician level. This process measure represents a gap in MIPS and CMS priority areas for diagnostic radiology, which has limited measures and digital measurement overall. Additionally, the measure may be considered for potential inclusion in the diagnostic radiology MIPS Value Pathway (MVP).

Measure Background

New measure; never reviewed by a Measure Applications Partnership (MAP) Workgroup or PRMR committee; never used in a Medicare program.

Measure Type

Process

Measure is a composite

Measure is digital and/or an eCQM

Digital

Measure has multiple scores

Measure is a paired or group measure

No pairing or grouping

CBE Endorsement Status

Endorsed

CBE Endorsement History

Endorsed with conditions during the Spring 2024 cycle. When the measure returns for maintenance (3 years), the measure developer should have:

Conducted additional validity testing (data element in additional EHR); and
Continued to monitor (e.g., qualitative assessments, empirical analyses) for unintended consequences (e.g., reduced access to mammography) during implementation.

Is measure currently used in CMS programs?

Does Measure Address a Statutorily Required Topic Area?

Substantive Changes from Prior Version

N/A

Measure Specification

Numerator

Patients in the denominator population who received timely diagnostic resolution defined as negative/benign/probably benign follow-up imaging (BI-RADS 1, 2, 3) or breast biopsy within 60 days after the date of their index (i.e., first) abnormal screening (BI-RADS 0) or screening-to-diagnostic (BI-RADS 4, 5) mammogram.

Extract the date of the first abnormal screening (BI-RADS 0) or screening-to-diagnostic (BI-RADS 4, 5) mammogram in the measurement period (i.e., calendar year) for each patient to define the index screening mammograms and index dates (i.e., start of the follow-up period) [value sets: “Screening Mammogram (Grouping)” OID 2.16.840.1.113762.1.4.1206.61; BIRADSCategories04And5 OID 2.16.840.1.113762.1.4.1206.67].

If documented, extract the first follow-up imaging (i.e., diagnostic mammogram, ultrasound, or MRI) with negative/benign/probably benign (BI-RADS 1, 2, 3) ratings within 60 days after the date of the index abnormal screening mammogram for each patient [value sets: “Diagnostic Mammography” OID 2.16.840.1.113762.1.4.1206.65; “Ultrasound of the Breast” OID 2.16.840.1.113883.3.3157.1902; “MRI of the Breast” OID 2.16.840.1.113883.3.3157.1903; BIRADSCategories12And3 OID 2.16.840.1.113762.1.4.1206.68].

If documented, extract the first breast biopsy procedure (i.e., core needle biopsy, fine needle aspiration, or surgical excision) within 60 days after the date of the index abnormal screening mammogram for each patient [value set: “Breast Cancer Biopsy and Surgical Excision” OID 2.16.840.1.113762.1.4.1206.66].

Patients that received negative/benign/probably benign follow-up imaging or breast biopsy within 60 days are included in the numerator population.

Numerator Exclusions

Not applicable

Numerator Exceptions

Not applicable

Denominator

Female patients aged 40 to 75 years with an abnormal screening (BI-RADS 0) or screening-to-diagnostic (BI-RADS 4, 5) mammogram during the measurement period (i.e., calendar year). Only the first abnormal screening or screening-to-diagnostic mammogram (i.e., index screening test) is included in the measure calculation.

Extract all abnormal screening mammograms (BI-RADS 0) and screening-to-diagnostic mammograms (BI-RADS 4, 5) during the measurement period (i.e., calendar year) [value sets: “Screening Mammogram (Grouping)” OID 2.16.840.1.113762.1.4.1206.61; BIRADSCategories04And5 OID 2.16.840.1.113762.1.4.1206.67].

Retain abnormal screening and screening-to-diagnostic mammograms where the patient was aged between 40 and 75 years on the date of the mammogram [value set "Birth Date" OID 2.16.840.1.113883.3.560.100.4].

Retain abnormal screening and screening-to-diagnostic mammograms where the patient was female [value set "ONC Administrative Sex" OID 2.16.840.1.113762.1.4.1].

Patients with at least one abnormal screening or screening-to-diagnostic mammogram are included in the denominator population.

Denominator Exceptions

Not applicable

Denominator Exclusions

Not applicable

Level of Analysis

Clinician: Individual

Facility

Types of Data Sources

Electronic Health Records

Care Setting

Ambulatory Care: Office

Hospital: Outpatient

Risk Adjustment

No risk adjustment necessary

Meaningfulness

Importance

Type of Evidence

Clinical Guidelines or USPSTF (U.S. Preventive Services Task Force) Guidelines

Empirical data

Peer-Reviewed Original Research

Peer-Reviewed Systematic Review

Importance Evaluation

As outlined in the literature cited for the measure rationale, early detection of breast cancer through routine mammographic screening has significantly reduced mortality and treatment costs. Studies show that most breast imaging facilities do not meet benchmarks for timely follow-up imaging and biopsy, but participation in quality measurement programs improves performance, especially in underperforming facilities. This eCQM has been developed to help facilities routinely assess and improve the timeliness of diagnostic resolution after abnormal mammograms, supporting better outcomes and more equitable care.

During CBE endorsement review in 2024, the committee found the evidence supporting the importance of this measure to be sufficient.

Importance Rating

Met

Conformance

Conformance Evaluation

This new measure is intended to calculate the rate of timely diagnostic resolution in facilities that perform mammographic screening after an abnormal screening mammogram to detect breast cancers. The measure numerator, denominator, and exclusions for the measure scores are defined and support the intent of the measure. The measure aligns with MIPS objectives to 1) improve beneficiary health through prevention; 2) educate, engage, and empower patients as members of their care team; and 3) provide accurate, timely, and actionable performance data to clinicians, patients, and other stakeholders.

Conformance Rating

Met

Feasibility

eCQM Feasibility Testing or Analysis Conducted

Yes, eCQM testing was performed

Feasibility Evaluation

All data elements are in defined fields in electronic sources and align with USCDI/USCDI+ Quality standards making the measure highly feasible. The measure was tested in three EHRs and is highly feasible.

The feasibility scorecard addresses the following domains:

Data availability: Data element exists in a structured format in this EHR.
Data accuracy: Information is from authoritative sources and/or is highly likely to be correct.
Data standards: Data element is coded in a nationally accepted terminology standard or can be mapped to that terminology standard.
Workflow: The data element is routinely collected during clinical care and requires no, or limited, additional data entry from a clinician or other provider, and no EHR interface changes.

Feasibility testing identified two data elements that required additional review within the Oracle Health or Allscripts testing sites. For the BI-RADS result data elements, the feasibility plan indicated search terms will be specified for use in EHRs that do not capture BI-RADS in structured fields. The developer developed and validated the string search algorithm in Health Systems 1 and 2.

During CBE endorsement review in 2024, the committee found the feasibility of this measure to be sufficiently demonstrated.

Feasibility Rating

Met

Validity

Validity Testing Method(s)

Face validity, patient-encounter level testing

Testing level(s)

Facility

Was this measure tested in the same target population as the CMS program?

Yes

Validity Evaluation

A technical expert panel (TEP) consisting of seven members, representing the patient experience and expertise in medicine, measure development, quality and safety of care, cancer screening, health services research, and EHRs, reviewed the measure. The majority of TEP members agreed that the measure can be used to distinguish good from poor quality care at the hospital (i.e., the facility) level.

The developer conducted chart reviews in two health systems to validate the accuracy of eCQM automated patient allocations. Using stratified random samples, reviewers compared manual chart assessments—considered the gold standard—to eCQM results. Percentage agreements ranged from 97% to 99%, and Positive Predictive Values (PPVs) ranged from 99% to 100%, confirming the measure’s strong validity. Health System 3 is currently undergoing similar validation.

During CBE endorsement review in 2024, the committee found the validity of this measure to be sufficiently demonstrated.

Considerations for the committee: The submitted face validity and patient-/encounter-level testing met the requirements for CBE endorsement. Committee members are encouraged to consider if this testing provides appropriate evidence of the measure's suitability for inclusion in MIPS.

Threats to validity

Threats to validity were not identified in the submission materials. Empiric validity testing was not completed for this measure.

Validity Rating

Met

Reliability

Reliability testing method(s)

Signal-to-Noise and Random Split-Half Correlation

Testing level

Facility (Facility Group), Individual Clinician

Reliability Evaluation

Signal-to-noise reliability at the group level was calculated across six facility groups in one hospital system. The minimum reliability for the most recent year (2023) at the facility group level is 0.989 and 100% of the six facility groups have a reliability greater than 0.6. The developer calculated a Spearman’s rank correlation between two randomly split halves of the data and reported a correlation of 0.94.

In the original MERIT submission, ICC was calculated as the percentage of variation in facility-level scores attributable to facility-level signal variation, with 95% confidence intervals for each split sample. Battelle noted during review that this was not the type of ICC that measures correlation between the two split samples. These initial ICC values were very low: 0.019 for the test sample and 0.084 for the validation sample in 2020. These results conflicted with the signal-to-noise and Spearman rank results.

In response to this issue noted during PA collaboration, the developer revised testing methods during the MERIT submission window to align with recommendations and calculated a different type of ICC to assess the correlation between the two spilt samples. The ICC calculation now aligns with the intended approach and is consistent with the Signal-to-Noise Ratio (SNR) and Spearman correlation results. These results are shown in Table 1, provided by the developer below.

At the individual clinician level, the median SNR was 0.962 (95% CI: 0.917, 0.956) for the 99 clinicians at Health System 1. The minimum SNR was 0.142 and the maximum SNR was 0.989. The SNRs were >0.900 from 2020 to 2023 with relatively narrow 95% confidence intervals, indicating that a high proportion of overall variability is explained by the differences between measured entities (i.e., individual clinicians). The Spearman’s rank correlation coefficient for 2023 at the individual level, was 0.79 (95% CI: 0.66, 0.87). Although substantially lower than at the facility group level, the overall clinician Spearman’s rank correlation coefficient still indicated a strong positive correlation between the test and validation samples.

During CBE endorsement review in 2024, the committee found the reliability of this measure to be sufficiently demonstrated.

Table 1. Intraclass Correlation Coefficients (ICC), Overall and by Year from 2018 to 2023 for Six Facility Groups in Health System 1

Measurement Year	Test-Validation Correlation	95% CI
Overall	0.996	(0.980, 0.999)
2018	0.929	(0.697, 0.987)
2019	0.835	(0.445, 0.970)
2020	0.904	(0.616, 0.982)
2021	0.942	(0.743, 0.989)

Additional reliability analyses

No additional reliability analyses were performed.

Reliability Rating

Met

Usability

Usability considered in application:

Yes, the submission materials briefly discuss the measure’s usability within relevant programs.

Usability Evaluation

This measure has usability in MIPS. It has been successfully tested at the facility level and shown to be feasible for integration into EHR systems. Additionally, feedback from a patient representative on the TEP confirmed that reporting eCQM diagnostic rates would be meaningful and could positively influence patient decision-making. However, this measure is currently specified in Fast Healthcare Interoperability Resources (FHIR), which may result in implementation barriers within MIPS if program updates are delayed in future.

During CBE endorsement review in 2024, the committee found the use/usability of this measure to be sufficiently demonstrated.

Usability Rating

Met

Appropriateness of Scale

Overview

Similar or Related Measures in Selected CMS Programs

Breast Cancer Screening Measure in MIPS

Breast Cancer Screening Recall Rates in MIPS

Evaluation of measure balance, burden and value across target populations and measured entities

While there are similar measures within MIPS, this measure assesses the full screening process from an inconclusive/abnormal screening mammogram through to diagnostic resolution, which offers benefit to the program population. The developer notes in submission materials that this eCQM emphasizes timeliness of diagnostic resolution after an abnormal screening mammogram to detect breast cancers and uses a patient-based—rather than episode-based—approach to measurement.

The measure is patient based and complements two related measures that are already in use in CMS programs: the Breast Cancer Screening measure (CMIT ID: 00093) and the Breast Cancer Screening Recall Rates measure (CMIT ID: 01648).

Additionally, this new measure does compete with one Healthcare Effectiveness Data and Information Set (HEDIS) measure: Follow-Up after Abnormal Breast Cancer Assessment. This competing measure reports on the percentage of mammograms with a BI-RADS of 0 that received follow-up diagnostic imaging within 90 days or mammograms with a BI-RADS of 4 or 5 that received follow-up breast biopsy within 90 days. However, the current competing measure does not quantify the percentage of patients who have timely diagnostic resolution from a screening mammogram to breast biopsy. Each of the related and competing clinical quality measures quantifies specific aspects of the multi-step breast cancer screening process; however, none of the measures assess the full screening process from an inconclusive/abnormal screening mammogram through to diagnostic resolution as the new measure does.

Regarding balance of this measure’s performance, burden and benefit across populations, the developer’s literature review and analysis do not indicate a potential for differential benefit or harm to specific subgroups of participating entities or their patient populations.

Considerations for the committee: Based on clinical and professional experience, the committee should consider the distribution of benefits and risks/burdens of the measure within the proposed program population.

Time to Value Realization

Overview

Plan for near & long term impacts after implementation

None specified

Evaluation of potential measure implementation impacts over time

The developer briefly mentions long- and near-term impacts of the measure as an eCQM in a patient population. There may be need for further examination of near- and long-term impacts of this measure after implementation across clinician and patient populations.

Considerations for the committee:

What are the potential near- and long-term impacts of this measure on measured entities, proposed CMS program, and patient populations?
Will benefits and burdens associated with this measure be realized within an appropriate implementation time frame?
How will this measure mature through revisions in the future if added to the MIPS measure set?

Public Comments

Timely follow up is critical

Timely follow-up after an abnormal screening mammogram is critical for early breast cancer detection, and it is especially important in minority populations because delays worsen outcomes and widen existing health disparities. An abnormal mammogram does not mean cancer, but it requires prompt diagnostic follow-up (additional imaging or biopsy). When breast cancer is found early, it is:
More treatable, Less likely to have spread, Associated with much higher survival rates. Research consistently shows that racial and ethnic minority women—including Black, Hispanic/Latina, Native American, and some Asian populations—are less likely to receive timely follow-up after abnormal mammograms. Contributing factors include: Limited access to specialty care. Lack of insurance or underinsurance, Transportation difficulties, Language barriers etc.

Organization

HCD International

Timely Follow Up on Abnormal Mammograms

Rationale: This measure reports the percentage of female patients aged 40 to 75 years with at least one abnormal screening mammogram who received timely diagnostic resolution within 60 days after their abnormal screening mammogram. Breast cancer is the second most common cause of cancer deaths among women in the United States. In 2025, around 42,170 women will die from breast cancer, and an estimated 316,950 new cases of invasive breast cancer will be diagnosed. Breast cancer survival is dependent upon cancer stage at diagnosis. Approximately 99% of women diagnosed with early-stage breast cancer live for 5 years or more. However, survival is only about 32% for those diagnosed at the most advanced stage. CMS is considering adding this measure as a new measure to the clinician payment program for future performance years. Currently, it does not have any related measures that examine timely follow-up for abnormal screening mammograms. This measure fills an important gap.

Organization

Patients For Patient Safety US

Good intent but concerns about feasibility, accountability, etc.

The AAFP supports the goal of improving timely follow-up after abnormal screening mammograms, recognizing its importance for patient outcomes and breast cancer detection. However, the implementation of this measure in Medicare payment programs, especially MIPS, raises significant concerns regarding feasibility, accountability, equity, and unintended consequences. We strongly encourage CMS to consider the challenges outlined below and work to resolve these issues.

1. Meaningfulness

Intent and Patient Benefit: The measure is meaningful in its aim to ensure patients receive timely follow-up imaging or biopsy after abnormal screening mammograms, which can improve patient care, outcomes, and reduce mortality.

2. Appropriateness of Scale

Timeline: Having a 60-day completion period for follow-up is appropriate and can help limit harm by providing definitive results sooner.
Attribution and Accountability: There are unresolved questions about attribution—whether responsibility lies with radiologists, primary care physicians, or others. The measure risks being applied at the wrong accountability level, penalizing individual practices for system-level issues such as scheduling, network access, and payer authorization.
- For example, there are many places a patient can walk-in and get a mammogram without an order. We support this to increase access to important, life-saving screenings. However, the results in these walk-in situations are not always communicated to the patient’s PCP (if they have one).
Equity Considerations: Access to higher-level imaging and diagnostic procedures may be limited in rural and low-income populations, impacting timely follow-up. Social risk and access factors must be considered so practices are not penalized for delays outside their control.

3. Feasibility and Implementation

Data Extraction Challenges: Mammogram results may not be readily extractable as structured data in EMRs, making implementation and tracking difficult. Reports often appear as images rather than data, complicating documentation and workflow.
Documentation Burden: The measure risks duplicative effort and workflow disruption for practices, especially when external imaging reports must be obtained and documented.

4. Time to Value Realization

Positive Impacts: The measure has potential for near- and long-term positive impacts on patient outcomes, but further discussion is needed on how benefits and harms may change over time as the measure matures.

5. Recommendations for CMS

Clarify Attribution: We encourage CMS to provide clear guidance on attribution for follow-up responsibility.
Reduce Burden: Efforts should be made to minimize documentation and workflow burden, including improving EHR data extraction capabilities.
Improve Access: We encourage CMS to consider social risk and access factors and provide support for practices serving underserved populations.
Consider Modifications: The measure should grant “credit” to clinicians for documented outreach and navigation efforts and it should align with payer prior authorization timelines to avoid penalizing physicians for delays outside their control.
Monitor and Update: Ongoing assessment of the measure’s impact and feasibility is needed, with updates as necessary to maintain clinical relevance and fairness.

Conclusion

We appreciate the opportunity to provide the important family physician perspective on this and other measures under consideration. The AAFP supports the concept of timely follow-up on abnormal screening mammograms but recommends against implementing the current measure in MIPS until concerns about feasibility, accountability, equity, and documentation burden are adequately addressed. We encourage CMS to carefully consider our insight and suggestions outlined above.

Organization

American Academy of Family Physicians (AAFP)

ECRI's Support of the CMS MUC - Safety & Diagnostic Excellence

ECRI, a global nonprofit advancing evidence-based healthcare, has submitted the attached comments on the MUC with an emphasis on measures most relevant to patient safety and diagnostic excellence.

ECRI supports the CMS Measures Under Consideration (MUC) List and its role in advancing meaningful, high-value measurement. Of particular significance are the measures focused on chronic disease management and diagnostic safety. Strengthening measurements in these domains supports more efficient, timely, and coordinated care across the healthcare system to better serve patients.

The attached comments from ECRI include recommendations on the importance of patient-reported outcomes, minimizing unnecessary reporting burdens, and feedback in support of the following measures:

Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection (MUC2025-042)
Timely Follow-up on Positive Stool-based Tests for Colorectal Cancer Detection (MUC2025-043)
Adult Community-Onset (CO) Sepsis Standardized Mortality Ratio (MUC2025-045)
Hospital Sepsis Program Core Elements Score (MUC2025-047)
Hospital 30-Day, All-Cause, Risk-Standardized Readmission Rate Following Sepsis Hospitalization (MUC2025-055)
Hospital Harm - Postoperative Venous Thromboembolism (MUC2025-067)

ECRI-comments-on-CMS-MUC---PDF--Jan.-6--2026-.pdf

Organization

ECRI

Advocate Health comments on MUC 2025- 042

Advocate Health appreciates the opportunity to provide feedback on MUC 2025-042, Rate of Timely Follow Up of Abnormal Mammogram Screening measure. We ask for clarification that the measure is only focused on the first abnormal screening, and not any subsequent abnormal screenings to account for those who are receiving 6 month follow-ups for continued screening, as an example.

Organization

Advocate Health

Public Comment in Support of MUC2025-042

Please see attachment. Thank you for the opportunity to submit a comment.

CMS-Public-Comment_Breast-Cancer-Follow-Up-MUC.pdf

Organization

HealthyWomen

Rate of Timely Follow-up on Abnormal Screening Mammograms for Br

The American Medical Association (AMA) supports the intent of this measure; however, we have many concerns with considering this measure for inclusion in the Merit-based Incentive Payment System (MIPS). The preliminary assessment bases its evaluation on the measure’s endorsement; however, it was endorsed at the facility and integrated delivery system level using hospital outpatient and integrated delivery system data. On review of the additional testing provided at the individual clinician and group level, we do not believe that this measure should be considered appropriate for inclusion in MIPS for the following reasons:

The performance scores provided across the three health systems and at all levels (clinician, group, and health system) are generally above 90% across several years. The current MIPS benchmarking approach will not allow this high rate of performance to be spread across the 10 deciles, and we believe that this measure could quickly become topped out. While no information was provided on the characteristics of the health systems (e.g., state or region, rural vs. urban, academic medical center vs. community hospital), we are also concerned that any scores that are below 90% may reflect staffing challenges or decreased access to imaging facilities in a region rather than true performance. Additional investigation into why performance that was generally high even during the COVID-19 public health emergency decreased in a subsequent year would be beneficial to understand if there are other factors outside of the control of the clinician, group and/or health system.
The electronic clinical quality measure (eCQM) specification requires integration of the Breast Imaging – Reporting and Data System (BI-RADS) into the electronic health record system (EHRs); however, the feasibility assessment and data element testing demonstrated that the BI-RADS data are not consistent captured in discrete fields. In fact, one of the testing sites used a simple string search to capture the unstructured data, which is inconsistent with current MIPS requirements that there be end-to-end reporting from EHRs without manual manipulation. Additional testing must be completed to demonstrate the agreement rates for each data element in a manner that is consistent with the current eCQM requirements (i.e., not include data from the string search unless it is integrated into discrete fields).
The reliability results were acceptable at the health system and facility group levels and while the median signal-to-noise ratio (SNR) at the individual clinician level was over 0.9, the minimum SNR was 0.142. All results were based on a minimum sample size of 40 patients; however, MIPS benchmarks are determined based on minimum sample sizes of 20 patients. We are extremely concerned that the reliability scores will be significantly lower when the 20-patient minimum is applied.

Given these concerns, the AMA does not support inclusion of this measure in MIPS.

Organization

American Medical Association

Program Recommendation Concerns

The American College of Radiology (ACR)—a professional association representing more than 40,000 physicians practicing diagnostic radiology, interventional radiology, radiation oncology, and nuclear medicine, as well as medical physicists— appreciates the opportunity to comment on the Centers for Medicare & Medicaid Services (CMS) 2025 Measures Under Consideration (MUC) List. Of the 24 measures included, only one pertains to radiology, MUC2025-042: Rate of Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection.

This electronic clinical quality measure (eCQM) was designed and tested for facility-level implementation, and it was endorsed by the Partnership for Quality Measurement (PQM) for use in programs such as CMS’s Hospital Outpatient Quality Reporting Program. However, in the MUC List, it is being considered for inclusion in the Merit-based Incentive Payment System (MIPS), which is a clinician-level program—a significant shift from its original intended use. (Partnership for Quality Measurement, 2024)

During the PQM endorsement process, ACR supported implementation at the facility level based on testing that demonstrated scientific acceptability for that setting. At the time, the measure had not been tested for clinician or group-level reporting.

While we appreciate CMS’s efforts to improve timely follow-up care, we have significant concerns about applying this measure at the clinician level. Our main concerns include:

High Stakes for Radiologists: If finalized, this would be the only breast imaging measure in MIPS, making accuracy and feasibility critical. Any errors in data capture or reporting could disproportionately affect radiologists’ MIPS scores, payment adjustments, and public reporting.
Technical Feasibility: Imaging reports (including BI-RADS results) are often stored in unstructured electronic health record (EHR) fields, requiring complex extraction methods like string searches or custom interfaces. Limited interoperability between imaging and EHR systems, vendor variability, and the need for custom IT solutions increase workload and risk of inaccurate reporting. These challenges may result in negative scoring and discourage participation.
Accountability for System-Level Factors: Radiologists may be penalized for delays outside their control, such as scheduling bottlenecks, patient navigation issues, and patient-driven delays. (Neiman Health Policy Institute, 2023; RSNA, 2023) These issues are compounded by workforce shortages and rising imaging volumes. While most facilities meet the 60-day benchmark for follow-up, some do not, and there is significant variability across facilities and patient subgroups. (Oluyemi et al., 2023) These findings show that systemic factors—unrelated to individual radiologist performance—can affect timely follow-up.
Resource Burden: Practices, especially small or rural ones, will need dedicated IT resources to extract BI-RADS data and maintain reporting systems. This will increase the cost and complexity of providing breast imaging services, putting smaller and rural practices at a significant disadvantage compared to large health systems. Additionally, the measure is currently specified in Fast Healthcare Interoperability Resources (FHIR), which may result in implementation barriers within MIPS if program updates are delayed.

ACR appreciates that the developer has since retested the measure. The group- and clinician-level testing demonstrates reliability, validity, and technical feasibility at both levels. However, clinician-level test results were derived from a single facility/group TIN, and group-level results were from only six groups within a single hospital system. Because clinician-level testing included only NPIs under a single TIN and lacked detail on the practice’s characteristics, we are hesitant to assume the reliability results will be generalized. This limited sample may reflect uniform workflows and IT infrastructure, rather than robustness and reproducibility across diverse practices. Additional multi-TIN testing is needed to confirm reliability for MIPS adoption. Further, feasibility testing was conducted in controlled environments with developer-provided tools (such as the BI-RADS string search algorithm), and the report does not confirm universal access to these tools or vendor integration. Real-world implementation could therefore vary substantially. As a result, practices without robust IT resources, especially small or rural practices, may face significant burdens, leading to imbalances in reporting and scoring. The testing also did not address delays outside clinicians’ control, and using the same target population for both facility and clinician testing raises concerns about the measure’s suitability. We are therefore very concerned that the current evidence does not support including this measure in MIPS.

Given these considerations, ACR strongly urges CMS to delay introducing this measure for clinician-level reporting in MIPS until the challenges described above are addressed. While testing confirms scientific acceptability, unresolved issues—such as access to extraction tools, interoperability limitations, and systemic delays—pose risks to fairness and accuracy. The ACR greatly appreciates this opportunity to provide this feedback to ensure the successful implementation of this measure. To avoid unintended consequences, we recommend that CMS either limit initial implementation to facility-level reporting or ensure robust resources, guidance, and safeguards are in place before considering clinician-level adoption.

References

Oluyemi, E. T., Grimm, L. J., Goldman, L., Burleson, J., Simanowith, M., Yao, K., & Rosenberg, R. D. (2023). Rate and timeliness of diagnostic evaluation and biopsy after recall from screening mammography in the National Mammography Database. Journal of the American College of Radiology, 20(5), 555–564. https://doi.org/10.1016/j.jacr.2023.01.012
Partnership for Quality Measurement. (2024). Measure endorsement summary: Rate of timely follow-up on abnormal screening mammograms for breast cancer detection. Retrieved from https://p4qm.org/measures
Neiman Health Policy Institute. (2023). Radiologist workforce projections and imaging demand. Retrieved from https://www.neimanhpi.org
Radiological Society of North America (RSNA). (2023). Technologist shortage and its impact on imaging services. Retrieved from https://www.rsna.org

Organization

American College of Radiology

Merck Comments Re MUC2025-042

MUC2025-042: Rate of Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection (MIPS)

Merck supports the proposed “Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection” measure (MUC2025-042), recognizing its potential to enhance the quality of care in breast cancer. Breast cancer was the second most common cause of cancer deaths among women in the U.S. in 2023.^[1] Research has highlighted the importance of timely follow-up care for breast cancer after receiving abnormal screening mammograms, as delays can lead to further progression of the disease and worsening outcomes.^[2] Five-year survival for breast cancer is about 99.6% when diagnosed at a localized stage but drops to roughly 32.9% for metastatic disease.^[3]

The measure specifications align with clinical practice guidelines, which emphasize the importance of timely follow-up care and appropriate diagnostics that can lead to earlier, treatment opportunities.^[4] However, gaps in timely follow-up on abnormal screening mammograms are significant; one systematic review found 7.2%-33% did not have any follow-up within 3 months and 27.3%-71.6% within 6 months.^[5] The wide variability in the lack of follow-up post-abnormal screening may indicate barriers limiting mammography facilities from carrying out complete diagnostic resolution within a timely manner for all patients. This quality measure can be used to address quality gaps by monitoring timeliness and completeness of care to improve breast cancer screening and diagnostic processes.

Overall, Merck encourages CMS to implement the “Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection” measure in the MIPS program. We are encouraged that CMS is continuing to prioritize care improvements for patients at risk of, or living with, cancer.

References

[1] U.S. Cancer Statistics Working Group. United States Cancer Cases and Death Statistics At a Glance. U.S. Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. Published June 2025. Accessed December 16, 2025. https://gis.cdc.gov/Cancer/USCS/.

[2] Reece JC, Neal EF, Nguyen P, et al. Delayed or failure to follow-up abnormal breast cancer screening mammograms in primary care: a systematic review. BMC Cancer. 2021 Apr 7;21(1):373.

[3] Centers for Disease Control and Prevention. U.S. Cancer Statistics Female Breast Cancer Stat Bite. U.S. Department of Health and Human Services. Published June 2, 2025. Accessed December 18, 2025. https://www.cdc.gov/united-states-cancer-statistics/publications/breast….

[4] Monticciolo DL, Malak SF, Friedewald SM, et al. Breast cancer screening recommendations inclusive of all women at average risk: update from the ACR and Society of Breast Imaging. J Am Coll Radiol. 2021 Sep 1;18(9):1280-8.

[5] Reece JC, Neal EF, Nguyen P, et al. Delayed or failure to follow-up abnormal breast cancer screening mammograms in primary care: a systematic review. BMC Cancer. 2021 Apr 7;21(1):373.

Merck-Comments-2025-CMS-Measures-Under-Consideration-1-6-2026-FINAL.pdf

Organization

Merck

Comment re: MUC2025-042

We recommend credit for documented outreach and navigation efforts and alignment with payer prior authorization timelines to avoid penalizing providers for delays outside their control.

Organization

Aspirus Health

Response to comment from Aspirus Health

This eCQM focuses on timely diagnostic resolution after an abnormal screening mammogram. It is a process measure aimed to drive early detection of breast cancer; therefore, the follow-up diagnostic tests must be completed for the patient to be included in the numerator. Several quality improvement initiatives, such as patient outreach and navigation, have demonstrated significantly improved uptake and timeliness of follow-up, and can be used to improve performance rates on this eCQM. The 60-day timeframe allows sufficient time for prior authorization, which is often not required for medically necessary follow-up diagnostic testing.

Organization

Brigham and Women's Hospital (Measure Developer/Steward)

MUC2025-042 measure

BCBSA supports this measure.

Organization

BCBSA

PIE Form

Breadcrumb

Rate of Timely Follow-up on Abnormal Screening Mammograms for Breast Cancer Detection