Show simple item record

dc.contributor.authorAndriola, Corinne
dc.contributor.authorEllis, Randall P
dc.contributor.authorSiracuse, Jeffrey J
dc.contributor.authorHoagland, Alex
dc.contributor.authorKuo, Tzu-Chun
dc.contributor.authorHsu, Heather E
dc.contributor.authorWalkey, Allan J
dc.contributor.authorLasser, Karen E
dc.contributor.authorAsh, Arlene S.
dc.date.accessioned2024-06-13T14:59:23Z
dc.date.available2024-06-13T14:59:23Z
dc.date.issued2024-04-05
dc.identifier.citationAndriola C, Ellis RP, Siracuse JJ, Hoagland A, Kuo TC, Hsu HE, Walkey A, Lasser KE, Ash AS. A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas. JAMA Health Forum. 2024 Apr 5;5(4):e240625. doi: 10.1001/jamahealthforum.2024.0625. PMID: 38639980; PMCID: PMC11065160.en_US
dc.identifier.eissn2689-0186
dc.identifier.doi10.1001/jamahealthforum.2024.0625en_US
dc.identifier.pmid38639980
dc.identifier.urihttp://hdl.handle.net/20.500.14038/53470
dc.description.abstractImportance: Models predicting health care spending and other outcomes from administrative records are widely used to manage and pay for health care, despite well-documented deficiencies. New methods are needed that can incorporate more than 70 000 diagnoses without creating undesirable coding incentives. Objective: To develop a machine learning (ML) algorithm, building on Diagnostic Item (DXI) categories and Diagnostic Cost Group (DCG) methods, that automates development of clinically credible and transparent predictive models for policymakers and clinicians. Design, setting, and participants: DXIs were organized into disease hierarchies and assigned an Appropriateness to Include (ATI) score to reflect vagueness and gameability concerns. A novel automated DCG algorithm iteratively assigned DXIs in 1 or more disease hierarchies to DCGs, identifying sets of DXIs with the largest regression coefficient as dominant; presence of a previously identified dominating DXI removed lower-ranked ones before the next iteration. The Merative MarketScan Commercial Claims and Encounters Database for commercial health insurance enrollees 64 years and younger was used. Data from January 2016 through December 2018 were randomly split 90% to 10% for model development and validation, respectively. Deidentified claims and enrollment data were delivered by Merative the following November in each calendar year and analyzed from November 2020 to January 2024. Main outcome and measures: Concurrent top-coded total health care cost. Model performance was assessed using validation sample weighted least-squares regression, mean absolute errors, and mean errors for rare and common diagnoses. Results: This study included 35 245 586 commercial health insurance enrollees 64 years and younger (65 901 460 person-years) and relied on 19 clinicians who provided reviews in the base model. The algorithm implemented 218 clinician-specified hierarchies compared with the US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model's 64 hierarchies. The base model that dropped vague and gameable DXIs reduced the number of parameters by 80% (1624 of 3150), achieved an R2 of 0.535, and kept mean predicted spending within 12% ($3843 of $31 313) of actual spending for the 3% of people with rare diseases. In contrast, the HHS HCC model had an R2 of 0.428 and underpaid this group by 33% ($10 354 of $31 313). Conclusions and relevance: In this study, by automating DXI clustering within clinically specified hierarchies, this algorithm built clinically interpretable risk models in large datasets while addressing diagnostic vagueness and gameability concerns.en_US
dc.language.isoenen_US
dc.relation.ispartofJAMA Health Forumen_US
dc.relation.urlhttps://doi.org/10.1001/jamahealthforum.2024.0625en_US
dc.rightsThis is an open access article distributed under the terms of the CC-BY License. © 2024 Andriola C et al. JAMA Health Forum.en_US
dc.rights.urihttps://jamanetwork.com/pages/cc-by-license-permissions
dc.subjectmachine learningen_US
dc.subjectalgorithmsen_US
dc.subjectrisk modelsen_US
dc.subjecthealth care costsen_US
dc.titleA Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulasen_US
dc.typeJournal Articleen_US
dc.source.journaltitleJAMA health forum
dc.source.volume5
dc.source.issue4
dc.source.beginpagee240625
dc.source.endpage
dc.source.countryUnited States
dc.identifier.journalJAMA health forum
refterms.dateFOA2024-06-13T14:59:25Z
dc.contributor.departmentBiostatistics and Health Services Researchen_US
dc.contributor.departmentMedicineen_US
dc.contributor.departmentPopulation and Quantitative Health Sciencesen_US
dc.contributor.departmentUMass Chan Analyticsen_US


Files in this item

Thumbnail
Name:
andriola_2024_oi_240013_171277 ...
Size:
1.116Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record