We are upgrading the repository! A content freeze is in effect until December 6, 2024. New submissions or changes to existing items will not be allowed during this period. All content already published will remain publicly available for searching and downloading. Updates will be posted in the Website Upgrade 2024 FAQ in the sidebar Help menu. Reach out to escholarship@umassmed.edu with any questions.

Show simple item record

dc.contributor.authorInekwe, Trusting
dc.contributor.authorMkandawire, Winnie
dc.contributor.authorWee, Brian
dc.contributor.authorAgu, Emmanuel
dc.contributor.authorColubri, Andres
dc.date.accessioned2024-10-11T18:45:56Z
dc.date.available2024-10-11T18:45:56Z
dc.date.issued2024-08-05
dc.identifier.citationT. Inekwe, W. Mkandawire, B. Wee, E. Agu and A. Colubri, "Biomarker Trajectory Prediction and Causal Analysis of the Impact of the Covid-19 Pandemic on CVD Patients using Machine Learning," 2024 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), Wilmington, DE, USA, 2024, pp. 1-12, doi: 10.1109/CHASE60773.2024.00011.en_US
dc.identifier.doi10.1109/CHASE60773.2024.00011en_US
dc.identifier.eid2-s2.0-85201196524
dc.identifier.scopusidSCOPUS_ID:85201196524
dc.identifier.urihttp://hdl.handle.net/20.500.14038/53857
dc.descriptionEmmanuel Agu of Worcester Polytechnic Institute is an affiliated faculty member of the Center for Accelerating Practices to End Suicide (CAPES) headquartered at UMass Chan Medical School.en_US
dc.description.abstractBackground: The COVID-19 pandemic disrupted healthcare services, increasing the susceptibility of high-risk patients including those with cardiovascular Diseases (CVDs), to adverse outcomes. Biomarkers provide insights into patients' underlying health status. However, few studies have investigated the effects of the COVID-19 pandemic on CVD biomarker trajectories using predictive modeling and causal analyses frameworks. Prior research explored the impacts of the COVID-19 pandemic on CVD severity and prognosis but did not investigate biomarker trajectories using Machine Learning (ML), which can discover complex multivariate relationships in multi-modal data. Objective: This study aimed to compare six ML regression models to select the best performing models for predicting biomarker trajectories in CVD patients using retrospective data. Subsequently, these models were used to assess the COVID-19 pandemic's impact on CVD patients and for causal analyses Approach: Using ML regression and causal inference, this study investigated the pandemic's impact on biomarker values of 80,917 CVD patients and 77,332 non-CVD controls, treated at two hospitals in Central Massachusetts between May 2018 and December 2021. ML regression algorithms, including Neural Networks (NN), Decision Trees (DT), Random Forests (RF), XGBoost, CATBoost and ADABoost, were trained and compared. Important CVD biomarkers (HbA1c, LDL cholesterol, BMI, and BP) were predicted as outcome variables with patients' risk factors (age, race, gender, socioeconomic status) as input variables. Shapley feature importance analyses identified the most predictive features, which were then utilized in Causal Analysis. A Difference-in-Differences (DID) approach within a Double/Debiased Machine Learning (DML) method isolated the pandemic's impact on biomarkers, while minimizing the effects of confounding factors. Results: CATBoost and XGBoost were the most predictive ML models for LDL cholesterol and HbA1c, yielding R 2 values of 0.13 and 0.10, respectively. RF outperformed other models for BMI and BP, achieving R 2 values of 0.192 and 0.071. The small R 2 values were due to the prevalence of categorical features in the data with substantial variation in biomarker values. Feature importance analysis determined age, socioeconomic status, and race/ethnicity to be important drivers of biomarker changes, highlighting the role of social determinants of health. DML with DID analysis revealed a statistically significant increase (p-value <0.05) in BMI and systolic BP values for CVD patients during the COVID-19 pandemic compared to the control group, their HbA1c and LDL cholesterol values actually improved during the pandemic, suggesting differential effects of the pandemic on key CVD biomarkers. Conclusion: Our proposed ML biomarker prediction models can facilitate personalized interventions and advance risk assessment for CVD patients. The predictive importance of factors such as age, socioeconomic status, and race highlights the need to address health disparities.en_US
dc.relation.ispartof2024 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE)en_US
dc.relation.urlhttps://doi.org/10.1109/CHASE60773.2024.00011en_US
dc.subjectbiomarkersen_US
dc.subjectCardiovascular Diseasesen_US
dc.subjectcausal inferenceen_US
dc.subjectCOVID-19en_US
dc.subjectDifference-in-Differencesen_US
dc.subjectDouble/Debiased machine learningen_US
dc.subjectmachine learningen_US
dc.subjectpredictive modelingen_US
dc.subjectregressionen_US
dc.subjectSHAP valueen_US
dc.titleBiomarker Trajectory Prediction and Causal Analysis of the Impact of the Covid-19 Pandemic on CVD Patients using Machine Learningen_US
dc.typeConference Paperen_US
dc.source.journaltitleProceedings - 2024 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2024
dc.source.beginpage1
dc.source.endpage12
dc.identifier.journalProceedings - 2024 IEEE/ACM Conference on Connected Health: Applications, Systems and Engineering Technologies, CHASE 2024
dc.contributor.departmentCenter for Accelerating Practices to End Suicide (CAPES)en_US
dc.contributor.departmentGenomics and Computational Biologyen_US
dc.contributor.departmentMorningside Graduate School of Biomedical Sciencesen_US
dc.contributor.studentWinnie Mkandawire


This item appears in the following Collection(s)

Show simple item record