Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) [preprint]
Authors
Jones, SaraBradwell, Katie R
Chan, Lauren E
Olson-Chen, Courtney
Tarleton, Jessica
Wilkins, Kenneth J
Qin, Qiuyuan
Faherty, Emily Groene
Lau, Yan Kwan
Xie, Catherine
Kao, Yu-Han
Liebman, Michael N
Mariona, Federico
Challa, Anup
Li, Li
Ratcliffe, Sarah J
McMurry, Julie A
Haendel, Melissa A
Patel, Rena C
Hill, Elaine L
UMass Chan Affiliations
Center for Clinical and Translational ScienceDocument Type
PreprintPublication Date
2022-08-06
Metadata
Show full item recordAbstract
Objective: To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). Materials and methods: We developed a comprehensive approach, named H ierarchy and rule-based pregnancy episode I nference integrated with P regnancy P rogression S ignatures (HIPPS) and applied it to EHR data in the N3C from 1 January 2018 to 7 April 2022. HIPPS combines: 1) an extension of a previously published pregnancy episode algorithm, 2) a novel algorithm to detect gestational aging-specific signatures of a progressing pregnancy for further episode support, and 3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated three types of pregnancy cohorts based on the level of precision for gestational aging and pregnancy outcomes for comparison of COVID-19 and other characteristics. Results: We identified 628,165 pregnant persons with 816,471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, spontaneous abortions), and 23.3% had unknown outcomes. We were able to estimate start dates within one week of precision for 431,173 (52.8%) episodes. 66,019 (8.1%) episodes had incident COVID-19 during pregnancy. Across varying COVID-19 cohorts, patient characteristics were generally similar though pregnancy outcomes differed. Discussion: HIPPS provides support for pregnancy-related variables based on EHR data for researchers to define pregnancy cohorts. Our approach performed well based on clinician validation. Conclusion: We have developed a novel and robust approach for inferring pregnancy episodes and gestational aging that addresses data inconsistency and missingness in EHR data.Source
Jones S, Bradwell KR, Chan LE, Olson-Chen C, Tarleton J, Wilkins KJ, Qin Q, Faherty EG, Lau YK, Xie C, Kao YH, Liebman MN, Mariona F, Challa A, Li L, Ratcliffe SJ, McMurry JA, Haendel MA, Patel RC, Hill EL. Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C). medRxiv [Preprint]. 2022 Aug 6:2022.08.04.22278439. doi: 10.1101/2022.08.04.22278439. PMID: 35982668; PMCID: PMC9387155.DOI
10.1101/2022.08.04.22278439Permanent Link to this Item
http://hdl.handle.net/20.500.14038/51163PubMed ID
35982668Notes
This article is a preprint. Preprints are preliminary reports of work that have not been certified by peer review.Funding and Acknowledgements
The UMass Center for Clinical and Translational Science (UMCCTS), UL1TR001453, helped fund this study.Rights
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.Distribution License
http://creativecommons.org/licenses/by/4.0/ae974a485f413a2113503eed53cd6c53
10.1101/2022.08.04.22278439
Scopus Count
The following license files are associated with this item:
- Creative Commons
Except where otherwise noted, this item's license is described as The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.