Show simple item record

dc.contributor.advisorJennifer F. Tseng
dc.contributor.authorArous, Edward J.
dc.contributor.authorSmith, Jillian K.
dc.contributor.authorNg, Sing Chau
dc.contributor.authorTseng, Jennifer F.
dc.contributor.authorMcDade, Theodore P.
dc.date2022-08-11T08:10:54.000
dc.date.accessioned2022-08-23T17:24:22Z
dc.date.available2022-08-23T17:24:22Z
dc.date.issued2011-05-02
dc.date.submitted2011-05-16
dc.identifier.doi10.13028/pe16-bd02
dc.identifier.urihttp://hdl.handle.net/20.500.14038/49176
dc.description<p>Medical student Edward Arous participated in this study as part of the Senior Scholars research program at the University of Massachusetts Medical School.</p>
dc.description.abstractBackground: The importance of an electronic medical record has been highlighted for both clinical care and research. In the current era, data warehouses and repositories have been established to serve the dual function of patient care and investigation. Purpose: The aim of this study was to compare a newly developed institutional clinical data warehouse, linked with the hospital information system (HIS), to a prospectively-maintained departmental database. Methods: A novel HIS-linked institutional clinical data warehouse was queried for 9 primary and secondary ICD-9-CM discharge diagnosis codes for pancreatic cancer. The database captured inpatient and outpatient clinical and billing information from a pool of over 2 million patients evaluated at an academic medical institution and its affiliates since 1995. A cohort was identified; following Institutional Review Board approval, demographic and clinical data was obtained. This data was compared to a manually-entered and prospectively-maintained surgical oncology database of the same institution, tracking 394 patients since 1999. Duplicated patients, and those unique to either dataset, were flagged. Patients with diagnosis dates prior to 1999 were excluded to allow comparison over the same time period. For validation purposes, a 10% random sample of remaining patients unique to each dataset underwent manual review of medical records including clinic notes, admission/discharge notes, diagnostic imaging, and pathology reports. Results: 1107 patients were identified from the HIS-linked dataset with pancreatic neoplasm-associated diagnosis codes dating from 1999 to 2009. Of these, 254 (22.9%) were captured in both datasets, while 853 (77.1%) were only in the HIS-linked dataset. Manual review of the 10% subset of the HIS-only group demonstrated that 55.6% of patients were without identifiable pancreatic pathology, suggesting miscoding, while 31.7% had diagnoses consistent with pancreatic neoplasm, and 12.7% had pseudocyst or pancreatitis. Of the 394 patients tracked by surgical oncology, 254 (64.5%) were captured in both datasets, while 140 (35.5%) had not been captured in the HIS-linked dataset. Manual review of the 10% subset of the non-captured patients demonstrated 93.3% with pancreatic neoplasm and 6.7% with pseudocyst or pancreatitis. Lastly, a review of the 10% subset of the 254 patient overlap demonstrated that 87.5% of patients were with pancreatic neoplasm, 8.3% with pseudocyst or pancreatitis, and 4.2% without pancreatic pathology. Conclusions: While technological advances provide a powerful means to automate institutional-level cohort identification and data collection, a high degree of misclassification may be present if queries are based solely on ICD-9-CM discharge codes. For that reason, careful validation and data cleaning are critical steps prior to research use. These results also suggest cautious interpretation of national-level administrative data utilizing ICD-9-CM diagnosis codes. Our findings suggest that the current state-of-the-art data warehouses continue to require clinical correlation and validation through traditional retrospective mechanisms.
dc.language.isoen_US
dc.rightsCopyright is held by the author(s), with all rights reserved.
dc.subjectElectronic Health Records
dc.subjectMedical Records Systems
dc.subjectComputerized
dc.subjectBiomedical Research
dc.subjectPancreatic Neoplasms
dc.subjectData Collection
dc.subjectInternational Classification of Diseases
dc.subjectHealth Information Technology
dc.subjectHealth Services Research
dc.subjectSurgery
dc.titleTales from the EMR: Does a 21st-Century Data Warehouse Facilitate Clinical Research for Pancreatic Cancer?
dc.typePoster
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=1113&amp;context=ssp&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/ssp/114
dc.identifier.contextkey2015770
refterms.dateFOA2022-08-28T02:54:39Z
html.description.abstract<p><strong>Background:</strong> The importance of an electronic medical record has been highlighted for both clinical care and research. In the current era, data warehouses and repositories have been established to serve the dual function of patient care and investigation.</p> <p><strong>Purpose: </strong>The aim of this study was to compare a newly developed institutional clinical data warehouse, linked with the hospital information system (HIS), to a prospectively-maintained departmental database.</p> <p><strong>Methods: </strong>A novel HIS-linked institutional clinical data warehouse was queried for 9 primary and secondary <em>ICD-9-CM</em> discharge diagnosis codes for pancreatic cancer. The database captured inpatient and outpatient clinical and billing information from a pool of over 2 million patients evaluated at an academic medical institution and its affiliates since 1995. A cohort was identified; following Institutional Review Board approval, demographic and clinical data was obtained. This data was compared to a manually-entered and prospectively-maintained surgical oncology database of the same institution, tracking 394 patients since 1999. Duplicated patients, and those unique to either dataset, were flagged. Patients with diagnosis dates prior to 1999 were excluded to allow comparison over the same time period. For validation purposes, a 10% random sample of remaining patients unique to each dataset underwent manual review of medical records including clinic notes, admission/discharge notes, diagnostic imaging, and pathology reports.</p> <p><strong>Results: </strong>1107 patients were identified from the HIS-linked dataset with pancreatic neoplasm-associated diagnosis codes dating from 1999 to 2009. Of these, 254 (22.9%) were captured in both datasets, while 853 (77.1%) were only in the HIS-linked dataset. Manual review of the 10% subset of the HIS-only group demonstrated that 55.6% of patients were without identifiable pancreatic pathology, suggesting miscoding, while 31.7% had diagnoses consistent with pancreatic neoplasm, and 12.7% had pseudocyst or pancreatitis. Of the 394 patients tracked by surgical oncology, 254 (64.5%) were captured in both datasets, while 140 (35.5%) had not been captured in the HIS-linked dataset. Manual review of the 10% subset of the non-captured patients demonstrated 93.3% with pancreatic neoplasm and 6.7% with pseudocyst or pancreatitis. Lastly, a review of the 10% subset of the 254 patient overlap demonstrated that 87.5% of patients were with pancreatic neoplasm, 8.3% with pseudocyst or pancreatitis, and 4.2% without pancreatic pathology.</p> <p><strong>Conclusions: </strong>While technological advances provide a powerful means to automate institutional-level cohort identification and data collection, a high degree of misclassification may be present if queries are based solely on <em>ICD-9-CM </em>discharge codes. For that reason, careful validation and data cleaning are critical steps prior to research use. These results also suggest cautious interpretation of national-level administrative data utilizing <em>ICD-9-CM </em>diagnosis codes. Our findings suggest that the current state-of-the-art data warehouses continue to require clinical correlation and validation through traditional retrospective mechanisms.</p>
dc.identifier.submissionpathssp/114
dc.contributor.departmentSenior Scholars Program
dc.contributor.departmentDepartment of Surgery, Surgical Outcomes Analysis and Research


Files in this item

Thumbnail
Name:
Edward_Arous_Senior_Scholars_P ...
Size:
1005.Kb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record