Annotating the human genome with Disease Ontology
dc.contributor.author | Osborne, John D. | |
dc.contributor.author | Flatow, Jared M. | |
dc.contributor.author | Holko, Michelle | |
dc.contributor.author | Lin, Simon M. | |
dc.contributor.author | Kibbe, Warren A. | |
dc.contributor.author | Zhu, Lihua Julie | |
dc.contributor.author | Danila, Maria I. | |
dc.contributor.author | Feng, Gang | |
dc.contributor.author | Chisholm, Rex L. | |
dc.date | 2022-08-11T08:10:16.000 | |
dc.date.accessioned | 2022-08-23T17:01:52Z | |
dc.date.available | 2022-08-23T17:01:52Z | |
dc.date.issued | 2009-07-25 | |
dc.date.submitted | 2011-04-19 | |
dc.identifier.citation | BMC Genomics. 2009 Jul 7;10 Suppl 1:S6. <a href="http://dx.doi.org/10.1186/1471-2164-10-S1-S6">Link to article on publisher's site</a> | |
dc.identifier.issn | 1471-2164 (Linking) | |
dc.identifier.doi | 10.1186/1471-2164-10-S1-S6 | |
dc.identifier.pmid | 19594883 | |
dc.identifier.uri | http://hdl.handle.net/20.500.14038/44102 | |
dc.description.abstract | BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. RESULTS: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. CONCLUSION: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. | |
dc.language.iso | en_US | |
dc.relation | <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&list_uids=19594883&dopt=Abstract">Link to Article in PubMed</a> | |
dc.subject | Computational Biology | |
dc.subject | *Databases, Genetic | |
dc.subject | *Genome, Human | |
dc.subject | Humans | |
dc.subject | *Software | |
dc.subject | *Unified Medical Language System | |
dc.subject | Genetics and Genomics | |
dc.title | Annotating the human genome with Disease Ontology | |
dc.type | Journal Article | |
dc.source.journaltitle | BMC genomics | |
dc.source.volume | 10 Suppl 1 | |
dc.identifier.legacyfulltext | https://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=1076&context=pgfe_pp&unstamped=1 | |
dc.identifier.legacycoverpage | https://escholarship.umassmed.edu/pgfe_pp/76 | |
dc.identifier.contextkey | 1946731 | |
refterms.dateFOA | 2022-08-23T17:01:53Z | |
html.description.abstract | <p>BACKGROUND: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases.</p> <p>RESULTS: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations.</p> <p>CONCLUSION: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome.</p> | |
dc.identifier.submissionpath | pgfe_pp/76 | |
dc.contributor.department | Program in Molecular Medicine | |
dc.contributor.department | Program in Gene Function and Expression | |
dc.source.pages | S6 |