Show simple item record

dc.contributor.authorLibbrecht, Maxwell Wing
dc.contributor.authorRodriguez, Oscar
dc.contributor.authorWeng, Zhiping
dc.contributor.authorHoffman, Michael
dc.contributor.authorBilmes, Jeffrey A.
dc.contributor.authorNoble, William Stafford
dc.date2022-08-11T08:08:23.000
dc.date.accessioned2022-08-23T15:53:05Z
dc.date.available2022-08-23T15:53:05Z
dc.date.issued2018-04-26
dc.date.submitted2018-06-07
dc.identifier.citation<p>bioRxiv 086025; doi: https://doi.org/10.1101/086025. <a href="https://doi.org/10.1101/086025" target="_blank">Link to preprint on bioRxiv service.</a></p>
dc.identifier.doi10.1101/086025
dc.identifier.urihttp://hdl.handle.net/20.500.14038/29282
dc.description.abstractSemi-automated genome annotation methods such as Segway enable understanding of chromatin activity. Here we present chromatin state annotations of 164 human cell types using 1,615 genomics data sets. To produce these annotations, we developed a fully-automated annotation strategy in which we train separate unsupervised annotation models on each cell type and use a machine learning classifier to automate the state interpretation step. Using these annotations, we developed a measure of the functional importance of each genomic position called the "functionality score," which allows us to aggregate information across cell types into a multi-cell type view. This score provides a measure of importance directly attributable to a specific activity in a specific set of cell types. In contrast to evolutionary conservation, this measure is not biased to detect only elements shared with related species. Using the functionality score, we combined all our annotations into a single cell type-agnostic encyclopedia that catalogs all human functional regulatory elements, enabling easy and intuitive interpretation of the effect of genome variants on phenotype, such as in disease-associated, evolutionarily conserved or positively selected loci. These resources, including cell type-specific annotations, enyclopedia, and a visualization server, are available at http://noble.gs.washington.edu/proj/encyclopedia.
dc.language.isoen_US
dc.relation<p>Now published in Genome Biol. 2019 Aug 28;20(1):180. doi: <a href="http://dx.doi.org/10.1186/s13059-019-1784-2" target="_blank" title="View published article">10.1186/s13059-019-1784-2</a>. </p>
dc.rightsThe copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY 4.0 International license.
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDNA
dc.subjectSemi-automated genome annotation
dc.subjectchromatin
dc.subjectencyclopedia
dc.subjectgenomics
dc.subjectCells
dc.subjectComputational Biology
dc.subjectGenetic Phenomena
dc.subjectGenomics
dc.titleA unified encyclopedia of human functional DNA elements through fully automated annotation of 164 human cell types [preprint]
dc.typePreprint
dc.source.journaltitlebioRxiv
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=2516&amp;context=faculty_pubs&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/faculty_pubs/1510
dc.identifier.contextkey12271396
refterms.dateFOA2022-08-23T15:53:05Z
html.description.abstract<p>Semi-automated genome annotation methods such as Segway enable understanding of chromatin activity. Here we present chromatin state annotations of 164 human cell types using 1,615 genomics data sets. To produce these annotations, we developed a fully-automated annotation strategy in which we train separate unsupervised annotation models on each cell type and use a machine learning classifier to automate the state interpretation step. Using these annotations, we developed a measure of the functional importance of each genomic position called the "functionality score," which allows us to aggregate information across cell types into a multi-cell type view. This score provides a measure of importance directly attributable to a specific activity in a specific set of cell types. In contrast to evolutionary conservation, this measure is not biased to detect only elements shared with related species. Using the functionality score, we combined all our annotations into a single cell type-agnostic encyclopedia that catalogs all human functional regulatory elements, enabling easy and intuitive interpretation of the effect of genome variants on phenotype, such as in disease-associated, evolutionarily conserved or positively selected loci. These resources, including cell type-specific annotations, enyclopedia, and a visualization server, are available at http://noble.gs.washington.edu/proj/encyclopedia.</p>
dc.identifier.submissionpathfaculty_pubs/1510
dc.contributor.departmentProgram in Bioinformatics and Integrative Biology


Files in this item

Thumbnail
Name:
086025.full.pdf
Size:
3.186Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY 4.0 International license.
Except where otherwise noted, this item's license is described as The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY 4.0 International license.