Show simple item record

dc.contributor.authorMoore, Jill E.
dc.contributor.authorZhang, Xiao-Ou
dc.contributor.authorElhajjajy, Shaimae I
dc.contributor.authorFan, Kaili
dc.contributor.authorPratt, Henry E
dc.contributor.authorReese, Fairlie
dc.contributor.authorMortazavi, Ali
dc.contributor.authorWeng, Zhiping
dc.date2022-08-11T08:08:00.000
dc.date.accessioned2022-08-23T15:38:34Z
dc.date.available2022-08-23T15:38:34Z
dc.date.issued2021-12-23
dc.date.submitted2022-07-05
dc.identifier.citation<p>Moore JE, Zhang XO, Elhajjajy SI, Fan K, Pratt HE, Reese F, Mortazavi A, Weng Z. Integration of high-resolution promoter profiling assays reveals novel, cell type-specific transcription start sites across 115 human cell and tissue types. Genome Res. 2022 Feb;32(2):389-402. doi: 10.1101/gr.275723.121. Epub 2021 Dec 23. PMID: 34949670; PMCID: PMC8805725. <a href="https://doi.org/10.1101/gr.275723.121">Link to article on publisher's site</a></p>
dc.identifier.issn1088-9051 (Linking)
dc.identifier.doi10.1101/gr.275723.121
dc.identifier.pmid34949670
dc.identifier.urihttp://hdl.handle.net/20.500.14038/25963
dc.description.abstractAccurate transcription start site (TSS) annotations are essential for understanding transcriptional regulation and its role in human disease. Gene collections such as GENCODE contain annotations for tens of thousands of TSSs, but not all of these annotations are experimentally validated nor do they contain information on cell type-specific usage. Therefore, we sought to generate a collection of experimentally validated TSSs by integrating RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression (RAMPAGE) data from 115 cell and tissue types, which resulted in a collection of approximately 50 thousand representative RAMPAGE peaks. These peaks are primarily proximal to GENCODE-annotated TSSs and are concordant with other transcription assays. Because RAMPAGE uses paired-end reads, we were then able to connect peaks to transcripts by analyzing the genomic positions of the 3' ends of read mates. Using this paired-end information, we classified the vast majority (37 thousand) of our RAMPAGE peaks as verified TSSs, updating TSS annotations for 20% of GENCODE genes. We also found that these updated TSS annotations are supported by epigenomic and other transcriptomic data sets. To show the utility of this RAMPAGE rPeak collection, we intersected it with the NHGRI/EBI genome-wide association study (GWAS) catalog and identified new candidate GWAS genes. Overall, our work shows the importance of integrating experimental data to further refine TSS annotations and provides a valuable resource for the biological community.
dc.language.isoen_US
dc.relation<p><a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&list_uids=34949670&dopt=Abstract">Link to Article in PubMed</a></p>
dc.rights© 2022 Moore et al.; Published by Cold Spring Harbor Laboratory Press. This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subjectBioinformatics
dc.subjectComputational Biology
dc.subjectGenetics and Genomics
dc.subjectIntegrative Biology
dc.subjectSystems Biology
dc.titleIntegration of high-resolution promoter profiling assays reveals novel, cell type-specific transcription start sites across 115 human cell and tissue types
dc.typeJournal Article
dc.source.journaltitleGenome research
dc.source.volume32
dc.source.issue2
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=1185&amp;context=bioinformatics_pubs&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/bioinformatics_pubs/174
dc.identifier.contextkey30064404
refterms.dateFOA2022-08-23T15:38:34Z
html.description.abstract<p>Accurate transcription start site (TSS) annotations are essential for understanding transcriptional regulation and its role in human disease. Gene collections such as GENCODE contain annotations for tens of thousands of TSSs, but not all of these annotations are experimentally validated nor do they contain information on cell type-specific usage. Therefore, we sought to generate a collection of experimentally validated TSSs by integrating RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression (RAMPAGE) data from 115 cell and tissue types, which resulted in a collection of approximately 50 thousand representative RAMPAGE peaks. These peaks are primarily proximal to GENCODE-annotated TSSs and are concordant with other transcription assays. Because RAMPAGE uses paired-end reads, we were then able to connect peaks to transcripts by analyzing the genomic positions of the 3' ends of read mates. Using this paired-end information, we classified the vast majority (37 thousand) of our RAMPAGE peaks as verified TSSs, updating TSS annotations for 20% of GENCODE genes. We also found that these updated TSS annotations are supported by epigenomic and other transcriptomic data sets. To show the utility of this RAMPAGE rPeak collection, we intersected it with the NHGRI/EBI genome-wide association study (GWAS) catalog and identified new candidate GWAS genes. Overall, our work shows the importance of integrating experimental data to further refine TSS annotations and provides a valuable resource for the biological community.</p>
dc.identifier.submissionpathbioinformatics_pubs/174
dc.contributor.departmentGraduate School of Biomedical Sciences
dc.contributor.departmentProgram in Bioinformatics and Integrative Biology
dc.source.pages389-402


Files in this item

Thumbnail
Name:
Genome_Res._2022_Moore_389_402.pdf
Size:
1.873Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

© 2022 Moore et al.; Published by Cold Spring Harbor Laboratory Press. This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
Except where otherwise noted, this item's license is described as © 2022 Moore et al.; Published by Cold Spring Harbor Laboratory Press. This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.