Show simple item record

dc.contributor.authorCenik, Can
dc.contributor.authorChua, Hon Nian
dc.contributor.authorSingh, Guramrit
dc.contributor.authorAkef, Abdalla
dc.contributor.authorSnyder, Michael P.
dc.contributor.authorPalazzo, Alexander F.
dc.contributor.authorMoore, Melissa J.
dc.contributor.authorRoth, Frederick P.
dc.date2022-08-11T08:09:46.000
dc.date.accessioned2022-08-23T16:43:03Z
dc.date.available2022-08-23T16:43:03Z
dc.date.issued2017-03-01
dc.date.submitted2017-05-09
dc.identifier.citationRNA. 2017 Mar;23(3):270-283. doi: 10.1261/rna.059105.116. Epub 2016 Dec 19. <a href="https://doi.org/10.1261/rna.059105.116">Link to article on publisher's site</a>
dc.identifier.issn1355-8382 (Linking)
dc.identifier.doi10.1261/rna.059105.116
dc.identifier.pmid27994090
dc.identifier.urihttp://hdl.handle.net/20.500.14038/40214
dc.description.abstractIntrons are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with > 80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5' proximal-intron-minus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N1-methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising approximately 20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N1-methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC.
dc.language.isoen_US
dc.relation<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&list_uids=27994090&dopt=Abstract">Link to Article in PubMed</a>
dc.relation.urlhttps://www.ncbi.nlm.nih.gov/pubmed/?term=27994090
dc.rights© 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society. Freely available online through the RNA Open Access option.
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject5′-UTR introns
dc.subjectN1-methyladenosine
dc.subjectexon junction complex
dc.subjectrandom forest
dc.subjectBiochemistry, Biophysics, and Structural Biology
dc.subjectBioinformatics
dc.subjectGenetics and Genomics
dc.titleA common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification
dc.typeJournal Article
dc.source.journaltitleRNA (New York, N.Y.)
dc.source.volume23
dc.source.issue3
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=4014&amp;context=oapubs&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/oapubs/3009
dc.identifier.contextkey10138278
refterms.dateFOA2022-08-23T16:43:03Z
html.description.abstract<p>Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with > 80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5' proximal-intron-minus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N1-methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising approximately 20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N1-methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC.</p>
dc.identifier.submissionpathoapubs/3009
dc.contributor.departmentRNA Therapeutics Institute
dc.contributor.departmentDepartment of Biochemistry and Molecular Pharmacology
dc.source.pages270-283


Files in this item

Thumbnail
Name:
RNA_2017_Cenik_270_83.pdf
Size:
1.061Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

© 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society. Freely available online through the RNA Open Access option.
Except where otherwise noted, this item's license is described as © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society. Freely available online through the RNA Open Access option.