Application of a Naïve Bayes Classifier to Assign Polyadenylation Sites from 3' End Deep Sequencing Data: A Dissertation
AuthorsSheppard, Sarah E.
Faculty AdvisorNathan Lawson, PhD
UMass Chan AffiliationsMolecular, Cell and Cancer Biology Department
Document TypeDoctoral Dissertation
RNA 3' Polyadenylation Signals
High-Throughput Nucleotide Sequencing
RNA 3' Polyadenylation Signals
High-Throughput Nucleotide Sequencing
MetadataShow full item record
AbstractCleavage and polyadenylation of a precursor mRNA is important for transcription termination, mRNA stability, and regulation of gene expression. This process is directed by a multitude of protein factors and cis elements in the pre-mRNA sequence surrounding the cleavage and polyadenylation site. Importantly, the location of the cleavage and polyadenylation site helps define the 3’ untranslated region of a transcript, which is important for regulation by microRNAs and RNA binding proteins. Additionally, these sites have generally been poorly annotated. To identify 3’ ends, many techniques utilize an oligo-dT primer to construct deep sequencing libraries. However, this approach can lead to identification of artifactual polyadenylation sites due to internal priming in homopolymeric stretches of adenines. Previously, simple heuristic filters relying on the number of adenines in the genomic sequence downstream of a putative polyadenylation site have been used to remove these sites of internal priming. However, these simple filters may not remove all sites of internal priming and may also exclude true polyadenylation sites. Therefore, I developed a naïve Bayes classifier to identify putative sites from oligo-dT primed 3’ end deep sequencing as true or false/internally primed. Notably, this algorithm uses a combination of sequence elements to distinguish between true and false sites. Finally, the resulting algorithm is highly accurate in multiple model systems and facilitates identification of novel polyadenylation sites.
Permanent Link to this Itemhttp://hdl.handle.net/20.500.14038/32005
RightsCopyright is held by the author, with all rights reserved.
Showing items related by title, author, creator and subject.
Role of the Cytoplasmic Polyadenylation Element Binding Proteins in Neuron: A DissertationOruganty, Aparna (2013-02-26)Genome regulation is an extremely complex phenomenon. There are various mechanisms in place to ensure smooth performance of the organism. Post-transcriptional regulation of gene expression is one such mechanism. Many proteins bind to mRNAs and regulate their translation. In this thesis, I have focused on the Cytoplasmic Polyadenylation Element Binding family of proteins (CPEB1-4); a group of sequence specific RNA binding proteins important for cell cycle progression, senescence, neuronal function and plasticity. CPEB protein binds mRNAs containing a short Cytoplasmic Polyadenylation Element (CPE) in 3’ untranslated Region (UTR) and regulates the polyadenylation of these mRNAs and thereby controls translation. In Chapter II, I have presented my work on the regulation of mitochondrial function by CPEB. CPEB knockout mice have brain specific defects in mitochondrial function owing to a reduction in Electron transport chain complex I component protein NDUFV2. CPEB controls the translation of this NDUFV2 mRNA and thus affects mitochondrial function. A consequence of this reduced bioenergetics is reduced growth and branching of neurons, again emphasizing the importance of this pathway. Chapter III focuses on the role of CPEB4 in neuronal survival and protection against apoptosis. CPEB4 shuttles between nucleus and cytoplasm and becomes nuclear in response to stimulation with ionotropic glutamate receptors, focal ischemia in vivo and when cultured neurons are deprived of oxygen and glucose; nuclear CPEB4 affords protection against apoptosis in ischemia model. The underlying cause for nuclear translocation is reduction in Endoplasmic Reticulum calcium levels. These studies give an insight into the function and dynamics of these two RNA binding proteins and provide a better understanding of cellular biology.
Pre-messenger RNA cleavage factor I (CFIm): potential role in alternative polyadenylation during spermatogenesisSartini, Becky L.; Wang, Hang; Wang, Wei; Millette, Clarke F.; Kilpatrick, Daniel L. (2007-11-23)A hallmark of male germ cell gene expression is the generation by alternative polyadenylation of cell-specific mRNAs, many of which utilize noncanonical A(A/U)UAAA-independent polyadenylation signals. Cleavage factor I (CFIm), a component of the pre-mRNA cleavage and polyadenylation protein complex, can direct A(A/U)UAAA-independent polyadenylation site selection of somatic cell mRNAs. Here we report that the CFIm subunits NUDT21/CPSF5 and CPSF6 are highly enriched in mouse male germ cells relative to somatic cells. Both subunits are expressed from spermatogenic cell mRNAs that are shorter than the corresponding somatic transcripts. Complementary DNA sequencing and Northern blotting revealed that the shorter Nudt21 and Cpsf6 mRNAs are generated by alternative polyadenylation in male germ cells using proximal poly(A) signals. Both sets of transcripts contain CFIm binding sites within their 3'-untranslated regions, suggesting autoregulation of CFIm subunit formation in male germ cells. CFIm subunit mRNA and protein levels exhibit distinct developmental variation during spermatogenesis, indicating stage-dependent translational and/or posttranslational regulation. CFIm binding sites were identified near the 3' ends of numerous male germ cell transcripts utilizing A(A/U)UAAA-independent sites. Together these findings suggest that CFIm complexes participate in alternative polyadenylation directed by noncanonical poly(A) signals during spermatogenesis.
Regulated Pumilio-2 binding controls RINGO/Spy mRNA translation and CPEB activationPadmanabhan, Kiran; Richter, Joel D. (2006-01-19)CPEB is a sequence-specific RNA-binding protein that controls the polyadenylation-induced translation of mos and cyclin B1 mRNAs in maturing Xenopus oocytes. CPEB activity requires not only the phosphorylation of S174, but also the synthesis of a heretofore-unknown upstream effector molecule. We show that the synthesis of RINGO/Spy, an atypical activator of cyclin-dependent kinases (cdks), is necessary for CPEB-directed polyadenylation. Deletion analysis and mRNA reporter assays show that a cis element in the RINGO/Spy 3'UTR is necessary for translational repression in immature (G2-arrested) oocytes. The repression is mediated by 3'UTR Pumilio-Binding Elements (PBEs), and by its binding protein Pumilio 2 (Pum2). Pum2 also interacts with the Xenopus homolog of human Deleted for Azoospermia-like (DAZL) and the embryonic poly(A)-binding protein (ePAB). Following the induction of maturation, Pum2 dissociates not only from RINGO/Spy mRNA, but from XDAZL and ePAB as well; as a consequence, RINGO/Spy mRNA is translated. These results demonstrate that a reversible Pum2 interaction controls RINGO/Spy mRNA translation and, as a result, CPEB-mediated cytoplasmic polyadenylation.