Show simple item record

dc.contributor.authorGupta, Ankit
dc.contributor.authorChristensen, Ryan G.
dc.contributor.authorBell, Heather A.
dc.contributor.authorGoodwin, Mathew
dc.contributor.authorPatel, Ronak Y.
dc.contributor.authorPandey, Manishi
dc.contributor.authorEnuameh, Metewo Selase
dc.contributor.authorRayla, Amy L.
dc.contributor.authorZhu, Cong
dc.contributor.authorThibodeau-Beganny, Stacey
dc.contributor.authorBrodsky, Michael H.
dc.contributor.authorJoung, J. Keith
dc.contributor.authorWolfe, Scot A.
dc.contributor.authorStormo, Gary D.
dc.date2022-08-11T08:08:55.000
dc.date.accessioned2022-08-23T16:12:22Z
dc.date.available2022-08-23T16:12:22Z
dc.date.issued2014-04-01
dc.date.submitted2015-08-31
dc.identifier.citationNucleic Acids Res. 2014 Apr;42(8):4800-12. doi: 10.1093/nar/gku132. Epub 2014 Feb 12. <a href="http://dx.doi.org/10.1093/nar/gku132">Link to article on publisher's site</a>
dc.identifier.issn0305-1048 (Linking)
dc.identifier.doi10.1093/nar/gku132
dc.identifier.pmid24523353
dc.identifier.urihttp://hdl.handle.net/20.500.14038/33379
dc.description.abstractCys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.
dc.language.isoen_US
dc.relation<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&list_uids=24523353&dopt=Abstract">Link to Article in PubMed</a>
dc.rightsCopyright © The Author(s) 2014. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (<a href="http://creativecommons.org/licenses/by-nc/3.0/">http://creativecommons.org/licenses/by-nc/3.0/</a>), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
dc.subjectArtificial Intelligence; Binding Sites; DNA; DNA-Binding Proteins; Models, Biological; Nucleotide Motifs; *Regulatory Elements, Transcriptional; Transcription Factors; *Zinc Fingers
dc.subjectBiochemistry
dc.subjectComputational Biology
dc.titleAn improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
dc.typeJournal Article
dc.source.journaltitleNucleic acids research
dc.source.volume42
dc.source.issue8
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=2925&amp;context=gsbs_sp&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/gsbs_sp/1904
dc.identifier.contextkey7536278
refterms.dateFOA2022-08-23T16:12:22Z
html.description.abstract<p>Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.</p>
dc.identifier.submissionpathgsbs_sp/1904
dc.contributor.departmentProgram in Molecular Medicine
dc.contributor.departmentDepartment of Biochemistry and Molecular Pharmacology
dc.contributor.departmentProgram in Gene Function and Expression
dc.source.pages4800-12
dc.contributor.studentAnkit Gupta


Files in this item

Thumbnail
Name:
Nucl._Acids_Res._2014_Gupta_48 ...
Size:
7.115Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record