A machine learning approach for the prediction of protein surface loop flexibility
| dc.contributor.author | Hwang, Howook | |
| dc.contributor.author | Vreven, Thom | |
| dc.contributor.author | Whitfield, Troy W. | |
| dc.contributor.author | Wiehe, Kevin | |
| dc.contributor.author | Weng, Zhiping | |
| dc.date | 2022-08-11T08:07:59.000 | |
| dc.date.accessioned | 2022-08-23T15:38:11Z | |
| dc.date.available | 2022-08-23T15:38:11Z | |
| dc.date.issued | 2011-08-01 | |
| dc.date.submitted | 2013-02-22 | |
| dc.identifier.citation | <p>Proteins. 2011 Aug;79(8):2467-74. doi: 10.1002/prot.23070. Epub 2011 Jun 1. <a href="http://dx.doi.org/10.1002/prot.23070">Link to article on publisher's site</a></p> | |
| dc.identifier.issn | 0887-3585 (Linking) | |
| dc.identifier.doi | 10.1002/prot.23070 | |
| dc.identifier.pmid | 21633973 | |
| dc.identifier.uri | http://hdl.handle.net/20.500.14038/25883 | |
| dc.description.abstract | Proteins often undergo conformational changes when binding to each other. A major fraction of backbone conformational changes involves motion on the protein surface, particularly in loops. Accounting for the motion of protein surface loops represents a challenge for protein-protein docking algorithms. A first step in addressing this challenge is to distinguish protein surface loops that are likely to undergo backbone conformational changes upon protein-protein binding (mobile loops) from those that are not (stationary loops). In this study, we developed a machine learning strategy based on support vector machines (SVMs). Our SVM uses three features of loop residues in the unbound protein structures-Ramachandran angles, crystallographic B-factors, and relative accessible surface area-to distinguish mobile loops from stationary ones. This method yields an average prediction accuracy of 75.3% compared with a random prediction accuracy of 50%, and an average of 0.79 area under the receiver operating characteristic (ROC) curve using cross-validation. Testing the method on an independent dataset, we obtained a prediction accuracy of 70.5%. Finally, we applied the method to 11 complexes that involve members from the Ras superfamily and achieved prediction accuracy of 92.8% for the Ras superfamily proteins and 74.4% for their binding partners. | |
| dc.language.iso | en_US | |
| dc.relation | <p><a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&list_uids=21633973&dopt=Abstract">Link to Article in PubMed</a></p> | |
| dc.relation.url | http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3341935/ | |
| dc.subject | *Artificial Intelligence | |
| dc.subject | Protein Structure, Secondary | |
| dc.subject | Proteins | |
| dc.subject | Amino Acids, Peptides, and Proteins | |
| dc.subject | Bioinformatics | |
| dc.subject | Computational Biology | |
| dc.subject | Systems Biology | |
| dc.title | A machine learning approach for the prediction of protein surface loop flexibility | |
| dc.type | Journal Article | |
| dc.source.journaltitle | Proteins | |
| dc.source.volume | 79 | |
| dc.source.issue | 8 | |
| dc.identifier.legacycoverpage | https://escholarship.umassmed.edu/bioinformatics_pubs/25 | |
| dc.identifier.contextkey | 3761406 | |
| html.description.abstract | <p>Proteins often undergo conformational changes when binding to each other. A major fraction of backbone conformational changes involves motion on the protein surface, particularly in loops. Accounting for the motion of protein surface loops represents a challenge for protein-protein docking algorithms. A first step in addressing this challenge is to distinguish protein surface loops that are likely to undergo backbone conformational changes upon protein-protein binding (mobile loops) from those that are not (stationary loops). In this study, we developed a machine learning strategy based on support vector machines (SVMs). Our SVM uses three features of loop residues in the unbound protein structures-Ramachandran angles, crystallographic B-factors, and relative accessible surface area-to distinguish mobile loops from stationary ones. This method yields an average prediction accuracy of 75.3% compared with a random prediction accuracy of 50%, and an average of 0.79 area under the receiver operating characteristic (ROC) curve using cross-validation. Testing the method on an independent dataset, we obtained a prediction accuracy of 70.5%. Finally, we applied the method to 11 complexes that involve members from the Ras superfamily and achieved prediction accuracy of 92.8% for the Ras superfamily proteins and 74.4% for their binding partners.</p> | |
| dc.identifier.submissionpath | bioinformatics_pubs/25 | |
| dc.contributor.department | Program in Bioinformatics and Integrative Biology | |
| dc.contributor.department | Department of Biochemistry and Molecular Pharmacology | |
| dc.source.pages | 2467-74 |