• A machine learning approach for the prediction of protein surface loop flexibility

      Hwang, Howook; Vreven, Thom; Whitfield, Troy W.; Wiehe, Kevin; Weng, Zhiping (2011-08-01)
      Proteins often undergo conformational changes when binding to each other. A major fraction of backbone conformational changes involves motion on the protein surface, particularly in loops. Accounting for the motion of protein surface loops represents a challenge for protein-protein docking algorithms. A first step in addressing this challenge is to distinguish protein surface loops that are likely to undergo backbone conformational changes upon protein-protein binding (mobile loops) from those that are not (stationary loops). In this study, we developed a machine learning strategy based on support vector machines (SVMs). Our SVM uses three features of loop residues in the unbound protein structures-Ramachandran angles, crystallographic B-factors, and relative accessible surface area-to distinguish mobile loops from stationary ones. This method yields an average prediction accuracy of 75.3% compared with a random prediction accuracy of 50%, and an average of 0.79 area under the receiver operating characteristic (ROC) curve using cross-validation. Testing the method on an independent dataset, we obtained a prediction accuracy of 70.5%. Finally, we applied the method to 11 complexes that involve members from the Ras superfamily and achieved prediction accuracy of 92.8% for the Ras superfamily proteins and 74.4% for their binding partners.
    • Prediction of homologous protein structures based on conformational searches and energetics

      Schiffer, Celia A.; Caldwell, James W.; Kollman, Peter A.; Stroud, Robert M. (1990-01-01)
      A "knowledge-based" method of predicting the unknown structure of a protein from a homologous known structure using energetics to determine a sidechain conformation is proposed. The method consists of exchanging the residues in the known structure for the sequence of the unknown protein. Then a conformational search with molecular mechanics energy minimization is done on the exchanged residues. The lowest energy conformer is the one picked to be the predicted structure. In the structure of bovine trypsin, the importance of including a solvation energy term in the search is demonstrated for solvent accessible residues, while molecular mechanics alone is enough to correctly predict the conformation of internal residues. The correctness of the model is assessed by a volume error overlap of the predicted structure compared to the crystal structure. Finally, the structure of rat trypsin is predicted from the crystal structure of bovine trypsin. The sequences of these two proteins are 74% identical and all of the significant changes between them are on external residues. Thus, the inclusion of solvation energy in the conformational search is necessary to accurately predict the structure of the exchanged residues.