Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility
Rhys Heffernan;Yuedong Yang;Kuldip Paliwal;Yaoqi Zhou;
1Signal Processing Laboratory, Griffith University, Brisbane, QLD 4111, Australia and 2Institute for Glycomics and School of Information and Communication Technology, Griffith University, Southport, QLD 4222, Australia
Checking for direct PDF access through Ovid
Motivation:The accuracy of predicting protein local and global structural properties such as secondary structure and solvent accessible surface area has been stagnant for many years because of the challenge of accounting for non-local interactions between amino acid residues that are close in three-dimensional structural space but far from each other in their sequence positions. All existing machine-learning techniques relied on a sliding window of 10–20 amino acid residues to capture some ‘short to intermediate’ non-local interactions. Here, we employed Long Short-Term Memory (LSTM) Bidirectional Recurrent Neural Networks (BRNNs) which are capable of capturing long range interactions without using a window.Results:We showed that the application of LSTM-BRNN to the prediction of protein structural properties makes the most significant improvement for residues with the most long-range contacts (|i-j| >19) over a previous window-based, deep-learning method SPIDER2. Capturing long-range interactions allows the accuracy of three-state secondary structure prediction to reach 84% and the correlation coefficient between predicted and actual solvent accessible surface areas to reach 0.80, plus a reduction of 5%, 10%, 5% and 10% in the mean absolute error for backbone Symbol, ψ, θ and τ angles, respectively, from SPIDER2. More significantly, 27% of 182724 40-residue models directly constructed from predicted Cα atom-based θ and τ have similar structures to their corresponding native structures (6Å RMSD or less), which is 3% better than models built by Symbol and ψ angles. We expect the method to be useful for assisting protein structure and function prediction.Availability and implementation:The method is available as a SPIDER3 server and standalone package at http://sparks-lab.org.Contact:email@example.com or firstname.lastname@example.orgSupplementary information:Supplementary data are available at Bioinformatics online.