This title appears in the Scientific Report :
2020
Please use the identifier:
http://dx.doi.org/10.1021/acs.jctc.9b00825 in citations.
Please use the identifier: http://hdl.handle.net/2128/24531 in citations.
TopModel: Template-based protein structure prediction at low sequence identity using top-down consensus and deep neural networks
TopModel: Template-based protein structure prediction at low sequence identity using top-down consensus and deep neural networks
Knowledge of protein structures is essential to understand the proteins’ functions, evolution, dynamics, stabilities, interactions, and for data-driven protein- or drug-design. Yet, experimental structure determination rates are far exceeded by that of next-generation sequencing. Computational struc...
Saved in:
Personal Name(s): | Mulnaes, Daniel |
---|---|
Porta, Nicola / Clemens, Rebecca / Apanasenko, Irina / Reiners, Jens / Gremer, Lothar / Neudecker, Philipp / Smits, Sander H. J. / Gohlke, Holger (Corresponding author) | |
Contributing Institute: |
Strukturbiochemie; ICS-6 John von Neumann - Institut für Computing; NIC Jülich Supercomputing Center; JSC |
Published in: | Journal of chemical theory and computation, 16 (2020) 3, S. 1953-1967 |
Imprint: |
Washington, DC
2020
|
DOI: |
10.1021/acs.jctc.9b00825 |
PubMed ID: |
31967823 |
Document Type: |
Journal Article |
Research Program: |
Forschergruppe Gohlke Functional Macromolecules and Complexes Computational Science and Mathematical Methods |
Link: |
Published on 2020-01-22. Available in OpenAccess from 2021-01-22. Restricted Restricted Published on 2020-01-22. Available in OpenAccess from 2021-01-22. Restricted Restricted Restricted Restricted |
Publikationsportal JuSER |
Please use the identifier: http://hdl.handle.net/2128/24531 in citations.
Knowledge of protein structures is essential to understand the proteins’ functions, evolution, dynamics, stabilities, interactions, and for data-driven protein- or drug-design. Yet, experimental structure determination rates are far exceeded by that of next-generation sequencing. Computational structure prediction seeks to alleviate this problem, and the Critical Assessment of protein Structure Prediction (CASP) has shown the value of consensus- and meta-methods that utilize complementary algorithms. However, traditionally, such methods employ majority voting during template selection and model averaging during refinement, which can drive the model away from the native fold if it is underrepresented in the ensemble. Here, we present TopModel, a fully automated meta-method for protein structure prediction. In contrast to traditional consensus- and meta-methods, TopModel uses top-down consensus and deep neural networks to select templates and identify and correct wrongly modeled regions. TopModel combines a broad range of state-of-the-art methods for threading, alignment and model quality estimation and provides a versatile work-flow and toolbox for template-based structure prediction. TopModel shows a superior template selection, alignment accuracy, and model quality for template-based structure prediction on the CASP10-12 datasets. TopModel was validated by prospective predictions of the nisin resistance protein NSR protein from S. agalactiae and LipoP from C. difficile, showing far better agreement with experimental data than any of its constituent primary predictors. These results, in general, demonstrate the utility of TopModel for protein structure prediction and, in particular, show how combining computational structure prediction with sparse or low-resolution experimental data can improve the final model. |