English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Meeting Abstract

PiPred: a deep-learning method for prediction of pi-helices in protein sequences

MPS-Authors
/persons/resource/persons271574

Alva,  V       
Protein Bioinformatics Group, Department Protein Evolution, Max Planck Institute for Developmental Biology, Max Planck Society;
Department Protein Evolution, Max Planck Institute for Developmental Biology, Max Planck Society;

/persons/resource/persons275270

Dunin-Horkawicz,  S       
Department Protein Evolution, Max Planck Institute for Developmental Biology, Max Planck Society;

External Resource
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Ludwiczak, J., Wiński, A., Marinho da Silva Neto, A., Szczepaniak, K., Alva, V., & Dunin-Horkawicz, S. (2019). PiPred: a deep-learning method for prediction of pi-helices in protein sequences. In BioInformatics in Torun 2019 - BIT19 (pp. 61).


Cite as: https://hdl.handle.net/21.11116/0000-000E-528C-8
Abstract
pi-helices are short and unstable protein secondary structure motifs. They are present in 15% of all known protein structures, often in functionally important regions such as ligand-binding sites. Thus, the correct prediction of pi-helices is essential for the detection of potential functional sites and can aid design tasks aiming at the creation of ligand-binding pockets. Due to the similarity of pi-helices to much more abundant alpha-helices, it is a challenging task to predict them based on the sequence data and there are no methods devoted to this problem. We present a deep learning neural network trained with sequences where pi-helical residues were assigned based on high-quality X-ray structures [1]. The model achieved 48% precision and 46% sensitivity in per-residue prediction on a test set. Moreover, in the benchmark on commonly used datasets like CB6133, CB513, CASP10, and CASP11 the model outperforms other state-of-the-art methods used for secondary structure prediction.