Help Privacy Policy Disclaimer
  Advanced SearchBrowse




Journal Article

CCMpred-fast and precise prediction of protein residue-residue contacts from correlated mutations.


Söding,  J.
Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

(Publisher version), 143KB

Supplementary Material (public)

(Supplementary material), 499KB


Seemayer, S., Gruber, M., & Söding, J. (2014). CCMpred-fast and precise prediction of protein residue-residue contacts from correlated mutations. Bioinformatics, 30(21), 3128-3130. doi:10.1093/bioinformatics/btu500.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0024-4702-B
Motivation: Recent breakthroughs in protein residue-residue contact prediction have made reliable de novo prediction of protein structures possible. The key was to apply statistical methods that can distinguish direct couplings between pairs of columns in a multiple sequence alignment from merely correlated pairs, i.e. to separate direct from indirect effects. Two classes of such methods exist, either relying on regularized inversion of the covariance matrix or on pseudo-likelihood maximization (PLM). Although PLM-based methods offer clearly higher precision, available tools are not sufficiently optimized and are written in interpreted languages that introduce additional overheads. This impedes the runtime and large-scale contact prediction for larger protein families, multi-domain proteins and protein-protein interactions. Results: Here we introduce CCMpred, our performance-optimized PLM implementation in C and CUDA C. Using graphics cards in the price range of current six-core processors, CCMpred can predict contacts for typical alignments 35-113 times faster and with the same precision as the most accurate published methods. For users without a CUDA-capable graphics card, CCMpred can also run in a CPU mode that is still 4-14 times faster. Thanks to our speed-ups (http://dictionary.cambridge.org/dictionary/british/speed-up) contacts for typical protein families can be predicted in 15-60s on a consumer-grade GPU and 1-6min on a six-core CPU.