English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

MPS-Authors
/persons/resource/persons128572

Söding,  J.
Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society;

External Resource
Fulltext (public)

1944220.pdf
(Publisher version), 256KB

Supplementary Material (public)

1944220_Suppl_1.pdf
(Supplementary material), 131KB

1944220_Suppl_2.pdf
(Supplementary material), 136KB

Citation

Sievers, F., Wilm, A., Dineen, D., Gibson, T. J., Karplus, K., Li, W., et al. (2011). Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology, 7: 539. doi:10.1038/msb.2011.75.


Cite as: http://hdl.handle.net/11858/00-001M-0000-0017-8C09-D
Abstract
Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high‐quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high‐quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.