English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Sequence context-specific profiles for homology searching.

Biegert, A., & Söding, J. (2009). Sequence context-specific profiles for homology searching. Proceedings of the National Academy of Sciences of the United States of America, 106(10), 3770-3775. doi:10.1073/pnas.0810767106.

Item is

Files

show Files
hide Files
:
1944232.pdf (Publisher version), 994KB
Name:
1944232.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-
:
1944232_Suppl.pdf (Supplementary material), 563KB
Name:
1944232_Suppl.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show
hide
Description:
-
OA-Status:

Creators

show
hide
 Creators:
Biegert, A.1, Author
Söding, J.2, Author           
Affiliations:
1external, ou_persistent22              
2Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society, ou_1933286              

Content

show
hide
Free keywords: -
 Abstract: Sequence alignment and database searching are essential tools in biology because a protein's function can often be inferred from homologous proteins. Standard sequence comparison methods use substitution matrices to find the alignment with the best sum of similarity scores between aligned residues. These similarity scores do not take the local sequence context into account. Here, we present an approach that derives context-specific amino acid similarities from short windows centered on each query sequence residue. Our results demonstrate that the sequence context contains much more information about the expected mutations than just the residue itself. By employing our context-specific similarities (CS-BLAST) in combination with NCBI BLAST, we increase the sensitivity more than 2-fold on a difficult benchmark set, without loss of speed. Alignment quality is likewise improved significantly. Furthermore, we demonstrate considerable improvements when applying this paradigm to sequence profiles: Two iterations of CSI-BLAST, our context-specific version of PSI-BLAST, are more sensitive than 5 iterations of PSI-BLAST. The paradigm for biological sequence comparison presented here is very general. It can replace substitution matrices in sequence- and profile-based alignment and search methods for both protein and nucleotide sequences.

Details

show
hide
Language(s): eng - English
 Dates: 2009-03-10
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1073/pnas.0810767106
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the National Academy of Sciences of the United States of America
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: National Academy of Sciences
Pages: - Volume / Issue: 106 (10) Sequence Number: - Start / End Page: 3770 - 3775 Identifier: ISSN: 0027-8424
CoNE: https://pure.mpg.de/cone/journals/resource/954925427230