Prediction of protein functional residues from sequence by probability density 
estimation.

Fischer, J. D.; Mayer, C. E.; Söding, J.

doi:10.1093/bioinformatics/btm626

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Zeitschriftenartikel

Prediction of protein functional residues from sequence by probability density estimation.

MPG-Autoren

/persons/resource/persons128572

Söding, J.
Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society;

Externe Ressourcen

http://bioinformatics.oxfordjournals.org/content/24/5/613.full.pdf+html
(Verlagsversion)

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

Es sind keine frei zugänglichen Volltexte in PuRe verfügbar

Ergänzendes Material (frei zugänglich)

1944237_Suppl.pdf
(Ergänzendes Material), 3MB

Zitation

Fischer, J. D., Mayer, C. E., & Söding, J. (2008). Prediction of protein functional residues from sequence by probability density estimation. Bioinformatics, 24(5), 613-620. doi:10.1093/bioinformatics/btm626.

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0017-D9B6-9

Zusammenfassung

Motivation: The prediction of ligand-binding residues or catalytically active residues of a protein may give important hints that can guide further genetic or biochemical studies. Existing sequence-based prediction methods mostly rank residue positions by evolutionary conservation calculated from a multiple sequence alignment of homologs. A problem hampering more wide-spread application of these methods is the low per-residue precision, which at 20% sensitivity is around 35% for ligand-binding residues and 20% for catalytic residues. Results: We combine information from the conservation at each site, its amino acid distribution, as well as its predicted secondary structure (ss) and relative solvent accessibility (rsa). First, we measure conservation by how much the amino acid distribution at each site differs from the distribution expected for the predicted ss and rsa states. Second, we include the conservation of neighboring residues in a weighted linear score by analytically optimizing the signal-to-noise ratio of the total score. Third, we use conditional probability density estimation to calculate the probability of each site to be functional given its conservation, the observed amino acid distribution, and the predicted ss and rsa states. We have constructed two large data sets, one based on the Catalytic Site Atlas and the other on PDB SITE records, to benchmark methods for predicting functional residues. The new method FRcons predicts ligand-binding and catalytic residues with higher precision than alternative methods over the entire sensitivity range, reaching 50% and 40% precision at 20% sensitivity, respectively.