Graph sharpening plus graph integration: a synergy that improves protein 
functional classification

Shin, H; Lisewski, AM; Lichtarge, O

doi:10.1093/bioinformatics/btm511

Local TagsRelease HistoryDetailsSummary

Graph sharpening plus graph integration: a synergy that improves protein functional classification

Shin, H., Lisewski, A., & Lichtarge, O. (2007). Graph sharpening plus graph integration: a synergy that improves protein functional classification. Bioinformatics, 23(23), 3217-3224. doi:10.1093/bioinformatics/btm511.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-CAE7-B Version Permalink: https://hdl.handle.net/21.11116/0000-0003-B9C3-0

Genre: Journal Article

Files

show Files

Locators

show

hide

Locator:
https://academic.oup.com/bioinformatics/article/23/23/3217/291641 (Publisher version) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Shin, H¹, Author
Lisewski, AM, Author
Lichtarge, O, Author

Affiliations:
1External Organizations, ou_persistent22

Content

show

hide

Free keywords: -

Abstract: Motivation: Predicting protein function is a central problem in bioinformatics, and many approaches use partially or fully automated methods based on various combination of sequence, structure and other information on proteins or genes. Such information establishes relationships between proteins that can be modelled most naturally as edges in graphs. A priori, however, it is often unclear which edges from which graph may contribute most to accurate predictions. For that reason, one established strategy is to integrate all available sources, or graphs as in graph integration, in the hope that the positive signals will add to each other. However, in the problem of functional prediction, noise, i.e. the presence of inaccurate or false edges, can still be large enough that integration alone has little effect on prediction accuracy. In order to reduce noise levels and to improve integration efficiency, we present here a recent method in graph-based learning, graph sharpening, which provides a theoretically firm yet intuitive and practical approach for disconnecting undesirable edges from protein similarity graphs. This approach has several attractive features: it is quick, scalable in the number of proteins, robust with respect to errors and tolerant of very diverse types of protein similarity measures.

Results: We tested the classification accuracy in a test set of 599 proteins with remote sequence homology spread over 20 Gene Ontology (GO) functional classes. When compared to integration alone, graph sharpening plus integration of four vastly different molecular similarity measures improved the overall classification by nearly 30% [0.17 average increase in the area under the ROC curve (AUC)]. Moreover, and partially through the increased sparsity of the graphs induced by sharpening, this gain in accuracy came at negligible computational cost: sharpening and integration took on average 4.66 (±4.44) CPU seconds.

Details

show

hide

Language(s):

Dates: Date issued: 2007-12

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.1093/bioinformatics/btm511
BibTex Citekey: 5017

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Bioinformatics

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Oxford : Oxford University Press

Pages: - Volume / Issue: 23 (23) Sequence Number: - Start / End Page: 3217 - 3224 Identifier: ISSN: 1367-4803
CoNE: https://pure.mpg.de/cone/journals/resource/954926969991