Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

 
 
DownloadE-Mail
  efam: an expanded, metaproteome-supported HMM profile database of viral protein families

Zayed, A. A., Luecking, D., Dominik, L., Mohssen, M., Cronin, D., Bolduc, B., et al. (2021). efam: an expanded, metaproteome-supported HMM profile database of viral protein families. BIOINFORMATICS, 37(22), 4202-4208. doi:10.1093/bioinformatics/btab451.

Item is

Basisdaten

einblenden: ausblenden:
Genre: Zeitschriftenartikel

Dateien

einblenden: Dateien
ausblenden: Dateien
:
btab451.pdf (Verlagsversion), 652KB
Name:
btab451.pdf
Beschreibung:
-
OA-Status:
Sichtbarkeit:
Öffentlich
MIME-Typ / Prüfsumme:
application/pdf / [MD5]
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
-
Lizenz:
-

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Zayed, Ahmed A.1, Autor
Luecking, Dominik1, Autor
Dominik, Lücking2, Autor           
Mohssen, Mohamed1, Autor
Cronin, Dylan1, Autor
Bolduc, Ben1, Autor
Gregory, Ann C.1, Autor
Hargreaves, Katherine R.1, Autor
Piehowski, Paul D.1, Autor
White, Richard A. I. I. I. I. I. I.1, Autor
Huang, Eric L.1, Autor
Adkins, Joshua N.1, Autor
Roux, Simon1, Autor
Moraru, Cristina1, Autor
Sullivan, Matthew B.1, Autor
Affiliations:
1external, ou_persistent22              
2Max Planck Institute for Marine Microbiology, Max Planck Society, ou_2481692              

Inhalt

einblenden:
ausblenden:
Schlagwörter: -
 Zusammenfassung: Motivation: Viruses infect, reprogram and kill microbes, leading to profound ecosystem consequences, from elemental cycling in oceans and soils to microbiome-modulated diseases in plants and animals. Although metagenomic datasets are increasingly available, identifying viruses in them is challenging due to poor representation and annotation of viral sequences in databases.
Results: Here, we establish efam, an expanded collection of Hidden Markov Model (HMM) profiles that represent viral protein families conservatively identified from the Global Ocean Virome 2.0 dataset. This resulted in 240 311 HMM profiles, each with at least 2 protein sequences, making efam >7-fold larger than the next largest, panecosystem viral HMM profile database. Adjusting the criteria for viral contig confidence from 'conservative' to 'eXtremely Conservative' resulted in 37 841 HMM profiles in our efam-XC database. To assess the value of this resource, we integrated efam-XC into VirSorter viral discovery software to discover viruses from less-studied, ecologically distinct oxygen minimum zone (OMZ) marine habitats. This expanded database led to an increase in viruses recovered from every tested OMZ virome by similar to 24% on average (up to similar to 42%) and especially improved the recovery of often-missed shorter contigs (<5 kb). Additionally, to help elucidate lesser-known viral protein functions, we annotated the profiles using multiple databases from the DRAM pipeline and virion-associated metaproteomic data, which doubled the number of annotations obtainable by standard, single-database annotation approaches. Together, these marine resources (efam and efam-XC) are provided as searchable, compressed HMM databases that will be updated bi-annually to help maximize viral sequence discovery and study from any ecosystem.

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 2021-11-152021
 Publikationsstatus: Erschienen
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: ISI: 000733835900026
DOI: 10.1093/bioinformatics/btab451
 Art des Abschluß: -

Veranstaltung

einblenden:

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: BIOINFORMATICS
Genre der Quelle: Zeitschrift
 Urheber:
Affiliations:
Ort, Verlag, Ausgabe: -
Seiten: - Band / Heft: 37 (22) Artikelnummer: - Start- / Endseite: 4202 - 4208 Identifikator: ISSN: 1367-4803