English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Application of tetranucleotide frequencies for the assignment of genomic fragments

MPS-Authors
/persons/resource/persons210812

Teeling,  H.
Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210607

Meyerdierks,  A.
Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210248

Bauer,  M.
Microbial Genomics Group, Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210230

Amann,  R.
Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210403

Glöckner,  F. O.
Microbial Genomics Group, Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

External Ressource
No external resources are shared
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Teeling, H., Meyerdierks, A., Bauer, M., Amann, R., & Glöckner, F. O. (2004). Application of tetranucleotide frequencies for the assignment of genomic fragments. Environmental Microbiology, 6(9), 938-947.


Cite as: http://hdl.handle.net/21.11116/0000-0001-D0FC-8
Abstract
A basic problem of the metagenomic approach in microbial ecology is the assignment of genomic fragments to a certain species or taxonomic group, when suitable marker genes are absent. Currently, the (G + C)-content together with phylogenetic information and codon adaptation for functional genes is mostly used to assess the relationship of different fragments. These methods, however, can produce ambiguous results. In order to evaluate sequence-based methods for fragment identification, we extensively compared (G + C)-contents and tetranucleotide usage patterns of 9054 fosmid-sized genomic fragments generated in silico from 118 completely sequenced bacterial genomes (40 982 931 fragment pairs were compared in total). The results of this systematic study show that the discriminatory power of correlations of tetranucleotide-derived z-scores is by far superior to that of differences in (G + C)-content and provides reasonable assignment probabilities when applied to metagenome libraries of small diversity. Using six fully sequenced fosmid inserts from a metagenomic analysis of microbial consortia mediating the anaerobic oxidation of methane (AOM), we demonstrate that discrimination based on tetranucleotide-derived z-score correlations was consistent with corresponding data from 16S ribosomal RNA sequence analysis and allowed us to discriminate between fosmid inserts that were indistinguishable with respect to their (G + C)-contents.