English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Multivariate Cutoff Level Analysis (MultiCoLA) of large community data sets

Gobet, A., Quince, C., & Ramette, A. (2010). Multivariate Cutoff Level Analysis (MultiCoLA) of large community data sets. Nucleic Acids Research, 38(15), e155-1-e155-9.

Item is

Files

show Files
hide Files
:
Gobet10.pdf (Publisher version), 831KB
Name:
Gobet10.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Gobet, A.1, Author           
Quince, C., Author
Ramette, A.2, Author           
Affiliations:
1Microbial Habitat Group, Max Planck Institute for Marine Microbiology, Max Planck Society, ou_2481709              
2HGF MPG Joint Research Group for Deep Sea Ecology & Technology, Max Planck Institute for Marine Microbiology, Max Planck Society, ou_2481702              

Content

show
hide
Free keywords: -
 Abstract: High-throughput sequencing techniques are becoming attractive to molecular biologists and ecologists as they provide a time- and cost-effective way to explore diversity patterns in environmental samples at an unprecedented resolution. An issue common to many studies is the definition of what fractions of a data set should be considered as rare or dominant. Yet this question has neither been satisfactorily addressed, nor is the impact of such definition on data set structure and interpretation been fully evaluated. Here we propose a strategy, MultiCoLA (Multivariate Cutoff Level Analysis), to systematically assess the impact of various abundance or rarity cutoff levels on the resulting data set structure and on the consistency of the further ecological interpretation. We applied MultiCoLA to a 454 massively parallel tag sequencing data set of V6 ribosomal sequences from marine microbes in temperate coastal sands. Consistent ecological patterns were maintained after removing up to 35-40% rare sequences and similar patterns of beta diversity were observed after denoising the data set by using a preclustering algorithm of 454 flowgrams. This example validates the importance of exploring the impact of the definition of rarity in large community data sets. Future applications can be foreseen for data sets from different types of habitats, e.g. other marine environments, soil and human microbiota.

Details

show
hide
Language(s): eng - English
 Dates: 2010-06-14
 Publication Status: Issued
 Pages: 9
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: eDoc: 534637
ISI: 000281345900004
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Nucleic Acids Research
  Other : Nucleic Acids Res.
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 38 (15) Sequence Number: - Start / End Page: e155-1 - e155-9 Identifier: ISSN: 0301-5610
CoNE: https://pure.mpg.de/cone/journals/resource/1000000000262810