English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Four high-quality draft genome assemblies of the marine heterotrophic nanoflagellate Cafeteria roenbergensis

MPS-Authors
/persons/resource/persons201270

Hackl,  Thomas
Department of Biomolecular Mechanisms, Max Planck Institute for Medical Research, Max Planck Society;

/persons/resource/persons118654

Barenhoff,  Karina
Department of Biomolecular Mechanisms, Max Planck Institute for Medical Research, Max Planck Society;

/persons/resource/persons231602

Duponchel,  Sarah
Department of Biomolecular Mechanisms, Max Planck Institute for Medical Research, Max Planck Society;

/persons/resource/persons117949

Fischer,  Matthias G.
Department of Biomolecular Mechanisms, Max Planck Institute for Medical Research, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Hackl, T., Martin, R., Barenhoff, K., Duponchel, S., Heider, D., & Fischer, M. G. (2020). Four high-quality draft genome assemblies of the marine heterotrophic nanoflagellate Cafeteria roenbergensis. Scientific Data, 7: 29, pp. 1-9. doi:10.1038/s41597-020-0363-4.


Cite as: https://hdl.handle.net/21.11116/0000-0005-8542-A
Abstract
The heterotrophic stramenopile Cafeteria roenbergensis is a globally distributed marine bacterivorous protist. This unicellular flagellate is host to the giant DNA virus CroV and the virophage mavirus. We sequenced the genomes of four cultured C. roenbergensis strains and generated 23.53 Gb of Illumina MiSeq data (99–282 × coverage per strain) and 5.09 Gb of PacBio RSII data (13–45 × coverage). Using the Canu assembler and customized curation procedures, we obtained high-quality draft genome assemblies with a total length of 34–36 Mbp per strain and contig N50 lengths of 148 kbp to 464 kbp. The C. roenbergensis genome has a GC content of ~70%, a repeat content of ~28%, and is predicted to contain approximately 7857–8483 protein-coding genes based on a combination of de novo, homology-based and transcriptome-supported annotation. These first high-quality genome assemblies of a bicosoecid fill an important gap in sequenced stramenopile representatives and enable a more detailed evolutionary analysis of heterotrophic protists.