English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Sfaira accelerates data and model reuse in single cell genomics

MPS-Authors
/persons/resource/persons262330

Dony,  Leander
Dept. Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Max Planck Society;
IMPRS Translational Psychiatry, Max Planck Institute of Psychiatry, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Fischer, D. S., Dony, L., Konig, M., Moeed, A., Zappia, L., Heumos, L., et al. (2021). Sfaira accelerates data and model reuse in single cell genomics. GENOME BIOLOGY, 22(1): 248. doi:10.1186/s13059-021-02452-6.


Cite as: https://hdl.handle.net/21.11116/0000-0009-2D05-0
Abstract
Single-cell RNA-seq datasets are often first analyzed independently without harnessing model fits from previous studies, and are then contextualized with public data sets, requiring time-consuming data wrangling. We address these issues with sfaira, a single-cell data zoo for public data sets paired with a model zoo for executable pre-trained models. The data zoo is designed to facilitate contribution of data sets using ontologies for metadata. We propose an adaption of cross-entropy loss for cell type classification tailored to datasets annotated at different levels of coarseness. We demonstrate the utility of sfaira by training models across anatomic data partitions on 8 million cells.