English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Paper

Partially Blind Domain Adaptation for Age Prediction from DNA Methylation Data

MPS-Authors
/persons/resource/persons202325

Handl,  Lisa
Computational Biology and Applied Algorithmics, MPI for Informatics, Max Planck Society;

/persons/resource/persons79293

Jalali,  Adrin
Computational Biology and Applied Algorithmics, MPI for Informatics, Max Planck Society;

/persons/resource/persons199763

Scherer,  Michael
Computational Biology and Applied Algorithmics, MPI for Informatics, Max Planck Society;

/persons/resource/persons98170

Pfeifer,  Nico
Computational Biology and Applied Algorithmics, MPI for Informatics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

arXiv:1612.06650.pdf
(Preprint), 73KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Handl, L., Jalali, A., Scherer, M., & Pfeifer, N. (2016). Partially Blind Domain Adaptation for Age Prediction from DNA Methylation Data. Retrieved from http://arxiv.org/abs/1612.06650.


Cite as: https://hdl.handle.net/11858/00-001M-0000-002C-4CDD-3
Abstract
Over the last years, huge resources of biological and medical data have become available for research. This data offers great chances for machine learning applications in health care, e.g. for precision medicine, but is also challenging to analyze. Typical challenges include a large number of possibly correlated features and heterogeneity in the data. One flourishing field of biological research in which this is relevant is epigenetics. Here, especially large amounts of DNA methylation data have emerged. This epigenetic mark has been used to predict a donor's 'epigenetic age' and increased epigenetic aging has been linked to lifestyle and disease history. In this paper we propose an adaptive model which performs feature selection for each test sample individually based on the distribution of the input data. The method can be seen as partially blind domain adaptation. We apply the model to the problem of age prediction based on DNA methylation data from a variety of tissues, and compare it to a standard model, which does not take heterogeneity into account. The standard approach has particularly bad performance on one tissue type on which we show substantial improvement with our new adaptive approach even though no samples of that tissue were part of the training data.