English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Using interpretable machine learning to understand gene silencing dynamics during X-chromosome inactivation

Barros de Andrade e Sousa, L. (2020). Using interpretable machine learning to understand gene silencing dynamics during X-chromosome inactivation. PhD Thesis. doi:10.17169/refubium-28944.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Barros de Andrade e Sousa, Lisa1, 2, Author           
Marsico, Annalisa3, Referee           
Affiliations:
1Regulatory Networks in Stem Cells (Edda G. Schulz), Independent Junior Research Groups (OWL), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_2117286              
2Fachbereich Mathematik und Informatik der Freien Universität Berlin, ou_persistent22              
3RNA Bioinformatics (Annalisa Marsico), Independent Junior Research Groups (OWL), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_2117285              

Content

show
hide
Free keywords: X-Chromosome Inactivation Interpretable Machine Learning Explainable AI Random Forest Forest-guided Clustering
 Abstract: To equalize gene dosage between sexes, the long non-coding RNA Xist mediates chromosome-wide gene silencing of one X Chromosome in female mammals - a process known as X chromosome inactivation (XCI). The efficiency of gene silencing is highly variable across genes, with some genes even escaping XCI in somatic cells. A gene’s susceptibility to Xist-mediated silencing appears to be determined by a complex interplay of epigenetic and genomic features. However, the underlying rules remain poorly understood. To advance the understanding of Xist-mediated silencing pathways, chromosome-wide gene silencing dynamics at the level of nascent transcriptome were quantified using allele-specific Precision nuclear Run-On sequencing. We have developed a Random Forest machine learning model that is able to predict the measured silencing dynamics based on a large set of epigenetic and genomic features and tested its predictive power experimentally. We introduced a forest-guided clustering approach to uncover the combinatorial rules that control Xist-mediated gene silencing. Results suggest that the genomic distance to the Xist locus, followed by gene density and distance to LINE elements are the prime determinants of silencing velocity. Moreover, a series of features associated with active transcriptional elongation and chromatin 3D structure are enriched at efficiently silenced genes. Generally, silenced genes seem to be separated into two distinct groups, associated with different silencing pathways: one group that requires an AT-rich sequence context and the Xist repeat-A for silencing, which is known to activate the SPEN pathway, and another group where genes are pre-marked by polycomb complexes and tend to rely on the repeat-B in Xist for silencing, known to recruit polycomb complexes during XCI. Our machine learning approach can thus uncover the complex combinatorial rules underlying gene silencing during X chromosome inactivation.

Details

show
hide
Language(s): eng - English
 Dates: 20202021-08-12
 Publication Status: Published online
 Pages: vii, 159 S.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Degree: PhD

Event

show

Legal Case

show

Project information

show

Source

show