hide
Free keywords:
Binding Sites/genetics
*Genetic Variation
Genome, Human
Genomics
Humans
Kruppel-Like Transcription Factors/metabolism
Molecular Sequence Annotation/*methods
Mutation
Neoplasms/*genetics
Polymorphism, Single Nucleotide
Population/genetics
RNA, Untranslated/genetics
Selection, Genetic
Abstract:
Interpreting variants, especially noncoding ones, in the increasing number of personal genomes is challenging. We used patterns of polymorphisms in functionally annotated regions in 1092 humans to identify deleterious variants; then we experimentally validated candidates. We analyzed both coding and noncoding regions, with the former corroborating the latter. We found regions particularly sensitive to mutations ("ultrasensitive") and variants that are disruptive because of mechanistic effects on transcription-factor binding (that is, "motif-breakers"). We also found variants in regions with higher network centrality tend to be deleterious. Insertions and deletions followed a similar pattern to single-nucleotide variants, with some notable exceptions (e.g., certain deletions and enhancers). On the basis of these patterns, we developed a computational tool (FunSeq), whose application to ~90 cancer genomes reveals nearly a hundred candidate noncoding drivers.