English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Detecting polymorphic regions in Arabidopsis thaliana with resequencing microarrays

Zeller, G., Clark, R., Schneeberger, K., Bohlen, A., Weigel, D., & Rätsch, G. (2008). Detecting polymorphic regions in Arabidopsis thaliana with resequencing microarrays. Genome Research, 18(6), 918-929. doi:10.1101/gr.070169.107.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Zeller, G1, Author           
Clark, RM, Author           
Schneeberger, K, Author           
Bohlen, A1, Author           
Weigel, D, Author           
Rätsch, G1, Author           
Affiliations:
1Rätsch Group, Friedrich Miescher Laboratory, Max Planck Society, ou_3378052              

Content

show
hide
Free keywords: -
 Abstract: Whole-genome oligonucleotide resequencing arrays have allowed the comprehensive discovery of single nucleotide polymorphisms (SNPs) in eukaryotic genomes of moderate to large size. With this technology, the detection rate for isolated SNPs is typically high. However, it is greatly reduced when other polymorphisms are located near a SNP as multiple mismatches inhibit hybridization to arrayed oligonucleotides. Contiguous tracts of suppressed hybridization therefore typify polymorphic regions (PRs) such as clusters of SNPs or deletions. We developed a machine learning method, designated margin-based prediction of polymorphic regions (mPPR), to predict PRs from resequencing array data. Conceptually similar to hidden Markov models, the method is trained with discriminative learning techniques related to support vector machines, and accurately identifies even very short polymorphic tracts (<10 bp). We applied this method to resequencing array data previously generated for the euchromatic genomes of 20 strains (accessions) of the best-characterized plant, Arabidopsis thaliana. Nonredundantly, 27% of the genome was included within the boundaries of PRs predicted at high specificity ( approximately 97%). The resulting data set provides a fine-scale view of polymorphic sequences in A. thaliana; patterns of polymorphism not apparent in SNP data were readily detected, especially for noncoding regions. Our predictions provide a valuable resource for evolutionary genetic and functional studies in A. thaliana, and our method is applicable to similar data sets in other species. More broadly, our computational approach can be applied to other segmentation tasks related to the analysis of genomic variation.

Details

show
hide
Language(s):
 Dates: 2008-06
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1101/gr.070169.107
PMID: 18323538
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Genome Research
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Cold Spring Harbor, N.Y. : Cold Spring Harbor Laboratory Press
Pages: - Volume / Issue: 18 (6) Sequence Number: - Start / End Page: 918 - 929 Identifier: ISSN: 1088-9051
CoNE: https://pure.mpg.de/cone/journals/resource/954926997202