English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Data mining, neural nets, trees–problems 2 and 3 of Genetic Analysis Workshop 15

Ziegler, A., DeStefano, A. L., König, I. R., Bardel, C., Brinza, D., Bull, S., et al. (2007). Data mining, neural nets, trees–problems 2 and 3 of Genetic Analysis Workshop 15. Genetic Epidemiology, 31(Suppl 1), S51-S60. doi:10.1002/gepi.20280.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Ziegler, Andreas, Author
DeStefano, Anita L., Author
König, Inke R., Author
Bardel, Claire, Author
Brinza, Dumitru, Author
Bull, Shelley, Author
Cai, Zhaohui, Author
Glaser, Beate1, Author
Jiang, Wei, Author
Lee, Kristine E., Author
Li, Chuang Xing, Author
Li, Jing, Author
Li, Xin, Author
Majoram, Paul, Author
Meng, Yan, Author
Nicodemus, Kristin K., Author
Platt, Alexander, Author
Schwarz, Daniel F., Author
Shi, Weilang, Author
Shugart, Yin Yao, Author
Stassen, Hans H., AuthorSun, Yan V., AuthorWon, Sungho, AuthorWang, Wenyi, AuthorWahba, Grace, AuthorZagaar, Usumah A., AuthorZhao, Zhenming, Author more..
Affiliations:
1External Organizations, ou_persistent22              

Content

show
hide
Free keywords: -
 Abstract: Genome-wide association studies using thousands to hundreds of thousands of single nucleotide polymorphism (SNP) markers and region-wide association studies using a dense panel of SNPs are already in use to identify disease susceptibility genes and to predict disease risk in individuals. Because these tasks become increasingly important, three different data sets were provided for the Genetic Analysis Workshop 15, thus allowing examination of various novel and existing data mining methods for both classification and identification of disease susceptibility genes, gene by gene or gene by environment interaction. The approach most often applied in this presentation group was random forests because of its simplicity, elegance, and robustness. It was used for prediction and for screening for interesting SNPs in a first step. The logistic tree with unbiased selection approach appeared to be an interesting alternative to efficiently select interesting SNPs. Machine learning, specifically ensemble methods, might be useful as pre-screening tools for large-scale association studies because they can be less prone to overfitting, can be less computer processor time intensive, can easily include pair-wise and higher-order interactions compared with standard statistical approaches and can also have a high capability for classification. However, improved implementations that are able to deal with hundreds of thousands of SNPs at a time are required.

Details

show
hide
Language(s): eng - English
 Dates: 2007
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1002/gepi.20280
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Genetic Epidemiology
  Other : Genetic Epidemiol.
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: New York, N.Y. : Wiley-Liss, Inc.
Pages: - Volume / Issue: 31 (Suppl 1) Sequence Number: - Start / End Page: S51 - S60 Identifier: ISSN: 0741-0395
CoNE: https://pure.mpg.de/cone/journals/resource/954925539161