English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Bayesian multiple logistic regression for case-control GWAS.

Banerjee, S., Zeng, L., Schunkert, H., & Söding, J. (2018). Bayesian multiple logistic regression for case-control GWAS. PLoS Genetics, 14(12): e1007856.. doi:10.1371/journal.pgen.1007856.

Item is

Basic

show hide
Item Permalink: http://hdl.handle.net/21.11116/0000-0002-B9E1-F Version Permalink: http://hdl.handle.net/21.11116/0000-0003-2A7C-4
Genre: Journal Article

Files

show Files
hide Files
:
3017184.pdf (Publisher version), 4MB
Name:
3017184.pdf
Description:
-
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Banerjee, S.1, Author              
Zeng, L., Author
Schunkert, H., Author
Söding, J.1, Author              
Affiliations:
1Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society, ou_1933286              

Content

show
hide
Free keywords: -
 Abstract: Genetic variants in genome-wide association studies (GWAS) are tested for disease association mostly using simple regression, one variant at a time. Standard approaches to improve power in detecting disease-associated SNPs use multiple regression with Bayesian variable selection in which a sparsity-enforcing prior on effect sizes is used to avoid overtraining and all effect sizes are integrated out for posterior inference. For binary traits, the logistic model has not yielded clear improvements over the linear model. For multi-SNP analysis, the logistic model required costly and technically challenging MCMC sampling to perform the integration. Here, we introduce the quasi-Laplace approximation to solve the integral and avoid MCMC sampling. We expect the logistic model to perform much better than multiple linear regression except when predicted disease risks are spread closely around 0.5, because only close to its inflection point can the logistic function be well approximated by a linear function. Indeed, in extensive benchmarks with simulated phenotypes and real genotypes, our Bayesian multiple LOgistic REgression method (B-LORE) showed considerable improvements (1) when regressing on many variants in multiple loci at heritabilities ≥ 0.4 and (2) for unbalanced case-control ratios. B-LORE also enables meta-analysis by approximating the likelihood functions of individual studies by multivariate normal distributions, using their means and covariance matrices as summary statistics. Our work should make sparse multiple logistic regression attractive also for other applications with binary target variables. B-LORE is freely available from: https://github.com/soedinglab/b-lore.

Details

show
hide
Language(s): eng - English
 Dates: 2018-12-312018-12
 Publication Status: Published in print
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Method: Peer
 Identifiers: DOI: 10.1371/journal.pgen.1007856
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: PLoS Genetics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: 27 Volume / Issue: 14 (12) Sequence Number: e1007856. Start / End Page: - Identifier: -