English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  L2,1-norm regularized multivariate regression model with applications to genomic prediction

Mbebi, A. J., Tong, H., & Nikoloski, Z. (2021). L2,1-norm regularized multivariate regression model with applications to genomic prediction. Bioinformatics, 37(18), 2896-2904. doi:10.1093/bioinformatics/btab212.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Mbebi, A. J.1, Author           
Tong, H.1, Author           
Nikoloski, Z.1, Author           
Affiliations:
1Mathematical Modelling and Systems Biology - Nikoloski, Cooperative Research Groups, Max Planck Institute of Molecular Plant Physiology, Max Planck Society, ou_1753310              

Content

show
hide
Free keywords: -
 Abstract: Genomic selection (GS) is currently deemed the most effective approach to speed up breeding of agricultural varieties. It has been recognized that consideration of multiple traits in GS can improve accuracy of prediction for traits of low heritability. However, since GS forgoes statistical testing with the idea of improving predictions, it does not facilitate mechanistic understanding of the contribution of particular single nucleotide polymorphisms (SNP).Here we propose a L2,1-norm regularized multivariate regression model and devise a fast and efficient iterative optimization algorithm, called L2,1-joint, applicable in multi-trait GS. The usage of the L2,1-norm facilitates variable selection in a penalized multivariate regression that considers the relation between individuals, when the number of SNPs is much larger than the number of individuals. The capacity for variable selection allows us to define master regulators that can be used in a multi-trait GS setting to dissect the genetic architecture of the analyzed traits. Our comparative analyses demonstrate that the proposed model is a favorable candidate compared to existing state-of-the-art approaches. Prediction and variable selection with data sets from Brassica napus, wheat and Arabidopsis thaliana diversity panels are conducted to further showcase the performance of the proposed model.The model is implemented using R programming language and the code is freely available from https://github.com/alainmbebi/L21-norm-GS.Supplementary data are available at Bioinformatics online.

Details

show
hide
Language(s): eng - English
 Dates: 2021
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1093/bioinformatics/btab212
BibTex Citekey: 10.1093/bioinformatics/btab212
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 37 (18) Sequence Number: - Start / End Page: 2896 - 2904 Identifier: ISSN: 1367-4803
CoNE: https://pure.mpg.de/cone/journals/resource/954926969991