English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  OBAMA: OBAMA for Bayesian aminoacid model averaging

Bouckaert, R. (2020). OBAMA: OBAMA for Bayesian aminoacid model averaging. PeerJ, 8: 9460. doi:10.7717/peerj.9460.

Item is

Files

show Files
hide Files
:
shh2705.pdf (Publisher version), 827KB
Name:
shh2705.pdf
Description:
OA
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Bouckaert, Remco1, Author           
Affiliations:
1Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Max Planck Society, ou_2074311              

Content

show
hide
Free keywords: Amino acid model, Bayesian analysis, Bayesian model averaging, BEAST, Gamma rate heterogeneity, Phylogenetics, Protein model, Site model, Statistical phylogenetics, Substitution model
 Abstract: Background. Bayesian analyses offer many benefits for phylogenetic, and have been popular for analysis of amino acid alignments. It is necessary to specify a substitution and site model for such analyses, and often an ad hoc, or likelihood based method is employed for choosing these models that are typically of no interest to the analysis overall. Methods. We present a method called OBAMA that averages over substitution models and site models, thus letting the data inform model choices and taking model uncertainty into account. It uses trans-dimensional Markov Chain Monte Carlo (MCMC) proposals to switch between various empirical substitution models for amino acids such as Dayhoff, WAG, and JTT. Furthermore, it switches base frequencies from these substitution models or use base frequencies estimated based on the alignment. Finally, it switches between using gamma rate heterogeneity or not, and between using a proportion of invariable sites or not. Results. We show that the model performs well in a simulation study. By using appropriate priors, we demonstrate both proportion of invariable sites and the shape parameter for gamma rate heterogeneity can be estimated. The OBAMA method allows taking in account model uncertainty, thus reducing bias in phylogenetic estimates. The method is implemented in the OBAMA package in BEAST 2, which is open source licensed under LGPL and allows joint tree inference under a wide range of models. © Copyright 2020 Bouckaert.

Details

show
hide
Language(s): eng - English
 Dates: 2020
 Publication Status: Published online
 Pages: 15
 Publishing info: -
 Table of Contents: Introduction

Methods
- site model
- prior
- MCMC proposals

Results
- Validation of model implementation
-- Simulation study
-- Identifiability of gamma shape and proportion of invariable sites
-- Variants on simulation study
-- Frequency operator

Discussion
- Site models matter

Conclusions
 Rev. Type: Peer
 Identifiers: DOI: 10.7717/peerj.9460
Other: shh2705
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: PeerJ
  Other : PeerJ
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: London [u.a.] : PeerJ Inc.
Pages: - Volume / Issue: 8 Sequence Number: 9460 Start / End Page: - Identifier: ISSN: 2167-8359
CoNE: https://pure.mpg.de/cone/journals/resource/2167-8359