English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  CRBHits: From Conditional Reciprocal Best Hits toCodon Alignments and Ka/Ks in R

Ullrich, K. K. (2020). CRBHits: From Conditional Reciprocal Best Hits toCodon Alignments and Ka/Ks in R. The Journal of Open Source Software, 5(55): 2424. doi:10.21105/joss.02424.

Item is

Files

show Files
hide Files
:
10.21105.joss.02424.pdf (Publisher version), 759KB
Name:
10.21105.joss.02424.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Ullrich, Kristian K.1, Author           
Affiliations:
1Department Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Max Planck Society, ou_1445635              

Content

show
hide
Free keywords: reciprocal best hit; conditional reciprocal best hit; codon alignment; Ka/Ks; dN/dS; tandem duplicated genes; synteny
 Abstract: CRBHitsis a coding sequence (CDS) analysis pipeline inR(R Core Team, 2019). It reimple-ments the Conditional Reciprocal Best Hit (CRBH) algorithmcrb-blastand covers all necessarysteps from sequence similarity searches, codon alignments to Ka/Ks calculations and synteny.The new R package targets ecology, population and evolutionary biologists working in thefield of comparative genomics.The Reciprocal Best Hit (RBH) approach is commonly used in bioinformatics to show thattwo sequences evolved from a common ancestral gene. In other words, RBH tries to findorthologous protein sequences within and between species. These orthologous sequencescan be further analysed to evaluate protein family evolution, infer phylogenetic trees and toannotate protein function (Altenhoff et al., 2019). The initial sequence search step is classicallyperformed with the Basic Local Alignment Search Tool (blast) (Altschul et al., 1990) and dueto evolutionary constraints, in most cases protein coding sequences are compared betweentwo species. Downstream analysis use the resulting RBH to cluster sequence pairs and buildso-called orthologous groups like e.g.OrthoFinder(Emms & Kelly, 2015) and other tools.The CRBH algorithm was introduced byAubry et al.(2014) and builds upon the traditionalRBH approach to find additional orthologous sequences between two sets of sequences. Asdescribed earlier (Aubry et al., 2014;Scott, 2017), CRBH uses the sequence search results tofit an expect value (E-value) cutoff given each RBH to subsequently add sequence pairs tothe list of bona-fide orthologs given their alignment length.Unfortunately, as mentioned byScott(2017), the original implementation of CRBH (crb-blast)lag improved blast-like search algorithm to speed up the analysis. As a consequence,Scott(2017) ported CRBH to pythonshmlast, whileshmlastcannot deal with IUPAC nucleotidecode so far.CRBHitsconstitutes a new R package, which build upon previous implementations and portsCRBH into theRenvironment, which is popular among biologists.CRBHitsimprove CRBHby additional implemented filter steps (Rost, 1999) and the possibility to apply custom filtersprior E-value fitting. Further, the resulting CRBH pairs can be evaluated for the presence oftandem duplicated genes, gene order based syntenic groups and evolutionary rates.

Details

show
hide
Language(s): eng - English
 Dates: 2020-05-262020-11-15
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.21105/joss.02424
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: The Journal of Open Source Software
  Other : Journal of Open Source Software
  Abbreviation : JOSS
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 5 (55) Sequence Number: 2424 Start / End Page: - Identifier: ISSN: 2475-9066
CoNE: https://pure.mpg.de/cone/journals/resource/2475-9066