English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

The Site/Group Extended Data format and tools

MPS-Authors
/persons/resource/persons179728

Dutheil,  Julien Y.       
Research Group Molecular Systems Evolution (Dutheil), Department Theoretical Biology (Traulsen), Max Planck Institute for Evolutionary Biology, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)

evae011_supplementary_data.pdf
(Supplementary material), 98KB

Citation

Dutheil, J. Y., Hamidi, D., & Pajot, B. (2024). The Site/Group Extended Data format and tools. Genome Biology and Evolution, 16(2): evae011. doi:10.1093/gbe/evae011.


Cite as: https://hdl.handle.net/21.11116/0000-000D-FFCE-D
Abstract
Comparative sequence analysis permits unraveling the molecular processes underlying gene evolution. Many statistical methods generate candidate positions within genes, such as fast or slowly evolving sites, coevolving groups of residues, sites undergoing positive selection, or changes in evolutionary rates. Understanding the functional causes of these evolutionary patterns requires combining the results of these analyses and mapping them onto molecular structures, a complex task involving distinct coordinate referential systems. To ease this task, we introduce the site/group extended data format, a simple text format to store (groups of) site annotations. We developed a toolset, the SgedTools, which permits site/group extended data file manipulation, creating them from various software outputs and translating coordinates between individual sequences, alignments, and three-dimensional structures. The package also includes a Monte-Carlo procedure to generate random site samples, possibly conditioning on site-specific features. This eases the statistical testing of evolutionary hypotheses, accounting for the structural properties of the encoded molecules.