English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  High-complexity regions in mammalian genomes are enriched for developmental genes

Pirogov, A., Pfaffelhuber, P., Börsch-Haubold, A., & Haubold, B. (2019). High-complexity regions in mammalian genomes are enriched for developmental genes. Bioinformatics, 35(11), 1813-1819. doi:10.1093/bioinformatics/bty922.

Item is

Basic

show hide
Item Permalink: http://hdl.handle.net/21.11116/0000-0002-94D0-B Version Permalink: http://hdl.handle.net/21.11116/0000-0003-D7BB-8
Genre: Journal Article

Files

show Files
hide Files
:
bty922.pdf (Any fulltext), 299KB
Name:
bty922.pdf
Description:
-
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show
hide
Description:
-

Creators

show
hide
 Creators:
Pirogov, Anton, Author
Pfaffelhuber, Peter, Author
Börsch-Haubold, Angelika, Author
Haubold, Bernhard1, Author              
Affiliations:
1Research Group Bioinformatics, Department Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Max Planck Society, ou_1445644              

Content

show
hide
Free keywords: -
 Abstract: MotivationUnique sequence regions are associated with genetic function in vertebrate genomes. However, measuring uniqueness, or absence of long repeats, along a genome is conceptually and computationally difficult. Here we use a previously published variant of the Lempel-Ziv complexity, the match complexity, Cm, and augment it by deriving its null distribution for random sequences. We then apply Cm to the human and mouse genomes to investigate the relationship between sequence complexity and function.ResultsWe implemented Cm in the program macle and show through simulation that the newly derived null distribution of Cm is accurate. This allows us to delineate high-complexity regions in the human and mouse genomes. Using our program macle2go, we find that these regions are two-fold enriched for genes. Moreover, the genes contained in these regions are more than 10-fold enriched for developmental functions.AvailabilitySource code for macle and macle2go is available from www.github.com/evolbioinf/macle and www.github.com/evolbioinf/macle2go, respectively; Cm browser tracks from guanine.evolbio.mgp.de/complexity.

Details

show
hide
Language(s): eng - English
 Dates: 2018-09-172018-07-232018-11-022018-11-082019-06-01
 Publication Status: Published in print
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Method: -
 Identifiers: DOI: 10.1093/bioinformatics/bty922
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 35 (11) Sequence Number: - Start / End Page: 1813 - 1819 Identifier: ISSN: 1367-4803
CoNE: https://pure.mpg.de/cone/journals/resource/954926969991