The prevalence of conservative evolution in the protein sequence universe

Weidmann, L; Dijkstra, TMH; Kohlbacher, O; Lupas, AN

Lokale TagsFreigabegeschichteDetailsÜbersicht

The prevalence of conservative evolution in the protein sequence universe

Weidmann, L., Dijkstra, T., Kohlbacher, O., & Lupas, A. (2018). The prevalence of conservative evolution in the protein sequence universe. Poster presented at CAS Conference 2018: Molecular Origins of LIFE, München, Germany.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-000B-70E4-5 Versions-Permalink: https://hdl.handle.net/21.11116/0000-000B-70E5-4

Genre: Poster

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
https://www.emergence-of-life.de/past-events/181011-12_mom_abstractbook.pdf (Zusammenfassung) Open Access Status unbekannt

Beschreibung:
-

OA-Status:
Keine Angabe

Urheber

einblenden:

ausblenden:

Urheber:
Weidmann, L¹, Autor
Dijkstra, TMH¹, Autor
Kohlbacher, O², Autor
Lupas, AN¹, Autor

Affiliations:
1Department Protein Evolution, Max Planck Institute for Developmental Biology, Max Planck Society, ou_3375791
2IMPRS From Molecules to Organisms, Max Planck Institute for Developmental Biology, Max Planck Society, ou_3376131

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: The genesis of a structured and functional protein by random processes is exceedingly unlikely. However, once a functioning protein emerges, it can easily gain acceptance [1]. The evolution of natural proteins therefore often proceeds through the amplification of already established protein sequences. Copies of the same sequence evolve over time, leading to the co-existence of similar
sequences that might also have diversified in function [2]. We investigate the prevalence of such conservative evolution by analyzing reuse in the protein sequence universe. 1300 non-redundant bacterial genomes of distinct genera with exemplars from most bacterial classes are chosen as a representative for this study. We use statistical modeling in order to distinguish sequence similarities arising through reuse, as opposed to mere chance. For this purpose we derive the distribution of point mutation distances between randomly drawn k-mers. For long point mutation distances, the distribution can be described by a binomial distribution based on the amino acid composition of the underlying data. The frequency of shorter distances is significantly increased relative to the binomial distribution and can be explained by reuse. In the example of 100mers, we find that most sequence fragments (>90%) are at least reused once (p-value of 10-5). More than 10% of all sequence fragments are extensively reused and reoccur more than thousand times. Pairwise genome comparison reveals an overlap of around 19% common sequences on average. This demonstrates that the pressure to conserve sequences is strong enough to cause such significant sequence overlap, even after billions of years have passed.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Online veröffentlicht: 2018-10

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: -

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: CAS Conference 2018: Molecular Origins of LIFE

Veranstaltungsort: München, Germany

Start-/Enddatum: 2018-10-11 - 2018-10-12

ausblenden:

Titel: CAS Conference 2018: Molecular Origins of LIFE

Genre der Quelle: Konferenzband

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: -

Seiten: - Band / Heft: - Artikelnummer: P37 Start- / Endseite: 58 Identifikator: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1