English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Mapping and classifying molecules from a high-throughput structural database

De, S., Musil, F., Ingram, T., Baldauf, C., & Ceriotti, M. (2017). Mapping and classifying molecules from a high-throughput structural database. Journal of Cheminformatics, 9: 6. doi:10.1186/s13321-017-0192-4.

Item is

Files

show Files
hide Files
:
art_10.1186_s13321-017-0192-4.pdf (Publisher version), 16MB
Name:
art_10.1186_s13321-017-0192-4.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2017
Copyright Info:
© The Author(s)

Locators

show

Creators

show
hide
 Creators:
De, Sandip1, 2, Author
Musil, Felix1, 2, Author
Ingram, Teresa3, Author           
Baldauf, Carsten3, Author           
Ceriotti, Michele1, 2, Author
Affiliations:
1National Center for Computational Design and Discovery of Novel Materials (MARVEL), Lausanne, Switzerland, ou_persistent22              
2Laboratory of Computational Science and Modelling, Institute of Materials, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, ou_persistent22              
3Theory, Fritz Haber Institute, Max Planck Society, ou_634547              

Content

show
hide
Free keywords: -
 Abstract: High-throughput computational materials design promises to greatly accelerate the process of discovering new materials and compounds, and of optimizing their properties. The large databases of structures and properties that result from computational searches, as well as the agglomeration of data of heterogeneous provenance leads to considerable challenges when it comes to navigating the database, representing its structure at a glance, understanding structure–property relations, eliminating duplicates and identifying inconsistencies. Here we present a case study, based on a data set of conformers of amino acids and dipeptides, of how machine-learning techniques can help addressing these issues. We will exploit a recently-developed strategy to define a metric between structures, and use it as the basis of both clustering and dimensionality reduction techniques—showing how these can help reveal structure–property relations, identify outliers and inconsistent structures, and rationalise how perturbations (e.g. binding of ions to the molecule) affect the stability of different conformers.

Details

show
hide
Language(s):
 Dates: 2016-09-292017-01-172017-02-02
 Publication Status: Published online
 Pages: 14
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1186/s13321-017-0192-4
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Journal of Cheminformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: BioMed Central
Pages: 14 Volume / Issue: 9 Sequence Number: 6 Start / End Page: - Identifier: Other: 1758-2946
CoNE: https://pure.mpg.de/cone/journals/resource/1758-2946