English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Simultaneous learning of several materials properties from incomplete databases with multi-task SISSO

MPS-Authors
/persons/resource/persons188977

Ouyang,  Runhai
Theory, Fritz Haber Institute, Max Planck Society;

/persons/resource/persons203189

Ahmetcik,  Emre
Theory, Fritz Haber Institute, Max Planck Society;

/persons/resource/persons21413

Carbogno,  Christian
Theory, Fritz Haber Institute, Max Planck Society;

/persons/resource/persons22064

Scheffler,  Matthias
Theory, Fritz Haber Institute, Max Planck Society;

/persons/resource/persons21549

Ghiringhelli,  Luca M.
Theory, Fritz Haber Institute, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

arXiv:1901.00948.pdf
(Preprint), 2MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Ouyang, R., Ahmetcik, E., Carbogno, C., Scheffler, M., & Ghiringhelli, L. M. (2019). Simultaneous learning of several materials properties from incomplete databases with multi-task SISSO. JPhys Materials, 2(2): 024002. doi:10.1088/2515-7639/ab077b.


Cite as: https://hdl.handle.net/21.11116/0000-0003-02D4-B
Abstract
The identification of descriptors of materials properties and functions that
capture the underlying physical mechanisms is a critical goal in data-driven
materials science. Only such descriptors will enable a trustful and efficient
scanning of materials spaces and possibly the discovery of new materials.
Recently, the sure-independence screening and sparsifying operator (SISSO) has
been introduced and was successfully applied to a number of materials-science
problems. SISSO is a compressed-sensing based methodology yielding predictive
models that are expressed in form of analytical formulas, built from simple
physical properties. These formulas are systematically selected from an immense
number (billions or more) of candidates. In this work, we describe a powerful
extension of the methodology to a 'multi-task learning' approach, which
identifies a single descriptor capturing multiple target materials properties
at the same time. This approach is specifically suited for a heterogeneous
materials database with scarce or partial data, e.g., in which not all
properties are reported for all materials in the training set. As showcase
examples, we address the construction of materials-properties maps for the
relative stability of octet-binary compounds, considering several crystal
phases simultaneously, and the metal/insulator classification of binary
materials distributed over many crystal-prototypes.