English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Data-efficient machine learning for molecular crystal structure prediction

MPS-Authors
/persons/resource/persons22000

Reuter,  Karsten
Chair of Theoretical Chemistry, Technische Universität München;
Theory, Fritz Haber Institute, Max Planck Society;

/persons/resource/persons257500

Margraf,  Johannes
Chair of Theoretical Chemistry, Technische Universität München;
Theory, Fritz Haber Institute, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

d0sc05765g.pdf
(Publisher version), 2MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Wengert, S., Csányi, G., Reuter, K., & Margraf, J. (2021). Data-efficient machine learning for molecular crystal structure prediction. Chemical Science, 12(12), 4536-4546. doi:10.1039/D0SC05765G.


Cite as: https://hdl.handle.net/21.11116/0000-0008-1F4E-0
Abstract
The combination of modern machine learning (ML) approaches with high-quality data from quantum mechanical (QM) calculations can yield models with an unrivalled accuracy/cost ratio. However, such methods are ultimately limited by the computational effort required to produce the reference data. In particular, reference calculations for periodic systems with many atoms can become prohibitively expensive for higher levels of theory. This trade-off is critical in the context of organic crystal structure prediction (CSP). Here, a data-efficient ML approach would be highly desirable, since screening a huge space of possible polymorphs in a narrow energy range requires the assessment of a large number of trial structures with high accuracy. In this contribution, we present tailored Δ-ML models that allow screening a wide range of crystal candidates while adequately describing the subtle interplay between intermolecular interactions such as H-bonding and many-body dispersion effects. This is achieved by enhancing a physics-based description of long-range interactions at the density functional tight binding (DFTB) level—for which an efficient implementation is available—with a short-range ML model trained on high-quality first-principles reference data. The presented workflow is broadly applicable to different molecular materials, without the need for a single periodic calculation at the reference level of theory. We show that this even allows the use of wavefunction methods in CSP.