English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels

MPS-Authors
/persons/resource/persons179722

Dral,  Pavlo O.
Research Department Thiel, Max-Planck-Institut für Kohlenforschung, Max Planck Society;

/persons/resource/persons179802

Owens,  Alec
Research Department Thiel, Max-Planck-Institut für Kohlenforschung, Max Planck Society;
Department of Physics and Astronomy, University College London;

/persons/resource/persons59045

Thiel,  Walter
Research Department Thiel, Max-Planck-Institut für Kohlenforschung, Max Planck Society;

External Resource
No external resources are shared
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Dral, P. O., Owens, A., Yurchenko, S. N., & Thiel, W. (2017). Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. The Journal of Chemical Physics, 146(24): 244108. doi:10.1063/1.4989536.


Cite as: http://hdl.handle.net/11858/00-001M-0000-002D-82E7-4
Abstract
We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level ab initio energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a published high-level ab initio PES of methyl chloride with 44 819 points. The ML model is trained on sets of different sizes and then used to predict the energies for tens of thousands of nuclear configurations within seconds. The resulting datasets are utilized in variational calculations of the vibrational energy levels of CH3Cl. By using both structure-based sampling and self-correction, the size of the training set can be kept small (e.g., 10% of the points) without any significant loss of accuracy. In ab initio rovibrational spectroscopy, it is thus possible to reduce the number of computationally costly electronic structure calculations through structure-based sampling and self-correcting KRR-based machine learning by up to 90%.