English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Improved vapor pressure predictions using group contribution-assisted graph convolutional neural networks (GC2NN)

Krüger, M., Galeazzo, T., Eremets, I., Schmidt, B., Pöschl, U., Shiraiwa, M., et al. (2025). Improved vapor pressure predictions using group contribution-assisted graph convolutional neural networks (GC2NN). EGUsphere. doi:10.5194/egusphere-2025-1191.

Item is

Files

show Files

Creators

show
hide
 Creators:
Krüger, Matteo1, Author           
Galeazzo , Tommso, Author
Eremets, Ivan1, Author           
Schmidt, Bertil, Author
Pöschl, Ulrich1, Author           
Shiraiwa, Manabu, Author
Berkemeier, Thomas1, Author           
Affiliations:
1Multiphase Chemistry, Max Planck Institute for Chemistry, Max Planck Society, ou_1826290              

Content

show
hide
Free keywords: -
 Abstract: The vapor pressures (pvap) of organic molecules play a crucial role in the partitioning of secondary organic aerosol (SOA). Given the vast diversity of atmospheric organic compounds, experimentally determining pvap of each compound is unfeasible. Machine Learning (ML) algorithms allow the prediction of physicochemical properties based on complex representations of molecular structure, but their performance crucially depends on the availability of sufficient training data. We propose a novel approach to predict pvap using group contribution-assisted graph convolutional neural networks (GC2NN). The models use molecular descriptors like molar mass alongside molecular graphs containing atom and bond features as representations of molecular structure. Molecular graphs allow the ML model to better infer molecular connectivity compared to methods using other, non-structural embeddings. We achieve best results with an adaptive-depth GC2NN, where the number of evaluated graph layers depends on molecular size. We present two vapor pressure estimation models that achieve strong agreement between predicted and experimentally-determined pvap. The first is a general model with broad scope that is suitable for both organic and inorganic molecules and achieves a mean absolute error (MAE) of 0.67 log-units (R2=0.86). The second model is specialized on organic compounds with functional groups often encountered in atmospheric SOA, achieving an even stronger correlation with the test data (MAE=0.36 log-units, R2=0.97). The adaptive-depth GC2NN models clearly outperform existing methods, including parameterizations and group-contribution methods, demonstrating that graph-based ML techniques are powerful tools for the estimation of physicochemical properties, even when experimental data are scarce.

Details

show
hide
Language(s): eng - English
 Dates: 2025-03-20
 Publication Status: Published online
 Pages: 22
 Publishing info: -
 Table of Contents: -
 Rev. Type: No review
 Identifiers: DOI: 10.5194/egusphere-2025-1191
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: EGUsphere
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: - Identifier: -