English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  SELFIES and the future of molecular string representations

Krenn, M., Ai, Q., Barthel, S., Carson, N., Frei, A., Frey, N. C., et al. (2022). SELFIES and the future of molecular string representations. Patterns, 3(10): 100588. doi:10.1016/j.patter.2022.100588.

Item is

Files

show Files
hide Files
:
Screenshot 2022-04-05 at 13.59.34.png (Supplementary material), 32KB
Name:
Screenshot 2022-04-05 at 13.59.34.png
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
image/png / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-
:
PIIS2666389922002069.pdf (Publisher version), 5MB
Name:
PIIS2666389922002069.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Krenn, Mario1, Author           
Ai, Qianxiang2, Author
Barthel, Senja2, Author
Carson, Nessa2, Author
Frei, Angelo2, Author
Frey, Nathan C.2, Author
Friederich, Pascal2, Author
Gaudin, Théophile2, Author
Gayle, Alberto Alexander2, Author
Jablonka, Kevin Maik2, Author
Lameiro, Rafael F.2, Author
Lemm, Dominik2, Author
Lo, Alston2, Author
Moosavi, Seyed Mohamad2, Author
Nápoles-Duarte, José Manuel2, Author
Nigam, AkshatKumar2, Author
Pollice, Robert2, Author
Rajan, Kohulan2, Author
Schatzschneider, Ulrich2, Author
Schwaller, Philippe2, Author
Skreta, Marta2, AuthorSmit, Berend2, AuthorStrieth-Kalthoff, Felix2, AuthorSun, Chong2, AuthorTom, Gary2, Authorvon Rudorff, Guido Falk2, AuthorWang, Andrew2, AuthorWhite, Andrew2, AuthorYoung, Adamo2, AuthorYu, Rose2, AuthorAspuru-Guzik, Alán2, Author more..
Affiliations:
1Krenn Research Group, Marquardt Division, Max Planck Institute for the Science of Light, Max Planck Society, ou_3345237              
2external, ou_persistent22              

Content

show
hide
Free keywords: inverse design, molecular design, molecular representation, deep learning
 Abstract: Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool to represent molecular graphs, and the most popular molecular string representation, SMILES, has powered cheminformatics since the late 1980s. However, in the context of AI and ML in chemistry, SMILES has several shortcomings -- most pertinently, most combinations of symbols lead to invalid results with no valid chemical interpretation. To overcome this issue, a new language for molecules was introduced in 2020 that guarantees 100\% robustness: SELFIES (SELF-referencIng Embedded Strings). SELFIES has since simplified and enabled numerous new applications in chemistry. In this manuscript, we look to the future and discuss molecular string representations, along with their respective opportunities and challenges. We propose 16 concrete Future Projects for robust molecular representations. These involve the extension toward new chemical domains, exciting questions at the interface of AI and robust languages and interpretability for both humans and machines. We hope that these proposals will inspire several follow-up works exploiting the full potential of molecular string representations for the future of AI in chemistry and materials science.

Details

show
hide
Language(s): eng - English
 Dates: 2022-10-14
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1016/j.patter.2022.100588
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Patterns
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Amsterdam : Elsevier
Pages: - Volume / Issue: 3 (10) Sequence Number: 100588 Start / End Page: - Identifier: ISSN: 2666-3899
CoNE: https://pure.mpg.de/cone/journals/resource/2666-3899