• LOGIN
    Login with username and password
Repository logo

BORIS Portal

Bern Open Repository and Information System

  • Publications
  • Projects
  • Research Data
  • Organizations
  • Researchers
  • Statistics
  • More
  • LOGIN
    Login with username and password
Repository logo
Unibern.ch
  1. Home
  2. Publications
  3. Prediction of chemical reaction yields using deep learning
 

Prediction of chemical reaction yields using deep learning

Options
  • Details
  • Files
BORIS DOI
10.48350/163004
Publisher DOI
10.1088/2632-2153/abc81d
Description
Artificial intelligence is driving one of the most important revolutions in organic chemistry. Multiple platforms, including tools for reaction prediction and synthesis planning based on machine learning, have successfully become part of the organic chemists' daily laboratory, assisting in domain-specific synthetic problems. Unlike reaction prediction and retrosynthetic models, the prediction of reaction yields has received less attention in spite of the enormous potential of accurately predicting reaction conversion rates. Reaction yields models, describing the percentage of the reactants converted to the desired products, could guide chemists and help them select high-yielding reactions and score synthesis routes, reducing the number of attempts. So far, yield predictions have been predominantly performed for high-throughput experiments using a categorical (one-hot) encoding of reactants, concatenated molecular fingerprints, or computed chemical descriptors. Here, we extend the application of natural language processing architectures to predict reaction properties given a text-based representation of the reaction, using an encoder transformer model combined with a regression layer. We demonstrate outstanding prediction performance on two high-throughput experiment reactions sets. An analysis of the yields reported in the open-source USPTO data set shows that their distribution differs depending on the mass scale, limiting the data set applicability in reaction yields predictions.
Date of Publication
2021-03-31
Publication Type
Article
Subject(s)
500 - Science::570 - Life sciences; biology
500 - Science::540 - Chemistry
Language(s)
en
Contributor(s)
Schwaller, Philippe
Vaucher, Alain C
Laino, Teodoro
Reymond, Jean-Louisorcid-logo
Departement für Chemie, Biochemie und Pharmazie (DCBP)
Additional Credits
Departement für Chemie, Biochemie und Pharmazie (DCBP)
Series
Machine learning: science and technology
Publisher
IOP Publishing
ISSN
2632-2153
Access(Rights)
open.access
Show full item
BORIS Portal
Bern Open Repository and Information System
Build: ae9592 [15.12. 16:43]
Explore
  • Projects
  • Funding
  • Publications
  • Research Data
  • Organizations
  • Researchers
  • Audiovisual Material
  • Software & other digital items
More
  • About BORIS Portal
  • Send Feedback
  • Cookie settings
  • Service Policy
Follow us on
  • Mastodon
  • YouTube
  • LinkedIn
UniBe logo