Design and experimental validation of a photocatalyst recommender based on a large language model

Francis Millward, Michal Kulczykowski, Jay Badland-Shaw, Sara Szymkuc, Rajan Suraksha, Aniket Kumar Srivastawa, Violaine Manet, Máire Griffin, Megan Bryden, Thomas Comerford, Lea Hämmerling, Aminata Mariko, Bartosz A. Grzybowski*, Eli Zysman-Colman*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Utilizing an extensive library of literature on photocatalytic transformations, we disclose the development of a machine learning (ML) model for the recommendation of photocatalysts most suitable for reactions of interest. The model is trained on > 36 000 such literature examples and uses an architecture inspired by the Bidirectional Encoder Representations from Transformer (BERT) large language model. Under cross-validation, it can suggest the “correct” photocatalysts with ∼90% accuracy. When experimentally tested on five out-of-box reactions, this algorithm consistently suggested photocatalysts that gave yields competitive to those chosen by human researchers and frequently suggested alternative photocatalysts that are potentially more appealing than the originally selected photocatalyst. Altogether, this platform serves as a valuable tool for researchers undertaking reaction optimization programs. The model is free to use at https://photocatals.grzybowskigroup.pl/predict/.
Original languageEnglish
Article numbere14544
Number of pages10
JournalAngewandte Chemie International Edition
VolumeEarly View
Early online date9 Dec 2025
DOIs
Publication statusE-pub ahead of print - 9 Dec 2025

Keywords

  • Large language models
  • Machine learning
  • Photocatalysis

Fingerprint

Dive into the research topics of 'Design and experimental validation of a photocatalyst recommender based on a large language model'. Together they form a unique fingerprint.

Cite this