Motivation: Molecular representation learning (MRL) models molecules with low-dimensional vectors to support biological and chemical applications. Current methods primarily rely on intrinsic molecular information to learn molecular representations, but they often overlook effectively integrating domain knowledge into MRL. Results: In this paper, we develop a reaction-enhanced graph learning (RXGL) framework for MRL, utilizing chemical reactions as domain knowledge. RXGL introduces dual graph learning modules to model molecule representation. One module employs graph convolutions on molecular graphs to capture molecule structures. The other module constructs a reaction-aware graph from chemical reactions and designs a novel graph attention network on this graph to integrate reaction-level relations into molecular modeling. To refine molecule representations, we design a reaction-based relation learning task, which considers the relations between the reactant and product sides in reactions. In addition, we introduce a cross-view contrastive task to strengthen the cooperative associations between molecular and reaction-aware graph learning. Experiment results show that our RXGL achieves strong performance in various downstream tasks, including product prediction, reaction classification, and molecular property prediction.

Chemical Reaction Enhanced Graph Learning for Molecule Representation / A. Li, E. Casiraghi, J. Rousu. - In: BIOINFORMATICS. - ISSN 1367-4811. - (2024), pp. btae558.1-btae558.15. [Epub ahead of print] [10.1093/bioinformatics/btae558]

Chemical Reaction Enhanced Graph Learning for Molecule Representation

E. Casiraghi
Penultimo
;
2024

Abstract

Motivation: Molecular representation learning (MRL) models molecules with low-dimensional vectors to support biological and chemical applications. Current methods primarily rely on intrinsic molecular information to learn molecular representations, but they often overlook effectively integrating domain knowledge into MRL. Results: In this paper, we develop a reaction-enhanced graph learning (RXGL) framework for MRL, utilizing chemical reactions as domain knowledge. RXGL introduces dual graph learning modules to model molecule representation. One module employs graph convolutions on molecular graphs to capture molecule structures. The other module constructs a reaction-aware graph from chemical reactions and designs a novel graph attention network on this graph to integrate reaction-level relations into molecular modeling. To refine molecule representations, we design a reaction-based relation learning task, which considers the relations between the reactant and product sides in reactions. In addition, we introduce a cross-view contrastive task to strengthen the cooperative associations between molecular and reaction-aware graph learning. Experiment results show that our RXGL achieves strong performance in various downstream tasks, including product prediction, reaction classification, and molecular property prediction.
Settore INFO-01/A - Informatica
Settore IBIO-01/A - Bioingegneria
   Adaptive AI methods for Digital Health (AIDH)
   AIDH
   POLITECNICO DI MILANO

   AI technologies for interaction prediction in biomedicine / Consortium: AIB
   Research Council of Finland
   345802

   Machine Learning for Systems Pharmacology / Consortium: MASF
   Research Council of Finland
   339421
2024
13-set-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
Bioinformatics_Anchen_btae558.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 4.05 MB
Formato Adobe PDF
4.05 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1100788
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact