Uni-MRL: Unified MultiModal Molecular Representation Learning with Large Language Models and Graph Neural Networks
File version
Author(s)
Zheng, Yizhen
Koh, Huan Yee
Pan, Shirui
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Sydney, Australia
License
Abstract
Molecular representation learning is essential for molecular property prediction, a key step in drug design and material discovery. While graph neural networks (GNNs) effectively capture structural information from molecular graphs, they often fail to incorporate broader contextual knowledge that goes beyond the local graph structure, such as chemical reactivity and relationships derived from domain-specific knowledge. Conversely, large language models (LLMs) excel at processing sequential molecular representations like SMILES strings. However, current methods inadequately integrate these complementary modalities, particularly for complex tasks in quantum and physical chemistry. To address this challenge, we propose Uni-MRL, a unified multi-modal framework that integrates GNN-based structural embeddings with task-specific features defined by LLMs. By leveraging the reasoning ability of LLMs to generate chemically meaningful features and combining them with graph-based representations through task-driven fusion and contrastive alignment, Uni-MRL captures both structural and chemical intricacies of molecular representations. We show that this unified framework achieves state-of-the-art performance in quantum and physical chemistry tasks. Our ablation studies further demonstrate the importance of its components, highlighting the effectiveness of multi-modal integration.
Journal Title
Conference Title
Advances in Knowledge Discovery and Data Mining: 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, NSW, Australia, June 10–13, 2025, Proceedings, Part V
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Ju, J; Zheng, Y; Koh, HY; Pan, S, Uni-MRL: Unified MultiModal Molecular Representation Learning with Large Language Models and Graph Neural Networks, 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025, Sydney, NSW, Australia, June 10–13, 2025, Proceedings, Part V, 2025, pp. 275-287