Brief · PulseAugur

TOOL · arXiv cs.AI English(EN) · 1w

Rethinking Molecular Text Representations for LLMs: An Empirical Study

A new study on arXiv benchmarks the performance of 16 large language models across nine molecular representations for eight chemical tasks. The research found that model performance is heavily dependent on the molecular representation used, with explicit structured text formats like CML and MolJSON excelling in structural tasks, while IUPAC proved best for semantic tasks. Chemistry-specialized models showed strong performance with SMILES but struggled with structured formats, indicating a potential bias in their evaluation. AI

IMPACT Highlights the critical need for task-specific molecular representations in LLMs for chemistry applications.

arXiv
Large Language Models
SMILES
InChI
MolJSON