Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Kim, Heegyu; Jeon, Taeyang; Choi, Seungtaek; Hong, Jihoon; Jeon, Dongwon; Cho, Sungbum; Baek, Ga-Yeon; Kwak, Kyung-Won; Lee, Dong-Hee; Choi, Sun-Jin; Bae, Jisu; Lee, Chihoon; Kim, Yunseo; Park, Jinsung; Cho, Hyunsouk

Computer Science > Computation and Language

arXiv:2502.16457 (cs)

[Submitted on 23 Feb 2025]

Title:Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Authors:Heegyu Kim, Taeyang Jeon, Seungtaek Choi, Jihoon Hong, Dongwon Jeon, Sungbum Cho, Ga-Yeon Baek, Kyung-Won Kwak, Dong-Hee Lee, Sun-Jin Choi, Jisu Bae, Chihoon Lee, Yunseo Kim, Jinsung Park, Hyunsouk Cho

View PDF HTML (experimental)

Abstract:Materials synthesis is vital for innovations such as energy storage, catalysis, electronics, and biomedical devices. Yet, the process relies heavily on empirical, trial-and-error methods guided by expert intuition. Our work aims to support the materials science community by providing a practical, data-driven resource. We have curated a comprehensive dataset of 17K expert-verified synthesis recipes from open-access literature, which forms the basis of our newly developed benchmark, AlchemyBench. AlchemyBench offers an end-to-end framework that supports research in large language models applied to synthesis prediction. It encompasses key tasks, including raw materials and equipment prediction, synthesis procedure generation, and characterization outcome forecasting. We propose an LLM-as-a-Judge framework that leverages large language models for automated evaluation, demonstrating strong statistical agreement with expert assessments. Overall, our contributions offer a supportive foundation for exploring the capabilities of LLMs in predicting and guiding materials synthesis, ultimately paving the way for more efficient experimental design and accelerated innovation in materials science.

Comments:	under review
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.16457 [cs.CL]
	(or arXiv:2502.16457v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.16457

Submission history

From: Heegyu Kim [view email]
[v1] Sun, 23 Feb 2025 06:16:23 UTC (7,899 KB)

Computer Science > Computation and Language

Title:Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators