Steel property prediction based on a materials language model

process image

SteelScientist is a reliable AI expert based on large language models that represents an end-to-end pipeline from materials text to properties, enabling quantitative predictions of mechanical properties including yield strength (YS), ultimate tensile strength (UTS), and total elongation (EL) with high accuracy, as well as the exploration of new steels. The pipeline comprises a materials language encoder named SteelBERT, which is pretrained on a comprehensive corpus comprising 4.2 million abstracts related to materials science and 55,000 full texts from the steel literature, along with a multimodal deep learning framework that maps the composition and text sequence of complex fabrication processes to mechanical properties. This model achieves R2 scores of about 80% on test data for mechanical properties prediction.

Input

Steel composition (Wt.%)



















Processing route

How to cite

Shaohan Tian, Xue Jiang, Weiren Wang, Zhihua Jing, Chi Zhang, Cheng Zhang, Turab Lookman, Yanjing Su. 2025. “Steel Design Based on a Large Language Model.” Acta Materialia 285:120663. doi: 10.1016/j.actamat.2024.120663.
Code: https://github.com/MGEdata/SteelScientist.
SteelBERT: https://huggingface.co/MGE-LLMs/SteelBERT.

© 2024 University of Science and Technology Beijing.

Powered by Sun Praise, Inc.