Compositional Design of Multicomponent Alloys Using Reinforcement Learning

Xian, Yuehui; Dang, Pengfei; Tian, Yuan; Jiang, Xue; Zhou, Yumei; Ding, Xiangdong; Sun, Jun; Lookman, Turab; Xue, Dezhen

doi:10.2139/ssrn.4707535

Download This Paper

Open PDF in Browser

Add Paper to My Library

Compositional Design of Multicomponent Alloys Using Reinforcement Learning

AMI: Acta Materialia

18 Pages Posted: 29 Jan 2024 Publication Status: Published

See all articles by Yuehui Xian

Yuehui Xian

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Pengfei Dang

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Yuan Tian

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials; Shanghai University

Xue Jiang

University of Science and Technology Beijing

Yumei Zhou

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Xiangdong Ding

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Jun Sun

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Dezhen Xue

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials

Abstract

The design of alloys has typically involved adaptive experimental synthesis and characterization guided by machine learning models fitted to available data. Often a bottleneck for sequential design, e.g., by Bayesian Global Optimization (BGO), be it for self-driven or manual synthesis, is that the search space becomes intractable as the number of alloy elements and its compositions exceeds a threshold. Here we overcome this limitation by performing compositional design of alloys using reinforcement learning (RL) within a closed loop with manual synthesis and characterization. Moreover, the training efficiency is increased by incorporating uncertainty within the reward. Existing alloy data is often limited, however, with pre-training the agent can access regions of higher reward values more frequently. In addition, the experimental feedback enables the agent to gradually explore new regions with higher rewards, compositionally different from the initial dataset. We demonstrate this strategy by designing a phase change multicomponent alloy (Ti27.2Ni47Hf13.8Zr12) with the highest transformation enthalpy (ΔH) -37.1 J/g within the TiNi-based family of alloys from a space of over 2×108 candidates, although the initial training is only on a compact dataset of 112 alloys. The approach directly applies to processing conditions where the actions would be performed in a given order.

Keywords: Compositional design, Reinforcement learning, Multicomponent alloys, Transformational enthalpy, phase change materials

Suggested Citation: Suggested Citation

Xian, Yuehui and Dang, Pengfei and Tian, Yuan and Jiang, Xue and Zhou, Yumei and Ding, Xiangdong and Sun, Jun and Lookman, Turab and Xue, Dezhen, Compositional Design of Multicomponent Alloys Using Reinforcement Learning. Available at SSRN: https://ssrn.com/abstract=4707535 or http://dx.doi.org/10.2139/ssrn.4707535

Yuehui Xian

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Pengfei Dang

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Yuan Tian

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Shanghai University ( email )

149 Yanchang Road
SHANGDA ROAD 99
Shanghai 200072, 200444
China

Xue Jiang

University of Science and Technology Beijing ( email )

30 Xueyuan Road, Haidian District
beijing, 100083
China

Yumei Zhou

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

26 Xianning W Rd.
Xi'an Jiao Tong University
Xi'an, Shaanxi 710049
China

Xiangdong Ding

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

26 Xianning W Rd.
Xi'an Jiao Tong University
Xi'an, Shaanxi 710049
China

Jun Sun

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Turab Lookman (Contact Author)

AiMaterials Research, LLC ( email )

Dezhen Xue

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Download This Paper

Open PDF in Browser

Do you have a job opening that you would like to promote on SSRN?

Place Job Opening

Paper statistics

Downloads

89

Abstract Views

386

PlumX Metrics

Feedback

Compositional Design of Multicomponent Alloys Using Reinforcement Learning

Abstract

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Shanghai University ( email )

University of Science and Technology Beijing ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

AiMaterials Research, LLC ( email )

Xi'an Jiaotong University (XJTU) - State Key Laboratory for Mechanical Behavior of Materials ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Related eJournals

Materials Engineering eJournal

Acta Materialia, Inc.

Computational Materials Science eJournal

Materials Processing & Manufacturing eJournal

Engineering Physics eJournal