Compositional design and phase formation capability of high-entropy rare-earth disilicates from machine learning and decision fusion

Fan, Yun; Bai, Yuelei; Li, Qian; Lu, Zhiyao; Chen, Dong; Liu, Yuchen; Li, Wenxian; Liu, Bin

doi:10.1038/s41524-024-01282-x

Download PDF

Article
Open access
Published: 07 May 2024

Compositional design and phase formation capability of high-entropy rare-earth disilicates from machine learning and decision fusion

Yun Fan^1,2,
Yuelei Bai ORCID: orcid.org/0000-0003-1738-3274¹,
Qian Li³,
Zhiyao Lu¹,
Dong Chen ORCID: orcid.org/0000-0002-7486-2549⁴,
Yuchen Liu²,
Wenxian Li⁵ &
…
Bin Liu ORCID: orcid.org/0000-0003-2764-5135^2,6

npj Computational Materials volume 10, Article number: 95 (2024) Cite this article

243 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

A key strategy for designing environmental barrier coatings is to incorporate multiple rare-earth (RE) components into β- and γ-RE₂Si₂O₇ to achieve multifunctional performance optimization. However, the polymorphic phase presents significant challenges for the design of multicomponent RE disilicates. Here, employing decision fusion, a machine learning (ML) method is crafted to identify multicomponent RE disilicates, showcasing notable accuracy in prediction. The well-trained ML models evaluated the phase formation capability of 117 (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ and (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O_7, which are unreported in experiments and validated by first-principles calculations. Utilizing model visualization, essential factors governing the formation of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ are pinpointed, including the average radius of RE³⁺ and variations in different RE³⁺ combinations. On the other hand, (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ must take into account the average mass and the electronegativity deviation of RE³⁺. This work combines material-oriented ML methods with formation mechanisms of multicomponent RE disilicates, enabling the efficient design of superior materials with exceptional properties for the application of environmental barrier coatings.

Phase formation capability and compositional design of β-phase multiple rare-earth principal component disilicates

Article Open access 08 March 2023

Machine-learning-guided descriptor selection for predicting corrosion resistance in multi-principal element alloys

Article Open access 31 January 2022

Discovery of novel Li SSE and anode coatings using interpretable machine learning and high-throughput multi-property screening

Article Open access 13 August 2021

Introduction

Environmental barrier coatings (EBCs) have received significant attention in gas turbine technology, aiming to shield turbine components crafted from SiC_f/SiC ceramic matrix composites (CMCs) against corrosion damage from CaO-MgO-Al₂O₃-SiO₂ (CMAS) and water vapor in high-temperature combustion environments, thus enabling higher inlet temperatures, greater core power, and improving fuel efficiency in turbine engines^1,2,3,4,5. Rare earth disilicates (RE₂Si₂O₇) show great potential for EBC applications, because of their excellent resistance to molten CMAS/water vapor, low thermal conductivity, and compatible thermal expansion coefficients (CTEs) with CMC substrates^6,7. Particularly, the thermal expansion coefficients ((4–5)×10^–6K⁻¹) of the β- and γ-RE₂Si₂O₇ are close to that of CMCs ((4.5–5.5)×10^–6K⁻¹)^6,7,8,9,10. However, previous studies have shown that their performance decreases when subjected to the strong coupled attack of thermal steam, molten CMAS, and thermal stresses^8,11,12.

Most RE₂Si₂O₇ experience intricate phase transformations (α, β, γ, δ, A, F, and G) at various temperatures. Especially, the β-γ polymorphic transition may result in catastrophic cracking or delamination of the EBCs^13,14,15. Multi-component systems often exhibit interesting characteristics, such as high configurational entropy, slow diffusion kinetics, severe lattice distortions, and mixing effects. Through combining the outstanding characteristics of each element, multicomponent RE₂Si₂O₇ materials allow for exploring and optimizing the properties of single-phase multi-RE principal component RE₂Si₂O₇ (namely (nRE_xi)₂Si₂O₇) beyond known single one^{16,17,18,19,20,21,22}. Currently, β-(nRE_xi)₂Si₂O₇ with good phase stability and co-doped solid solutions are commonly used for EBC applications²³. A key challenge in designing (nRE_xi)₂Si₂O₇ is to determine if a specific composition can eventually form a single-phase (nRE_xi)₂Si₂O₇.

Recently, Machine Learning (ML) methods have achieved great success in many fields, especially in predicting new materials. ML can predict high-performance materials directly from high-dimensional input data (descriptors) through learning, rather than extracting limited information from linear combinations of different descriptors^{24,25,26,27,28,29}. In the field of multicomponent materials, ML models, including Artificial Neural Networks (ANN)^30,31,32, Random Forest Classification (RFC)^32,33,34,35, and Support Vector Machines (SVM)^{31,32,33,34,35}, have been successfully applied for predicting their single-phase formation ability, with their outstanding predictive capabilities and flexibility in handling new material predictions. For example, Kaufmann et al. developed a regression ML model based on hundreds of properties and CALPHAD features³⁶, in which the results are consistent with density functional theory (DFT)^37,38,39 calculations and experiments. In contrast, the effective ML model for designing and identifying the β- and γ-(nRE_xi)₂Si₂O₇ is still rare. Recently, Luo et al. found that the configurational entropy of mixing serves as a dependable descriptor for β-(nRE_xi)₂Si₂O₇ formation, but extensive experiments and first-principles calculations are needed for the configurational entropy of mixing¹³. Therefore, it is essential to develop a visualized ML model that utilizes the characteristics of potential (nRE_xi)₂Si₂O₇ and RE₂Si₂O₇ features to predict the phase of (nRE_xi)₂Si₂O₇.

Moreover, the (nRE_xi)₂Si₂O₇ synergistically optimizes its structural stability, mechanical/thermal properties, and corrosion resistance by combining key RE components. The performance of rare earth elements in CMAS corrosion reactions can be categorized into three groups: inert elements (Yb and Lu), active elements (Gd, Tb, and Dy), and neutral elements (Ho, Er, and Tm)¹⁵. For instance, the Lu element shows the ability to diminish the activity of CMAS, thereby improving the resistance of environmental barrier coatings to CMAS^23,40,41. Through customization of these elements, (nRE_xi)₂Si₂O₇ may enhance the formation of corrosion-resistant products (apatite) and decrease the activity of CMAS, resulting in a slower corrosion rate¹⁵. Especially, Lu₂Si₂O₇ and Yb₂Si₂O₇ exhibit good phase stability and comprehensive performance in EBCs. Therefore, the SVC, ANN, and RFC models are firstly developed to predict the β- and γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ (RE= La, Ce, Eu, Gd, Tb, Dy, Ho, Er, Tm, and Y) in this work (Fig. 1), where some structural characteristics are extracted from the (nRE_xi)₂Si₂O₇ and RE₂Si₂O₇. Moreover, the well-trained SVC, ANN, and RFC models are used to predict the β- and γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ with a validating by DFT calculations, in which the correlation between the average RE³⁺ radius and deviation with different RE³⁺ combinations are analyzed by model visualization.

**Fig. 1: Machine Learning framework used in this work.**

Meanwhile, by combining the Bayesian theory and majority voting methods as decision fusion approaches, the decision fusion-based RFC model is optimized and extended for the single-phase (6RE_xi)₂Si₂O₇. Due to the atomic radius of Gd located in the middle of lanthanide RE elements, it plays a dual role in improving both the formation of apatite and facilitating the synthesis of single-phase (nRE_xi)₂Si₂O₇. (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ (RE = Y, La, Ce, Eu, Tb, Dy, Ho, Er, and Tm) are focused on this work to enhance their performance as EBCs, where the formation of regular patterns for single-phase (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ are deduced through model visualization and elemental analysis. Finally, the effectiveness of the decision fusion strategy and robustness of the optimized model are demonstrated by re-predicting the phase formation ability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ using them. It is important that the good consistency between the predictions and reported experiments demonstrates the accuracy of this ML model for the rapid evaluation and design of multicomponent RE disilicates, which will encourage and facilitate both the exploitation and design of innovative single-phase β-(nRE_xi)₂Si₂O₇ and γ-(nRE_xi)₂Si₂O₇.

Results

Feature selection

As shown in Table 1, a list of commonly used descriptors that may influence the phase stability of multicomponent RE disilicates, such as ion radius-related features, is considered for this ML models²². It should be noticed that the features which can be obtained directly from the database to replace the input descriptors calculated by DFT is selected. To mitigate the risk of overfitting caused by highly correlated features and enhance the fitting efficiency, the features are eliminated with Pearson coefficients exceeding 0.90^18,42. The Pearson correlation coefficient (r) is calculated using the following formula:

$$r=\frac{\sum \left({{\bf{x}}}_{i}-\bar{{\bf{x}}}\right)\left({{\bf{y}}}_{i}-\bar{{\bf{y}}}\right)}{\sqrt{\sum {\left({{\bf{x}}}_{i}-\bar{{\bf{x}}}\right)}^{2}{\left({{\bf{y}}}_{i}-\bar{{\bf{y}}}\right)}^{2}}}$$

(1)

where, ${{\bf{x}}}_{i}$ and ${{\bf{y}}}_{i}$ represent the ith values of two different input features, respectively, while $\bar{{\bf{x}}}$ and $\bar{{\bf{y}}}$ represent the expectations of these two input features, respectively. As depicted in Fig. 2, all Pearson correlation coefficients are below 0.8, indicating the rationality of the feature value selection in this work. To expedite the convergence of the ML models, the features are additionally normalized via the formula:

$${\hat{{\bf{x}}}}_{i}=\frac{{{\bf{x}}}_{i}-{{\bf{x}}}_{{\rm{mean}}}}{{{\bf{x}}}_{{\rm{std}}}}$$

(2)

where, ${{\bf{x}}}_{{\rm{std}}}$ and ${{\bf{x}}}_{{\rm{mean}}}$ are the variance and mean values, respectively. The processed array shows a mean of 0 and a variance of 1 for each column. ${{\bf{x}}}_{i}$ is ith value of the feature (${\bf{x}}$).

Table 1 Input features and their corresponding descriptions

Full size table

**Fig. 2: Pearson correlation coefficients for all features.**

Phase identification of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ via ML

During the training process, a random splitting technique is employed to divide the data into training and testing sets. Then, the performance of each model is assessed by measuring its validation accuracy at three proportions of training and testing data. Here, the proportions for the test set are 0.15, 0.2, and 0.25, while the corresponding proportions for the training set are 0.85, 0.8, and 0.75, respectively. For the ANN model, the hyper-parameters play a crucial role in the ability of the model to capture intricate patterns from input samples. Few hidden layers and nodes may lead to incomplete feature acquisition. Conversely, excessive hidden layers and nodes can introduce unnecessary noise and lead to overfitting. Therefore, a grid search was conducted on the number of hidden layers and the nodes in each layer. In the case of the SVC model, the kernel type, and the regularization parameter (denoted as C) are optimized. The C parameter determines the tolerance for misclassification errors. For the RFC model, the random state and the number of decision trees are refined to enhance its performance. The random state ensures reproducibility of results, while the number of decision trees affects the model’s complexity and predictive power. These three methods, including SVC with hyperplane, ANN with a brain-like structure, and RFC with a tree structure, provide different perspectives. By fine-tuning these parameters and methods, we aim to improve the model’s ability to classify data accurately.

As shown in Fig. 3, compared to the testing-to-training ratios of 0.15, 0.20, and 0.25, all three models achieved their respective optimal validation accuracies at a ratio of 0.15. Both the RFC and SVC models achieved a validation accuracy score of 1.000, while the ANN model shows a relatively lower validation accuracy score of 0.857, as depicted in Fig. 3. Meanwhile, the ANN under the PyTorch framework and under the scikit-learn Python package exhibit similar accuracies, as shown in Supplementary Figure 1. To determine the most suitable model for this study, the evaluation metrics of RFC and SVC are further compared, such as the confusion matrices, the receiver operator characteristic (ROC) curve, and AUC (area under the curve). The confusion matrices of RFC and SVC models display similar patterns (Fig. 4a and b), as evidenced by the summation of true positive rate (TPR) and true negative rate (TNR). Additionally, from the AUC-ROC curve (Fig. 4c and d), RFC exhibits higher classification quality with an AUC score of 1.00, in contrast to the AUC score of 0.93 for SVC. Therefore, RFC demonstrates better performance than SVC.

**Fig. 3: Grid search of Machine Learning models.**

**Fig. 4: The evaluation metrics of (a, c) SVC and (b, d) RFC models.**

Then, the trained SVC, RFC, and ANN models are used to identify the formation capability of the pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇, with their phase composition predicted by the RFC model in Table 2, as well as the ones for SVC and ANN in Supplementary Table 1. Among the 33 predicted samples in the RFC model, 17 samples are identified as pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇, in which three compounds are β-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ and 14 ones are γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O_7. The ML framework offers a high accuracy in prediction but is often perceived as a black-box model. To interpret the prediction process of the RFC model, researchers commonly employ feature importance, which visually reflects the contribution of each feature in enhancing the model’s predictive ability^43,44. Meanwhile, although the Pearson coefficient identifies influential features, it does not provide an explanation of how these features affect the predictions. To address this limitation, the SHAP (Shapley Additive exPlanetions) is introduced to explain the key mechanism of the formation of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O_7.

Table 2 Predicted phase composition for some selected (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ via RFC models

Full size table

The SHAP value and its importance for all features are visualized in Fig. 5. The importance of features is reflected by their average absolute SHAP values (|SHAP|), where the larger the |SHAP| value is, the greater the impact of the feature on the prediction of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is. Figure 5 (a) illustrates the impact of input features on the prediction by graphing the SHAP values for each feature-sample pair, with each row and dot representing a feature and sample, respectively. The position of the point on the x-axis is determined by its SHAP value, reflecting the impact of features on target attribute prediction. In addition, the point color corresponds to the feature values represented by the color code from low to high, and their specific analysis is described in Supplementary Note 1 and Supplementary Figure 2. The importance of input features that influence the formation capability of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is ranked in increasing order and displayed in Fig. 5 (b). It is identified that the value of $\bar{r}$ is the most critical factor in predicting the formation capability of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇. In addition, the value of ${\sigma }_{r}$ also holds significance in the prediction results. The specific values of ${\sigma }_{r}$ and $\bar{r}$ for each predicted (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ are summarized in Supplementary Table 2.

**Fig. 5: The feature visualization via the SHAP value.**

As shown in Fig. 6, β-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ corresponds to smaller $\bar{r}$ values, while γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ tends to form as the $\bar{r}$ increases. There is a potential boundary between the dominant regions of the β and γ phases at “$\bar{r}$ = 8.885 Å”. This suggests that to design β-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇, the $\bar{r}$ feature for multiple combinations of RE³⁺ should be kept below 0.885 Å, which is consistent with the experimental result inferred by Luo et al. ¹³. Furthermore, as indicated by Fig. 6, the formation of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ compound can only occur when the value of ${\sigma }_{r}$ is small enough, while large ${\sigma }_{r}$ values lead to element segregation and phase separation. In the ML prediction results, the ${\sigma }_{r}$ values for pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ are all smaller than 0.066, which is consistent with the reported results⁴⁵. In fact, when multiple RE elements reside within the target lattice, a phenomenon known as “imperfect isomorphism” occurs⁴⁶. For example, RE elements apart from Yb and Lu can be incorporated into the β-type lattice. Therefore, it is expected that the size mismatch and property differences among multiple RE³⁺ will introduce extensive disturbances in the lattice, resulting in competition among different phases during the formation of solid solutions. The lattice can tolerate small differences in size between various RE³⁺, thus forming a thermodynamically favorable pure multicomponent solid solution. However, excessive differences will lead to element segregation and phase separation.

It can be concluded that the insights obtained through SHAP analysis are of great significance for the design of synthetic pure materials. Specifically, the formation ability of the pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ depends on the average $\bar{r}$ and the ${\sigma }_{r}$, following the potential phase formation and possible phase transitions at sufficiently high temperatures in experimental conditions. Since there are no reports on these predicted (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ compounds before, the first-principles calculation is used to validate our prediction via the ML models.

Verification of pure (RE1_0.25 RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇

Based on the predicted results of the RFC model (as shown in Table 2), the structures of 17 pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ compounds are constructed, including three β phases and fourteen γ ones. As described in Fig. 7, the β-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ consists of corner-sharing Si-O tetrahedra that form [Si₂O₇] units, which are stacked along the y-axis. The rare earth atoms are coordinated with six oxygen atoms, with a Si-O_bridge-Si bond angle of 180°. The bridging oxygen atoms in the [Si₂O₇] unit do not bond with the rare earth atoms. The γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is similar to the β phase, while the [Si₂O₇] units are arranged in a staggered pattern, resembling a sinusoidal waveform. The lattice parameters of 17 materials and related RE₂Si₂O₇ are listed in Supplementary Table 3, in which the calculated lattice parameters are close to the reported values⁸.

**Fig. 7: The crystal structure of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇.**

The thermodynamic stability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is analyzed through a linear optimization program to verify the stable phases screened by the ML method. Supplementary Table 4 summarizes the calculated formation enthalpies $\Delta {H}_{{\rm{comp}}}$ of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ via a linear optimization procedure. As illustrated in Fig. 8, the $\Delta {H}_{{\rm{comp}}}$ values of predicted (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ are all negative, indicating their thermodynamic stability. Among the γ phases, the $\Delta {H}_{{\rm{comp}}}$ values follow the order of (Dy_0.25Y_0.25Yb_0.25Lu_0.25)₂Si₂O₇ < (Tb_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ < (Gd_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ < (Eu_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ with the increasing of $\bar{r}$ and ${\sigma }_{r}$ (Supplementary Table 2). It means that the thermodynamics stability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ synthesis raises with the increasing $\bar{r}$ and ${\sigma }_{r}$, while the multiple-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ will be formed above a certain threshold of $\bar{r}$ and ${\sigma }_{r}$. This is consistent with reported results¹³ and the present ML analysis.

**Fig. 8: Calculated formation enthalpies (eV per atom) of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇.**

Then, the mixing Gibbs free energy is also introduced to calculate the phase stability of materials against temperatures⁴⁷, which can be expressed as:

$${G}_{{\rm{mix}}}={H}_{{\rm{mix}}}-T\triangle S$$

(3)

where, ${H}_{{\rm{mix}}}$ and $\Delta S$ represent the mixing enthalpy and mixing entropy, respectively. T is the temperature. The mixing enthalpy of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is defined as the energy relative to four single-component RE disilicates with the same space group, according to the following equation:

$${H}_{{\rm{mix}}}=\frac{{E}_{{\rm{total}}}\left({({{\rm{RE}}1}_{0.25}{{\rm{RE}}2}_{0.25}{{\rm{Yb}}}_{0.25}{{\rm{Lu}}}_{0.25})}_{2}{{\rm{Si}}}_{2}{{\rm{O}}}_{7}\right)-{E}_{{\rm{total}}}\left({{\rm{RE}}}_{2}{{\rm{Si}}}_{2}{{\rm{O}}}_{7}\right)}{n}$$

(4)

where E_total((RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇) and E_total(RE₂Si₂O₇) are the total energy of multicomponent and single rare-earth disilicates, respectively. n represents the atom number of the calculated (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O_7. In addition, the mixing entropy of the supercells (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is calculated by Eq. (5). As depicted in Fig. 9, the mixing Gibbs free energies are negative, suggesting the stability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ above 0 K. Also, Chen et. al successfully synthesized the γ-(Gd_1/4Dy_/4Yb_1/4Lu_1/4)₂Si₂O₇, which verifies our ML prediction of this material⁴⁸.

**Fig. 9: The mixing Gibbs free energy of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ at different temperatures.**

In summary, all 17 predicted (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ selected by the RFC model exhibit stability through $\Delta {H}_{{\rm{comp}}}$ and mixing Gibbs free energy. This verifies the reliability of the present ML model.

Extended phase prediction of (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ based on decision fusion

As the component of RE elements increases, the difficulty of synthesizing multi-component rare earth silicates escalates. Therefore, based on the validated data of quaternary rare earth disilicates, we retrained the RFC model in ML to predict the single-phase formation capability of (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇. To replace computationally expensive DFT verification calculations, two decision fusion theories based on cross-validation, i.e., relative voting and Bayesian theory, are incorporated into the retrained model to enhance its reliability in this study. Compared with the proportional division approach used in (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇, the k-fold strategy is considered to take full advantage of decision fusion and improves the results by accentuating decision fusion based on the validated effective algorithms. After the parameter testing, the training model with high precision (0.875), namely k = 80 and n = 200 (Fig. 10) are selected. The retrained RFC model exhibits a favorable performance, which indicates its generalization ability for high-throughput predictions.

**Fig. 10: Grid search of optimized RFC Machine Learning models.**

To enhance the accuracy of phase stability prediction, decision fusion is applied to integrate and leverage multiple well-trained models. We mainly evaluate two distinct fusion strategies: a voting-based approach and a Bayesian fusion-based method. The former utilizes the common majority voting scheme, treating the classification result of each model as one vote and integrating the results. The latter effectively combines experimental results and prior information and provides specific prediction probabilities for each class^49,50,51. Table 3 presents the results obtained from the above two decision fusion methods, revealing that the outputs from both methods are consistent. Among them, P_Mul, P_β, and P_γ represent the probabilities of forming multiple, β, and γ classes after Bayesian fusion, respectively, while V_Mul, V_β, and V_γ denote the number of votes obtained by multiple, β, and γ classes after majority voting, respectively. Among the studied 84 (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ materials, 35 ones are predicted to be single phases, including 7 β and 28 γ phases. In experiments, Sun et al. have successfully synthesized γ-(Tb_1/6Dy_1/6Tm_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ that is predicted as a single γ phase in this work⁴⁰, validating our ML prediction results.

Table 3 Predicted phase composition for some selected (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ via RFC models based on decision fusion

Full size table

To further analyze the theoretical implications of the ML prediction results, we selected one of the cross-validation models with high validation accuracy to analyze its SHAP values. As shown in Fig. 11a, there is a negative correlation between ${\sigma }_{r}$ and the formation ability of pure (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇. Additionally, Fig. 11b shows that being similar to the SHAP model of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇, the features related to ionic radii still play a crucial role, with the greatly increased importance of ${\sigma }_{X}$.

**Fig. 11: The feature visualization via the SHAP value.**

The specific values of ${\sigma }_{r}$, $\bar{r}$, $\bar{m}$ and ${\sigma }_{X}$ for each predicted (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ are plotted in Supplementary Table 5. It can be found that ${\sigma }_{r}$ plays a decisive role in the formation of pure (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇, as the predicted ${\sigma }_{r}$ values for the 35 pure materials are all below 4.0% (as displayed in Fig. 12a), which is consistent with the reported literature⁴⁷. The reported key transition point for the β- and γ-(RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ is $\bar{r}$ = 0.885 Å¹³. According to the SHAP model, $\bar{m}$ and ${\sigma }_{X}$ also play a crucial role in (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ located at the boundary for $\bar{r}$ = 0.885 Å. As plotted in Fig. 12b, the $\bar{r}$ values of β-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ are all smaller than 0.900 Å (around 0.885 Å), and the ${\sigma }_{X}$ values of them are less than 0.046. Furthermore, the ${\sigma }_{X}$ values of all the materials predicted as β phase are less than 478.5 g·mol⁻¹. This also explains why the materials satisfied the ${\sigma }_{X}$, ${\sigma }_{r}$ and $\bar{r}$ conditions (as listed in Supplementary Table 5). For example, (Dy_1/6Er_1/6Tm_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ and (Dy_1/6Ho_1/6Tm_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O_7, are predicted as γ phases. Figure 13 presents the frequency of each RE element in predicted β-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇. The Y element shows the highest frequency, indicating its positive contribution to the formation of β-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇, while the frequency of Tb, La, and Ce elements is 0, suggesting that these elements hinder the formation of β phases.

**Fig. 12: The relevant features and phase formation of (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇.**

**Fig. 13: Frequencies of various RE elements appeared in the β-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇.**

In a word, the formation of pure (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ cannot be determined solely based on $\bar{r}$, with increase of the RE component. It requires considering various factors, such as $\bar{m}$ and ${\sigma }_{X}$ of the materials. Moreover, by examining the predicted probabilities of different phases via Bayesian theory in Table 3, there is small difference in the predicted probabilities of β- and γ-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ by the decision fusion-based RFC model for these β-(RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇. This indicates that the prediction results show a certain degree of tolerance for the prediction errors.

Discussion

To verify the accuracy and robustness of the RFC model based on decision fusion for the prediction of multicomponent rare earth disilicates, the results of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ phase stability are re-predicted by an optimized RFC model. After parameter testing, the results of the model with the highest accuracy are listed in Table 4. Specifically, the model achieved the optimal performance when the k = 20 is selected.

Table 4 Results of predicted (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ via RFC models based on decision fusion

Full size table

Combined with Table 2, the inclusion of decision fusion does not change the results of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ with true labels. This indicates that the RFC model based on decision fusion exhibits high accuracy and generalization in predicting multicomponent disilicates. Additionally, it has been reported that the results obtained through the model based on decision fusion are equal to and/or better than those of the individual models^49,50. Therefore, the RFC model based on decision fusion demonstrates theoretical and experimental feasibility in the prediction of multicomponent disilicates. In addition, although ML methods are applied to predict the mechanical and thermal properties of different material systems^19,20,21,24, research on mechanical and thermal properties of high-entropy rare earth disilicates via ML methods is still rare. The foundation for exploring the mechanical and thermal properties of high-entropy rare earth disilicates lies in the investigation of their compositional design and phase formation capability (i.e. phase stability). Thus, this work is expected to lay the foundation for further studying the mechanical and thermal properties of stable high-entropy rare earth silicates.

This work successfully establishes a series of models based on Artificial Neural Network, Random Forest Classification, and Support Vector Machine with a high accuracy for predicting the single-phase formation ability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ compounds. The results indicate that ${\sigma }_{r}$ and $\bar{r}$ are the most significant impact on the formation of pure (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇. Specifically, β phases can be formed and further stabilized at high temperatures when satisfying the criteria of (1) $\bar{r}$ < 0.885 Å and (2) sufficiently small ${\sigma }_{r}$. Then, the predictions validated via DFT calculations show excellent agreement with experimental results. Furthermore, by combining two decision fusion approaches (Bayesian theory and the majority voting method), we further optimize the Random Forest Classification model and successfully predict the single-phase formation ability of 84 un-synthesized (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ compounds. Among them, 7 β and 28 γ phases are predicted. The results reveal that, unlike (RE1_1/4RE2_1/4Yb_1/4Lu_1/4)₂Si₂O₇, (RE1_1/6RE2_1/6RE3_1/6Gd_1/6Yb_1/6Lu_1/6)₂Si₂O₇ also require additional attention to the factors of ${\sigma }_{X}$ and $\bar{m}$. The β phases can be formed when satisfying the criteria of (1) $\bar{r}$ < 0.900 Å, (2) sufficiently small ${\sigma }_{r}$, (3) ${\sigma }_{X}$ < 0.046, and (4) $\bar{m}$ < 478 g·mol⁻¹. Finally, by re-predicting the phase formation ability of (RE1_0.25RE2_0.25Yb_0.25Lu_0.25)₂Si₂O₇ via the Random Forest Classification model based on decision fusion, the effectiveness of the decision fusion strategy and the robustness of the optimized model is demonstrated. In summary, this paper explores a materials-oriented machine learning approach that incorporates the mechanisms behind the phase formation of multicomponent rare-earth disilicates into materials design. It also opens avenues for rational design of multicomponent rare-earth disilicates within the embedded phase space, thereby allowing for effective tuning of their properties.

Methods

Machine Learning

The training dataset was derived from various published sources^{7,8,13,23,40,41,52,53,54,55,56}. In the process of ML model training, samples representing the mix phase, β phase, and γ phase were marked as “0”, “1”, and “2” respectively. These samples were characterized by 7 input features (Table 1), which captured the properties of potential multicomponent RE₂Si₂O₇. Notably, these input features were selected based on their proven effectiveness in predicting other multicomponent materials^57,58. In the calculations, the RE₂Si₂O₇ was all composed of equiatomic ratios of the constituent elements, whose mixing entropy was estimated as:

$$\Delta S=-R\mathop{\sum }\limits_{i}\left({c}_{i}{\rm{In}}{c}_{i}\right)$$

(5)

where R represented the molar gas constant, and c_i was the molecular concentration of the i-th RE₂Si₂O₇. The mean value of a specific property ($\overline{{\rm{prop}}}$) was expressed by:

$$\overline{{\rm{prop}}}=\mathop{\sum }\limits_{1}^{n}{c}_{i}{{\rm{prop}}}_{i}$$

(6)

where ${{\rm{prop}}}_{i}$ was considered as the values of each constituent RE₂Si₂O₇. To account for the variation in the constituent RE₂Si₂O₇, the deviation (${\sigma }_{{\rm{prop}}}$) of the considered properties was calculated as following:

$${\sigma }_{{\rm{prop}}}=\sqrt{{\sum }_{1}^{n}{c}_{i}{\left(1-\frac{{{\rm{prop}}}_{i}}{\overline{{\rm{prop}}})}\right)}^{2}}$$

(7)

As shown in Fig. 1, the collected samples were used to extract features, which were then transformed into vectors. These input vectors were randomly shuffled and supplied as input to the ANN, SVC, and RFC models. These trained ML models were employed to predict the phase formation ability of multicomponent RE disilicates, with the subsequent validation using first-principles calculations for selected quaternary multicomponent RE disilicates. The validation results could be leveraged to improve the robustness of the ML models and extended the prediction capability to six-RE-principal-component disilicates by including the ML models based on decision fusion in the future training dataset.

The SVC and RFC models were employed utilizing the Scikit-learn (sklearn) Python package^59,60. The former used a linear kernel with a C parameter of 1.5, which enabled it to effectively find a hyperplane for separating the input samples by mapping them to a higher-dimensional space, while the latter consisted of 200 decision trees, with the predictions by combining the results from all the trees. The ANN models were implemented using two approaches: one utilizing the PyTorch framework and the other employing the sklearn Python package, both with three densely connected hidden layers. The calculation process for obtaining the output vectors in each layer was described as follows:

$${{\bf{X}}}_{l+1}=\sigma \left({\bf{W}}{{\bf{X}}}_{l}+{\bf{b}}\right)$$

(8)

where ${{\bf{X}}}_{l+1}$ and ${{\bf{X}}}_{l}$ corresponded to the feature representations in the (l + 1)-th and lth layers, respectively. When l was 0, ${{\bf{X}}}_{l}$ represented the input feature vector. The weight matrix and bias vector were denoted by W and b, respectively. The softmax activation function ($\sigma \left({\bf{z}}\right)$) was applied to map the perceptron outputs into a non-linear space. During the training process, the PyTorch-based ANN model employed the Adam optimizer to iteratively minimize the cross-entropy loss function. The loss ($H\left(p,q\right)$) between the labels and predictions was calculated as follows:

$$H\left(p,q\right)=-\mathop{\sum }\limits_{x}\left[p\left(x\right)\log q\left(x\right)\right]$$

(9)

where $p\left(x\right)$ represented the one-hot encoding of real label, while $q\left(x\right)$ was the predict distribution.

The paper employed two training strategies, namely k-fold cross-validation and random sampling⁶¹, to make optimal use of the limited available data. In the former, the number of folds was set to 20 for the selected quaternary multicomponent RE disilicates and 80 for six-RE-principal-component disilicates, based on training accuracy. Multiple models obtained from these strategies could be further combined for the subsequent decision fusion to achieve more accurate results. In the latter, a fixed seed was set to ensure that different models can be compared under the same data partitioning conditions.

The ROC curve, AUC, and confusion matrix were commonly utilized for classifier performance evaluation. The former plotted the true positive rate (TPR) against the false positive rate (FPR) at different discrimination thresholds, while a higher AUC value of the ROC curve, particularly in the top-left region, generally signified better statistical performance of the classifier⁶².

Decision fusion is a process of forming unified decisions by integrating the outputs of multiple sensors, systems, or algorithms. The decision fusion used in this work is implemented by synthesizing outcomes from diverse trained models through majority voting and Bayesian fusion. It can maximize the information contained in the current data, provide a comprehensive and accurate decision-making, and construct an appropriate material-oriented modeling approach. For the majority voting method, each classifier casted a vote for a class label, which with the highest number of votes was selected as the final output⁵¹. The ensemble’s output class label was represented as follows:

$$H\left(x\right)={c}_{{\rm{arg }}\mathop{\max }\limits_{j}\mathop{\sum }\limits_{i=1}^{T}{h}_{i}^{j}\left(x\right)}$$

(10)

where ${h}_{i}^{j}\left(x\right)$ represented predictive output for classifier ${h}_{i}$ on category marker ${c}_{j}$. T was the number of a classifier.

For the Bayesian fusion algorithm, the validation accuracy obtained under the k-fold cross-validation method was served as prior information in this paper, while the predicted probabilities was used as the experimental information⁵⁰. The probability fusion formula was as follows and the specific derivation process is shown in Supplementary Note 2:

$$P\left({B}_{1},{B}_{2},\ldots ,{B}_{n}{\rm{|}}{A}_{j}\right)=\frac{{\sum }_{i=1}^{n}P\left({A}_{j}{\rm{|}}{B}_{i}\right)P\left({B}_{i}\right)}{{\sum }_{j=1}^{c}{\sum }_{i=1}^{n}\left({A}_{j}{\rm{|}}{B}_{i}\right)P\left({B}_{i}\right)}$$

(11)

where ${A}_{j}$ and ${B}_{i}$ represented the jth category and the ith model, respectively. n was consistent with the number of cross-validation, which was the number of models. During the implementation of the Bayesian fusion algorithm, the validation accuracy of models was taken as the weights for each model, with the adjusted results to calculate the final predicted probabilities.

As a post-hoc model interpretation method, the SHAP model analysis is a component of interpretable machine learning^63,64. It establishes a connection between the model’s output and input, offering insights into “black box models” from both overall and local situations. Regarded as a paramount method for visual analysis and model interpretation, the SHAP was approached for interpreting predictions generated by the RFC and SVC models to shed light on complex models, with assigning an important value to each feature of a sample in the dataset. These SHAP values acted as sensitivity coefficients, providing insights into the most significant features of the ML model and illustrating how each feature influenced the predictions⁶⁴. Notably, a positive SHAP value indicated that a feature increased the predicted value and had a positive effect, and vice versa. In this way, these SHAP values allowed for a delicately understanding of feature contributions and their impact on the predictions.

DFT calculation

All calculations in this study were conducted utilizing the Vienna Ab Initio Simulation Package (VASP), which implemented the density functional theory (DFT). The projector augmented wave (PAW) potentials were employed for an accurate description of the electron-ion interactions, with an energy cutoff of 520 eV that converged the energy uncertainty below 1 meV per atom. Valence electrons were 5p⁶5d¹6s², 3s²3p², and 2s²2p⁴ for RE, Si, and O atoms, respectively. The Perdew-Burke-Ernzerhof (PBE) functional was employed to describe the exchange and correlation interactions within the generalized gradient approximation (GGA)^37,38,39. The Brillouin zone (BZ) was sampled using a Γ- centered 2×3×3 and 3×2×2 Monkhorst-Pack k- mesh for β and γ phase multicomponent rare earth disilicates and RE₂Si₂O₇. During the geometric optimization process, electronic self-consistency was obtained when the energy difference fell below 10^–6 eV. As for ionic optimization, it was terminated when the forces acting on atoms reached a value less than 0.01 eV ·Å⁻¹. During the geometrical optimization, the atomic positions, super-cell volume, and supercell shapes were fully relaxed.

To construct special quasirandom structures (SQS) of multicomponent RE disilicates, Monte-Carlo simulations were employed^65,66. The random swap process was confined to the cation sublattice and excluded swaps between cations and anions. To ensure the convergence of lattice constants and energies concerning the local environments, three independent SQS (Special Quasirandom Structure) structures were computed for each rare earth (RE) disilicate.

The phase stability was analyzed by examining their thermodynamics, in which the targeted phases should show a lower energy than all the other competing ones and their combinations. In the study, a linear optimization procedure introduced by Dahlqvist et al. ⁶⁷. was employed:

$$\triangle {H}_{{\rm{comp}}}=E-{E}_{{\rm{comp}}}\left({b}^{{\rm{RE}}1},{b}^{{\rm{RE}}2},{b}^{{\rm{Yb}}},{b}^{{\rm{Lu}}},{b}^{{\rm{Si}}},{b}^{{\rm{O}}}\right)=E-\min \left\{{\sum }_{i}^{n}{x}_{i}{E}_{i}\right.$$

(12)

where E and E_i represented the total energy of predicted and competing phases, respectively. ${b}^{{\rm{RE}}1}$, ${b}^{{\rm{RE}}2}$, ${b}^{{\rm{Yb}}}$, ${b}^{{\rm{Lu}}}$, ${b}^{{\rm{Si}}}$ and ${b}^{{\rm{O}}}$ correspond to the elemental fraction of the RE1, RE2, Yb, Lu, Si, and O elements, respectively. E_comp should be minimized subject to the constraints:

$${x}_{i}\ge 0,\mathop{\sum }\limits_{i}^{n}{x}_{i}=1$$

(13)

$$\left\{\begin{array}{c}{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{RE}}1}={b}_{{\rm{RE}}1},{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{RE}}2}={b}_{{\rm{RE}}2},{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{Yb}}}={b}_{{\rm{Yb}}}\\ \,{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{Lu}}}={b}_{{\rm{Lu}}},{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{Si}}}={b}_{{\rm{Si}}},{\sum }_{i}^{n}{x}_{i}{b}_{i}^{{\rm{O}}}={b}_{{\rm{O}}}\end{array}\right.$$

(14)

$${b}_{{\rm{RE}}1}+{b}_{{\rm{RE}}2}{+b}_{{\rm{Yb}}}+{b}_{{\rm{Lu}}}+{b}_{{\rm{Si}}}+{b}_{{\rm{O}}}=1$$

(15)

where, ${b}_{i}^{{\rm{RE}}1}$ was the proportion of RE1 atomic number in compound i, etc.

Data availability

All data used in this work are publicly available. Original datasets could be found in corresponding literature^{7,8,13,23,40,41,52,53,54,55,56}. Besides, the original and processed datasets used in this work are also available at https://github.com/Yun-Fann/ML-HEC.

Code availability

The codes developed for this work are available at https://github.com/Yun-Fann/ML-HEC.

References

Turcer, L. R. & Padture, N. P. Towards multifunctional thermal environmental barrier coatings (TEBCs) based on rare-earth pyrosilicate solid-solution ceramics. Scr. Mater. 154, 111–117 (2018).
Article CAS Google Scholar
Liu, B. et al. Advances on strategies for searching for next generation thermal barrier coating materials. J. Mater. Sci. Technol. 35, 833–851 (2019).
Article CAS Google Scholar
Liu, B. et al. Application of high-throughput first-principles calculations in ceramic innovation. J. Mater. Sci. Technol. 88, 143–157 (2021).
Article CAS Google Scholar
Dang, X. L. et al. Oxidation behaviors of carbon fiber reinforced multilayer SiC-Si₃N₄ matrix composites. J. Adv. Ceram. 11, 354–364 (2022).
Article CAS Google Scholar
Dong, L. et al. Pressure infiltration of molten aluminum for densification of environmental barrier coatings. J. Adv. Ceram. 11, 145–157 (2022).
Article CAS Google Scholar
Fernandez-Carrion, A. J., Allix, M. & Becerro, A. I. Thermal expansion of rare-earth pyrosilicates. J. Am. Ceram. Soc. 96, 2298–2305 (2013).
Article CAS Google Scholar
Xu, Y., Hu, X. X., Xu, F. F. & Li, K. W. Rare earth silicate environmental barrier coatings: Present status and prospective. Ceram. Int. 43, 5847–5855 (2017).
Article CAS Google Scholar
Luo, Y. X. et al. Material-genome perspective towards tunable thermal expansion of rare-earth di-silicates. J. Eur. Ceram. Soc. 38, 3547–3554 (2018).
Article CAS Google Scholar
Lv, X. R. et al. Rare earth monosilicates as oxidation resistant interphase for SiC_f/SiC CMC: Investigation of SiC_f/Yb₂SiO₅ model composites. J. Adv. Ceram. 11, 702–711 (2022).
Article CAS Google Scholar
Yang, L. W. et al. Dynamic oxidation mechanism of carbon fiber reinforced SiC matrix composite in high-enthalpy and high-speed plasmas. J. Adv. Ceram. 11, 365–377 (2022).
Article CAS Google Scholar
Poerschke, D. L., Van Sluytman, J. S., Wong, K. B. & Levi, C. G. Thermochemical compatibility of ytterbia-(hafnia/silica) multilayers for environmental barrier coatings. Acta Mater. 61, 6743–6755 (2013).
Article CAS Google Scholar
Richards, B. T. et al. Response of ytterbium disilicate-silicon environmental barrier coatings to thermal cycling in water vapor. Acta Mater. 106, 1–14 (2016).
Article CAS Google Scholar
Luo, Y. X. et al. Phase formation capability and compositional design of β-phase multiple rare-earth principal component disilicates. Nat. Commun. 14, 1275 (2023).
Article CAS PubMed PubMed Central Google Scholar
Soetebier, F. & Urland, W. Crystal structure of lutetium disilicate, Lu₂Si₂O₇. Z. Krist.-N. Cryst. St. 217, 22 (2002). 22.
CAS Google Scholar
Poerschke, D. L., Jackson, R. W. & Levi, C. G. Silicate deposit degradation of engineered coatings in gas turbines: progress toward models and materials solutions. Annu. Rev. Mater. Res. 47, 297–330 (2017).
Article CAS Google Scholar
Yeh, J. W. et al. Nanostructured high-entropy alloys with multiple principal elements: Novel alloy design concepts and outcomes. Adv. Eng. Mater. 6, 299–303 (2004).
Article CAS Google Scholar
Rost, C. M. et al. Entropy-stabilized oxides. Nat. Commun. 6, 8485 (2015).
Article CAS PubMed Google Scholar
Gild, J. et al. High-entropy metal diborides: a new class of high-entropy materials and a new type of ultrahigh temperature ceramics. Sci. Rep. 6, 37946 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sarker, P. et al. High-entropy high-hardness metal carbides discovered by entropy descriptors. Nat. Commun. 9, 4980 (2018).
Article PubMed PubMed Central Google Scholar
Castle, E., Csanadi, T., Grasso, S., Dusza, J. & Reece, M. Processing and properties of high-entropy ultra-high temperature carbides. Sci. Rep. 8, 8609 (2018).
Article PubMed PubMed Central Google Scholar
Harrington, T. J. et al. Phase stability and mechanical properties of novel high entropy transition metal carbides. Acta Mater. 166, 271–280 (2019).
Article CAS Google Scholar
Zhang, J. et al. Design high-entropy carbide ceramics from machine learning. Npj Comput. Mater. 8, 5 (2022).
Article Google Scholar
Sun, L. C. et al. High temperature corrosion of (Er_0.25Tm_0.25Yb_0.25Lu_0.25)₂Si₂O₇ environmental barrier coating material subjected to water vapor and molten calcium-magnesium-aluminosilicate (CMAS). Corros. Sci. 175, 108881 (2020).
Article CAS Google Scholar
Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. Npj Comput. Mater. 5, 83 (2019).
Article Google Scholar
Jablonka, K. M., Ongari, D., Moosavi, S. M. & Smit, B. Big-data science in porous materials: materials genomics and machine learning. Chem. Rev. 120, 8066–8129 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. et al. A critical review of machine learning of energy materials. Adv. Energy Mater. 10, 1903242 (2020).
Article CAS Google Scholar
Liu, H., Fu, Z. P., Yang, K., Xu, X. Y. & Bauchy, M. Machine learning for glass science and engineering: A review. J. Non-Cryst. Solids 557, 119419 (2021).
Article CAS Google Scholar
Hart, G. L. W., Mueller, T., Toher, C. & Curtarolo, S. Machine learning for alloys. Nat. Rev. Mater. 6, 730–755 (2021).
Article Google Scholar
Guo, Y. N. et al. Cracking behavior of newly-developed high strength eutectic high entropy alloy matrix composites manufactured by laser powder b e d fusion. J. Mater. Sci. Technol. 163, 81–91 (2023).
Article CAS Google Scholar
Huang, W. J., Martin, P. & Zhuang, H. L. L. Machine-learning phase prediction of high-entropy alloys. Acta Mater. 169, 225–236 (2019).
Article CAS Google Scholar
Islam, N., Huang, W. J. & Zhuang, H. L. L. Machine learning for phase selection in multi-principal element alloys. Comp. Mater. Sci. 150, 230–235 (2018).
Article CAS Google Scholar
Zhou, Z. Q. et al. Machine learning guided appraisal and exploration of phase design for high entropy alloys. Npj Comput. Mater. 5, 128 (2019).
Article CAS Google Scholar
Kaufmann, K. & Vecchio, K. S. Searching for high entropy alloys: A machine learning approach. Acta Mater. 198, 178–222 (2020).
Article CAS Google Scholar
Zhang, L. et al. Machine learning reveals the importance of the formation enthalpy and atom-size difference in forming phases of high entropy alloys. Mater. Des. 193, 108835 (2020).
Article CAS Google Scholar
Zhang, Y. et al. Phase prediction in high entropy alloys with a rational selection of materials descriptors and machine learning models. Acta Mater. 185, 528–539 (2020).
Article CAS Google Scholar
Kaufmann, K. et al. Discovery of high-entropy ceramics via machine learning. Npj Comput. Mater. 6, 42 (2020).
Article Google Scholar
Kresse, G. & Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B. 59, 1758–1775 (1999).
Article CAS Google Scholar
Perdew, J. P. et al. Restoring the density-gradient expansion for exchange in solids and surfaces. Phys. Rev. Lett. 100, 136406 (2008).
Article PubMed Google Scholar
Zhao, J. L. et al. Native point defects and oxygen migration of rare earth zirconate and stannate pyrochlores. J. Mater. Sci. Technol. 73, 23–30 (2021).
Article CAS Google Scholar
Sun, L. C. et al. A multicomponent γ-type (Gd_1/6Tb_1/6Dy_1/6Tm_1/6Yb_1/6Lu_1/6)₂Si₂O₇ disilicate with outstanding thermal stability. Mater. Res. Lett. 8, 424–430 (2020).
Article CAS Google Scholar
Sun, L. C. et al. High entropy engineering: new strategy for the critical property optimizations of rare earth silicates. J. Inorg. Mater. 36, 339–346 (2021).
Article Google Scholar
Jung, H. W., Sauerland, L., Stocker, S., Reuter, K. & Margraf, J. T. Machine-learning driven global optimization of surface adsorbate geometries. Npj Comput. Mater. 9, 114 (2023).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Zhu, X. Z. et al. Machine learning exploration of the critical factors for CO₂ adsorption capacity on porous carbon materials at different pressures. J. Clean. Prod. 273, 122915 (2020).
Article CAS Google Scholar
Yang, X. & Zhang, Y. Prediction of high-entropy stabilized solid-solution in multi-component alloys. Mater. Chem. Phys. 132, 233–238 (2012).
Article CAS Google Scholar
Bondar, I. A. Rare-earth silicates. Ceram. Int. 8, 83–89 (1982).
Article CAS Google Scholar
Wang, J. et al. High-entropy ferroelastic rare-earth tantalite ceramic: (Y_0.2Ce_0.2Sm_0.2Gd_0.2Dy_0.2)TaO₄. J. Am. Ceram. Soc. 104, 5873–5882 (2021).
Article CAS Google Scholar
Chen, Z. Y. et al. Mechanism of enhanced corrosion resistance against molten CMAS for pyrosilicates by high-entropy design. J. Am. Ceram. Soc. 106, 6000–6013 (2023).
Article CAS Google Scholar
Chen, B. & Varshney, P. K. A Bayesian sampling approach to decision fusion using hierarchical models. IEEE T. Signal Proces. 50, 1809–1818 (2002).
Article Google Scholar
He, J. P., Tu, Y. Y. & Shi, Y. Q. Fusion model of multi monitoring points on dam based on Bayes theory. Procedia Eng. 15, 2133–2138 (2011).
Article Google Scholar
Kittler, J. & Alkoot, F. M. Sum versus vote fusion in multiple classifier systems. IEEE T. Pattern Anal. 25, 110–115 (2003).
Article Google Scholar
Wang, X. et al. Preparation and corrosion resistance of high-entropy disilicate (Y_0.25Yb_0.25Er_0.25Sc_0.25)Si₂O₇ ceramics. Corros. Sci. 192, 109786 (2021).
Article CAS Google Scholar
Fujii, S., Ioki, A., Yokoi, T. & Yoshiya, M. Role of phonons on phase stabilization of RE₂Si₂O₇ over wide temperature range (RE = Yb, Gd). J. Eur. Ceram. Soc. 40, 780–788 (2020).
Article CAS Google Scholar
Guo, X. T. et al. High-entropy rare-earth disilicate (Lu_0.2Yb_0.2Er_0.2Tm_0.2Sc_0.2)Si₂O₇: A potential environmental barrier coating material. J. Eur. Ceram. Soc. 42, 3570–3578 (2022).
Article CAS Google Scholar
Stokes, J. L., Harder, B. J., Wiesner, V. L. & Wolfe, D. E. Effects of crystal structure and cation size on molten silicate reactivity with environmental barrier coating materials. J. Am. Ceram. Soc. 103, 622–634 (2020).
Article CAS Google Scholar
Salanova, A., Brummel, I. A., Yakovenko, A. A., Opila, E. J. & Ihlefeld, J. F. Phase stability and tensorial thermal expansion properties of single to high-entropy rare-earth disilicates. J. Am. Ceram. Soc. 106, 3228–3238 (2023).
Article CAS Google Scholar
Troparevsky, M. C., Morris, J. R., Kent, P. R. C., Lupini, A. R. & Stocks, G. M. Criteria for predicting the formation of single-phase high-entropy alloys. Phys. Rev. X. 5, 011041 (2015).
Google Scholar
Li, Y. & Guo, W. L. Machine-learning model for predicting phase formations of high-entropy alloys. Phys. Rev. Mater. 3, 095005 (2019).
Article CAS Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Google Scholar
Hao, J. G. & Ho, T. K. Machine learning made easy: a review of Scikit-learn Package in Python programming language. J. Educ. Behav. Stat. 44, 348–361 (2019).
Article Google Scholar
Oyedele, O. Determining the optimal number of folds to use in a K-fold cross-validation: A neural network classification experiment. Res. Math. 10, 2201015 (2023).
Article Google Scholar
Marzban, C. The ROC curve and the area under it as performance measures. Weather Forecast 19, 1106–1114 (2004).
Article Google Scholar
Mangalathu, S., Hwang, S. H. & Jeon, J. S. Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach. Eng. Struct. 219, 110927 (2020).
Article Google Scholar
Rodriguez-Perez, R. & Bajorath, J. Interpretation of compound activity predictions from complex machine learning models using local approximations and shapley values. J. Med. Chem. 63, 8761–8777 (2020).
Article CAS PubMed Google Scholar
Zhang, J., Ma, S. H., Xiong, Y. X., Xu, B. A. & Zhao, S. J. Elemental partitions and deformation mechanisms of L1₂-type multicomponent intermetallics. Acta Mater. 219, 117238 (2021).
Article CAS Google Scholar
Zhao, S. J., Stocks, G. M. & Zhang, Y. W. Stacking fault energies of face-centered cubic concentrated solid solution alloys. Acta Mater. 134, 334–345 (2017).
Article CAS Google Scholar
Dahlqvist, M., Alling, B. & Rosen, J. Stability trends of MAX phases from first principles. Phys. Rev. B. 81, 220102 (2010).
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. U21A2063, 52172071 and 51972080). Bin Liu acknowledges research support from Shanghai Technical Service Center for Advanced Ceramics Structure Design and Precision Manufacturing (No. 20DZ2294000).

Author information

Authors and Affiliations

National Key Laboratory of Science and Technology on Advanced Composites in Special Environments and Center for Composite Materials and Structure, Harbin Institute of Technology, Harbin, 150080, China
Yun Fan, Yuelei Bai & Zhiyao Lu
School of Materials Science and Engineering, Shanghai University, Shanghai, 200444, China
Yun Fan, Yuchen Liu & Bin Liu
National Engineering Research Center for Magnesium Alloy, Chongqing University, Chongqing, 400044, China
Qian Li
School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150080, China
Dong Chen
UNSW Materials & Manufacturing Futures Institute, School of Materials Science and Engineering, University of New South Wales, Sydney, NSW 2052, Australia
Wenxian Li
Institute of Coating Technology for Hydrogen Gas Turbines, Liaoning Academy of Materials, Shenyang, 110004, China
Bin Liu

Authors

Yun Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yuelei Bai
View author publications
You can also search for this author in PubMed Google Scholar
Qian Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyao Lu
View author publications
You can also search for this author in PubMed Google Scholar
Dong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yuchen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wenxian Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yun Fan: Conceptualization, Investigation, Formal analysis, Data Curation, Mading and training the ML model, Writing-Original draft preparation. Yuelei Bai: Writing-Reviewing and Editing, Supervision, Funding acquisition. Qian Li: Writing-Reviewing and Editing. Zhiyao Lu: Visualization, Data Curation. Dong Chen: Mading and training the ML model. Yuchen Liu: Investigation, Formal analysis. Wenxian Li: Writing-Reviewing and Editing, Visualization. Bin Liu: Investigation, Resources, Supervision, Funding acquisition, Writing-Reviewing and Editing.

Corresponding authors

Correspondence to Yuelei Bai or Bin Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fan, Y., Bai, Y., Li, Q. et al. Compositional design and phase formation capability of high-entropy rare-earth disilicates from machine learning and decision fusion. npj Comput Mater 10, 95 (2024). https://doi.org/10.1038/s41524-024-01282-x

Download citation

Received: 04 January 2024
Accepted: 19 April 2024
Published: 07 May 2024
DOI: https://doi.org/10.1038/s41524-024-01282-x