Publications
Full list in `google scholar`
2025
- Using data-driven methods to analyze the roles of different elements in liquid metal batteriesZiming Wang, Yipeng Liu, Yu Hong, and 4 more authorsJournal of Energy Storage, 2025
As an excellent energy storage technology, liquid metal batteries (LMBs) have gained significant attention recently. For these batteries, because lithium is commonly used as the anode, the specific energy capacity and charge–discharge performance mainly depend on the cathode. Therefore, the selection of the cathode material is extremely important. Designing this material involves finding an optimal combination of elements. Despite limited data, machine learning techniques can still be employed to conduct meaningful research. In this work, we developed a new evaluation method that not only focuses on energy density, which has been the primary focus in previous studies, but also considers rate performance of the battery system. The entire workflow of the method includes regression with different models, numerical differentiation, correlation coefficient calculation, and variance inflation factor evaluation. Based on the elements in our current dataset (including Te, Se, Sb, Cu, Zn, Bi, Sn and Pb), the importance degree sequences of these elements in terms of energy density and rate capability are delivered. Considering the overall performance, we suggest that Te and Se are particularly promising candidates for cathode materials. Finally, we discuss the quantitative results in the context of conventional qualitative understanding in electrochemistry, and we find that they are consistent and without contradiction. We have open-sourced the curated dataset and all relevant codes and we believe that as more research is conducted and data are accumulated, expanding the dataset to include more elements will yield even more promising results.
2024
- Size-Dependent Thermodynamic Stability of Copper Sulfide NanoparticlesMeimin Hu, Jinjia Liu, Wenping Guo, and 3 more authorsChemistry of Materials, 2024
Copper sulfide nanoparticles are extensively employed in the field of functional materials. However, synthesizing the desired nanoparticles in a controlled manner is challenging due to the variety of copper sulfide phases and their potential transformations. Here, we utilize a unified theoretical approach combining a high-throughput computational workflow, ab initio atomistic thermodynamics, and the Wulff theorem to study the thermodynamic stability of copper sulfide nanoparticles. Theoretical size-dependent phase diagrams are constructed for the first time, considering various sulfur chemical potentials. This study unveils the evolution of crystal morphology under varying external conditions and underlines the crucial role of surface energy in maintaining the stability of copper sulfide nanoparticles. Our findings offer a theoretical guide for experimental endeavors aimed at synthesizing the desired surface morphology and phases of copper sulfide nanomaterials.
- AlchemBERT: Exploring Lightweight Language Models for Materials InformaticsXiaotong Liu, Xingchen Liu, and Xiaodong WenChemRxiv, 2024This content is a preprint and has not been peer-reviewed
The advent of large language models (LLMs) has spurred numerous applications across various domains, including material design. However, we assert that without accurate prediction capabilities, effective material design is unattainable, as generating necessary structures becomes futile if their quality cannot be reliably evaluated. Matbench provides an excellent foundation for predictive tasks, yet prior efforts with LLMs have primarily focused on composition-related tasks using models such as GPT or LLaMA. In this study, we revisit BERT, a relatively small LLM with 110 million parameters, which is significantly smaller than GPT or LLaMA models containing billions of parameters. Remarkably, we demonstrate that BERT-base achieves comparable performance to these larger models in material property prediction tasks. Extending beyond composition tasks, we introduce BERT’s application in structure prediction using CIF (Crystallographic Information File) data and natural language descriptions of structures. Our results rival state-of-the-art composition models such as CrabNet and, in several tasks, even surpass traditional structure-based models like CGCNN, DeeperGATGNN, MegNet, DimeNet++, and knowledge-driven models such as MODNet. Notably, despite its parameter count, our approach excels on small datasets with minimal overfitting, indicating that fine-tuned LLMs can genuinely capture meaningful material insights. Our findings provide a new reference point for future LLM applications in material design, offering valuable insights for leveraging LLMs in this domain and introducing a paradigm shift for physicists and material scientists, emphasizing natural language descriptions over conventional model-centric design. We term this application of BERT in material design AlchemBERT, signifying its novel role in bridging natural language and structural representations.
2023
- Exploring Multi-Fidelity Data in Materials Science: Challenges, Applications, and Optimized Learning StrategiesZiming Wang, Xiaotong Liu, Haotian Chen, and 2 more authorsApplied Sciences, 2023
Machine learning techniques offer tremendous potential for optimizing resource allocation in solving real-world problems. However, the emergence of multi-fidelity data introduces new challenges. This paper offers an overview of the definition, applications, data preprocessing methodologies, and learning approaches associated with multi-fidelity data. To validate the algorithms, we examine three widely-used learning methods relevant to multi-fidelity data through the design of multi-fidelity datasets that encompass various types of noise. As we expected, employing multi-fidelity data learning methods yields better results compared to solely using high-fidelity data learning methods. Additionally, considering the inherent various types of noise within datasets, the comprehensive correction strategy proves to be the most effective. Moreover, multi-fidelity learning methods facilitate effective decision-making processes by enabling the combination of datasets from various sources. They extract knowledge from lower fidelity data, improving model accuracy compared to models solely relying on high-fidelity data.
- Speeding up the prediction of C–O cleavage through bond valence and charge on iron carbidesYurong He, Kuan Lu, Jinjia Liu, and 7 more authorsInternational Journal of Minerals, Metallurgy and Materials, 2023
The activation of CO on iron-based materials is a key elementary reaction for many chemical processes. We investigate CO adsorption and dissociation on a series of Fe, Fe3C, Fe5C2, and Fe2C catalysts through density functional theory calculations. We detect dramatically different performances for CO adsorption and activation on diverse surfaces and sites. The activation of CO is dependent on the local coordination of the molecule to the surface and on the bulk phase of the underlying catalyst. The bulk properties and the different local bonding environments lead to varying interactions between the adsorbed CO and the surface and thus yielding different activation levels of the C–O bond. We also examine the prediction of CO adsorption on different types of Fe-based catalysts by machine learning through linear regression models. We combine the features originating from surfaces and bulk phases to enhance the prediction of the activation energies and perform eight different linear regressions utilizing the feature engineering of polynomial representations. Among them, a ridge linear regression model with 2nd-degree polynomial feature generation predicted the best CO activation energy with a mean absolute error of 0.269 eV.
- 多保真度数据学习算法的定量噪声评价刘晓彤, 王滋明, 欧阳嘉华, and 1 more author硅酸盐学报, 2023
多保真度数据是当前材料领域数据的主要存在形式。在数据生产端,不同量化方法在材料同种属性的计算上存在较大差距。对于数据消费端的机器学习算法,研究人员为最大化提取数据中知识设计了各种方法。采用定量噪声添加的方法,评价不同噪声强度、类型对不同多保真度数据学习方法的影响,通过迭代降噪验证数据修正方法的适用场景。结果表明:多保真度数据的利用方式至关重要,需对各子数据集中数据量及含噪情况进行综合考量。在使用不同噪声类型与强度构造出的多种数据集上,得益于数据间的协同效应,逐步删除低保真度数据的―Onion”训练方式明显优于按数据集所含噪声减小方向逐个进行的训练方式。在多保真度数据训练中,无论何种噪声强度及训练方式,线性噪声对模型的影响更小。对于采样噪声来说,在各环节更好地模拟了真实多保真度数据,建议被后续研究采用。此外,复杂噪声难以让少量真值数据发挥―纠偏”作用,更适合进行迭代降噪处理。
- 基于零知识证明的匿名投票方案于筌, 刘晓彤, 刁恩虎, and 1 more authorScience Technology & Engineering, 2023
针对现有电子投票与问卷调查系统中公正性与匿名性这两项最根本需求,提出了运行于以太坊上的智能合约投票方案。方案满足可信、透明的要求,剥离了对可信第三方的依赖。随后进一步结合零知识证明与数字签名技术,实现了方案的匿名性。通过合理设计算术电路及智能合约,该方案可满足一人一票或一人多票的应用场景。通过安全性分析,本方案完全满足业界对电子投票方案公认的各种要求,加之相关代码已完整开源,为后续移植及具体应用提供了借鉴与参考。
2022
- A simple denoising approach to exploit multi-fidelity data for machine learning materials propertiesXiaotong Liu, Pierre-Paul De Breuck, Linghui Wang, and 1 more authornpj Computational Materials, 2022
Machine-learning models have recently encountered enormous success for predicting the properties of materials. These are often trained based on data that present various levels of accuracy, with typically much less high- than low-fidelity data. In order to extract as much information as possible from all available data, we here introduce an approach which aims to improve the quality of the data through denoising. We investigate the possibilities that it offers in the case of the prediction of the band gap using both limited experimental data and density-functional theory relying on different exchange-correlation functionals. After analyzing the raw data thoroughly, we explore different ways to combine the data into training sequences and analyze the effect of the chosen denoiser. We also study the effect of applying the denoising procedure several times until convergence. Finally, we compare our approach with various existing methods to exploit multi-fidelity data and show that it provides an interesting improvement.
2021
- 无模型强化学习研究综述秦智慧, 李宁, 刘晓彤, and 3 more authors计算机科学, 2021
强化学习(Reinforcement Learning,RL)作为机器学习领域中与监督学习、无监督学习并列的第三种学习范式,通过与环境进行交互来学习,最终将累积收益最大化。常用的强化学习算法分为模型化强化学习(Model-based Reinforcement Lear-ning)和无模型强化学习(Model-free Reinforcement Learning)。模型化强化学习需要根据真实环境的状态转移数据来预定义环境动态模型,随后在通过环境动态模型进行策略学习的过程中无须再与环境进行交互。在无模型强化学习中,智能体通过与环境进行实时交互来学习最优策略,该方法在实际任务中具有更好的通用性,因此应用范围更广。文中对无模型强化学习的最新研究进展与发展动态进行了综述。首先介绍了强化学习、模型化强化学习和无模型强化学习的基础理论;然后基于价值函数和策略函数归纳总结了无模型强化学习的经典算法及各自的优缺点;最后概述了无模型强化学习在游戏AI、化学材料设计、自然语言处理和机器人控制领域的最新研究现状,并对无模型强化学习的未来发展趋势进行了展望。
2020
- Solving chemistry problems via an end-to-end approach: A proof of conceptXiaotong Liu, Tianfu Zhang, Tao Yang, and 7 more authorsThe Journal of Physical Chemistry A, 2020
Traditionally, chemistry problems are solved by means of a deductive approach. The question to be addressed is typically related to the value of a property that is either measured experimentally, computed using quantum-chemistry software, or (more recently) predicted using a machine-learned model. In this paper, we demonstrate that an inductive approach can be adopted using End-to-End (E2E) machine learning. This approach is illustrated for tackling the following chemistry problems: (i) determine the fully coordinated (FC) and undercoordinated (UC) atoms in a molecule with one missing atom, (ii) identify the type of atom that is missing in such an incomplete molecule, and (iii) predict the direction of a reaction between two molecules according to an existing dataset. The E2E approach leads to accuracies higher than 99%, 98%, and 93% for these three problems, respectively. Finally, in order to achieve such accuracies, a descriptor for the molecules, called bag of clusters, is introduced and compared with a series previously proposed descriptors, highlighting a series of advantages.
2011
- Lattice characteristics, structure stability and oxygen permeability of \(BaFe_{1-x}Y_xO_{3−δ}\) ceramic membranesXiaotong Liu, Hailei Zhao, Jianying Yang, and 5 more authorsJournal of membrane science, 2011
\(BaFe_{1−x}Y_xO_{3−δ}\) (x=0–0.2) materials were synthesized by conventional solid-state reaction process for oxygen separation application. The effects of Y-doping on the crystal structure development, electrical conductivity and oxygen permeability were evaluated. Yttrium introduction effectively stabilize the cubic structure of BaFe1−xYxO3−δ. With Y-doping, the oxidation state of Fe ions reduces, resulting in the increase in oxygen vacancy concentration as charge compensation and the decreases in electrical conductivity. Y-doping enhances the structural stability of BaFe1−xYxO3−δ in reducing atmosphere but decreases the oxygen permeability. Both of them are attributed to the strong binding energy of Y–O bond. The cobalt free membrane BaFe0.95Y0.05O3−δ shows good structural stability under reducing atmosphere and acceptable oxygen permeation flux of 0.798 ml (STP) min−1 cm−2 at 900 °C for 1.1 mm thick membrane, making it a promising candidate for future practical applications.