PDF
Abstract
Perovskite materials are central to the fields of energy conversion and storage, especially for fuel cells. However, they are challenged by overcomplexity, coupled with a strong desire for new materials discovery at high speed and high precision. Herein, we propose a new approach involving a combination of extreme feature engineering and automated machine learning to adaptively learn the structure-composition-property relationships of perovskite oxide materials for energy conversion and storage. Structure-composition-property relationships between stability and other features of perovskites are investigated. Extreme feature engineering is used to construct a great quantity of fresh descriptors, and a crucial subset of 23 descriptors is acquired by sequential forward selection algorithm. The best descriptor
for stability of perovskites is determined with linear regression. The results demonstrate a high-efficient and non-priori-knowledge investigation of structure-composition-property relationships for perovskite materials, providing a new road to discover advanced energy materials.
Keywords
Perovskites
/
structure-composition-property relationships
/
stability
/
descriptors
/
automated machine learning
Cite this article
Download citation ▾
Qin Deng, Bin Lin.
Automated machine learning structure-composition-property relationships of perovskite materials for energy conversion and storage.
Energy Materials, 2021, 1(1): 100006 DOI:10.20517/energymater.2021.10
| [1] |
Hong Y,Jiang H.Machine learning and artificial neural network accelerated computational discoveries in materials science..WIREs Comput Mol Sci2020;10:e1450
|
| [2] |
Sparks TD,Parry ME,Brgoch J.Machine learning for structural materials..Annu Rev Mater Res2020;50:27-48
|
| [3] |
Hwang J,Giordano L,Yu Y.Perovskites in catalysis and electrocatalysis..Science2017;358:751-6
|
| [4] |
Butler K T,Cartwright H.Machine learning for molecular and materials science..Nature2018;559:547-55
|
| [5] |
Juan Y,Yang Y.Accelerating materials discovery using machine learning..J Mater Sci Mater Med2021;79:178-90
|
| [6] |
Fischer CC,Morgan D.Predicting crystal structure by merging data mining with quantum mechanics..Nat Mater2006;5:641-6
|
| [7] |
Raccuglia P,Adler PD.Machine-learning-assisted materials discovery using failed experiments..Nature2016;533:73-6
|
| [8] |
Sanchez-Lengeling B.Inverse molecular design using machine learning: generative models for matter engineering..Science2018;361:360-5
|
| [9] |
Ong SP.Accelerating materials science with high-throughput computations and machine learning..Computational Materials Science2019;161:143-50
|
| [10] |
Balachandran PV.Machine learning guided design of functional materials with targeted properties..Computational Materials Science2019;164:82-90
|
| [11] |
Peña MA.Chemical structures and performance of perovskite oxides..Chem Rev2001;101:1981-2017
|
| [12] |
Lino A,Sizo A.Virtual teaching and learning environments: automatic evaluation with symbolic regression..IFS2016;31:2061-72
|
| [13] |
Yuan S,Quddus N,Mashuga CV.Developing quantitative structure-property relationship models to predict the upper flammability limit using machine learning..Ind Eng Chem Res2019;58:3531-7
|
| [14] |
Ouyang R,Ahmetcik E,Ghiringhelli LM.SISSO: a compressed-sensing method for identifying the best low-dimensional descriptor in an immensity of offered candidates..Phys Rev Materials2018;2:083802
|
| [15] |
Zhang Y.Machine learning lattice constants for cubic perovskite compounds..Chemistry Select2020;5:9999-10009
|
| [16] |
de Franca FO.Interaction-transformation symbolic regression with extreme learning machine..Neurocomputing2021;423:609-19
|
| [17] |
Li Z,Sun Q.Stability engineering of halide perovskite via machine learning..arXiv preprint arXiv2018;1803.06042:
|
| [18] |
Li W,Morgan D.Predicting the thermodynamic stability of perovskite oxides using machine learning models..Computational Materials Science2018;150:454-63
|
| [19] |
Deng Q.Exploring structure-composition relationships of cubic perovskite oxides via extreme feature engineering and automated machine learning..Materials Today Communications2021;28:102590
|
| [20] |
Gardner S,Griffin J.Constrained multi-objective optimization for automated machine learning. 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA)2019;2019 Oct 5-8Washington, DC, USA. IEEE364-73
|
| [21] |
Masrom S,Jamil N S.Automated machine learning based on genetic programming: a case study on a real house pricing dataset. 2019 1st International Conference on Artificial Intelligence and Data Sciences (AiDAS)2019;2019 Sep 19Ipoh, Malaysia. IEEE48-52
|
| [22] |
Chauhan K,Thakkar D.Automated machine learning: the new wave of machine learning. 2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA)2020;2020 Mar 5-7Bangalore, India. IEEE205-12
|
| [23] |
Ge P.Analysis on approaches and structures of automated machine learning frameworks. 2020 International Conference on Communications, Information System and Computer Engineering (CISCE)2020;2020 Jul 3-5Kuala Lumpur, Malaysia. IEEE474-7
|
| [24] |
Han J,Lee KM.An automated machine learning platform for non-experts. Proceedings of the International Conference on Research in Adaptive and Convergent Systems.2020;Association for Computing Machinery, New York, NY, USA84-6
|
| [25] |
Umamahesan A.2020;Association for Computing Machinery, New York, NY, USA3495
|
| [26] |
Waring J,Umeton R.Automated machine learning: review of the state-of-the-art and opportunities for healthcare..Artif Intell Med2020;104:101822
|
| [27] |
Zeineddine H,Farah A.Enhancing prediction of student success: automated machine learning approach..2021;89:106903
|
| [28] |
Sun Y.Feature engineering for search advertising recognition. 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)2019;2019 Mar 15-17Chengdu, China. IEEE1859-64
|
| [29] |
Li Z,Xin H.Feature engineering of machine-learning chemisorption models for catalyst design..Catalysis Today2017;280:232-8
|
| [30] |
Emery AA.High-throughput DFT calculations of formation energy, stability and oxygen vacancy formation energy of ABO3 perovskites..Sci Data2017;4:170153
|
| [31] |
Kotsiantis SB,Pintelas PE.Data preprocessing for supervised leaning..International journal of computer science2006;1:111-7
|
| [32] |
Liu N,Liu G.Data Preprocessing based on partially supervised learning. Proceedings of the 6th International Conference on Information Engineering for Mechanics and Materials2016;2016 Nov.Atlantis Press678-83
|
| [33] |
Zainuddin Z.A comparative study of missing value estimation methods: which method performs better? 2008 International Conference on Electronic Design2008;2008 Dec 1-3Penang, Malaysia. IEEE1-5
|
| [34] |
Kang H.The prevention and handling of the missing data..Korean J Anesthesiol2013;64:402-6 PMCID:PMC3668100
|
| [35] |
Li J,Wang S.Feature Selection: A Data Perspective..ACM Comput Surv2018;50:1-45
|
| [36] |
Mangal A.A comparative study of feature selection methods for stress hotspot classification in materials..Integrating Materials and Manufacturing Innovation2018;7:87-95
|
| [37] |
Zhou H,Xia Y.A new sampling method in particle filter based on Pearson correlation coefficient..Neurocomputing2016;216:208-15
|
| [38] |
Hauke J.Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data..Quaestiones Geographicae2011;30:87-93
|
| [39] |
Khurana U,Turaga D.Feature engineering for predictive modeling using reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence..Thirty-Second AAAI Conference on Artificial Intelligence2018;32:3407-14
|
| [40] |
Heaton J.An empirical analysis of feature engineering for predictive modeling. SoutheastCon. 20162016;2016 Mar 30-Apr 3Norfolk, VA, USA. IEEE1-6
|
| [41] |
Zheng A.Feature engineering for machine learning: principles and techniques for data scientists.2018;‘O’Reilly Media, Inc.
|
| [42] |
Dai D,Wei X.Using machine learning and feature engineering to characterize limited material datasets of high-entropy alloys..Computational Materials Science2020;175:109618
|
| [43] |
Nargesian F,Khurana U.Learning feature engineering for classification. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence Main track..IJCAI2017;2529-35
|
| [44] |
Hou J.A simple feature combination method based on dominant sets..Pattern Recognition2013;46:3129-39
|
| [45] |
Ghiringhelli LM,Levchenko SV,Scheffler M.Big data of materials science: critical role of the descriptor..Phys Rev Lett2015;114:105503
|
| [46] |
Fox J.Regression diagnostics: an introduction.2019;Sage publications
|
| [47] |
Montgomery DC,Vining GG.Introduction to linear regression analysis.2021;John Wiley & Sons
|
| [48] |
Weisberg S.Applied linear regression.2005;John Wiley & Sons
|
| [49] |
Dai D,Hu R.Method construction of structure-property relationships from data by machine learning assisted mining for materials design applications..2020;196:109194
|
| [50] |
Pedregosa F,Gramfort A.Scikit-learn: machine learning in Python..The Journal of machine Learning research2011;12:2825-30
|
| [51] |
Abraham A,Eickenberg M.Machine learning for neuroimaging with scikit-learn..Front Neuroinform2014;8:14 PMCID:PMC3930868
|
| [52] |
Jović A,Bogunović N.A review of feature selection methods with applications. 2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO)2015;2015 May 25-29Opatija, Croatia. IEEE1200-5
|
| [53] |
Remeseiro B.A review of feature selection methods in medical applications..Comput Biol Med2019;112:103375
|
| [54] |
Dash M.Feature selection for classification..Intelligent Data Analysis1997;1:131-56
|
| [55] |
Chandrashekar G.A survey on feature selection methods..2014;40:16-28
|
| [56] |
Rückstieß T,van der Smagt P.Wang D.Sequential feature selection for classification..AI 2011: advances in artificial intelligence.2011;BerlinSpringer Berlin Heidelberg132-41
|
| [57] |
Lee CY.Mutually-exclusive-and-collectively-exhaustive feature selection scheme..Applied Soft Computing2018;68:961-71
|
| [58] |
Su R,Wei L.MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy-defined energy..Brief Bioinform2020;21:687-98
|
| [59] |
Yang F,Xu F,Tsui K.Lifespan prediction of lithium-ion batteries based on various extracted features and gradient boosting regression tree model..J Power Sources2020;476:228654
|
| [60] |
Sun X.Experiments with two new boosting algorithms..IIM2010;02:386-90
|
| [61] |
Schapire RE.Denison DD,Holmes CC,Yu B.The boosting approach to machine learning: an overview..Nonlinear estimation and classification.2003;New YorkSpringer149-71
|
| [62] |
Oza N C.Online bagging and boosting. 2005 IEEE International Conference on Systems, Man and Cybernetics2005;2005 Oct 12Waikoloa, HI, USA. IEEE2340-5
|
| [63] |
Elith J,Hastie T.A working guide to boosted regression trees..J Anim Ecol2008;77:802-13
|
| [64] |
Friedman JH.Greedy function approximation: a gradient boosting machine..Ann Statist2001;29:
|
| [65] |
Abe GL,Katata C.Fabrication of novel poly(lactic acid/caprolactone) bilayer membrane for GBR application..Dent Mater2020;36:626-34
|
| [66] |
Sharafati A,Hosseinzadeh M.The potential of new ensemble machine learning models for effluent quality parameters prediction and related uncertainty..Process Safety and Environmental Protection2020;140:68-78
|
| [67] |
Friedman JH.Stochastic gradient boosting..2002;38:367-78
|
| [68] |
Domingos P.A few useful things to know about machine learning..Commun ACM2012;55:78-87
|
| [69] |
Lu S,Guo Y,Wu Y.Coupling a crystal graph multilayer descriptor to active learning for rapid discovery of 2D Ferromagnetic Semiconductors/Half-Metals/Metals..Adv Mater2020;32:e2002658
|
| [70] |
Bartel CJ,Goldsmith BR.New tolerance factor to predict the stability of perovskite oxides and halides..Sci Adv2019;5:eaav0693 PMCID:PMC6368436
|
| [71] |
Uyanık GK.A study on multiple linear regression analysis..Procedia - Social and Behavioral Sciences2013;106:234-40
|