Automated machine learning structure-composition-property relationships of perovskite materials for energy conversion and storage

Qin Deng , Bin Lin

Energy Materials ›› 2021, Vol. 1 ›› Issue (1) : 100006

PDF
Energy Materials ›› 2021, Vol. 1 ›› Issue (1) :100006 DOI: 10.20517/energymater.2021.10
Article
Article

Automated machine learning structure-composition-property relationships of perovskite materials for energy conversion and storage

Author information +
History +
PDF

Abstract

Perovskite materials are central to the fields of energy conversion and storage, especially for fuel cells. However, they are challenged by overcomplexity, coupled with a strong desire for new materials discovery at high speed and high precision. Herein, we propose a new approach involving a combination of extreme feature engineering and automated machine learning to adaptively learn the structure-composition-property relationships of perovskite oxide materials for energy conversion and storage. Structure-composition-property relationships between stability and other features of perovskites are investigated. Extreme feature engineering is used to construct a great quantity of fresh descriptors, and a crucial subset of 23 descriptors is acquired by sequential forward selection algorithm. The best descriptor for stability of perovskites is determined with linear regression. The results demonstrate a high-efficient and non-priori-knowledge investigation of structure-composition-property relationships for perovskite materials, providing a new road to discover advanced energy materials.

Keywords

Perovskites / structure-composition-property relationships / stability / descriptors / automated machine learning

Cite this article

Download citation ▾
Qin Deng, Bin Lin. Automated machine learning structure-composition-property relationships of perovskite materials for energy conversion and storage. Energy Materials, 2021, 1(1): 100006 DOI:10.20517/energymater.2021.10

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Hong Y,Jiang H.Machine learning and artificial neural network accelerated computational discoveries in materials science..WIREs Comput Mol Sci2020;10:e1450

[2]

Sparks TD,Parry ME,Brgoch J.Machine learning for structural materials..Annu Rev Mater Res2020;50:27-48

[3]

Hwang J,Giordano L,Yu Y.Perovskites in catalysis and electrocatalysis..Science2017;358:751-6

[4]

Butler K T,Cartwright H.Machine learning for molecular and materials science..Nature2018;559:547-55

[5]

Juan Y,Yang Y.Accelerating materials discovery using machine learning..J Mater Sci Mater Med2021;79:178-90

[6]

Fischer CC,Morgan D.Predicting crystal structure by merging data mining with quantum mechanics..Nat Mater2006;5:641-6

[7]

Raccuglia P,Adler PD.Machine-learning-assisted materials discovery using failed experiments..Nature2016;533:73-6

[8]

Sanchez-Lengeling B.Inverse molecular design using machine learning: generative models for matter engineering..Science2018;361:360-5

[9]

Ong SP.Accelerating materials science with high-throughput computations and machine learning..Computational Materials Science2019;161:143-50

[10]

Balachandran PV.Machine learning guided design of functional materials with targeted properties..Computational Materials Science2019;164:82-90

[11]

Peña MA.Chemical structures and performance of perovskite oxides..Chem Rev2001;101:1981-2017

[12]

Lino A,Sizo A.Virtual teaching and learning environments: automatic evaluation with symbolic regression..IFS2016;31:2061-72

[13]

Yuan S,Quddus N,Mashuga CV.Developing quantitative structure-property relationship models to predict the upper flammability limit using machine learning..Ind Eng Chem Res2019;58:3531-7

[14]

Ouyang R,Ahmetcik E,Ghiringhelli LM.SISSO: a compressed-sensing method for identifying the best low-dimensional descriptor in an immensity of offered candidates..Phys Rev Materials2018;2:083802

[15]

Zhang Y.Machine learning lattice constants for cubic perovskite compounds..Chemistry Select2020;5:9999-10009

[16]

de Franca FO.Interaction-transformation symbolic regression with extreme learning machine..Neurocomputing2021;423:609-19

[17]

Li Z,Sun Q.Stability engineering of halide perovskite via machine learning..arXiv preprint arXiv2018;1803.06042:

[18]

Li W,Morgan D.Predicting the thermodynamic stability of perovskite oxides using machine learning models..Computational Materials Science2018;150:454-63

[19]

Deng Q.Exploring structure-composition relationships of cubic perovskite oxides via extreme feature engineering and automated machine learning..Materials Today Communications2021;28:102590

[20]

Gardner S,Griffin J.Constrained multi-objective optimization for automated machine learning. 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA)2019;2019 Oct 5-8Washington, DC, USA. IEEE364-73

[21]

Masrom S,Jamil N S.Automated machine learning based on genetic programming: a case study on a real house pricing dataset. 2019 1st International Conference on Artificial Intelligence and Data Sciences (AiDAS)2019;2019 Sep 19Ipoh, Malaysia. IEEE48-52

[22]

Chauhan K,Thakkar D.Automated machine learning: the new wave of machine learning. 2020 2nd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA)2020;2020 Mar 5-7Bangalore, India. IEEE205-12

[23]

Ge P.Analysis on approaches and structures of automated machine learning frameworks. 2020 International Conference on Communications, Information System and Computer Engineering (CISCE)2020;2020 Jul 3-5Kuala Lumpur, Malaysia. IEEE474-7

[24]

Han J,Lee KM.An automated machine learning platform for non-experts. Proceedings of the International Conference on Research in Adaptive and Convergent Systems.2020;Association for Computing Machinery, New York, NY, USA84-6

[25]

Umamahesan A.2020;Association for Computing Machinery, New York, NY, USA3495

[26]

Waring J,Umeton R.Automated machine learning: review of the state-of-the-art and opportunities for healthcare..Artif Intell Med2020;104:101822

[27]

Zeineddine H,Farah A.Enhancing prediction of student success: automated machine learning approach..2021;89:106903

[28]

Sun Y.Feature engineering for search advertising recognition. 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)2019;2019 Mar 15-17Chengdu, China. IEEE1859-64

[29]

Li Z,Xin H.Feature engineering of machine-learning chemisorption models for catalyst design..Catalysis Today2017;280:232-8

[30]

Emery AA.High-throughput DFT calculations of formation energy, stability and oxygen vacancy formation energy of ABO3 perovskites..Sci Data2017;4:170153

[31]

Kotsiantis SB,Pintelas PE.Data preprocessing for supervised leaning..International journal of computer science2006;1:111-7

[32]

Liu N,Liu G.Data Preprocessing based on partially supervised learning. Proceedings of the 6th International Conference on Information Engineering for Mechanics and Materials2016;2016 Nov.Atlantis Press678-83

[33]

Zainuddin Z.A comparative study of missing value estimation methods: which method performs better? 2008 International Conference on Electronic Design2008;2008 Dec 1-3Penang, Malaysia. IEEE1-5

[34]

Kang H.The prevention and handling of the missing data..Korean J Anesthesiol2013;64:402-6 PMCID:PMC3668100

[35]

Li J,Wang S.Feature Selection: A Data Perspective..ACM Comput Surv2018;50:1-45

[36]

Mangal A.A comparative study of feature selection methods for stress hotspot classification in materials..Integrating Materials and Manufacturing Innovation2018;7:87-95

[37]

Zhou H,Xia Y.A new sampling method in particle filter based on Pearson correlation coefficient..Neurocomputing2016;216:208-15

[38]

Hauke J.Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data..Quaestiones Geographicae2011;30:87-93

[39]

Khurana U,Turaga D.Feature engineering for predictive modeling using reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence..Thirty-Second AAAI Conference on Artificial Intelligence2018;32:3407-14

[40]

Heaton J.An empirical analysis of feature engineering for predictive modeling. SoutheastCon. 20162016;2016 Mar 30-Apr 3Norfolk, VA, USA. IEEE1-6

[41]

Zheng A.Feature engineering for machine learning: principles and techniques for data scientists.2018;‘O’Reilly Media, Inc.

[42]

Dai D,Wei X.Using machine learning and feature engineering to characterize limited material datasets of high-entropy alloys..Computational Materials Science2020;175:109618

[43]

Nargesian F,Khurana U.Learning feature engineering for classification. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence Main track..IJCAI2017;2529-35

[44]

Hou J.A simple feature combination method based on dominant sets..Pattern Recognition2013;46:3129-39

[45]

Ghiringhelli LM,Levchenko SV,Scheffler M.Big data of materials science: critical role of the descriptor..Phys Rev Lett2015;114:105503

[46]

Fox J.Regression diagnostics: an introduction.2019;Sage publications

[47]

Montgomery DC,Vining GG.Introduction to linear regression analysis.2021;John Wiley & Sons

[48]

Weisberg S.Applied linear regression.2005;John Wiley & Sons

[49]

Dai D,Hu R.Method construction of structure-property relationships from data by machine learning assisted mining for materials design applications..2020;196:109194

[50]

Pedregosa F,Gramfort A.Scikit-learn: machine learning in Python..The Journal of machine Learning research2011;12:2825-30

[51]

Abraham A,Eickenberg M.Machine learning for neuroimaging with scikit-learn..Front Neuroinform2014;8:14 PMCID:PMC3930868

[52]

Jović A,Bogunović N.A review of feature selection methods with applications. 2015 38th international convention on information and communication technology, electronics and microelectronics (MIPRO)2015;2015 May 25-29Opatija, Croatia. IEEE1200-5

[53]

Remeseiro B.A review of feature selection methods in medical applications..Comput Biol Med2019;112:103375

[54]

Dash M.Feature selection for classification..Intelligent Data Analysis1997;1:131-56

[55]

Chandrashekar G.A survey on feature selection methods..2014;40:16-28

[56]

Rückstieß T,van der Smagt P.Wang D.Sequential feature selection for classification..AI 2011: advances in artificial intelligence.2011;BerlinSpringer Berlin Heidelberg132-41

[57]

Lee CY.Mutually-exclusive-and-collectively-exhaustive feature selection scheme..Applied Soft Computing2018;68:961-71

[58]

Su R,Wei L.MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy-defined energy..Brief Bioinform2020;21:687-98

[59]

Yang F,Xu F,Tsui K.Lifespan prediction of lithium-ion batteries based on various extracted features and gradient boosting regression tree model..J Power Sources2020;476:228654

[60]

Sun X.Experiments with two new boosting algorithms..IIM2010;02:386-90

[61]

Schapire RE.Denison DD,Holmes CC,Yu B.The boosting approach to machine learning: an overview..Nonlinear estimation and classification.2003;New YorkSpringer149-71

[62]

Oza N C.Online bagging and boosting. 2005 IEEE International Conference on Systems, Man and Cybernetics2005;2005 Oct 12Waikoloa, HI, USA. IEEE2340-5

[63]

Elith J,Hastie T.A working guide to boosted regression trees..J Anim Ecol2008;77:802-13

[64]

Friedman JH.Greedy function approximation: a gradient boosting machine..Ann Statist2001;29:

[65]

Abe GL,Katata C.Fabrication of novel poly(lactic acid/caprolactone) bilayer membrane for GBR application..Dent Mater2020;36:626-34

[66]

Sharafati A,Hosseinzadeh M.The potential of new ensemble machine learning models for effluent quality parameters prediction and related uncertainty..Process Safety and Environmental Protection2020;140:68-78

[67]

Friedman JH.Stochastic gradient boosting..2002;38:367-78

[68]

Domingos P.A few useful things to know about machine learning..Commun ACM2012;55:78-87

[69]

Lu S,Guo Y,Wu Y.Coupling a crystal graph multilayer descriptor to active learning for rapid discovery of 2D Ferromagnetic Semiconductors/Half-Metals/Metals..Adv Mater2020;32:e2002658

[70]

Bartel CJ,Goldsmith BR.New tolerance factor to predict the stability of perovskite oxides and halides..Sci Adv2019;5:eaav0693 PMCID:PMC6368436

[71]

Uyanık GK.A study on multiple linear regression analysis..Procedia - Social and Behavioral Sciences2013;106:234-40

PDF

319

Accesses

0

Citation

Detail

Sections
Recommended

/