Machine learning transforms the inference of the nuclear equation of state

Yongjia Wang, Qingfeng Li

Front. Phys. ›› 2023, Vol. 18 ›› Issue (6) : 64402.

PDF(3007 KB)
Front. Phys. All Journals
PDF(3007 KB)
Front. Phys. ›› 2023, Vol. 18 ›› Issue (6) : 64402. DOI: 10.1007/s11467-023-1313-3
VIEW & PERSPECTIVE
VIEW & PERSPECTIVE

Machine learning transforms the inference of the nuclear equation of state

Author information +
History +

Abstract

Our knowledge of the properties of dense nuclear matter is usually obtained indirectly via nuclear experiments, astrophysical observations, and nuclear theory calculations. Advancing our understanding of the nuclear equation of state (EOS, which is one of the most important properties and of central interest in nuclear physics) has relied on various data produced from experiments and calculations. We review how machine learning is revolutionizing the way we extract EOS from these data, and summarize the challenges and opportunities that come with the use of machine learning.

Graphical abstract

Keywords

nuclear equation of state / machine learning / nuclear theory / nuclear experiment

Cite this article

Download citation ▾
Yongjia Wang, Qingfeng Li. Machine learning transforms the inference of the nuclear equation of state. Front. Phys., 2023, 18(6): 64402 https://doi.org/10.1007/s11467-023-1313-3
The field of nuclear physics relies on experiments and theoretical models to understand the structure and properties of nuclei and their reactions in order to bring insights into the nuclear forces that holds protons and neutrons together in an atomic nucleus. The nuclear equation of state (EOS) usually describes the thermodynamic relationship between the binding energy per nucleon E and density ρ, as well as the isospin asymmetry δ in nuclear matter (ideally, a uniform and infinite system with neutrons and protons), it also reveals rich information about the nature of nuclear forces in the nuclear medium. The knowledge of the nuclear EOS is essential for our understanding of the properties and structures of both nuclei and neutron stars, the dynamics of heavy ion collisions (HICs), as well as supernova explosions and neutron-star mergers. Thus, the nuclear EOS has attracted considerable attention from both nuclear physics and astrophysics communities since long time ago [16]. From experimental point of view, the nuclear EOS cannot be directly measured in terrestrial laboratory because the created nuclear matter in HICs exists only for a very short time (typically several fm/c). From theoretical point of view, the calculation of EOS relies heavily on microscopic nuclear many-body theory and nuclear force. Neither of them are completely understood, although decades of efforts have been devoted to study them, and great progress has been made [3, 7]. Alternatively, the nuclear EOS can be extracted or inferred from measurements on nuclear structure, astrophysical observations of neutron stars, and observables from HICs [8, 9].
To quantitatively describe the nuclear EOS, the following expression is frequently used: E(ρ,δ)=E(ρ,δ=0)+Esym(ρ)δ2+O(δ4). The first term E(ρ,δ=0) is the binding energy per nucleon in the isospin-symmetric nuclear matter, Esym(ρ) is the density-dependent nuclear symmetry energy. Investigating how E(ρ,δ=0) and Esym(ρ) vary with density, is one of the scientific goals of the current and upcoming nuclear facilities (e.g., the CSR and HIAF in China, the FRIB in the United States, the RIBF in Japan, the SPIRAL2 in France, the FAIR in Germany, and the NICA in Russia) around the world. Great endeavors have been made, however a precise picture of the nuclear EOS in a broad range of densities and isospin asymmetries has still not been obtained [2, 3, 7].
The inference of the nuclear EOS usually relies on data from both experimental measurements and theoretical calculations, which is similar to the process of mining information from the Big Data. Machine learning (ML) algorithm has a strong capability to infer and mine information from data. Indeed, ML has been a branch of artificial intelligence and computer science since 1980s, and mainly focuses on the use of data and algorithms to imitate the way that humans learn based on various mathematical models [10]. Thanks to the appearance of many modern and highly integrated programming tools, high performance personal computers and open-source tools and platforms, ML or deep learning (which is a subset of ML but with a deeper and more complex mathematical structure) has become very popular in many fields of natural science (such as physics, see Ref. [11] for recent review), and brings the fourth paradigm of science, which is so-called data-driven discovery [12].
In the conventional framework for studying nuclear physics, usually huge amounts of data that consist of both signal (which is the information one wants to obtain) and noise (which is something one wishes to forgo) are acquired from nuclear experiments. Then, the acquired big data will be processed and analyzed in order to obtain observables with defined physical meanings according to the prior human knowledge. These observables usually only have a few data points and will be used to infer physics of interest with the combination of nuclear theory. A well-known example of the conventional framework in nuclear physics is the studying of nuclear EOS with HICs. The size of data acquired per second from detectors in HIC experiments can reach as high as several terabytes, and these data will be processed to obtain observables, such as the collective flows which are only a few data points. The information of nuclear EOS is then extracted from the comparison of the collective flows obtained with transport model simulations to those with experimental measurements. Apparently, the extent of data used in the conventional framework is very limited.
The application of ML has changed the way we study nuclear physics. Indeed, pioneering work on studying nuclear physics with ML started in the early 1990s, where artificial neural networks (ANN) were used to study global nuclear properties [13]. Later on, Bass et al. [14] demonstrated that ANN is capable of improving the accuracy of the impact parameter determination in HICs, which is the pioneering application of ML in heavy-ion physics. Since then such studies are continually performed and are rapidly becoming popular research topics especially in the recent decade [1518]. As illustrated in Fig.1, ML can alter the full chain of the conventional framework. For example, it has been found that ML not only can speeding-up data acquisition and analysis in nuclear experiments, but also can refine model, accelerate computation by emulators, optimize model parameters and quantify uncertainties in nuclear theory. For example, various ML algorithms have been shown to be rather powerful in improving the performance of nuclear models on the prediction of nuclear masses, nuclear charged radii, half-lives, and so on. See, e.g., Refs. [1518] for recent reviews. In addition, quantitative analysis methods, such as Bayesian inference, are also widely used to infer physical quantities of interest from the comparison of observables determined experimentally with that obtained from theoretical model calculations [1921]. Extraction of the nuclear EOS is ideally suited for the Bayesian inference. According to the Bayes’ theorem, constraints on a physical quantity θ (or vector of parameters) from a set of experimental measurements yexp are described by the posterior probability P(θ|yexp) and can be written as
Fig.1 Conventional framework (dashed-line box) and machine learning based framework (solid-line box) in the study of nuclear physics.

Full size|PPT slide

P(θ|yexp)P(yexp|θ)P(θ).
Here the prior P(θ) reflects any preexisting information/belief about the distribution of the physical quantity θ. P(yexp|θ) is likelihood function that denotes the probability of a theoretical model being matched with the data yexp at a given value of its parameter θ. Both experimental and theoretical model uncertainties can be considered in the construction of P(yexp|θ). Usually, it requires massive computational work of running theoretical model with different choices of θ, especially when θ is a vector of parameters (which is common in nuclear physics), so, this is time consuming and not feasible until the last decade. In a work recently reported in Nature, Huth et al. [22] extracted the EOS of dense nuclear matter by using Bayesian inference which combines data from HICs and astrophysical observations with microscopic nuclear theory calculations. This work is novel in that it demonstrates how the interdisciplinary analyses can shed light on the nuclear EOS, and shows that EOS constraints from HIC experiments and multi-messenger astrophysics are remarkably consistent with each other, which provides important information on the nuclear EOS at supra-saturation densities.
Besides Bayesian inference, another emblematic example, for which modern ML methodology is also ideally suited, is the use of various supervised machine learning algorithms to extract information from event-by-event HIC data. The application of ML in heavy-ion physics is expected to be promising because there is a huge amount of event-by-event data either from experimental measurements or from theoretical model simulations. Event-by-event data in HICs encodes rich information about relevant physics and correlations among observables which can be sensitive to the physics of interest but independent of other model parameters. With the help of ML algorithm, physical information can be extracted from observables in HICs on the event-by-event analysis, see e.g., Refs. [23, 24], instead of traditional method which usually relies on the expectation values of observables over all considered events. As demonstrated in our recent work [24], by using a modern ML algorithm and data generated with a microscopic transport model, we presented an ML based framework for inferring the Esym(ρ) from observables in HICs on the event-by-event analysis, and the framework is found to be robust against variations in model parameters. Note that unlike the conventional framework in which observables are usually constructed according to the prior human knowledge and their number is limited, observables (or features) in the ML based framework can be constructed by ML algorithms themselves without any prior knowledge from human and their number can be infinite in general. Apparently, the extent of data used in the ML based framework is much higher than that in conventional framework. Moreover, the most important observables that drive ML predictions can be identified by various feature attribution methods, thereby offering valuable insights on the physics of interest. This ML based framework as a data-driven methodology, is extensible and may offer a new paradigm to study other issues in HICs. Besides studying the nuclear EOS, various ML algorithms have been used to study diverse topics in HICs from low and intermediate energies up to relativistic energies, such as, the impact parameter, the nuclear liquid-gas phase transition, initial state nuclear structure, and QCD phase transition. See, e.g., Refs. [1618, 25] for recent reviews and comprehensive lists of references.
The marriage of ML and heavy-ion physics brings a ML based framework which can be a powerful tool and may open a new venue to study the underlying physics in HICs, and has revolutionized the way we infer the nuclear EOS. In the recent years, the number of reported applications of ML is growing at an extraordinary rate, several promising results have been shown, but we are still in the earliest stages of exploring the application of ML in the study of heavy-ion physics. Many open questions remain, such as, whether the output of ML is interpretable or explainable (the interpretability or explainability, it is the degree to which a human can understand the cause of a decision. This is important, particularly for physicists, because understanding what happens when ML algorithms make predictions could help us make better use of the outputs), whether ML algorithms have the ability to infer information from vastly different data sets (generalizability, it refers to the capacity of the trained ML algorithms to adequately predict on data which sources differ from those involved in training. ML algorithms which have high generalizability are usually preferred, especially when the data are generated with models), whether knowledge can be learned from small datasets (data-hungry, it matters especially when the cost of data generation is high).
Within the next decade, new heavy-ion facilities, such as HIAF, NICA, and FAIR, will generate data at far greater rates than the present facilities. The interpretation of these big data will become even more challenging. Fortunately, ML is one of today’s most rapidly growing technical fields and continuously producing tools that are potentially applicable to a wide array of tasks in heavy-ion physics. Moving forward, the mergence of ML and heavy-ion physics can be improved in at least four aspects. First, improving the quality of the generated synthetic training data. Because real data do not come with labels, model simulations provide the only possible way to get data sets have truth labels for training. More sophisticated models are expected to be more appropriate to generate synthetic data that can closely mimic the real data. Alternatively, using data that generated from different models may be of benefit to achieve an model-independent extraction of the nuclear EOS with ML algorithms. Second, introducing physical information into ML algorithms. This can be done either by using input features with defined physical meanings or by considering physical symmetries and laws when constructing architectures of ML algorithms [26]. Third, using successful experiences of ML applications in other fields, such as condensed matter physics and particle physics. The use of ML in those fields is more advanced than in heavy-ion physics, many promising techniques can be extended to study heavy-ion physics. Fourth, changing the latest developments in ML into tools for studying heavy-ion physics. ML is a rapidly growing and flourishing field fueled by the successful applications in many aspects of our daily life. Nowadays, a diverse array of ML algorithm has been developed and continue to be refined to cover a wide variety of data types and tasks, this is a sufficiently large and diverse pool of tools feasible to study heavy-ion physics.

References

[1]
C. Drischler , J. Holt , C. Wellenhofer . Chiral effective field theory and the high-density nuclear equation of state. Annu. Rev. Nucl. Part. Sci., 2021, 71(1): 403
CrossRef ADS Google scholar
[2]
B. A. Li , L. W. Chen , C. M. Ko . Recent progress and new challenges in isospin physics with heavy-ion reactions. Phys. Rep., 2008, 464(4−6): 113
CrossRef ADS Google scholar
[3]
M. Oertel , M. Hempel , T. Klähn , S. Typel . Equations of state for supernovae and compact stars. Rev. Mod. Phys., 2017, 89(1): 015007
CrossRef ADS Google scholar
[4]
A.SorensenK.AgarwalK.W. Brown, ., Dense nuclear matter equation of state from heavy-ion collisions, arXiv: 2301.13253 (2023)
[5]
J. Lattimer . Neutron stars and the nuclear matter equation of state. Annu. Rev. Nucl. Part. Sci., 2021, 71(1): 433
CrossRef ADS Google scholar
[6]
J. A. López , C. O. Dorso , G. A. Frank . Properties of nuclear pastas. Front. Phys. (Beijing), 2021, 16(2): 24301
CrossRef ADS Google scholar
[7]
M. Tsang , J. Stone , F. Camera , P. Danielewicz , S. Gandolfi , K. Hebeler , C. J. Horowitz , J. Lee , W. G. Lynch , Z. Kohley , R. Lemmon , P. Möller , T. Murakami , S. Riordan , X. Roca-Maza , F. Sammarruca , A. W. Steiner , I. Vidaña , S. J. Yennello . Constraints on the symmetry energy and neutron skins from experiments and theory. Phys. Rev. C, 2012, 86(1): 015803
CrossRef ADS Google scholar
[8]
H. Wolter , M. Colonna , D. Cozma . . Transport model comparison studies of intermediate-energy heavy-ion collisions. Prog. Part. Nucl. Phys., 2022, 125: 103962
CrossRef ADS Google scholar
[9]
Y. J. Wang , Q. F. Li . Application of microscopic transport model in the study of nuclear equation of state from heavy ion collisions at intermediate energies. Front. Phys. (Beijing), 2020, 15(4): 44302
CrossRef ADS Google scholar
[10]
M. I. Jordan , T. M. Mitchell . Machine learning: Trends, perspectives, and prospects. Science, 2015, 349(6245): 255
CrossRef ADS Google scholar
[11]
G. Carleo , I. Cirac , K. Cranmer , L. Daudet , M. Schuld , N. Tishby , L. Vogt-Maranto , L. Zdeborová . Machine learning and the physical sciences. Rev. Mod. Phys., 2019, 91(4): 045002
CrossRef ADS Google scholar
[12]
A.J. HeyS.TansleyK.M. Tolle, ., The Fourth Paradigm: Data-Intensive Scientific Discovery, Vol. 1, Microsoft research Redmond, WA, 2009
[13]
S. Gazula , J. Clark , H. Bohr . Learning and prediction of nuclear stability by neural networks. Nucl. Phys. A., 1992, 540(1−2): 1
CrossRef ADS Google scholar
[14]
S. Bass , A. Bischoff , C. Hartnack , J. A. Maruhn , J. Reinhardt , H. Stocker , W. Greiner . Neural networks for impact parameter determination. J. Phys. G Nucl. Part. Phys., 1994, 20(1): L21
CrossRef ADS Google scholar
[15]
P. Bedaque , A. Boehnlein , M. Cromaz , M. Diefenthaler , L. Elouadrhiri , T. Horn , M. Kuchera , D. Lawrence , D. Lee , S. Lidia , R. McKeown , W. Melnitchouk , W. Nazarewicz , K. Orginos , Y. Roblin , M. Scott Smith , M. Schram , X. N. Wang . A. I. for nuclear physics. Eur. Phys. J. A, 2021, 57(3): 100
CrossRef ADS Google scholar
[16]
A. Boehnlein , M. Diefenthaler , N. Sato , M. Schram , V. Ziegler , C. Fanelli , M. Hjorth-Jensen , T. Horn , M. P. Kuchera , D. Lee , W. Nazarewicz , P. Ostroumov , K. Orginos , A. Poon , X. N. Wang , A. Scheinker , M. S. Smith , L. G. Pang . Colloquium: Machine learning in nuclear physics. Rev. Mod. Phys., 2022, 94(3): 031003
CrossRef ADS Google scholar
[17]
W.B. HeY.G. MaL.G. PangH.C. SongK.Zhou, High energy nuclear physics meets Machine Learning, arXiv: 2303.06752 (2023)
[18]
W.HeQ.LiY.MaZ.M. NiuJ.C. PeiY.X. Zhang, Machine learning in nuclear physics at low and intermediate energies, arXiv: 2301.06396 (2023)
[19]
P. Morfouace , C. Tsang , Y. Zhang , W. G. Lynch , M. B. Tsang , D. D. S. Coupland , M. Youngs , Z. Chajecki , M. A. Famiano , T. K. Ghosh , G. Jhang , J. Lee , H. Liu , A. Sanetullaev , R. Showalter , J. Winkelbauer . Constraining the symmetry energy with heavy-ion collisions and Bayesian analyses. Phys. Lett. B, 2019, 799: 135045
CrossRef ADS Google scholar
[20]
M.O. KuttanJ.SteinheimerK.ZhouH.Stoecker, The QCD EoS of dense nuclear matter from Bayesian analysis of heavy ion collision data, arXiv: 2211.11670 (2022)
[21]
B.A. LiW.J. Xie, Bayesian inference of in-medium baryon−baryon scattering cross sections from HADES proton flow data, arXiv: 2303.10474 (2023)
[22]
S. Huth , P. T. Pang , I. Tews , T. Dietrich , A. Le Fèvre , A. Schwenk , W. Trautmann , K. Agarwal , M. Bulla , M. W. Coughlin , C. Van Den Broeck . Constraining neutron-star matter with microscopic and macroscopic collisions. Nature, 2022, 606(7913): 276
CrossRef ADS Google scholar
[23]
L. G. Pang , K. Zhou , N. Su , H. Petersen , H. Stöcker , X. N. Wang . An equation-of-state-meter of quantum chromodynamics transition from deep learning. Nat. Commun., 2018, 9(1): 210
CrossRef ADS Google scholar
[24]
Y. Wang , Z. Gao , H. Lü , Q. Li . Decoding the nuclear symmetry energy event-by-event in heavy-ion collisions with machine learning. Phys. Lett. B, 2022, 835: 137508
CrossRef ADS Google scholar
[25]
K.ZhouL.WangL.G. PangS.Z. Shi, Exploring QCD matter in extreme conditions with Machine Learning, arXiv: 2303.15136 (2023)
[26]
M. Raissi , P. Perdikaris , G. E. Karniadakis . Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys., 2019, 378: 686
CrossRef ADS Google scholar

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

The work was supported in part by the National Natural Science Foundation of China (Grant Nos. U2032145 and 11875125) and the National Key Research and Development Program of China (Grant No. 2020YFE0202002).

RIGHTS & PERMISSIONS

2023 Higher Education Press
AI Summary AI Mindmap
PDF(3007 KB)

545

Accesses

10

Citations

Detail

Sections
Recommended

/