PDF
Abstract
This paper introduces two variational inference (VI) approaches for infinite-dimensional inverse problems, developed through gradient descent with a constant learning rate. The proposed methods enable efficient approximate sampling from the target posterior distribution using a constant-rate stochastic gradient descent (cSGD) iteration. Specifically, we introduce a randomization strategy that incorporates stochastic gradient noise, allowing the cSGD iteration to be viewed as a discrete-time process. This transformation establishes key relationships between the covariance operators of the approximate and true posterior distributions, thereby validating cSGD as a VI method. We also investigate the regularization properties of the cSGD iteration and provide a theoretical analysis of the discretization error between the approximated posterior mean and the true background function. Building on this framework, we develop a preconditioned version of cSGD to further improve sampling efficiency. Finally, we apply the proposed methods to two practical inverse problems: one governed by a simple smooth equation and the other by the steady-state Darcy flow equation. Numerical results confirm our theoretical findings and compare the sampling performance of the two approaches for solving linear and non-linear inverse problems.
Keywords
Inverse problems
/
Infinite-dimensional variational inference (iVI)
/
Bayesian analysis for functions
/
Partial differential equations (PDEs)
/
Stochastic gradient descent (SGD)
/
65L09
/
35R30
/
49N45
/
62F15
Cite this article
Download citation ▾
Jiaming Sui, Junxiong Jia, Jinglai Li.
Stochastic Gradient Descent Based Variational Inference for Infinite-Dimensional Inverse Problems.
Communications on Applied Mathematics and Computation 1-30 DOI:10.1007/s42967-026-00574-x
| [1] |
Arendt W, Chalendar I, Eymard R. Galerkin approximation of linear problems in Banach and Hilbert spaces. IMA J. Numer. Anal., 2020, 42(1): 165-198
|
| [2] |
Arridge, S., Maass, P., Öktem, O., Schönlieb, C.-B.: Solving inverse problems using data-driven models. Acta Numer. 28, 1–174 (2019)
|
| [3] |
Beskos A, Girolami M, Lan S, Farrell PE, Stuart AM. Geometric MCMC for infinite-dimensional inverse problems. J. Comput. Phys., 2017, 335: 327-351
|
| [4] |
Blei DM, Kucukelbir A, McAuliffe JD. Variational inference: a review for statisticians. J. Am. Stat. Assoc., 2017, 112(518): 859-877
|
| [5] |
Bui-Thanh, T., Ghattas, O., Martin, J., Stadler, G.: A computational framework for infinite-dimensional Bayesian inverse problems. Part I: the linearized case, with application to global seismic inversion. SIAM J. Sci. Comput. 35(6), A2494–A2523 (2013)
|
| [6] |
Bui-Thanh T, Nguyen QP. FEM-based discretization-invariant MCMC methods for PDE-constrained Bayesian inverse problems. Inverse Probl. Imaging, 2016, 10(4): 943-975
|
| [7] |
Cotter SL, Dashti M, Robinson JC, Stuart AM. Bayesian inverse problems for functions and applications to fluid mechanics. Inverse Probl., 2009, 25(11): Article ID: 115008
|
| [8] |
Cotter SL, Roberts GO, Stuart AM, White D. MCMC methods for functions: modifying old algorithms to make them faster. Stat. Sci., 2013, 28(3): 424-446
|
| [9] |
Cui T, James M, Marzouk YM, Antti S, Spantini A. Likelihood-informed dimension reduction for nonlinear inverse problems. Inverse Probl., 2014, 30(11): Article ID: 114015
|
| [10] |
Cui T, Marzouk Y, Willcox K. Scalable posterior approximations for large-scale Bayesian inverse problems via likelihood-informed parameter and state reduction. J. Comput. Phys., 2016, 315: 363-387
|
| [11] |
Cui T, Zahm O. Data-free likelihood-informed dimension reduction of Bayesian inverse problems. Inverse Probl., 2021, 37(4): Article ID: 045009
|
| [12] |
Dashti, M., Stuart, A.M.: The Bayesian approach to inverse problems. In: Handbook of Uncertainty Quantification. Springer, New York (2017)
|
| [13] |
Dunlop MM, Iglesias MA, Stuart AM. Hierarchical Bayesian level set inversion. Stat. Comput., 2017, 27(6): 1555-1584
|
| [14] |
Feng Z, Li J. An adaptive independence sampler MCMC algorithm for Bayesian inferences of functions. SIAM J. Sci. Comput., 2018, 40(3): A1301-A1321
|
| [15] |
Fichtner AFull Seismic Waveform Modelling and Inversion, 2010HeidelbergSpringer
|
| [16] |
Guha, N., Wu, X.Q., Efendiev, Y., Jin, B., Mallick, B.K.: A variational Bayesian approach for inverse problems with skew-t error distribution. J. Comput. Phys. 301, 377–393 (2015)
|
| [17] |
Hinze M, Pinnau R, Ulbrich M, Ulbrich SOptimization with PDE Constraints, 2008New YorkSpringer
|
| [18] |
Hu Z, Yao Z, Li J. On an adaptive preconditioned Crank-Nicolson MCMC algorithm for infinite dimensional Bayesian inferences. J. Comput. Phys., 2017, 332: 492-503
|
| [19] |
Jia J, Li P, Meng D. Stein variational gradient descent on infinite-dimensional space and applications to statistical inverse problems. SIAM J. Numer. Anal., 2022, 60(4): 2225-2252
|
| [20] |
Jia, J.,Wu, Y., Li, P., Meng, D.: Variational inverting network for statistical inverse problems of partial differential equations. J. Mach. Learn. Res. 24(201), 1–60 (2023)
|
| [21] |
Jia, J., Zhao, Q., Xu, Z.B., Meng, D., Leung, Y.: Variational Bayes’ method for functions with applications to some inverse problems. SIAM J. Sci. Comput. 43(1), A355–A383 (2021)
|
| [22] |
Jin B, Zou J. Hierarchical Bayesian inference for ill-posed problems via variational method. J. Comput. Phys., 2010, 229(19): 7317-7343
|
| [23] |
Kaipio J, Somersalo EStatistical and Computational Inverse Problems, 2005New YorkSpringer
|
| [24] |
Knapik BT, Van Der Vaart AW, van Zanten JH. Bayesian inverse problems with Gaussian priors. Ann. Stat., 2011, 39(5): 2626-2657
|
| [25] |
Lassas M, Siltanen S. Can one use total variation prior for edge-preserving Bayesian inversion?. Inverse Probl., 2004, 20(5): 1537-1563
|
| [26] |
Lu, S., Mathé, P.: Stochastic gradient descent for linear inverse problems in Hilbert spaces. Math. Comput. 91(336), 1763–1788 (2022)
|
| [27] |
Mandt, S., Hoffman, M.D., Blei, D.M.: A variational analysis of stochastic gradient algorithms. In: ICML’16: Proceedings of the 33rd International Conference on International Conference on Machine Learning, vol. 48, pp. 354–363 (2016)
|
| [28] |
Mandt S, Hoffman MD, Blei DM. Stochastic gradient descent as approximate Bayesian inference. J. Mach. Learn. Res., 2017, 18(134): 1-35
|
| [29] |
Mathé P, Pereverzev SV. Discretization strategy for linear ill-posed problems in variable Hilbert scales. Inverse Probl., 2003, 19(6): 1263
|
| [30] |
Pillai NS, Stuart AM, Thiéry AH. Noisy gradient flow from a random walk in Hilbert space. Stoch. Partial Differ. Equ. Anal. Comput., 2014, 2(2): 196-232
|
| [31] |
Pinski, F.J., Simpson, G., Stuart, A.M., Weber, H.: Algorithms for Kullback-Leibler approximation of probability measures in infinite dimensions. SIAM J. Sci. Comput. 37(6), A2733–A2757 (2015)
|
| [32] |
Pinski, F.J., Simpson, G., Stuart, A.M., Weber, H.: Kullback-Leibler approximation for probability measures on infinite dimensional space. SIAM J. Math. Anal. 47(6), 4091–4122 (2015)
|
| [33] |
Stuart AM. Inverse problems: a Bayesian perspective. Acta Numer., 2010, 19: 451-559
|
| [34] |
Sui J, Jia J. Non-centered parametric variational Bayes’ approach for hierarchical inverse problems of partial differential equations. Math. Comput., 2024, 93(348): 1715-1760
|
| [35] |
Wang Y, Blei DM. Frequentist consistency of variational Bayes. J. Am. Stat. Assoc., 2019, 114(527): 1147-1161
|
| [36] |
Weglein AB, Araújo FV, Carvalho PM, Stolt RH, Matson KH, Coates RT, Corrigan D, Foster DJ, Shaw SA, Zhang H. Inverse scattering series and seismic exploration. Inverse Probl., 2003, 19(6): R27-R83
|
| [37] |
Yao Z, Hu Z, Li J. A TV-Gaussian prior for infinite-dimensional Bayesian inverse problems and its numerical implementations. Inverse Probl., 2016, 32(7): Article ID: 075006
|
| [38] |
Zhang C, Bütepage J, Kjellström H, Mandt S. Advances in variational inference. IEEE Trans. Pattern Anal. Mach. Intell., 2018, 41(8): 2008-2026
|
| [39] |
Zhang F, Gao C. Convergence rates of variational posterior distributions. Ann. Stat., 2020, 48(4): 2180-2207
|
| [40] |
Zhou, Q.P., Hu, Z.X., Yao, Z.W., Li, J.L.: A hybrid adaptive MCMC algorithm in function spaces. SIAM/ASA J. Uncertainty. Quantification. 5(1), 621–639 (2017)
|
| [41] |
Zhou, Q.P., Yu, T.C., Zhang, X.Q., Li, J.L.: Bayesian inference and uncertainty quantification for medical image reconstruction with Poisson data. SIAM J. Imaging Sci. 13(1), 29–52 (2020)
|
Funding
National Natural Science Foundation of China(12322116)
UK Research and Innovation
China Scholarship Council
RIGHTS & PERMISSIONS
Shanghai University