Abstract
In geodesy, Tikhonov regularization and truncated singular value decomposition (TSVD) are commonly used to derive a well-defined solution for ill-conditioned observation equations. However, as single-parameter regularization methods, they may face some limitations in application due to their lack of flexibility. In this contribution, a kind of multiparameter regularization method is considered, called generalized ridge regression (GRR). Generally, GRR projects observations into several orthogonal spectral domains and then uses different regularization parameters to minimize the mean squared error of the estimated parameters in corresponding spectral domains. To find suitable regularization parameters for GRR, an iterative and shrinking generalized ridge regression (IS-GRR) is proposed. The IS-GRR procedure starts by introducing a predetermined approximation of unknown parameters. Subsequently, in each spectral domain, the signal and noise of the observations are estimated in an iterative and shrinking manner, and the regularization parameters are updated according to the estimated signal-to-noise ratio. Compared to conventional regularization schemes, IS-GRR has the following advantages: Tikhonov regularization usually oversmooths signals in the low-spectral domains and undersuppresses noise in the high-spectral domains, whereas TSVD usually undersuppresses noise in the low-spectral domains and oversmooths signals in the high-spectral domains. However, IS-GRR strikes a balance between retaining signals and suppressing noise in different spectral domains, thereby exhibiting better performance. Two experiments (simulation and mascon modelling examples) verify the effectiveness of IS-GRR for solving ill-conditioned equations in geodesy.
Similar content being viewed by others
Data availability
The GIA model ICE6G-D was available at https://www.atmosp.physics.utoronto.ca/~peltier/data.php; The CSR RL06 SH coefficients data can be downloaded from DOI: https://podaac.jpl.nasa.gov/dataset/GRACE_GSM_L2_GRAV_CSR_RL06; The CSR RL06 mascon solutions can be downloaded from https://www2.csr.utexas.edu/grace/RL06_mascons.html.
References
Arashi M, Saleh AME, Kibria BG (2019) Theory of ridge regression estimation with applications. Wiley, Hoboken
Arsenin VY, Krianev AV (1992) Generalized maximum likelihood method and its application for solving ill-posed problems A. In: Tikhonov (ed) Ill-posed problems in natural sciences. TVP Science Publishers, Moscow, pp 3–12
Baur O, Sneeuw N (2011) Assessing Greenland ice mass loss by means of point-mass modeling: a viable methodology. J Geodesy 85(9):607–615. https://doi.org/10.1007/s00190-011-0463-1
Belge M, Kilmer ME, Miller EL (2002) Efficient determination of multiple regularization parameters in a generalized L-curve framework. Inverse Probl 18(4):1161. https://doi.org/10.1088/0266-5611/18/4/314
Berger JO (1985) Statistical decision theory and Bayesian analysis. Springer Verlag, New York
Bonesky T (2008) Morozov’s discrepancy principle and Tikhonov-type functionals. Inverse Probl 25(1):015015. https://doi.org/10.1088/0266-5611/25/1/015015
Bovik AC (2010) Handbook of image and video processing. Academic Press
Brezinski C, Redivo-Zaglia M, Rodriguez G, Seatzu S (2003) Multi-parameter regularization techniques for ill-conditioned linear systems. Numer Math 94(2):203–228. https://doi.org/10.1007/s00211-002-0435-8
Byrne MJ, Renaut RA (2021) Learning multiple regularization parameters for generalized Tikhonov regularization using multiple data sets without true data. arXiv preprint arXiv:2112.12344
Calvetti D, Reichel L (2003) Tikhonov regularization of large linear problems. BIT Numer Math 43(2):263–283. https://doi.org/10.1023/A:1026083619097
Calvetti D, Morigi S, Reichel L, Sgallari F (2000) Tikhonov regularization and the L-curve for large discrete ill-posed problems. J Comput Appl Math 123(1–2):423–446. https://doi.org/10.1016/S0377-0427(00)00414-3
Chen JL, Wilson CR, Tapley BD, Yang ZL, Niu GY (2009) 2005 drought event in the Amazon River basin as measured by GRACE and estimated by climate models. J Geophys Res Solid Earth. https://doi.org/10.1029/2008JB006056
Chen T, Kusche J, Shen Y, Chen Q (2020) A combined use of TSVD and tikhonov regularization for mass flux solution in tibetan plateau. Remote Sens 12(12):2045. https://doi.org/10.3390/rs12122045
Chen Q, Shen Y, Kusche J, Chen W, Chen T, Zhang X (2021) High-resolution GRACE monthly spherical harmonic solutions. J Geophys Res Solid Earth 126:e2019JB018892. https://doi.org/10.1029/2019JB018892
Chung J, Español MI (2017) Learning regularization parameters for general-form Tikhonov. Inverse Probl 33(7):074004. https://doi.org/10.1088/1361-6420/33/7/074004
Ditmar P (2022) How to quantify the accuracy of mass anomaly time-series based on GRACE data in the absence of knowledge about true signal? J Geod 96:54. https://doi.org/10.1007/s00190-022-01640-x
Ditmar P, Tangdamrongsub N, Ran J, Klees R (2018) Estimation and reduction of random noise in mass anomaly time-series from satellite gravity data by minimization of month-to-month year-to-year double differences. J Geodyn 119:9–22. https://doi.org/10.1016/j.jog.2018.05.003
Dykes L, Noschese S, Reichel L (2015) Rescaling the GSVD with application to ill-posed problems. Numer Algor 68(3):531–545. https://doi.org/10.1007/s11075-014-9859-3
Fenu C, Reichel L, Rodriguez G, Sadok H (2017) GCV for Tikhonov regularization by partial SVD. BIT Numer Math 57(4):1019–1039. https://doi.org/10.1007/s10543-017-0662-0
Fuhry M, Reichel L (2012) A new Tikhonov regularization method. Numer Algor 59(3):433–445. https://doi.org/10.1007/s11075-011-9498-x
Gazzola S, Novati P (2013) Multi-parameter Arnoldi–Tikhonov methods. Electron Trans Numer Anal 40:452–475
Golub GH, Heath M, Wahba G (1979) Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 21(2):215–223. https://doi.org/10.1080/00401706.1979.10489751
Hanke M (1996) Limitations of the L-curve method on ill-posed problems. BIT 36:287–301. https://doi.org/10.1007/BF01731984
Hansen PC (1989) Regularization, GSVD and truncated GSVD. BIT Numer Math 29(3):491–504. https://doi.org/10.1007/BF02219234
Hansen PC (1990) Truncated singular value decomposition solutions to discrete ill-posed problems with ill-determined numerical rank. SIAM J Sci Comput 11:503–518. https://doi.org/10.1137/091102
Hansen PC (1992) Analysis of discrete ill-posed problems by means of the L-curve. SIAM Rev 34(4):561–580. https://doi.org/10.1137/1034115
Hanson RJ (1971) A numerical method for solving Fredholm integral equations of the first kind using singular values. SIAM J Numer Anal 8(3):616–622. https://doi.org/10.1137/0708058
Hemmerle W (1975) An explicit solution for generalized ridge regression. Technometrics 17:309–314. https://doi.org/10.1080/00401706.1975.10489333
Hemmerle W, Brantle TF (1978) Explicit and constrained generalized ridge estimation. Technometrics 20:109–120. https://doi.org/10.1080/00401706.1978.10489634
Hoerl AE, Kennard RW (1970a) Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12:55–67. https://doi.org/10.1080/00401706.2000.10485983
Hoerl AE, Kennard RW (1970b) Ridge regression: applications to nonorthogonal problems. Technometrics 12:69–82. https://doi.org/10.1080/00401706.1970.10488635
Hoerl AE, Kennard RW (1976) Ridge regression iterative estimation of the biasing parameter. Commun Stat-Theory Methods 5(1):77–88
Jing W, Zhang P, Zhao X (2019) A comparison of different GRACE solutions in terrestrial water storage trend estimation over Tibetan Plateau. Sci Rep 9(1):1–10. https://doi.org/10.1038/s41598-018-38337-1
Kibria BG (2003) Performance of some new ridge regression estimators. Commun in Stat-Simul Comput 32(2):419–435. https://doi.org/10.1081/SAC-120017499
Koch KR (1999) Parameter estimation and hypothesis testing in linear models. Springer Science & Business Media, New York
Koch KR (2007) Introduction to Bayesian statistics. Springer Science & Business Media, Heidelberg
Koch KR, Kusche J (2002) Regularization of geopotential determination from satellite data by variance components. J Geodesy 76(5):259–268. https://doi.org/10.1007/s00190-002-0245-x
Kusche J, Klees R (2002) Regularization of gravity field estimation from satellite gravity gradients. J Geodesy 76(6):359–368. https://doi.org/10.1007/s00190-002-0257-6
Loomis BD, Rachlin KE, Luthcke SB (2019) Improved Earth oblateness rate reveals increased ice sheet losses and mass-driven sea level rise. Geophys Res Lett 46:6910–6917. https://doi.org/10.1029/2019GL082929
Lu S, Pereverzev SV (2011) Multi-parameter regularization and its numerical realization. Numer Math 118(1):1–31. https://doi.org/10.1007/s00211-010-0318-3
Moritz H (1980) Advanced physical geodesy. Herbert Wichmann Verlag, Karlsruhe
Morozov VA (1966) On the solution of functional equations by the method of regularization. In Doklady Akademii Nauk. Russian Academy of Sciences, vol 167, No 3, pp 510–512
Morozov VA (1984) Methods for solving incorrectly posed problems (translation ed.: Nashed MZ). Springer, Wien
Mu D, Yan H, Feng W, Peng P (2017) GRACE leakage error correction with regularization technique: case studies in Greenland and Antarctica. Geophys J Int 208(3):1775–1786. https://doi.org/10.1093/gji/ggw494
Nair M (2009) On Morozov’s discrepancy principle for nonlinear ill-posed equations. Bull Aust Math Soc 79(2):337–342. https://doi.org/10.1017/S0004972708001342
Peltier WR, Argus DF, Drummond R (2018) Comment on the paper by Purcell et al. 2016 entitled An assessment of ICE-6G_C (VM5a) glacial isostatic adjustment model. J Geophys Res Solid Earth 122:2019–2028. https://doi.org/10.1002/2016JB013844
Rezghi M, Hosseini SM (2009) A new variant of L-curve for Tikhonov regularization. J Comput Appl Math 231(2):914–924. https://doi.org/10.1016/j.cam.2009.05.016
Rummel R, Schwarz KP, Gerstl M (1979) Least squares collocation and regularization. Bull Geod 53:343–361. https://doi.org/10.1007/BF02522276
Sakumura C, Bettadpur S, Bruinsma S (2014) Ensemble prediction and intercomparison analysis of grace time variable gravity field models. Geophys Res Lett 41:1389–1397. https://doi.org/10.1002/2013GL058632
Save H, Bettadpur S, Tapley BD (2016) High-resolution CSR GRACE RL05 mascons. J Geophys Res Solid Earth 121(10):7547–7569. https://doi.org/10.1002/2016JB013007
Shen Y, Liu D (2002) An unbiased estimate of the variance of unit weight after regularization. Geomat Inf Sci Wuhan Univ 27:604–606
Shen Y, Xu P, Li B (2012) Bias-corrected regularized solution to inverse ill-posed models. J Geodesy 86(8):597–608. https://doi.org/10.1007/s00190-012-0542-y
Shen Y, Xu G (2013) Regularization and adjustment. In Sciences of geodesy-II. Springer, Berlin, Heidelberg, pp 293–337
Swenson S, Wahr J (2006) Post-processing removal of correlated errors in GRACE data. Geophys Res Lett 33:L08402. https://doi.org/10.1029/2005GL025285
Tapley B, Watkins M, Flechtner F et al (2019) Contributions of GRACE to understanding climate change. Nat Clim Change 5(5):358–369. https://doi.org/10.1038/s41558-019-0456-2
Tarantola A (1987) Inverse problem theory. Elsevier, Amsterdam
Tikhonov AN (1963a) Regularization of ill-posed problems. Dokl Akad Nauk SSSR 151(1):49–52
Tikhonov AN (1963b) Solution of incorrectly formulated problems and the regularization method. Dokl Akad Nauk SSSR 151(3):501–504
Vogel CR (1996) Non-convergence of the L-curve regularization parameter selection method. Inverse Probl 12:535–547. https://doi.org/10.1088/0266-5611/12/4/013
Vovk V (2013) Kernel ridge regression. In Empirical inference. Springer, Berlin, Heidelberg, pp 105–116
Wahr J, Molenaar M, Bryan F (1998) Time variability of the Earth’s gravity field: Hydrological and oceanic effects and their possible detection using GRACE. J Geophys Res Solid Earth 103:30205–30229. https://doi.org/10.1029/98JB02844
Wang Z (2012) Multi-parameter Tikhonov regularization and model function approach to the damped Morozov principle for choosing regularization parameters. J Comput Appl Math 236(7):1815–1832. https://doi.org/10.1016/j.cam.2011.10.014
Watkins MM, Wiese DN, Yuan D-N, Boening C, Landerer FW (2015) Improved methods for observing Earth’s time variable mass distribution with GRACE using spherical cap mascons. J Geophys Res Solid Earth 120:2648–2671. https://doi.org/10.1002/2014JB011547
Wood SN (2000) Modelling and smoothing parameter estimation with multiple quadratic penalties. J R Stat Soc Ser B (stat Methodol) 62(2):413–428. https://doi.org/10.1111/1467-9868.00240
Xu P (1992) Determination of surface gravity anomalies using gradiometric observables. Geophys J Int 110:321–332. https://doi.org/10.1111/j.1365-246X.1992.tb00877.x
Xu P (1998) Truncated SVD methods for discrete linear ill-posed problems. Geophys J Int 135:505–514. https://doi.org/10.1046/j.1365-246X.1998.00652.x
Xu P (2009) Iterative generalized cross-validation for fusing heteroscedastic data of inverse ill-posed problems. Geophys J Int 179(1):182–200. https://doi.org/10.1111/j.1365-246X.2009.04280.x
Xu P, Rummel R (1994) Generalized ridge regression method with applications in determination of potential fields. Manus Geod 20:8–20
Xu P, Fukuda Y, Liu Y (2006a) Multiple parameter regularization: numerical solutions and applications to the determination of geopotential from precise satellite orbits. J Geodesy 80(1):17–27. https://doi.org/10.1007/s00190-006-0025-0
Xu P, Shen Y, Fukuda Y, Liu Y (2006b) Variance component estimation in linear inverse ill-posed models. J Geodesy 80(2):69–81. https://doi.org/10.1007/s00190-006-0032-1
Xu P, Rummel R (1992) A generalized regularization method with applications in determination of potential fields. In: Holota P, Vermeer M (eds) Proceedings of 1st continental workshop on the geoid in Europe, Prague, pp 444–457
Acknowledgements
This work is sponsored by the Natural Science Foundation of China (42192532, 42274005, 41974002). The authors would like to thank, with gratitude, Prof Peiliang Xu from Kyoto University for providing valuable suggestions and for polishing the English which lead to a significant improvement and clarification of the paper.
Author information
Authors and Affiliations
Contributions
YY proposed the key idea, designed the research, processed data, and wrote the paper draft; YS supervised the research, revised the manuscript, and checked all the formulate; LY, BL, and QC revised the manuscript; WW provided and processed data.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Appendix of Proof
Appendix of Proof
1.1 Proof of Theorem 1
To find the value of \({\alpha }_{i}\) which make \(\mathrm{Tr}\left[\mathrm{MSE}\left({\hat{{\varvec{x}}}}_{\mathrm{GRR}}^{\prime}\right)\right]\) to be minimized, we differentiate \(\mathrm{Tr}\left[\mathrm{MSE}\left({\hat{{\varvec{x}}}}_{\mathrm{GRR}}^{\prime}\right)\right]\) with respect to \({\alpha }_{i}\), there is (Hoerl and Kennard 1970a, b).
It is easy to find that the first derivative function has one and only one zero point which is \(\frac{{\sigma }^{2}}{{\left({{\varvec{v}}}_{i}^{\mathrm{T}}{{\varvec{x}}}^{\prime}\right)}^{2}}\). Furthermore, at the point of \({\alpha }_{i}=\frac{{\sigma }^{2}}{{\left({{\varvec{v}}}_{i}^{\mathrm{T}}{\varvec{x}^\prime}\right)}^{2}}\), the second derivative function is given as
The second derivative function is greater than zero, which indicates that this point is a unique minimum point of the function \(\mathrm{Tr}\left[\mathrm{MSE}\left({\hat{{\varvec{x}}}}_{\mathrm{GRR}}^{\prime}\right)\right]\). Therefore, the optimal \({\alpha }_{i}\) which minimizes the \(\mathrm{Tr}\left[\mathrm{MSE}\left({\hat{{\varvec{x}}}}_{\mathrm{GRR}}^{\prime}\right)\right]\) is given by
1.2 Proof of Theorem 2
On the one hand, considering the proof by contradiction, if there is a null hypothesis that.
in which
If we set \({\alpha }_{0}=0, \left(i=1,\dots ,n\right)\), then there is
which is contradictory. It means that the null hypothesis is wrong, thus we have
Also, if there is a null hypothesis that
in which
If we set \({\alpha }_{j}=0,\left(i=1,\dots ,n\right)\), then there is
which is contradictory. It means that the null hypothesis is wrong, thus we have
Combining Eqs. (A17) and (A22), we have
On the other hand, we also consider the proof by contradiction. If there is a null hypothesis that
in which
and
As \({\alpha }_{i}\) could be any non-negative value, if we set \({\alpha }_{i}={\alpha }_{0}, \left(i=1,\dots ,n\right)\), then there is
which is contradictory. It means that the null hypothesis is wrong, thus we have
Also, if there is a null hypothesis that
in which
and
As \({\alpha }_{i}\) could be any non-negative value, if we set \({\alpha }_{i}={\alpha }_{j},\left(i=1,\dots ,n\right)\), then there is
which is contradictory. It means that the null hypothesis is wrong, thus we have
Combining Eqs. (A17) and (A22), we have
Finally, combining Eqs. (A17) and (A23), we have
1.3 Proof of Theorem 3
Assuming that \({\alpha }_{i}^{\left(t\right)}>0\) for all i and that the iterative procedure is convergent such that (Hemmerle 1975).
According to Eq. (53), we must then have the relationship
where
Solving Eq. (A26) for \({\alpha }_{i}^{\left(*\right)}\) we have
It is easy to find that the necessary condition for convergence is \({T}_{i}-4>0\).
Therefore, on the one hand, when \({T}_{i}-4>0\), if the initial value satisfied \(0\le {\alpha }_{i}^{\left(0\right)}/{\lambda }_{i}^{2}\le \frac{1}{2}{T}_{i}-1-\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}\), then the \({\alpha }_{i}^{\left(t\right)}/{\lambda }_{i}^{2}\) would experience an increasing tendency and finally converge to \(\frac{1}{2}{T}_{i}-1-\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}\), which is
Else if \(\frac{1}{2}{T}_{i}-1-\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}<{\alpha }_{i}^{\left(0\right)}/{\lambda }_{i}^{2}<\frac{1}{2}{T}_{i}-1+\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}\), then the \({\alpha }_{i}^{\left(t\right)}/{\lambda }_{i}^{2}\) would experience a decreasing tendency and finally converge to \(\frac{1}{2}{T}_{i}-1-\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}\) as the same, which is
However, if the initial value \({\alpha }_{i}^{\left(0\right)}/{\lambda }_{i}^{2}>\frac{1}{2}{T}_{i}-1+\sqrt{\frac{1}{4}{T}_{i}^{2}-{T}_{i}}\), then the \({\alpha }_{i}^{\left(t\right)}\) would experience a divergence tendency, which is
On the other hand, when \({T}_{i}-4>0\), then the \({\alpha }_{i}^{\left(t\right)}\) would always experience a divergence tendency, which is
1.4 Proof of Corollary 3.1
According to Eqs. (36) and (54), given a certain \({\hat{\sigma }}^{2}\), the expectation of \({T}_{i}\) is given by
1.5 Proof of Corollary 3.2
On the one hand, when \({T}_{i}>4\), the derivation of \({B}_{i}\) to \({T}_{i}\) satisfies that
which means that \({B}_{i}\) increases monotonically with \({T}_{i}\). Thus \({B}_{i}\) get the minimum value in case of \({T}_{i}=4\), which can be written as
Thus, we have
On the other hand
1.6 Proof of Corollary 3.3
On the one hand, when \({T}_{i}>4\), the derivation of \({P}_{i}\) to \({T}_{i}\) satisfies that
which means that \({P}_{i}\) decreases monotonically with \({T}_{i}\). Thus \({P}_{i}\) get the maximum value in case of \({T}_{i}=4\) which can be written as
Thus, we have
On the other hand, to approve \({P}_{i}>\frac{1}{{T}_{i}-1}\) is equivalent to approve
Since both sides of the inequality are positive when \({T}_{i}>4\), it is equivalent to
in which the left side can be expanded to
Thus, the original inequality is proved.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yu, Y., Yang, L., Shen, Y. et al. An iterative and shrinking generalized ridge regression for ill-conditioned geodetic observation equations. J Geod 98, 3 (2024). https://doi.org/10.1007/s00190-023-01795-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00190-023-01795-1