Search | arXiv e-print repository

Recycling MMGKS for large-scale dynamic and streaming data

Authors: Mirjeta Pasha, Eric de Sturler, Misha E. Kilmer

Abstract: Reconstructing high-quality images with sharp edges requires the use of edge-preserving constraints in the regularized form of the inverse problem. The use of the $\ell_q$-norm on the gradient of the image is a common such constraint. For implementation purposes, the $\ell_q$-norm term is typically replaced with a sequence of $\ell_2$-norm weighted gradient terms with the weights determined from t… ▽ More Reconstructing high-quality images with sharp edges requires the use of edge-preserving constraints in the regularized form of the inverse problem. The use of the $\ell_q$-norm on the gradient of the image is a common such constraint. For implementation purposes, the $\ell_q$-norm term is typically replaced with a sequence of $\ell_2$-norm weighted gradient terms with the weights determined from the current solution estimate. While (hybrid) Krylov subspace methods can be employed on this sequence, it would require generating a new Krylov subspace for every new two-norm regularized problem. The majorization-minimization Krylov subspace method (MM-GKS) addresses this disadvantage by combining norm reweighting with generalized Krylov subspaces (GKS). After projecting the problem using a small dimensional subspace - one that expands each iteration - the regularization parameter is selected. Basis expansion repeats until a sufficiently accurate solution is found. Unfortunately, for large-scale problems that require many expansion steps to converge, storage and the cost of repeated orthogonalizations presents overwhelming memory and computational requirements. In this paper we present a new method, recycled MM-GKS (RMM-GKS), that keeps the memory requirements bounded through recycling the solution subspace. Specifically, our method alternates between enlarging and compressing the GKS subspace, recycling directions that are deemed most important via one of our tailored compression routines. We further generalize the RMM-GKS approach to handle experiments where the data is either not all available simultaneously, or needs to be treated as such because of the extreme memory requirements. Numerical examples from dynamic photoacoustic tomography and streaming X-ray computerized tomography (CT) imaging are used to illustrate the effectiveness of the described methods. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: 25 pages, 13 figures, 3 tables

arXiv:2306.15049 [pdf, other]

Subspace Recycling for Sequences of Shifted Systems with Applications in Image Recovery

Authors: Misha E. Kilmer, Eric de Sturler

Abstract: For many applications involving a sequence of linear systems with slowly changing system matrices, subspace recycling, which exploits relationships among systems and reuses search space information, can achieve huge gains in iterations across the total number of linear system solves in the sequence. However, for general (i.e., non-identity) shifted systems with the shift value varying over a wide… ▽ More For many applications involving a sequence of linear systems with slowly changing system matrices, subspace recycling, which exploits relationships among systems and reuses search space information, can achieve huge gains in iterations across the total number of linear system solves in the sequence. However, for general (i.e., non-identity) shifted systems with the shift value varying over a wide range, the properties of the linear systems vary widely as well, which makes recycling less effective. If such a sequence of systems is embedded in a nonlinear iteration, the problem is compounded, and special approaches are needed to use recycling effectively. In this paper, we develop new, more efficient, Krylov subspace recycling approaches for large-scale image reconstruction and restoration techniques that employ a nonlinear iteration to compute a suitable regularization matrix. For each new regularization matrix, we need to solve regularized linear systems, ${\bf A} + γ_\ell {\bf E}_k$, for a sequence of regularization parameters, $γ_\ell$, to find the optimally regularized solution that, in turn, will be used to update the regularization matrix. In this paper, we analyze system and solution characteristics to choose appropriate techniques to solve each system rapidly. Specifically, we use an inner-outer recycling approach with a larger, principal recycle space for each nonlinear step and smaller recycle spaces for each shift. We propose an efficient way to obtain good initial guesses from the principle recycle space and smaller shift-specific recycle spaces that lead to fast convergence. Our method is substantially reduces the total number of matrix-vector products that would arise in a naive approach. Our approach is more generally applicable to sequences of shifted systems where the matrices in the sum are positive semi-definite. △ Less

Submitted 26 June, 2023; originally announced June 2023.

MSC Class: 65F10; 65F22

arXiv:2305.10965 [pdf, other]

Stop** Criteria for the Conjugate Gradient Algorithm in High-Order Finite Element Methods

Authors: Yichen Guo, Eric de Sturler, Tim Warburton

Abstract: We introduce three new stop** criteria that balance algebraic and discretization errors for the conjugate gradient algorithm applied to high-order finite element discretizations of Poisson problems. The current state of the art stop** criteria compare a posteriori estimates of discretization error against estimates of the algebraic error. Firstly, we propose a new error indicator derived from… ▽ More We introduce three new stop** criteria that balance algebraic and discretization errors for the conjugate gradient algorithm applied to high-order finite element discretizations of Poisson problems. The current state of the art stop** criteria compare a posteriori estimates of discretization error against estimates of the algebraic error. Firstly, we propose a new error indicator derived from a recovery-based error estimator that is less computationally expensive and more reliable. Secondly, we introduce a new stop** criterion that suggests stop** when the norm of the linear residual is less than a small fraction of an error indicator derived directly from the residual. This indicator shares the same mesh size and polynomial degree scaling as the norm of the residual, resulting in a robust criterion regardless of the mesh size, the polynomial degree, and the shape regularity of the mesh. Thirdly, in solving Poisson problems with highly variable piecewise constant coefficients, we introduce a subdomain-based criterion that recommends stop** when the norm of the linear residual restricted to each subdomain is smaller than the corresponding indicator also restricted to that subdomain. Numerical experiments, including tests with anisotropic meshes and highly variable piecewise constant coefficients, demonstrate that the proposed criteria efficiently avoid both premature termination and over-solving. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 22 pages, 11 figures

MSC Class: 65N30; 65N22; 65F10

arXiv:2210.12053 [pdf, other]

Analysis of GMRES for Low-Rank and Small-Norm Perturbations of the Identity Matrix

Authors: Arielle K. Carr, Eric de Sturler, Mark Embree

Abstract: In many applications, linear systems arise where the coefficient matrix takes the special form ${\bf I} + {\bf K} + {\bf E}$, where ${\bf I}$ is the identity matrix of dimension $n$, ${\rm rank}({\bf K}) = p \ll n$, and $\|{\bf E}\| \leq ε< 1$. GMRES convergence rates for linear systems with coefficient matrices of the forms ${\bf I} + {\bf K}$ and ${\bf I} + {\bf E}$ are guaranteed by well-known… ▽ More In many applications, linear systems arise where the coefficient matrix takes the special form ${\bf I} + {\bf K} + {\bf E}$, where ${\bf I}$ is the identity matrix of dimension $n$, ${\rm rank}({\bf K}) = p \ll n$, and $\|{\bf E}\| \leq ε< 1$. GMRES convergence rates for linear systems with coefficient matrices of the forms ${\bf I} + {\bf K}$ and ${\bf I} + {\bf E}$ are guaranteed by well-known theory, but only relatively weak convergence bounds specific to matrices of the form ${\bf I} + {\bf K} + {\bf E}$ currently exist. In this paper, we explore the convergence properties of linear systems with such coefficient matrices by considering the pseudospectrum of ${\bf I} + {\bf K}$. We derive a bound for the GMRES residual in terms of $ε$ when approximately solving the linear system $({\bf I} + {\bf K} + {\bf E}){\bf x} = {\bf b}$ and identify the eigenvalues of ${\bf I} + {\bf K}$ that are sensitive to perturbation. In particular, while a clustered spectrum away from the origin is often a good indicator of fast GMRES convergence, that convergence may be slow when some of those eigenvalues are ill-conditioned. We show there can be at most $2p$ eigenvalues of ${\bf I} + {\bf K}$ that are sensitive to small perturbations. We present numerical results when using GMRES to solve a sequence of linear systems of the form $({\bf I} + {\bf K}_j + {\bf E}_j){\bf x}_j = {\bf b}_j$ that arise from the application of Broyden's method to solve a nonlinear partial differential equation. △ Less

Submitted 21 October, 2022; originally announced October 2022.

arXiv:2204.09815 [pdf, other]

Parametric Level-sets Enhanced To Improve Reconstruction (PaLEnTIR)

Authors: Ege Ozsar, Misha Kilmer, Eric Miller, Eric de Sturler, Arvind Saibaba

Abstract: We introduce PaLEnTIR, a significantly enhanced parametric level-set (PaLS) method addressing the restoration and reconstruction of piecewise constant objects. Our key contribution involves a unique PaLS formulation utilizing a single level-set function to restore scenes containing multi-contrast piecewise-constant objects without requiring knowledge of the number of objects or their contrasts. Un… ▽ More We introduce PaLEnTIR, a significantly enhanced parametric level-set (PaLS) method addressing the restoration and reconstruction of piecewise constant objects. Our key contribution involves a unique PaLS formulation utilizing a single level-set function to restore scenes containing multi-contrast piecewise-constant objects without requiring knowledge of the number of objects or their contrasts. Unlike standard PaLS methods employing radial basis functions (RBFs), our model integrates anisotropic basis functions (ABFs), thereby expanding its capacity to represent a wider class of shapes. Furthermore, PaLEnTIR improves the conditioning of the Jacobian matrix, required as part of the parameter identification process, and consequently accelerates optimization methods. We validate PaLEnTIR's efficacy through diverse experiments encompassing sparse and limited angle of view X-ray computed tomography (2D and 3D), nonlinear diffuse optical tomography (DOT), denoising, and deconvolution tasks using both real and simulated data sets. △ Less

Submitted 13 February, 2024; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: 28 pages, 35 figures

MSC Class: 65F22; 65F99; 65N21

arXiv:2107.05727 [pdf, other]

Efficient edge-preserving methods for dynamic inverse problems

Authors: Mirjeta Pasha, Arvind K. Saibaba, Silvia Gazzola, Malena I. Espanol, Eric de Sturler

Abstract: We consider efficient methods for computing solutions to dynamic inverse problems, where both the quantities of interest and the forward operator (measurement process) may change at different time instances but we want to solve for all the images simultaneously. We are interested in large-scale ill-posed problems that are made more challenging by their dynamic nature and, possibly, by the limited… ▽ More We consider efficient methods for computing solutions to dynamic inverse problems, where both the quantities of interest and the forward operator (measurement process) may change at different time instances but we want to solve for all the images simultaneously. We are interested in large-scale ill-posed problems that are made more challenging by their dynamic nature and, possibly, by the limited amount of available data per measurement step. To remedy these difficulties, we apply regularization methods that enforce simultaneous regularization in space and time (such as edge enhancement at each time instant and proximity at consecutive time instants) and achieve this with low computational cost and enhanced accuracy. More precisely, we develop iterative methods based on a majorization-minimization (MM) strategy with quadratic tangent majorant, which allows the resulting least squares problem to be solved with a generalized Krylov subspace (GKS) method; the regularization parameter can be defined automatically and efficiently at each iteration. Numerical examples from a wide range of applications, such as limited-angle computerized tomography (CT), space-time image deblurring, and photoacoustic tomography (PAT), illustrate the effectiveness of the described approaches. △ Less

Submitted 12 July, 2021; originally announced July 2021.

Comments: 30 pages, 10 figure, 3 tables

arXiv:2010.11447 [pdf, other]

Krylov Subspace Recycling for Evolving Structures

Authors: Matthias Bolten, Eric de Sturler, Camilla Hahn

Abstract: Krylov subspace recycling is a powerful tool for solving long series of large, sparse linear systems that change slowly. In PDE constrained shape optimization, these appear naturally, as hundreds or more optimization steps are needed with only small changes in the geometry. In this setting, however, applying Krylov subspace recycling can be difficult. As the geometry evolves, so does the finite el… ▽ More Krylov subspace recycling is a powerful tool for solving long series of large, sparse linear systems that change slowly. In PDE constrained shape optimization, these appear naturally, as hundreds or more optimization steps are needed with only small changes in the geometry. In this setting, however, applying Krylov subspace recycling can be difficult. As the geometry evolves, so does the finite element mesh, especially if re-meshing is needed. As a result, the number of algebraic degrees of freedom in the system may change from one optimization step to the next, and with it the size of the finite element system matrix. Changes in the mesh also lead to structural changes in the matrices. In the case of remeshing, even if the geometry changes only a little, the corresponding mesh might differ substantially from the previous one. This prevents any straightforward map** of the approximate invariant subspace of the linear system matrix (the focus of recycling in this paper) from one step to the next; similar problems arise for other selected subspaces. We present an algorithm for general meshes to map an approximate invariant subspace of the system matrix for the previous optimization step to an approximate invariant subspace of the system matrix for the current optimization step. We exploit the map from coefficient vectors to finite element functions on the mesh combined with function approximation on the finite element mesh. In addition, we develop a straightforward warm-start adaptation of the Krylov-Schur algorithm [G.W. Stewart, SIAM J. Matrix Anal. Appl. 23, 2001] to improve the approximate invariant subspace at the start of a new optimization step if needed. We demonstrate the effectiveness of our approach numerically with several proof of concept studies for a specific meshing technique. △ Less

Submitted 22 October, 2020; originally announced October 2020.

MSC Class: 65F10; 65N22

arXiv:2007.06027 [pdf, other]

Randomization for the Efficient Computation of Parametric Reduced Order Models for Inversion

Authors: Selin Aslan, Eric de Sturler, Serkan Gugercin

Abstract: Nonlinear parametric inverse problems appear in many applications. Here, we focus on diffuse optical tomography (DOT) in medical imaging to recover unknown images of interest, such as cancerous tissue in a given medium, using a mathematical (forward) model. The forward model in DOT is a diffusion-absorption model for the photon flux. The main bottleneck in these problems is the repeated evaluation… ▽ More Nonlinear parametric inverse problems appear in many applications. Here, we focus on diffuse optical tomography (DOT) in medical imaging to recover unknown images of interest, such as cancerous tissue in a given medium, using a mathematical (forward) model. The forward model in DOT is a diffusion-absorption model for the photon flux. The main bottleneck in these problems is the repeated evaluation of the large-scale forward model. For DOT, this corresponds to solving large linear systems for each source and frequency at each optimization step. Moreover, Newton-type methods, often the method of choice, require additional linear solves with the adjoint to compute derivative information. Emerging technology allows for large numbers of sources and detectors, making these problems prohibitively expensive. Reduced order models (ROM) have been used to drastically reduce the system size in each optimization step, while solving the inverse problem accurately. However, for large numbers of sources and detectors, just the construction of the candidate basis for the ROM projection space incurs a substantial cost, as matching the full parameter gradient matrix in interpolatory model reduction requires large linear solves for all sources and frequencies and all detectors and frequencies for each parameter interpolation point. As this candidate basis numerically has low rank, this construction is followed by a rank-revealing factorization that typically reduces the number of vectors in the candidate basis substantially. We propose to use randomization to approximate this basis with a drastically reduced number of large linear solves. We also provide a detailed analysis for the low-rank structure of the candidate basis for our problem of interest. Even though we focus on the DOT problem, the ideas presented are relevant to many other large scale inverse problems and optimization problems. △ Less

Submitted 12 July, 2020; originally announced July 2020.

Comments: 18 pages, 5 figures

MSC Class: 65F22; 65N22; 93A15; 93C05

arXiv:2007.00207 [pdf, other]

Hybrid Projection Methods with Recycling for Inverse Problems

Authors: Julianne Chung, Eric de Sturler, Jiahua Jiang

Abstract: Iterative hybrid projection methods have proven to be very effective for solving large linear inverse problems due to their inherent regularizing properties as well as the added flexibility to select regularization parameters adaptively. In this work, we develop Golub-Kahan-based hybrid projection methods that can exploit compression and recycling techniques in order to solve a broad class of inve… ▽ More Iterative hybrid projection methods have proven to be very effective for solving large linear inverse problems due to their inherent regularizing properties as well as the added flexibility to select regularization parameters adaptively. In this work, we develop Golub-Kahan-based hybrid projection methods that can exploit compression and recycling techniques in order to solve a broad class of inverse problems where memory requirements or high computational cost may otherwise be prohibitive. For problems that have many unknown parameters and require many iterations, hybrid projection methods with recycling can be used to compress and recycle the solution basis vectors to reduce the number of solution basis vectors that must be stored, while obtaining a solution accuracy that is comparable to that of standard methods. If reorthogonalization is required, this may also reduce computational cost substantially. In other scenarios, such as streaming data problems or inverse problems with multiple datasets, hybrid projection methods with recycling can be used to efficiently integrate previously computed information for faster and better reconstruction. Additional benefits of the proposed methods are that various subspace selection and compression techniques can be incorporated, standard techniques for automatic regularization parameter selection can be used, and the methods can be applied multiple times in an iterative fashion. Theoretical results show that, under reasonable conditions, regularized solutions for our proposed recycling hybrid method remain close to regularized solutions for standard hybrid methods and reveal important connections among the resulting projection matrices. Numerical examples from image processing show the potential benefits of combining recycling with hybrid projection methods. △ Less

Submitted 30 June, 2020; originally announced July 2020.

arXiv:2003.10938 [pdf, other]

Robust Parameter Inversion using Adaptive Reduced Order Models

Authors: Drayton Munster, Eric de Sturler

Abstract: Nonlinear parametric inverse problems appear in many applications and are typically very expensive to solve, especially if they involve many measurements. These problems pose huge computational challenges as evaluating the objective function or misfit requires the solution of a large number of parameterized partial differential equations, typically one per source term. Newton-type algorithms, whic… ▽ More Nonlinear parametric inverse problems appear in many applications and are typically very expensive to solve, especially if they involve many measurements. These problems pose huge computational challenges as evaluating the objective function or misfit requires the solution of a large number of parameterized partial differential equations, typically one per source term. Newton-type algorithms, which may be required for fast convergence, typically require the additional solution of a large number of adjoint problems. The use of parametric model reduction may substantially alleviate this problem. In [de Sturler, E., Gugercin, S., Kilmer, M. E., Chaturantabut, S., Beattie, C., and O'Connell, M. (2015). Nonlinear Parametric Inversion Using Interpolatory Model Reduction. SIAM Journal on Scientific Computing, 37(3)], interpolatory model reduction was successfully used to drastically speed up inversion for Diffuse Optical Tomography (DOT). However, when using model reduction in high dimensional parameter spaces, obtaining error bounds in parameter space is typically intractable. In this paper, we propose to use stochastic estimates to remedy this problem. At the cost of one (randomized) full-scale linear solve per optimization step we obtain a robust algorithm. Moreover, since we can now update the model when needed, this robustness allows us to further reduce the order of the reduced order model and hence the cost of computing and using it, further decreasing the cost of inversion. We also propose a method to update the model reduction basis that reduces the number of large linear solves required by 46%-98% compared with the fixed reduced-order model. We demonstrate that this leads to a highly efficient and robust inversion method. △ Less

Submitted 24 March, 2020; originally announced March 2020.

MSC Class: 65N21; 65N22; 65M32; 65K05; 65L20; 90C15; 90C30; 90C90

arXiv:2001.10347 [pdf, ps, other]

A survey of subspace recycling iterative methods

Authors: Kirk M. Soodhalter, Eric de Sturler, Misha Kilmer

Abstract: This survey concerns subspace recycling methods, a popular class of iterative methods that enable effective reuse of subspace information in order to speed up convergence and find good initial guesses over a sequence of linear systems with slowly changing coefficient matrices, multiple right-hand sides, or both. The subspace information that is recycled is usually generated during the run of an it… ▽ More This survey concerns subspace recycling methods, a popular class of iterative methods that enable effective reuse of subspace information in order to speed up convergence and find good initial guesses over a sequence of linear systems with slowly changing coefficient matrices, multiple right-hand sides, or both. The subspace information that is recycled is usually generated during the run of an iterative method (usually a Krylov subspace method) on one or more of the systems. Following introduction of definitions and notation, we examine the history of early augmentation schemes along with deflation preconditioning schemes and their influence on the development of recycling methods. We then discuss a general residual constraint framework through which many augmented Krylov and recycling methods can both be viewed. We review several augmented and recycling methods within this framework. We then discuss some known effective strategies for choosing subspaces to recycle before taking the reader through more recent developments that have generalized recycling for (sequences of) shifted linear systems, some of them with multiple right-hand sides in mind. We round out our survey with a brief review of application areas that have seen benefit from subspace recycling methods. △ Less

Submitted 29 July, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

Comments: 21 pages of text plus pages for references (27 total)

MSC Class: 65F10

arXiv:1706.05586 [pdf, other]

Randomized Approach to Nonlinear Inversion Combining Simultaneous Random and Optimized Sources and Detectors

Authors: Selin Aslan, Eric de Sturler, Misha E. Kilmer

Abstract: In partial differential equations-based (PDE-based) inverse problems with many measurements, many large-scale discretized PDEs must be solved for each evaluation of the misfit or objective function. In the nonlinear case, evaluating the Jacobian requires solving an additional set of systems. This leads to a tremendous computational cost, and this is by far the dominant cost for these problems. Sev… ▽ More In partial differential equations-based (PDE-based) inverse problems with many measurements, many large-scale discretized PDEs must be solved for each evaluation of the misfit or objective function. In the nonlinear case, evaluating the Jacobian requires solving an additional set of systems. This leads to a tremendous computational cost, and this is by far the dominant cost for these problems. Several authors have proposed randomization and stochastic programming techniques to drastically reduce the number of system solves by estimating the objective function using only a few appropriately chosen random linear combinations of the sources. While some have reported good solution quality at a greatly reduced cost, for our problem of interest, diffuse optical tomography, the approach often does not lead to sufficiently accurate solutions. We propose two improvements. First, to efficiently exploit Newton-type methods, we modify the stochastic estimates to include random linear combinations of detectors, drastically reducing the number of adjoint solves. Second, after solving to a modest tolerance, we compute a few simultaneous sources and detectors that maximize the Frobenius norm of the sampled Jacobian to improve the rate of convergence and obtain more accurate solutions. We complement these optimized simultaneous sources and detectors by random simultaneous sources and detectors constrained to a complementary subspace. Our approach leads to solutions of the same quality as obtained using all sources and detectors but at a greatly reduced computational cost, as the number of large-scale linear systems to be solved is significantly reduced. △ Less

Submitted 17 July, 2018; v1 submitted 17 June, 2017; originally announced June 2017.

Comments: The algorithm to select optimized sources and detectors and combining these with randomized sources and detectors has been substantially revised from the previous version

MSC Class: 65F22; 65N21; 65N22; 65M32; 62L20; 90C15

arXiv:1609.03099 [pdf, other]

doi 10.1016/j.cma.2017.06.035

Stochastic Sampling for Structural Topology Optimization with Many Load Cases: Density-Based and Ground Structure Approaches

Authors: Xiaojia Zhang, Eric de Sturler, Glaucio H. Paulino

Abstract: We propose an efficient probabilistic method to solve a deterministic problem -- we present a randomized optimization approach that drastically reduces the enormous computational cost of optimizing designs under many load cases for both continuum and truss topology optimization. Practical structural designs by topology optimization typically involve many load cases, possibly hundreds or more. The… ▽ More We propose an efficient probabilistic method to solve a deterministic problem -- we present a randomized optimization approach that drastically reduces the enormous computational cost of optimizing designs under many load cases for both continuum and truss topology optimization. Practical structural designs by topology optimization typically involve many load cases, possibly hundreds or more. The optimal design minimizes a, possibly weighted, average of the compliance under each load case (or some other objective). This means that in each optimization step a large finite element problem must be solved for each load case, leading to an enormous computational effort. On the contrary, the proposed randomized optimization method with stochastic sampling requires the solution of only a few (e.g., 5 or 6) finite element problems (large linear systems) per optimization step. Based on simulated annealing, we introduce a dam** scheme for the randomized approach. Through numerical examples in two and three dimensions, we demonstrate that the stochastic algorithm drastically reduces computational cost to obtain similar final topologies and results (e.g., compliance) compared with the standard algorithms. The results indicate that the dam** scheme is effective and leads to rapid convergence of the proposed algorithm. △ Less

Submitted 10 September, 2016; originally announced September 2016.

MSC Class: 65K10; 74P05; 49Q10

arXiv:1602.00138 [pdf, ps, other]

Computing Reduced Order Models via Inner-Outer Krylov Recycling in Diffuse Optical Tomography

Authors: Meghan O'Connell, Misha E. Kilmer, Eric de Sturler, Serkan Gugercin

Abstract: In nonlinear imaging problems whose forward model is described by a partial differential equation (PDE), the main computational bottleneck in solving the inverse problem is the need to solve many large-scale discretized PDEs at each step of the optimization process. In the context of absorption imaging in diffuse optical tomography, one approach to addressing this bottleneck proposed recently (de… ▽ More In nonlinear imaging problems whose forward model is described by a partial differential equation (PDE), the main computational bottleneck in solving the inverse problem is the need to solve many large-scale discretized PDEs at each step of the optimization process. In the context of absorption imaging in diffuse optical tomography, one approach to addressing this bottleneck proposed recently (de Sturler, et al, 2015) reformulates the viewing of the forward problem as a differential algebraic system, and then employs model order reduction (MOR). However, the construction of the reduced model requires the solution of several full order problems (i.e. the full discretized PDE for multiple right-hand sides) to generate a candidate global basis. This step is then followed by a rank-revealing factorization of the matrix containing the candidate basis in order to compress the basis to a size suitable for constructing the reduced transfer function. The present paper addresses the costs associated with the global basis approximation in two ways. First, we use the structure of the matrix to rewrite the full order transfer function, and corresponding derivatives, such that the full order systems to be solved are symmetric (positive definite in the zero frequency case). Then we apply MOR to the new formulation of the problem. Second, we give an approach to computing the global basis approximation dynamically as the full order systems are solved. In this phase, only the incrementally new, relevant information is added to the existing global basis, and redundant information is not computed. This new approach is achieved by an inner-outer Krylov recycling approach which has potential use in other applications as well. We show the value of the new approach to approximate global basis computation on two DOT absorption image reconstruction problems. △ Less

Submitted 6 March, 2016; v1 submitted 30 January, 2016; originally announced February 2016.

MSC Class: 65F10; 65F22

arXiv:1601.05883 [pdf, other]

Preconditioning Parametrized Linear Systems

Authors: Arielle Grim-McNally, Eric de Sturler, Serkan Gugercin

Abstract: Preconditioners are generally essential for fast convergence in the iterative solution of linear systems of equations. However, the computation of a good preconditioner can be expensive. So, while solving a sequence of many linear systems, it is advantageous to recycle preconditioners, that is, update a previous preconditioner and reuse the updated version. In this paper, we introduce a simple and… ▽ More Preconditioners are generally essential for fast convergence in the iterative solution of linear systems of equations. However, the computation of a good preconditioner can be expensive. So, while solving a sequence of many linear systems, it is advantageous to recycle preconditioners, that is, update a previous preconditioner and reuse the updated version. In this paper, we introduce a simple and effective method for doing this. Although our approach can be used for matrices changing slowly in any way, we focus on the important case of sequences of the type $(s_k\textbf{E}(\textbf{p}) + \textbf{A}(\textbf{p}))\textbf{x}_k = \textbf{b}_k$, where the right hand side may or may not change. More general changes in matrices will be discussed in a future paper. We update preconditioners by defining a map from a new matrix to a previous matrix, for example the first matrix in the sequence, and combine the preconditioner for this previous matrix with the map to define the new preconditioner. This approach has several advantages. The update is entirely independent from the original preconditioner, so it can be applied to any preconditioner. The possibly high cost of an initial preconditioner can be amortized over many linear solves. The cost of updating the preconditioner is more or less constant and independent of the original preconditioner. There is flexibility in balancing the quality of the map with the computational cost. In the numerical experiments section we demonstrate good results for several applications, in particular when using an algebraic multigrid preconditioner. △ Less

Submitted 17 December, 2020; v1 submitted 22 January, 2016; originally announced January 2016.

Comments: V2 Model Reduction. V3 analysis of sparsity patterns, Topopt added. Flow replace rail. Only early THT shifts. ILUTP, SAM implementations more efficient. ILUTP m-file included. V4 New pattern added to top opt sparsity pattern analysis/results. ILUTP m-file minor updates V5 Add AMG, remove THT V6 NO CHANGE TO PDF added ilutp anc file

arXiv:1501.03358 [pdf, other]

doi 10.1016/j.jcp.2015.09.040

Recycling Krylov subspaces for CFD applications and a new hybrid recycling solver

Authors: Amit Amritkar, Eric de Sturler, Katarzyna Świrydowicz, Danesh Tafti, Kapil Ahuja

Abstract: We focus on robust and efficient iterative solvers for the pressure Poisson equation in incompressible Navier-Stokes problems. Preconditioned Krylov subspace methods are popular for these problems, with BiCGStab and GMRES(m) most frequently used for nonsymmetric systems. BiCGStab is popular because it has cheap iterations, but it may fail for stiff problems, especially early on as the initial gues… ▽ More We focus on robust and efficient iterative solvers for the pressure Poisson equation in incompressible Navier-Stokes problems. Preconditioned Krylov subspace methods are popular for these problems, with BiCGStab and GMRES(m) most frequently used for nonsymmetric systems. BiCGStab is popular because it has cheap iterations, but it may fail for stiff problems, especially early on as the initial guess is far from the solution. Restarted GMRES is better, more robust, in this phase, but restarting may lead to very slow convergence. Therefore, we evaluate the rGCROT method for these systems. This method recycles a selected subspace of the search space (called recycle space) after a restart. This generally improves the convergence drastically compared with GMRES(m). Recycling subspaces is also advantageous for subsequent linear systems, if the matrix changes slowly or is constant. However, rGCROT iterations are still expensive in memory and computation time compared with those of BiCGStab. Hence, we propose a new, hybrid approach that combines the cheap iterations of BiCGStab with the robustness of rGCROT. For the first few time steps the algorithm uses rGCROT and builds an effective recycle space, and then it recycles that space in the rBiCGStab solver. We evaluate rGCROT on a turbulent channel flow problem, and we evaluate both rGCROT and the new, hybrid combination of rGCROT and rBiCGStab on a porous medium flow problem. We see substantial performance gains on both problems. △ Less

Submitted 25 September, 2015; v1 submitted 1 January, 2015; originally announced January 2015.

Comments: 26 pages, 7 figures

arXiv:1406.2831 [pdf, ps, other]

Recycling BiCGSTAB with an Application to Parametric Model Order Reduction

Authors: Kapil Ahuja, Peter Benner, Eric de Sturler, Lihong Feng

Abstract: Krylov subspace recycling is a process for accelerating the convergence of sequences of linear systems. Based on this technique, the recycling BiCG algorithm has been developed recently. Here, we now generalize and extend this recycling theory to BiCGSTAB. Recycling BiCG focuses on efficiently solving sequences of dual linear systems, while the focus here is on efficiently solving sequences of sin… ▽ More Krylov subspace recycling is a process for accelerating the convergence of sequences of linear systems. Based on this technique, the recycling BiCG algorithm has been developed recently. Here, we now generalize and extend this recycling theory to BiCGSTAB. Recycling BiCG focuses on efficiently solving sequences of dual linear systems, while the focus here is on efficiently solving sequences of single linear systems (assuming non-symmetric matrices for both recycling BiCG and recycling BiCGSTAB). As compared with other methods for solving sequences of single linear systems with non-symmetric matrices (e.g., recycling variants of GMRES), BiCG based recycling algorithms, like recycling BiCGSTAB, have the advantage that they involve a short-term recurrence, and hence, do not suffer from storage issues and are also cheaper with respect to the orthogonalizations. We modify the BiCGSTAB algorithm to use a recycle space, which is built from left and right approximate invariant subspaces. Using our algorithm for a parametric model order reduction example gives good results. We show about 40% savings in the number of matrix-vector products and about 35% savings in runtime. △ Less

Submitted 25 January, 2015; v1 submitted 11 June, 2014; originally announced June 2014.

Comments: 18 pages, 5 figures, Extended version of Max Planck Institute report (MPIMD/13-21)

MSC Class: 65F10; 65N22; 93A15; 93C05

arXiv:1311.0922 [pdf, other]

doi 10.1137/130946320

Nonlinear Parametric Inversion using Interpolatory Model Reduction

Authors: Eric de Sturler, Serkan Gugercin, Misha E. Kilmer, Saifon Chaturantabut, Christopher Beattie, Meghan O'Connell

Abstract: Nonlinear parametric inverse problems appear in several prominent applications; one such application is Diffuse Optical Tomography (DOT) in medical image reconstruction. Such inverse problems present huge computational challenges, mostly due to the need for solving a sequence of large-scale discretized, parametrized, partial differential equations (PDEs) in the forward model. In this paper, we sho… ▽ More Nonlinear parametric inverse problems appear in several prominent applications; one such application is Diffuse Optical Tomography (DOT) in medical image reconstruction. Such inverse problems present huge computational challenges, mostly due to the need for solving a sequence of large-scale discretized, parametrized, partial differential equations (PDEs) in the forward model. In this paper, we show how interpolatory parametric model reduction can significantly reduce the cost of the inversion process in DOT by drastically reducing the computational cost of solving the forward problems. The key observation is that function evaluations for the underlying optimization problem may be viewed as transfer function evaluations along the imaginary axis; a similar observation holds for Jacobian evaluations as well. This motivates the use of system-theoretic model order reduction methods. We discuss the construction and use of interpolatory parametric reduced models as surrogates for the full forward model. Within the DOT setting, these surrogate models can approximate both the cost functional and the associated Jacobian with very little loss of accuracy while significantly reducing the cost of the overall inversion process. Four numerical examples illustrate the efficiency of the proposed approach. Although we focus on DOT in this paper, we believe that our approach is applicable much more generally. △ Less

Submitted 4 November, 2013; originally announced November 2013.

MSC Class: 65F10; 65N22; 93A15; 93C05

Journal ref: SIAM Journal on Scientific Computing, Vol. 37, Issue: 3, B495--B517, 2015

arXiv:1305.5796 [pdf, other]

Efficient methods for computing observation impact in 4D-Var data assimilation

Authors: Alexandru Cioaca, Adrian Sandu, Eric de Sturler

Abstract: This paper presents a practical computational approach to quantify the effect of individual observations in estimating the state of a system. Such an analysis can be used for pruning redundant measurements, and for designing future sensor networks. The mathematical approach is based on computing the sensitivity of the reanalysis (unconstrained optimization solution) with respect to the data. The c… ▽ More This paper presents a practical computational approach to quantify the effect of individual observations in estimating the state of a system. Such an analysis can be used for pruning redundant measurements, and for designing future sensor networks. The mathematical approach is based on computing the sensitivity of the reanalysis (unconstrained optimization solution) with respect to the data. The computational cost is dominated by the solution of a linear system, whose matrix is the Hessian of the cost function, and is only available in operator form. The right hand side is the gradient of a scalar cost function that quantifies the forecast error of the numerical model. The use of adjoint models to obtain the necessary first and second order derivatives is discussed. We study various strategies to accelerate the computation, including matrix-free iterative solvers, preconditioners, and an in-house multigrid solver. Experiments are conducted on both a small-size shallow-water equations model, and on a large-scale numerical weather prediction model, in order to illustrate the capabilities of the new methodology. △ Less

Submitted 18 July, 2013; v1 submitted 24 May, 2013; originally announced May 2013.

Report number: CSL-TR-2-2013

arXiv:1010.0762 [pdf, ps, other]

Recycling BiCG with an Application to Model Reduction

Authors: Kapil Ahuja, Eric de Sturler, Serkan Gugercin, Eun R. Chang

Abstract: Science and engineering problems frequently require solving a sequence of dual linear systems. Besides having to store only few Lanczos vectors, using the BiConjugate Gradient method (BiCG) to solve dual linear systems has advantages for specific applications. For example, using BiCG to solve the dual linear systems arising in interpolatory model reduction provides a backward error formulation in… ▽ More Science and engineering problems frequently require solving a sequence of dual linear systems. Besides having to store only few Lanczos vectors, using the BiConjugate Gradient method (BiCG) to solve dual linear systems has advantages for specific applications. For example, using BiCG to solve the dual linear systems arising in interpolatory model reduction provides a backward error formulation in the model reduction framework. Using BiCG to evaluate bilinear forms -- for example, in quantum Monte Carlo (QMC) methods for electronic structure calculations -- leads to a quadratic error bound. Since our focus is on sequences of dual linear systems, we introduce recycling BiCG, a BiCG method that recycles two Krylov subspaces from one pair of dual linear systems to the next pair. The derivation of recycling BiCG also builds the foundation for develo** recycling variants of other bi-Lanczos based methods, such as CGS, BiCGSTAB, QMR, and TFQMR. We develop an augmented bi-Lanczos algorithm and a modified two-term recurrence to include recycling in the iteration. The recycle spaces are approximate left and right invariant subspaces corresponding to the eigenvalues closest to the origin. These recycle spaces are found by solving a small generalized eigenvalue problem alongside the dual linear systems being solved in the sequence. We test our algorithm in two application areas. First, we solve a discretized partial differential equation (PDE) of convection-diffusion type. Such a problem provides well-known test cases that are easy to test and analyze further. Second, we use recycling BiCG in the Iterative Rational Krylov Algorithm (IRKA) for interpolatory model reduction. IRKA requires solving sequences of slowly changing dual linear systems. We show up to 70% savings in iterations, and also demonstrate that for a model reduction problem BiCG takes (about) 50% more time than recycling BiCG. △ Less

Submitted 16 October, 2011; v1 submitted 5 October, 2010; originally announced October 2010.

Comments: 25 pages, 6 figures, 3 tables

MSC Class: 65F10; 65N22; 93A15; 93C05

arXiv:1009.4975 [pdf, ps, other]

Dynamic Adaptive Mesh Refinement for Topology Optimization

Authors: Shun Wang, Eric de Sturler, Glaucio H. Paulino

Abstract: We present an improved method for topology optimization with both adaptive mesh refinement and derefinement. Since the total volume fraction in topology optimization is usually modest, after a few initial iterations the domain of computation is largely void. Hence, it is inefficient to have many small elements, in such regions, that contribute significantly to the overall computational cost but co… ▽ More We present an improved method for topology optimization with both adaptive mesh refinement and derefinement. Since the total volume fraction in topology optimization is usually modest, after a few initial iterations the domain of computation is largely void. Hence, it is inefficient to have many small elements, in such regions, that contribute significantly to the overall computational cost but contribute little to the accuracy of computation and design. At the same time, we want high spatial resolution for accurate three-dimensional designs to avoid postprocessing or interpretation as much as possible. Dynamic adaptive mesh refinement (AMR) offers the possibility to balance these two requirements. We discuss requirements on AMR for topology optimization and the algorithmic features to implement them. The numerical design problems demonstrate (1) that our AMR strategy for topology optimization leads to designs that are equivalent to optimal designs on uniform meshes, (2) how AMR strategies that do not satisfy the postulated requirements may lead to suboptimal designs, and (3) that our AMR strategy significantly reduces the time to compute optimal designs. △ Less

Submitted 25 September, 2010; originally announced September 2010.

Comments: adaptive mesh refinement, topology optimization, iterative solvers

arXiv:1008.5113 [pdf, ps, other]

Improved Scaling for Quantum Monte Carlo on Insulators

Authors: Kapil Ahuja, Bryan K. Clark, Eric de Sturler, David M. Ceperley, Jeongnim Kim

Abstract: Quantum Monte Carlo (QMC) methods are often used to calculate properties of many body quantum systems. The main cost of many QMC methods, for example the variational Monte Carlo (VMC) method, is in constructing a sequence of Slater matrices and computing the ratios of determinants for successive Slater matrices. Recent work has improved the scaling of constructing Slater matrices for insulators so… ▽ More Quantum Monte Carlo (QMC) methods are often used to calculate properties of many body quantum systems. The main cost of many QMC methods, for example the variational Monte Carlo (VMC) method, is in constructing a sequence of Slater matrices and computing the ratios of determinants for successive Slater matrices. Recent work has improved the scaling of constructing Slater matrices for insulators so that the cost of constructing Slater matrices in these systems is now linear in the number of particles, whereas computing determinant ratios remains cubic in the number of particles. With the long term aim of simulating much larger systems, we improve the scaling of computing the determinant ratios in the VMC method for simulating insulators by using preconditioned iterative solvers. The main contribution of this paper is the development of a method to efficiently compute for the Slater matrices a sequence of preconditioners that make the iterative solver converge rapidly. This involves cheap preconditioner updates, an effective reordering strategy, and a cheap method to monitor instability of ILUTP preconditioners. Using the resulting preconditioned iterative solvers to compute determinant ratios of consecutive Slater matrices reduces the scaling of QMC algorithms from O(n^3) per sweep to roughly O(n^2), where n is the number of particles, and a sweep is a sequence of n steps, each attempting to move a distinct particle. We demonstrate experimentally that we can achieve the improved scaling without increasing statistical errors. Our results show that preconditioned iterative solvers can dramatically reduce the cost of VMC for large(r) systems. △ Less

Submitted 6 May, 2011; v1 submitted 30 August, 2010; originally announced August 2010.

Comments: 24 pages, 10 figures

Showing 1–22 of 22 results for author: de Sturler, E