An incomplete generalized minimum backward perturbation algorithm for large nonsymmetric linear systems

Lei SUN

doi:10.3868/s140-DDD-023-0014-x

Front. Math. China ›› 2023, Vol. 18 ›› Issue (3) :203 -222. DOI: 10.3868/s140-DDD-023-0014-x

RESEARCH　ARTICLE

An incomplete generalized minimum backward perturbation algorithm for large nonsymmetric linear systems

Lei SUN ^†

Author information +

History +

PDF (1307KB)

Abstract

This paper gives the truncated version of the generalized minimum backward error algorithm (GMBACK)—the incomplete generalized minimum backward perturbation algorithm (IGMBACK) for large nonsymmetric linear systems. It is based on an incomplete orthogonalization of the Krylov vectors in question, and gives an approximate or quasi-minimum backward perturbation solution over the Krylov subspace. Theoretical properties of IGMBACK including finite termination, existence and uniqueness are discussed in details, and practical implementation issues associated with the IGMBACK algorithm are considered. Numerical experiments show that, the IGMBACK method is usually more efficient than GMBACK and GMRES, and IMBACK, GMBACK often have better convergence performance than GMRES. Specially, for sensitive matrices and right-hand sides being parallel to the left singular vectors corresponding to the smallest singular values of the coefficient matrices, GMRES does not necessarily converge, and IGMBACK, GMBACK usually converge and outperform GMRES.

Graphical abstract

Keywords

Nonsymmetric linear systems / Krylov subspace methods / minimum backward perturbation / incomplete orthogonalization process / GMBACK / GMRES

Cite this article

Download citation ▾

Lei SUN. An incomplete generalized minimum backward perturbation algorithm for large nonsymmetric linear systems. Front. Math. China, 2023, 18(3): 203-222 DOI:10.3868/s140-DDD-023-0014-x

登录浏览全文

4963

注册一个新账户忘记密码

1 Introduction

In many applied sciences and engineering calculations, it is often necessary to solve large nonsymmetric sparse linear systems:

(1)

A x = b,

where

A ∈ R n × n

is nonsingular,

x, b ∈ R n

. The Krylov subspace method is a very effective method for solving (1). Krylov subspace methods [20] such as the generalized minimum residual method (CMRES algorithm [4, 22]) often use the residual norm as a condition for determining the termination of the algorithm. If the approximate solution is accurate, then the residual norm is small. Conversely, a small residual norm does not mean that the approximate solution is accurate, especially when

A

is a ill-posed matrix. In order to overcome the shortcomings of residual norm as a termination condition, the idea of using backward perturbation norm [10] as the termination condition is proposed in reference [1]. Kasenally minimized the backward perturbation norm

‖ Δ A ‖ F

A

by finding an approximate solution satisfying the perturbation equation when solving nonsymmetric linear systems

(A − Δ A) x m = b

, that is,

min x m ∈ x 0 + K m (A, r 0) ‖ Δ A ‖ F

makes

(2)

(A − Δ A) x m = b .

The generalized minimum backward perturbation method (CMBACK algorithm [13]) for solving large nonsymmetric linear systems is presented. In this notation,

x m

represents the approximate solution of (1) with the form:

x m = x 0 + t m

, where

t m ∈ K m (A, r 0)

x 0

represents the original estimate of the approximate solution, denoted

r 0 := b − A x 0

Δ A

and

Δ b

represent perturbations to matrix

A

and vector

b

, respectively, forming a joint backward perturbation matrix

[Δ A, Δ b]

. In the late 1990s, Kasenally, Simoncini and Zhihao Cao popularized the GMBACK algorithm by perturbing 6 on top of 4, that is,

min x m ∈ x 0 + K m (A, r 0) ‖ Δ A, Δ b ‖ F

makes

(A − Δ A) x m = (b + Δ b) .

The minimum joint backward perturbation method for solving large nonsymmetric linear systems is obtained (Minpert algorithm [8, 14]). The GMRES algorithm can be regarded as minimizing the perturbation norm

‖ Δ b ‖ 2

b

by finding an approximate solution satisfying the equation

A x m = b − Δ b

, that is,

min x m ∈ x 0 + K m (A, r 0) ‖ Δ b ‖ 2

makes

A x m = b − Δ b .

Therefore, GMBACK and GMRES algorithms can be regarded as minimizing the perturbation norm on the coefficient matrix

A

and vector

b

, respectively. While the Minpert algorithm is minimizing the norm of joint backward perturbation matrix

[Δ A, Δ b]

on matrix

A

and vector

b

The GMBACK, GMRES, and Minpert algorithms are all a set of bases

v 1, v 2, …, v m

that utilize the Arnoldi process to generate

K m (A, r 0)

. This means that these algorithms use long recursive formulas, which leads to a dramatic increase in computation and storage capacity with the number of steps, and the algorithms become unusable when the number of steps increases to a certain point. Therefore, these algorithms usually need to be restarted, but for difficult problems, even when combined with preprocessing techniques, the number of steps taken in a restart is still quite large to ensure convergence of the algorithm. In order to overcome the shortcomings of long recursive formulas, a popular technique is to resort to truncation strategies. The idea is to use only a few rather than all the previously calculated vectors to construct a new short recursive formula to calculate the following vectors, which greatly reduces the computation and storage capacity. The truncation form of CMRES algorithm is given in references [5, 6]: quasi-incomplete orthogonalization method (QGMRES) or incomplete generalized minimum Residue Method (ICMRES). Sun Lei provided the truncated form of the Minpert algorithm in reference [25]: incomplete minimum joint backward perturbation method (IMinpert algorithm). However, the truncated form of the CMIBACK algorithm has not yet been given. In this paper, by means of incomplete orthogonalization process [12, 20, 21], a group of bases

v 1, v 2, …, v m

K m (A, r 0)

will be generated. The truncation form of CMBACK algorithm is given: incomplete generalized minimum backward perturbation method (IGMBACK). As with GMBACK, the approximate solution of IGMBACK is generated by solving the minimum problem (2).

The structure of this paper is as follows: Section 2 gives the theoretical derivation process of the ICMBACK algorithm, followed by the IGMBACK algorithm, and some theoretical studies are made on the algorithm, including the finite termination of the algorithm, existence and uniqueness of the solution. Section 3 gives the execution of IGMBACK algorithm. In Section 4, we will show with numerical examples that IGMBACK is generally more efficient than GMBACK and GMRES, and IGMBACK and GMBACK generally converge better than GMRES. In particular, the restarted GMRES algorithm does not necessarily converge if the coefficient matrix is a sensitive matrix and the vector

b

on the right-hand side of equations is parallel to the left singular vector corresponding to the minimum singular value of

A

. However, IGMBACK and GMBACK algorithms generally converge and converge better than GMRES. Section 5 is the summary of the full text and the prospect of the work.

2 IGMBACK algorithm

2.1 Analysis of the backward perturbation matrix

The following lemma gives the general formula for the backward perturbed matrix

Δ A

Lemma 2.1　 Suppose that an incomplete orthogonalization process of

m

steps has been performed and a set of basis vectors

v 1, v 2, …, v m

and

v m + 1

K m (A, r 0)

is obtained, denoted

V m = [v 1, v 2, …, v m]

V m + 1 = [v 1, v 2, …, v m, v m + 1]

. At the same time, an upper Hessenberg matrix

H m ∈ R (m + 1) × m

is obtained satisfying:

A V m = V m + 1 H m = V m H ¯ m + h m + 1, m v m + 1 e m T,

where the last line of

H m

is removed to get

H ¯ m

. The approximate solution of the equation (1) can be written as follows:

(3)

x m = x 0 + V m y m, y m ∈ R m .

Denoted

β = ‖ r 0 ‖ 2

. Thus the set

S = {Δ A}

of backward perturbation matrices satisfying

(A − Δ A) x m = b

can be expressed as:

(4)

S = {V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T + R (I − w m w m T) : ∀ R ∈ R n × n},

where

w m = x m ‖ x m ‖ 2 − 1

Proof　Substituting

A V m = V m + 1 H m

and (3) into

(A − Δ A) x m = b

, we get

Δ A w m = V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 1 .

Thus for any

R ∈ R n × n

, we immediately obtain (4) (see [3]).

By Lemma 2.1, we can further obtain F- in

S

the backward perturbation matrix

Δ min

with the minimum norm and its norm

‖ Δ min ‖ F

Theorem 2.1　 The backward perturbed matrix

Δ min

with the minimum norm of

F

S

can be expressed as

Δ min := V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T = − r m w m T ‖ x m ‖ 2,

where

r m = b − A x m

. Thus the backward perturbation norm

‖ Δ min ‖ F = ‖ r m ‖ 2 ‖ x m ‖ 2

Proof　Require F- in

S

the backward perturbation matrix with the minimum norm. According to Lemma 2.1, let

g (y m, R) = ‖ V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T + R (I − w m w m T) ‖ F 2 .

We can write

g (y m, R)

g (y m, R) = ‖ v e c {V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T} + {(I − w m w m T) ⊗ I} v e c (R) ‖ 22,

where

v e c (⋅)

indicates that the columns of the matrix are straightened,

⊗

denotes tensor product. From the above equation, the minimum value of

g (y m, R)

is required. By the knowledge of optimization theory, we have

∇ v e c (R) g (y m, R) = 0

. Thus we have

R (I − w m w m T) = 0

, and the conclusion of the theorem holds. □

2.2 Generation of IGMBACK algorithm

This section will derive the IGMBACK algorithm. Lemma 2.1 gives the general formula (4) for the backward perturbation matrix

Δ A

. In the proof of Theorem 2.1, we establish the function

g (y m, R)

according to the general formula (4). It can be seen that the only free variables in

g (y m, R)

are

R

and

y m

, so that the minimum value problem (2) can be transformed into

min y m ∈ R m, R ∈ R n × n g (y m, R)

, i.e.,

(5)

min y m ∈ R m, R ∈ R n × n ‖ V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T + R (I − w m w m T) ‖ F 2 .

According to Theorem 2.1, problem (5) can be further transformed into

(6)

min y m ∈ R m ‖ V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T ‖ F 2 .

Since

V m + 1 = [v 1, v 2, …, v m, v m + 1]

is not a column orthogonal norm matrix, this can be computationally intensive, and since

(7)

‖ V m + 1 (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T ‖ F ⩽ ‖ V m + 1 ‖ F ‖ H m y m − β e 1 ‖ 2 ‖ x m ‖ 2 = m + 1 ‖ H m y m − β e 1 ‖ 2 ‖ x m ‖ 2,

we can choose

y m

such that

‖ H m y m − β e 1 ‖ 2 ‖ x m ‖ 2

is minimal. That is, problem (6) is approximated as

(8)

min y m ∈ R m ‖ H m y m − β e 1 ‖ 22 ‖ x m ‖ 22 .

For convenience, we define two matrices

L m ∈ R (m + 1) × (m + 1)

and

G m ∈ R n × (m + 1)

, where

(9)

L m = [H m − β e 1], G m = [V m x 0] .

The following theorem will give the solution to the minimum value problem (8).

Theorem 2.2　 Assuming that the

m

-step incomplete orthogonalization process has been performed, the form of

x m

in problem (8) is as in (3). Let

{λ i, μ i} i = 1, 2, …, m + 1

be the combination of all generalized eigenvalues and corresponding eigenvectors of

(L m T L m, G m T G m)

, where

λ i ⩾ λ i + 1, i = 1, 2, …, m

. If the last element of the vector

μ m + 1

is not

0

, i.e.,

μ m + 1, m + 1 ≠ 0

, then the solution

y m

of the minimum value problem (8) satisfies

(10)

[y m 1] = 1 μ m + 1, m + 1 μ m + 1

and

(11)

‖ Δ min ‖ F ⩽ m + 1 λ m + 1 12 .

Proof　The preceding reasoning process eventually transforms problem (2) into the minimum value problem (8). By

min y m ∈ R m ‖ H m y m − β e 1 ‖ 22 ‖ x m ‖ 22 = min y m ∈ R m ‖ [H m − β e 1] [y m T 1] T ‖ 22 ‖ [V m x 0] [y m T 1] T ‖ 22 = min μ ∈ R m + 1 ‖ L m μ ‖ 22 ‖ G m μ ‖ 22,

and the Courant−Fischer theorem [24], we have,

(12)

min μ ∈ R m + 1 ‖ L m μ ‖ 22 ‖ G m μ ‖ 22 = λ m + 1,

where

λ m + 1

is the minimum generalized eigenvalue of

(L m T L m, G m T G m)

. Since

μ m + 1, m + 1 ≠ 0

, the solution

y m

of (8) can be calculated by using equation (10). From (7), we soon obtain (11).

By Theorem 2.2, we derive the IGMBACK algorithm. In order to reduce the storage and computation capacity of the algorithm, the algorithm in this paper adopts the

m

step loop format. The main steps for

I G M B A C K (m)

are given as follows.

Algorithm 1　Restart

I G M B A C K (m)

① Choose the initial estimate

x 0

of (1) and the parameter

q

, where

q

satisfies

2 ⩽ q ⩽ m

, compute

r 0 := b − A x 0

and

v 1 := r 0 β

, where

β := ‖ r 0 ‖ 2

② Incomplete orthogonalization process

(q)

: Perform

m

steps of the incomplete orthogonalization process to compute

V m + 1

and

H m

, for

j =

1, 2, …, m

i. Calculate

v^j + 1 := A v j

ii. For

i = max {1, j − q + 1}, …, j − 1, j

, compute

h i j = v i T v^j + 1, v^j + 1 := v^j + 1 − h i j v i

iii. Calculate

h j + 1, j = ‖ v^j + 1 ‖ 2

. If

h j + 1, j ≠ 0

, compute

v j + 1 := v^j + 1 h j + 1, j

, otherwise stop.

③ Solving generalized eigenvalue problems.

(13)

L m T L m u = λ G m T G m u,

where

L m

and

G m

are defined in (9).

④ The approximate solution

x m I G B := x 0 + V m y m

is calculated from Equation (10) in Theorem 2.2.

⑤ Start over: Define

‖ (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T ‖ F = λ m + 1 12

. If the termination condition is met, stop, otherwise redefine

x 0 := x m I G B

, compute

r 0 := b − A x 0, v 1 := r 0 β

, where

β := ‖ r 0 ‖ 2

, and return to step (2).

Denote

(14)

P = L m T L m = [H m T H m − β H m T e 1 − β e 1 T H m β 2], Q = G m T G m = [V m T V m V m T x 0 x 0 T V m x 0 T x 0] .

Therefore, the generalized eigenvalue problem (13) can then be written as

P u = λ Q u

The minimum value problem (12) can be rewritten as

(15)

λ m + 1 12 = min μ ∈ R m + 1 (‖ G m μ ‖ 2 ‖ μ ‖ 2) − 1 ‖ L m μ ‖ 2 ‖ μ ‖ 2 .

Using equation (15), we can give an estimate of the range of

‖ (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T ‖ F = λ m + 1 12

in the termination condition of the algorithm. This is shown by the following theorem. Here

x m = x 0 + V m y m

is the approximate solution of the IGMBACK algorithm.

Theorem 2.3　 Assuming that

m

steps of incomplete orthogonalization have been performed,

σ 1 (G m)

denotes the maximum singular value of

G m

σ m + 1 (L m), σ m + 1 (G m)

are the minimum singular values of

L m, G m

respectively. Let

σ m + 1 (G m) > 0

, we have

σ m + 1 (L m) σ 1 − 1 (G m) ⩽ λ m + 1 12 ⩽ σ m + 1 (L m) σ m + 1 − 1 (G m) .

Proof　By employing equation (15), we have

λ m + 1 12 ⩾ min z 1 ∈ R m + 1 (‖ G m z 1 ‖ 2 ‖ z 1 ‖ 2) − 1 min z 2 ∈ R m + 1 ‖ L m z 2 ‖ 2 ‖ z 2 ‖ 2 = σ 1 − 1 (G m) σ m + 1 (L m)

and

λ m + 1 12 ⩽ max z 1 ∈ R m + 1 (‖ G m z 1 ‖ 2 ‖ z 1 ‖ 2) − 1 min z 2 ∈ R m + 1 ‖ L m z 2 ‖ 2 ‖ z 2 ‖ 2 = σ m + 1 − 1 (G m) σ m + 1 (L m) .

Thus, the conclusion of the theorem holds.□

2.3 The theoretical research of IGMBACK algorithm

This section presents a theoretical study of the IGMBACK algorithm in terms of finite termination, existence and uniqueness of the solution.

2.3.1 Finite termination of the algorithm

Suppose that after

m

steps of incomplete orthogonalization, we get

h m + 1, m = 0

, i.e.,

P

is singular, then the vector

v m + 1

cannot be formed. When this happens, we call it a lucky break [22].

Let the column vectors of

H m − 1

be linearly independent and the F- backward perturbation matrix with minimum norm

Δ i

generated in the IGMBACK algorithm satisfies

Δ i ≠ 0

when

i < m

. According to Theorem 2.1, a sufficient condition for

Δ min = Δ m = 0

H m y m − β e 1 = 0

, then we have

Δ min = 0 ⇔ h m + 1, m = 0 .

In fact, assume

h m + 1, m = 0

y m = H ¯ m − 1 β e 1

, then

Δ min = 0

, thus

x m = x 0 + V m H ¯ m − 1 β e 1

is an exact solution to the system of equations (1). Conversely, if

Δ min = 0

, then

H m y m − β e 1 = 0

. Let

y m = [(y m (1)) T, y m (2)] T

, where

y m (2) ∈ R

. From the last line of the equation

H m y m − β e 1 = 0

, we get

h m + 1, m y m (2) = 0

. If

y m (2) = 0

, then

H m − 1 y m (1) − β e 1 = 0

, and thus

Δ m − 1 = 0

, which contradicts the previous assumption and gives

h m + 1, m = 0

We know that the algorithm can execute at most

m = n

steps before a lucky break occurs, so the following corollary holds.

Corollary 2.1　 For any

A ∈ R n × n

and

b ∈ R n

, the IGMBACK algorithm can converge by up to

n

steps.

According to the above discussion,

h i + 1, i ≠ 0, i ⩽ m

is assumed in the rest of this paper. That is,

P

is assumed to be non-singular.

2.3.2 Existence and uniqueness of solutions

There are two cases where the solution of the algorithm does not exist: one is when the generalized eigenvalue problem

P u = λ Q u

has degenerate generalized eigenvalues, the other is when

μ m + 1, m + 1 = 0

in (10).

Lemma 2.2　 Let the incomplete orthogonalization process in

m

steps have been performed, then, the necessary and sufficient condition for

Q = G m T G m

to be a singular matrix is

x 0 = 0

x 0 ∈ K m (A, r 0)

Proof　According to the basic knowledge of algebra,

Q = G m T G m

is a singular matrix if and only if the rank

r (G m) <

m + 1

of matrix

G m

, where

G m

is shown in (9). On the one hand, if

Q = G m T G m

is a singular matrix, then

r (G m) < m + 1

, i.e., the set of column vectors of

G m = [V m x 0]

is linearly related. The set of column vectors of

V m

is linearly independent, so

x 0

can be uniquely linearly tabulated by the set of column vectors of

V m

, i.e.,

x 0 ∈ K m (A, r 0)

. On the other hand, if

x 0 = 0

x 0 ∈ K m (A, r 0)

, i.e.,

x 0

can be linearly tabulated by the set of column vectors of

V m

, then the set of column vectors of

G m

is linearly related. Thus

r (G m) < m + 1

, i.e.,

Q

is a singular matrix.

By Lemma 2.2,

Q

is potentially singular. We choose

x 0

in the actual implementation of the IGMBACK algorithm such that

x 0 ∉ K m (A, r 0)

, which ensures that

Q

is a non-singular matrix. Thus, the generalized eigenvalue problem

P u = λ Q u

belongs to the non-degenerate generalized eigenvalue problem [7], without degenerate generalized eigenvalues and eigenvectors. In general, the dimension

m

K m (A, r 0)

chosen in the IGMBACK algorithm is much smaller than the order

n

A

. Therefore, there is plenty of scope for choosing

x 0

in the

n

-dimensional vector space

R n

such that

x 0 ∉ K m (A, r 0)

. Throughout the rest of the paper, we assume that

x 0 ∉ K m (A, r 0)

, i.e., that

Q

is a non-singular matrix.

The following theorem gives a sufficient condition for the IGMBACK algorithm to fail to form an approximate solution when the multiplicity of

λ m + 1

is 1.

Theorem 2.4　 Suppose

{λ m + 1, μ m + 1}

is the combination of the minimum generalized eigenvalue and the corresponding eigenvector of

(P, Q)

Q

is a non-singular matrix, and the multiplicity of

λ m + 1

is 1. Let

μ m + 1 = [l l μ^T μ m + 1, m + 1] T

. Then

μ m + 1, m + 1 = 0 ⇔ H m T H m μ^= λ m + 1 V m T V m μ^, μ^⊥ (λ m + 1 V m T x 0 + H m T β e 1) .

Proof　

⇒

Let

μ m + 1, m + 1 = 0

. Substituting the expressions for

P, Q

in (14) into

P u = λ Q u

, we have

[H m T H m − β H m T e 1 − β e 1 T H m β 2] [μ^0] = λ m + 1 [V m T V m V m T x 0 x 0 T V m x 0 T x 0] [μ^0] .

From the first line of the matrix equation above, we get

H m T H m μ^= λ m + 1 V m T V m μ^

. From the second line we get

− β e 1 T H m μ^=

λ m + 1 x 0 T V m μ^

, i.e.,

(λ m + 1 V m T x 0 + H m T β e 1) T μ^= 0

, that is,

μ^⊥ (λ m + 1 V m T x 0 + H m T β e 1) .

⇐

Assume

H m T H m μ^= λ m + 1 V m T V m μ^

, and

(λ m + 1 V m T x 0 + H m T β e 1) T μ^= 0

. Then

[μ^T, 0] T

is the eigenvector of

P u = λ Q u

, and the corresponding generalized eigenvalue is

λ m + 1

. Since

Q

is a non-singular matrix,

P u = λ Q u

does not have degenerate generalized eigenvalues and eigenvectors, and since the multiplicity of

λ m + 1

is 1,

k [μ^T, 0] T (k ∈ R, k ≠ 0)

is the eigenvector of all

P u = λ Q u

corresponding to

λ m + 1

. Therefore, there must be

μ m + 1, m + 1 = 0

The existence and uniqueness of the solution are discussed further below. For convenience, according to (14) and (8), denote

P m = P = L m T L m, Q m = Q = G m T G m = [V m T V m V m T x 0 x 0 T V m x 0 T x 0], Δ ~ m = min y m ∈ R m [y m T 1] P m [y m T 1] T [y m T 1] Q m [y m T 1] T .

Let’s start with a lemma.

Lemma 2.3　 Suppose that an incomplete orthogonalization process of

m + 1

steps has been performed.

{λ i} i = 1, 2, …, m + 1

and

{λ^i} i = 1, 2, …, m + 2

represent all generalized eigenvalues of

(P m, Q m)

and

(P m + 1, Q m + 1)

in non-increasing order, respectively, then

λ^i ⩾ λ i ⩾ λ^i + 1, i = 1, 2, …, m + 1.

Proof　Let

{λ i, μ i} i = 1, 2, …, m + 1

be all generalized characteristic pairs of

(P m, Q m)

. According to Subsection 2.3.1, if

h m + 2, m + 1 ≠ 0

, then

L m + 1, L m

are non-singular matrices. Therefore, we can write

P u = λ Q u

(L m − 1) T Q m L m − 1

z i = ξ i z i

, where

z i = L m μ i, ξ i

is the reciprocal of

λ i

. The size order of

{λ i} i = 1, 2, …, m + 1

determines

0 < ξ 1 ⩽ ξ 2 ⩽

⋯ ⩽ ξ m + 1

. By

K = [H m + 1 − β e 1] = [L m h^m + 1 0 h m + 2, m + 1] P m + 2 (m + 1, m + 2),

where

h^m + 1 = [I m + 1 0] H m + 1 e m + 1, P m + 2 (m + 1, m + 2)

represents the primary matrix after interchanging the

m + 2

and

m + 1

columns of the unit matrix

I m + 2

, one gets that

L m + 1 − 1 = P m + 2 (m + 1, m + 2) [L m − 1 − L m − 1 h^m + 1 h m + 2, m + 1 − 1 0 h m + 2, m + 1 − 1] .

Substituting the above equation into

(L m + 1 − 1) T Q m + 1 L m + 1 − 1

, we have

(L m + 1 − 1) T Q m + 1 L m + 1 − 1 = [(L m − 1) T Q m L m − 1 f f T g],

where

g = h m + 2, m + 1 − 2 + 2 h m + 2, m + 1 − 1 d m + 1 h m + 1 + d m + 1 Q m d m + 1 T, f = h m + 2, m + 1 − 1 (L m − 1) T (h m + 1 − Q m L m − 1 h^m + 1), h m + 1 = [V m T v m + 1 x 0 T v m + 1] (m + 1) × 1, d m + 1 = − h m + 2, m + 1 − 1 h^m + 1 T (L m − 1) T .

Thus, according to Theorem IV.4.2 in reference [24], one obtains that

ξ^m + 2 ⩾ ξ m + 1 ⩾ ξ^m + 1 ⩾ ξ m ⩾ ⋯ ⩾ ξ^2 ⩾ ξ 1 ⩾ ξ^1,

where

{ξ^i} i = 1, 2, …, m + 2

is all the eigenvalues of

(L m + 1 − 1) T Q m + 1 L m + 1 − 1

. Since

ξ i, ξ^i

are the reciprocals of

λ i

and

λ^i

, respectively, the conclusion of the lemma holds.

According to Lemma 2.3, we obtain the following conclusions about the convergence, existence and uniqueness of the solution.

Corollary 2.2　 Assuming that the incomplete orthogonalization process has been performed in

m + 1

steps, the following conclusion holds:

(a)

Δ ~ m + 1 ⩽ Δ ~ m

(b) If

λ^m + 2 < λ^m + 1

and

μ m + 2, m + 2 ≠ 0

, then there is a unique approximate solution to the IGMBACK algorithm.

λ^m + 2 = λ^m + 1

, and the approximate solution of the IGMBACK algorithm exists, then the approximate solution of the IGMBACK algorithm is not unique, and

Δ ~ m + 1 = Δ ~ m

, i.e., the IGMBACK algorithm stalls from step

m

to step

m + 1

Let the minimum generalized eigenvalue

λ m + 1

(P m, Q m)

be of multiplicity not

1

. Assume

λ m − k + 1 > λ m − k + 2 = ⋯ = λ m + 1

, i.e., let

λ m + 1

be of

k

multiplicity, and the corresponding standard orthogonal eigenvectors are

μ m − k + 2, μ m − k + 3, …, μ m + 1

, respectively. Let

U k = [μ m − k + 2 μ m − k + 3 … μ m + 1] ∈ R (m + 1) × k .

Then for any

z = [z 1 T z 2] T ∈ s p a n {U k}, z 2 ∈ R

, if

z 2 ≠ 0

, the approximate solution of the IGMBACK algorithm takes the form

x m = x 0 + V m z 1 z 2 .

In the practical implementation of the algorithm, we often choose

z 1 z 2

such that

‖ z 1 ‖ 2 ‖ 2

is minimal to make the resulting approximate solution unique. Maximize

s p a n {U k}

in all unit vectors containing span

{z : z = [z 1 T z 2] T ∈ s p a n {U k}, ‖ z ‖ 2 = 1}

maximizes

| z 2 |

such that

‖ z 1 z 2 ‖ 2

is minimized.

3 Execution of the IGMBACK algorithm

This section considers how to solve the generalized eigenvalue problem (13), i.e.,

P u = λ Q u

In the implementation of Algorithm 1, we focus on solving the generalized eigenvalue problem (13). Based on Theorem 2.2, it is sufficient to compute the minimum generalized eigenvalue of

(L m T L m, G m T G m)

and the corresponding combination of eigenvectors

{λ m + 1, μ m + 1}

. Since

L m T L m − λ G m T G m

is symmetric, we can compute

{λ m + 1, μ m + 1}

with the help of the inverse iteration method. However, when the condition numbers of

L m T L m

and

G m T G m

are relatively large, it becomes difficult to compute

{λ m + 1, μ m + 1}

by inverse iteration. Therefore, in the GMBACK algorithm [13], the authors solve the corresponding generalized eigenvalue problem by solving the associated singular value decomposition problem, which overcomes the difficulties associated with the increasing number of conditions for

L m T L m

and

G m T G m

. Can Algorithm 1 also calculate

{λ m + 1, μ m + 1}

by solving the associated singular value decomposition problem?

Noting that the matrix

Q

contains

V m T V m

, make the Cholesky decomposition of

V m T V m

V m T V m = X X T

, where

X

is the lower triangular matrix, thus

Q = [X 0 x 0 T V m (X − 1) T ‖ x 0 ‖ 22 − ‖ X − 1 V m T x 0 ‖ 22] [X T X − 1 V m T x 0 0 ‖ x 0 ‖ 22 − ‖ X − 1 V m T x 0 ‖ 22] = Z Z T,

where

Z = [X 0 x 0 T V m (X − 1) T ‖ x 0 ‖ 22 − ‖ X − 1 V m T x 0 ‖ 22] .

By substituting

Q = Z Z T

into

P u = λ Q u

, the original generalized eigenvalue problem

P u = λ Q u

is transformed into an ordinary eigenvalue problem for symmetric matrices:

Z − 1 P (Z − 1) T v = λ v,

where

v = Z T u

. When

λ

is small, based on

P = [H m − β e 1] T [H m − β e 1]

, to solve exactly for the matrix

Z − 1 P (Z − 1) T

with the minimum eigenvalue

λ min

and the corresponding eigenvector

v

is not an easy task. Therefore, we transform the original problem (16) into a solution for the minimum singular value of another matrix

(16)

= [H m (X − 1) T − H m ([V m T V m] − 1) T V m T x 0 ⋅ x l − β ⋅ x l ⋅ e 1],

and the corresponding right singular vector, which solves the original problem very well. In equation (17),

x l = 1 ‖ x 0 ‖ 22 − ‖ X − 1 V m T x 0 ‖ 22

So we get the specific execution process of IGMBACK algorithm.

Algorithm 2　Restart IGMBACK algorithm,

I G M B A C K (m)

① Select initial estimated value

x 0

and parameter

q

of (1), where

q

satisfies

2 ⩽ q ⩽ m

. Calculate

r 0 := b − A x 0

and

v 1 := r 0 β

, where

β := ‖ r 0 ‖ 2

② Incomplete orthogonalisation process (

q

): Performing an incomplete orthogonalisation process in

m

steps,

V m + 1

and

H m

were calculated, for

j = 1, 2, …, m

i. Calculate

v^j + 1 := A v j

ii. For

i = max {1, j − q + 1}, …, j

, compute

h i j = v i T v^j + 1

v^j + 1 := v^j + 1 − h i j v i

iii. Calculate

h j + 1, j = ‖ v^j + 1 ‖ 2

, if

h j + 1, j ≠ 0

, calculate

v j + 1 = v^j + 1 h j + 1, j

, otherwise stop.

③ Compute the minimum singular value

σ

of the matrix

[H m − β e 1] (Z − 1) T

and the corresponding right singular vector

v

. Calculate

u = (Z − 1) T v = [y ~ m T, η] T

. Standardize the vector

u

to derive

y m : y m = y ~ m η

. Forming an approximate solution

x m I G B := x 0 + V m y m

④ Starting over: Define

‖ (H m y m − β e 1) ‖ x m ‖ 2 − 2 x m T ‖ F 2 = σ

. If the termination condition is met, stop. Otherwise, redefine

x 0 := x m I G B

and calculate

r 0 := b − A x 0

v 1 := r 0 β

where

β := ‖ r 0 ‖ 2

, and return to step ②.

4 Numerical experiments

It is well known that when the coefficient matrix

A

is a ill-posed matrix, a small perturbation of

A

b

causes a large change in the approximate solution of (1). This is further exacerbated [9] when

b

is parallel to the left singular vector corresponding to the minimum singular value of

A

. In this section, we illustrate the effectiveness of the ICMBACK(

m

) algorithm in solving the preceding problem by means of several numerical examples, and the IGMBACK(

m

) algorithm is often more efficient than the GMBACK(

m

) and CMRES(

m

) algorithms in solving many problems. The programming software used for the numerical experiments in this paper is MATLAB 7.0.

We mainly compare the following three algorithms:

1) IGMBACK(

m, q

)(

q

is the parameter in the incomplete orthogonalisation process).

2) GMBACK(

m

3) CMRES(

m

). The different convergence rates of the three algorithms during execution are reflected by comparing the change in the backward perturbation norm

‖ Δ min ‖ F

of the approximate solutions of the three algorithms with the increase in the number of selected generations (restarts). In the GMRES(

m

) algorithm, let the backward perturbation norm

‖ Δ min ‖ F = ‖ r m ‖ 2 ‖ x m ‖ 2

. The stop criteria is

‖ Δ min ‖ F ⩽ 10 − 7

4.1 Numerical experiments with strong informal matrices

This part of the numerical experiment will consider two matrices that are sensitive to small perturbations (called sensitivity matrices).

We notate the singular value decomposition of the matrix A with the following symbols:

A = U S V T

, where

U = [u 1, u 2, …, u n], V = [v 1, v 2, …, v n], S = (s 1, s 2, …, s n) .

For

i = 1, 2, …, n − 1

s i ⩾ s i + 1

S

, then

u 1

and

v 1

are the left singular vectors and the right singular vector corresponding to the maximum singular value

s n

of matrix

A

, respectively.

Example 4.1　Consider a Toeplitz matrix of order

100 × 100

A = T o e p l i t z ([1_, 1, 1])

. Calculate the maximum singular value

s 1 = 2.9990

, and minimum singular value

s n = 2.6784 ⋅ 10 − 2

of the matrix

A

. For the first experiment Test 1, set

b = u 1

x 0 = v 1

. Thus, we obtain

r 0 = u 1 − A v 1

, that is to say,

r 0

is parallel to

u 1

(noted as

r 0 ‖ u 1

). For the second experiment Test 2, assume

b = u n

x 0 = v n

. Thus, we obtain

r 0 = u n − A v n

, that is to say,

r 0 ‖ u n

. In these two experiments, let

m = 20

q = 15

. The experimental results are shown in Fig.1 (a) and Fig.1 (b), respectively.

− 10

in the vertical coordinate represents

10 − 10

, and so on.

Fig.1 (a) shows that the convergence rates of the three algorithms in the first experiment are essentially the same. From Fig.1 (b), it can be seen that the convergence rates of IGMBACK(

m

) and GMBACK(

m

) in the second experiment are equivalent (IGMBACK (

m

) converges slightly faster), and both converge significantly faster than GMRES(

m

). The coefficient matrices are the same in both experiments (both sensitive matrices), what makes IGMBACK(

m

) and GMBACK(

m

) significantly better than GMRES(

m

) in the second experiment? This is mainly because the second experiment was set up with

b ‖ u n

and

x 0 = v n

making

r 0 ‖ u n

Example 4.2　Consider another matrix of order

100 × 100

A = T o e p l i t z ([− 1, 1_, 1, 1, 1])

. Matrix

A

has sensitive eigenvalues. Calculate the maximum singular value

s 1 = 4.2394

, and minimum singular value

s n = 9.0205 ⋅ 10 − 1

of the matrix

A

. The first experiment Test 1, solves the equation

A x = b

, with a setting of

b = u 0

x 0 = v n

. Let

m = 20

q = 15

. The result is shown in Fig.2 (a).

From Fig.2 (a), it can be seen that the convergence rates of IGMBAC-K(

m

), GMBACK(

m

), and GMRES(

m

) are completely consistent, that is, the setting of vector

b

and the selection of initial estimates for approximate solutions cannot cause differences in the convergence rates of the three algorithms, as shown in Example 4.1 (even if set at

b ‖ u n

). This is mainly owing to the fact that the coefficient matrix

A

is well conditioned. However, by reducing the minimum singular value of

A

, the convergence rates of the above three algorithms will become different. Use

E = 0.89 u n v n T

to perturb the matrix

A

. The first

n − 1

singular values of the perturbed coefficient matrix

A − E

are the same as

A

, while the minimum singular value

S n

decreases to

1.2048 ⋅ 10 − 2

. The left and right singular vectors also remain unchanged. The second experiment Test 2 considers solving the system of equations

(A − E) x = b

using the three algorithms described above, still setting

b = u n

x 0 = v n

. Let

m = 20

q = 15

. The result is shown in Fig.2 (b).

As depicted in Fig.2 (b), stagnation occurs in GMRES, while IGMBACK(

m

) and GMBACK(

m

) are convergent, and

I G M B A C K (m)

converges at a rate comparable to GMBACK(m). Obviously, the latter two algorithms are clearly superior to the restarted GMRES(

m

), which is largely attributable to the the foundation of setting

b ‖ u n

. In the second experiment, when the minimum singular value of

A

is reduced, the condition of the coefficient matrix becomes worse and the coefficient matrix becomes sensitive matrix.

From the previous two examples, we can conclude that the GMRES(

m

) algorithm, which uses the least-squares problem to derive approximate solutions, is not sufficient to guarantee convergence if the coefficient matrix is a sensitive matrix and the vector

b

on the right-hand side of the system of equations is parallel to the left singular vector corresponding to the least singular value of the coefficient matrix. However, GMBACK(

m

) and IGMBACK(

m

) algorithms can get better convergence effect by minimizing or approximately minimizing the backward perturbation norm. Interestingly, when comparing the TLS (Total Least Sguares) algorithm with the classical least square method, we can come to a similar conclusion [26].

In order to further illustrate that the algorithms IGMBACK(

m

) can be compared with GMBACK(

m

), we compare the CPU time taken by each of the two algorithms to run in Example 4.1 and Example 4.2. The comparison results are shown in Tab.1. In the table, CPU and CPU

a

represent the entire CPU time used and the average CPU time used every reboot, respectively, in seconds.

i t e r

represents the number of restarts;

r a t i o

is the ratio of CPU time used by algorithm IGMBACK(

m

) and GMBACK(

m

);

B P N

represents the norm of backward perturbation. Note that IGMBACK(

m

) becomes GMBACK(

m

) when

q = m

From Tab.1, we can see that the algorithm

I G M B A C K (m)

is more efficient than GMBACK(

m

4.2 Numerical experiments related to partial differential equations

In practical applications, a large number of large-scale nonsymmetric linear systems arise from the discretization of partial differential equations.

Example 4.3　Consider the convective diffusion equation defined over the region (0,1) × (0,1)

− ∂ 2 u ∂ x 2 − ∂ 2 u ∂ y 2 + γ (x ∂ u ∂ x + y ∂ u ∂ y) + β u = f,

where

γ = 1000

β = 10

. The boundary condition is

u (x, y) = 0

. The above convective diffusion equation is discretized by the central difference method with lattice length

h = 132

to obtain an nonsymmetric matrix

A

of order

312

. Take

b

such that

x = [11 ⋯ 1] T

is an exact solution to the system of equations

A x = b

. Suppose

x 0

is a random matrix of order

961 × 1

, with the values of the elements in the interval

(0.0, 1.0)

, and let

m = 15

q = 10

. The experimental result is shown in Fig.3.

In this example, the GMRES algorithm does not converge in 400 steps when

m

is set at

15

, while Fig.3 shows that the ICMBACK and GMBACK can drop to around

10 − 8

40

steps by the backward perturbation norm. This indicates that the latter two algorithms are significantly better than GMRES algorithm. Tab.2 shows the CPU time spent by each of the three algorithms when

m

is 15 and 25 respectively in Example 4.3.

Tab.2 illustrates that IGMBACK is more efficient than GMBACK and that both algorithms are significantly better than GMRES algorithm.

Example 4.4　Consider the second order elliptic partial differential equation defined on the region

[0, 1] × [0, 1]

− e y ∂ 2 u ∂ x 2 – e x ∂ 2 u ∂ y 2 + (x + y) ∂ u ∂ x + (x − y) ∂ u ∂ y + u = − (y 2 e y + x 2 e x) e x y + (y 2 + x 2 + 1) e x y,

the boundary condition is

u (0, y) = 1

u (1, y) = e y

u (x, 0) = 1

u (x, 1) = e x

. Differentiating the above second-order elliptic partial differential equations by the central difference method with lattice lengths A and h duals, respectively, yields a nonsymmetric difference linear systems

A x = b

of coefficient matrix orders

392

and

692

. Let

x 0 = 11 … 1 T

. The experimental results are shown in Tab.3 and Tab.4, respectively. The symbols in Tab.3 and Tab.4 are used as in Tab.1, where

r a t i o

represents the ratio of CPU time used by the algorithm

I G M B A C K (m)

or GMRES(

m

) and GMBACK(

m

) for the same value of

m

As seen in Tab.3 and Tab.4: overall

I G M B A C K (m)

and GMBACK(

m

) converge faster than CMRES(

m

). GMBACK(

m

) is more effective than GMRES(

m

), which in turn is more effective than

I G M B A C K (m)

. GMBACK(

m

) is more effective than GMRES(

m

), and

I G M B A C K (m)

is more effective than GMBACK(

m

). We have additionally done a number of numerical experiments and found that

I G M B A C K (m)

and GMBACK(

m

) often converge better than GMRES(

m

), and that

I G M B A C K (m)

is generally more efficient than GMBACK(

m

) and GMRES(

m

5 Summary and expectation of the work

In today's increasingly popular Krylov subspace methods [15], this paper presents another method for solving large sparse linear systems of equations Krylov subspace method: The truncated form of GMBACK algorithm, that is, IGMBACK algorithm. The IGMBACK algorithm uses an incomplete orthogonalization process to generate a set of bases for

K m (A, r 0)

, which overcomes the disadvantage of using Arnold process in CMBACK algorithm with large amount of computation and storage capacity. The detailed theoretical derivation and some theoretical analysis of the new algorithm are given. Numerical experiments have shown that IGMBACK is generally more effective than CMBACK and CMRES.

It should be noted that the GMRES algorithm has been greatly improved in recent years by the efforts of many authors, and has become a major iterative method for solving large nonsymmetric linear systems. Reference [18] lists the GMRES algorithm including some of the major deformations [11, 17, 19, 23] and theoretical developments [2, 16] in recent years. At present, the international exploration of GMRES algorithm is deepening, and GMRES algorithm itself has become a relatively independent research field. At the same time, this algorithm is also widely used in many fields such as science and engineering calculation. In this case, the advantages of IGMBACK and GMBACK over CMRES in solving many problems illustrate the value of IGMBACK and GMBACK.

The convergence problem of the GMBACK algorithm has not yet been solved. In the truncated version of the GMBACK algorithm, the cost is the loss of some important properties of the original method, such as orthogonality or

A

-orthogonality, minimization of the backward perturbation parametrization, etc., which are the main basis for the analysis of the convergence of the method, thus making the convergence of the IGMBACK algorithm more complicated. Therefore, one of our major research directions in the future is how to establish the convergence theory of these two algorithms and compare them with GMRES algorithm theoretically, so as to further determine theoretically what kind of equations the GMBACK series algorithm is better than the restarted GMRES for solving. In addition, in many cases and applications, direct use of the iterative method does not converge or converges very slowly. The current general solution is to combine iterative methods with preprocessing techniques. Therefore, another major research direction in the future is how to combine IGMBACK and GMBACK algorithm with preprocessing technology, so as to obtain a new algorithm with better convergence effect and higher accuracy.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Arioli M, Duff I, Ruiz D. Stopping criteria for iterative solvers. SIAM J Matrix Anal Appl 1992; 13(1): 138–144

[2]	Baker A H, Jessup E R, Kolev T V. A simple strategy for varying the restart parameter in GMRES(m). J Comput Appl Math 2009; 230(2): 751–761

[3]	Ben-IsraelAGrevilleT N E. Generalized Inverses: Theory and Applications. New York: Wiley Interscience, 1974

[4]	Brown P N. A theoretical comparison of the Arnoldi and GMRES algorithms. SIAM J Sci Stat Comput 1991; 12(1): 58–78

[5]	Brown P N, Hindmarsh A C. Reduced storage matrix methods in stiff ODE systems. Appl Math Comput 1989; 31: 40–91

[6]	Brown P N, Saad Y. Hybrid Krylov methods for nonlinear systems of equations. SIAM J Sci Stat Comput 1990; 11(3): 450–481

[7]	Cao Z H. On a deflation method for the symmetric generalized eigenvalue problem. Linear Algebra Appl 1987; 92: 187–196

[8]	Cao Z H. Total generalized minimum backward error algorithm for solving nonsymmetric linear systems. J Comput Math 1998; 16(6): 539–550

[9]	GolubG HVan LoanC F. Matrix Computations, 2nd ed. Baltimore, MD: John Hopkins Univ Press, 1990

[10]	Higham D J, Higham N J. Backward error and condition of structured linear systems. SIAM J Matrix Anal Appl 1992; 13(1): 162–175

[11]	Huhtanen M, Permki A. Orthogonal polynomials of the R-linear generalized minimal residual method. J Approx Theory 2013; 167(3): 220–239

[12]	Jia Z X. On IOM(q): the incomplete orthogonalization method for large unsymmetric linear systems. Numer Linear Algebra Appl 1996; 3(6): 491–512

[13]	Kasenally E M. GMBACK: a generalized minimum backward error algorithm for nonsymmetric linear systems. SIAM J Sci Comput 1995; 16(3): 698–719

[14]	Kasenally E M, Simoncini V. Analysis of a minimum perturbation algorithm for nonsymmetric linear systems. SIAM J Numer Anal 1997; 34(1): 48–66

[15]	Li X A, Chen Y H, Zhang Y, Wang X P. Development of the Krylov subspace method for solving large sparse linear systems. science & Technology Review 2013; 11: 68–73

[16]	LiesenJTichyP. The field of values bound on ideal GMRES, 2012, arXiv: 1211.5969v1

[17]	Liu Y Q, Yin K X, WU E H. Fast GMRES-GPU algorithm for solving large scale sparse linear systems. Journal of Computer-Aided Design & Computer Graphics 2011; 23(4): 553–560

[18]	Ma X F. An overview of recent developments and applications of the GMRES method. Pure Math 2013; 3: 181–187

[19]	Najafi H S, Zareamonghaddam H. A new computational GMRES method. Appl Math Comput 2008; 199(2): 527–534

[20]	Saad Y. Krylov subspace methods for solving large unsymmetric linear systems. Math Comp 1981; 37(155): 105–126

[21]	Saad Y. Practical use of some Krylov subspace methods for solving indefinite and nonsymmetric linear systems. SIAM J Sci Stat Comput 1984; 5(1): 203–228

[22]	Saad Y, Schultz M H. GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J Sci Stat Comput 1986; 7(3): 856–869

[23]	Sterck H D. Steepest descent preconditioning for nonlinear GMRES optimization. Numer Linear Algebra Appl 2013; 20(3): 453–471

[24]	StewartG WSunJ-G. Matrix Perturbation Theory. New York: Academic Press, 1990

[25]	Sun L, Wang X H, Guan Y. IMinpert: an incomplete minimum perturbation algorithm for large unsymmetric linear systems. Numer Math J Chinese Univ 2007; 16(4): 300–312

[26]	Van HuffelSVandewalleJ. The Total Least Squares Problem: Computational Aspects and Analysis. Frontiers in Applied Mathematics, Vol 9. Philadelphia, PA: SIAM, 1991

RIGHTS & PERMISSIONS

Higher Education Press 2023

PDF (1307KB)

871

Accesses

Citation

Detail

Sections

Recommended

About the journal

Aims & scope

Editorial board

Abstracting / indexing

Contact us

Browse

Online first

Latest issue

All volumes and issues

Collections

Most accessed

Most cited

Collections

Authors & reviewers

Online submisson

Abstract

Graphical abstract

Keywords

Cite this article

1 Introduction

2 IGMBACK algorithm

2.1 Analysis of the backward perturbation matrix

2.2 Generation of IGMBACK algorithm

2.3 The theoretical research of IGMBACK algorithm

2.3.1 Finite termination of the algorithm

2.3.2 Existence and uniqueness of solutions

3 Execution of the IGMBACK algorithm

4 Numerical experiments

4.1 Numerical experiments with strong informal matrices

4.2 Numerical experiments related to partial differential equations

5 Summary and expectation of the work

References

RIGHTS & PERMISSIONS