A primal-dual approximation algorithm for the <i>k</i>-prize-collecting minimum vertex cover problem with submodular penalties

Xiaofei LIU; Weidong LI; Jinhua YANG

doi:10.1007/s11704-022-1665-9

Frontiers of Computer Science >

2023 , Vol. 17 >Issue 3: 173404

DOI: https://doi.org/10.1007/s11704-022-1665-9

RESEARCH ARTICLE

A primal-dual approximation algorithm for the k-prize-collecting minimum vertex cover problem with submodular penalties

Xiaofei LIU ¹ ,
Weidong LI ^,² ,
Jinhua YANG ³

Expand

¹. School of Information Science and Engineering, Yunnan University, Kunming 650500, China
². School of Mathematics and Statistics, Yunnan University, Kunming 650500, China
³. Dianchi College of Yunnan University, Kunming 650228, China

Received date: 23 Nov 2021

Accepted date: 15 Apr 2022

Copyright

2023 Higher Education Press

Fold

Abstract

In this paper, we consider the $k$ -prize-collecting minimum vertex cover problem with submodular penalties, which generalizes the well-known minimum vertex cover problem, minimum partial vertex cover problem and minimum vertex cover problem with submodular penalties. We are given a cost graph $G = (V, E; c)$ and an integer $k$ . This problem determines a vertex set $S ⊆ V$ such that $S$ covers at least $k$ edges. The objective is to minimize the total cost of the vertices in $S$ plus the penalty of the uncovered edge set, where the penalty is determined by a submodular function. We design a two-phase combinatorial algorithm based on the guessing technique and the primal-dual framework to address the problem. When the submodular penalty cost function is normalized and nondecreasing, the proposed algorithm has an approximation factor of $3$ . When the submodular penalty cost function is linear, the approximation factor of the proposed algorithm is reduced to $2$ , which is the best factor if the unique game conjecture holds.

Key words： vertex cover; k-prize-collecting; primal-dual; approximation algorithm

Cite this article

Xiaofei LIU , Weidong LI , Jinhua YANG . A primal-dual approximation algorithm for the k-prize-collecting minimum vertex cover problem with submodular penalties[J]. Frontiers of Computer Science, 2023 , 17(3) : 173404 . DOI: 10.1007/s11704-022-1665-9

1 Introduction

The minimum vertex cover problem (MVC) is one of the most important and fundamental problems in graph theory and combinatorial optimization [1,2]. In the MVC, we are given a graph

G = (V, E)

with vertex set

V = {v 1, v 2, …, v n}

and edge set

E = {e 1, e 2, …, e m}

. For each vertex

v ∈ V

, there is an associated nonnegative cost

c (v)

. The MVC finds a vertex set

S

to cover all the edges of the graph such that the total cost of the vertices in

S

is minimized, where an edge is said to be covered by

S

if at least one of its incident vertices is in

S

. For this problem, Karp [1] proved it is

N P

-hard. This result is improved by Khot and Regev [3], who proved that the MVC cannot be approximated within

2 − ε

for any

ε > 0

under the unique game conjecture. Based on the LP-rounding, Hochbaum [4] presented a

2

-approximation algorithm with a running time of

O (n 3)

. Based on the primal-dual framework, Bar-Yehuda and Even [5] proposed a linear-time

2

-approximation algorithm.

Bshouty and Burroughs [6] proposed the minimum partial vertex cover problem (MPVC) which is a generalization of the MVC. In this variant, we are given an additional parameter

k

, which is called the covering requirement. The objective of the MPVC is to find a minimum cost subset of

V

that covers at least

k

edges in

E

. If

k = m

, the MPVC is exactly the MVC. For the MPVC, Bshouty and Burroughs [6] presented a

2

-approximation algorithm with a running time of

O (n 3)

based on the LP-rounding. On the basis of Lagrangian relaxation, Hochbaum [7] presented a

2

-approximation algorithm with a running time of

O (n m log ⁡ n 2 m log ⁡ n)

. Based on the local-ratio technique, Bar-Yehuda [8] presented a

2

-approximation algorithm with a running time of

O (n 2)

. Furthermore, on the basis of the primal-dual framework, Gandhi et al. [9] presented a

2

-approximation algorithm with a running time of

O (n (n log ⁡ n + m))

and Julián [10] reduced the running time to

O (n log ⁡ n + m)

based on the pruning technique.

Karp [1] proposed the prize-collecting minimum vertex cover problem (PCMVC), which is another generalization of the MVC. In this variant, each edge

e

can be either covered by a vertex set or uncovered and a penalty

π (e)

is paid. The objective is to minimize the total cost of vertices in this set plus the overall penalty of uncovered edges. If

π (e) = ∞

for every edge

e

, the PCMVC is exactly the MVC. For the PCMVC, based on the LP-rounding technique, Hochbaum [11] presented a

2

-approximation algorithm with a running time of

O (n 3)

. Based on the primal-dual framework, Bar-Yehuda and Rawitz [12] proposed a linear-time

2

-approximation algorithm.

Submodular function is a set function with the property of decreasing marginal return, and plays a key role in combinatorial optimization somewhat similar to that played by convex/concave function in continuous optimization. There has been a lot of work on the submodular function optimization [13-16]. The minimum submodular vertex cover problem (MSVC) is one type of submodular function optimization problem and its objective is to find a minimum cost subset

S

V

to cover all the edges, where the cost of

S

is determined by a submodular function. Iwata and Nagano [17] presented a combinatorial

2

-approximation algorithm based on the primal-dual framework. If the submodular function is linear, the MSVC is exactly the MVC. Recently, Xu et al. [18] considered the minimum submodular vertex cover problem with submodular penalties (MSVCS), which is a generalization of the PCMVC and MSVC. In the MSVCS, the cost of vertex set and the penalty of uncovered edge set are determined by two submodular function, respectively. If the penalty of any edge set is infinity, the MSVCS is exactly the MSVC; If two submodular functions are all linear, the MSVCS is exactly the PCMVC. Xu et al. [18] presented a combinatorial

4

-approximation algorithm by relaxing the dual program to a slightly weaker version, which was improved by Kamiyama [19] with a

3

-approximation algorithm.

Motivated by the above work, in this paper, we consider the

k

-prize-collecting minimum vertex cover problem with submodular penalties (

k

-PCVCS), which is a generalization of the MVC [1], MPVC [6], and PCMVC [11]. The

k

-PCVCS finds a vertex set

S

that covers at least

k

edges, and the objective is to minimize the total cost of the vertices in

S

plus the penalty of the uncovered edge set, where

k ≤ m

, and the penalty is determined by a submodular function. In this paper, we design a two-phase combinatorial algorithm based on the guessing technique and the primal-dual framework to address the

k

-PCVCS. When the submodular penalty cost function is normalized and nondecreasing, the proposed algorithm has an approximation factor of

3

. When the submodular penalty cost function is linear, this problem is a special case of the

P

-prize-collecting set cover problem [20]. For this problem, the approximation factor of the algorithm in [20] is

4

, which is the current best factor. In this paper, we prove that the approximation factor of the two-phase combinatorial algorithm is

2

, which is the best factor if the unique game conjecture holds.

The remainder of this paper is structured as follows. In Section 2, we provide basic definitions and a formal problem statement. In Section 3, we focus on the

k

-PCVCS and propose the two-phase combinatorial algorithm. In Section 4, we provide a brief conclusion.

2 Preliminaries

Let

E

be a given ground set, and let

π (⋅) : 2 E → R ≥ 0

be a real-valued set function defined on all subsets of

E

. If

π (E 1) + π (E 2) ≥ π (E 1 ∪ E 2) + π (E 1 ∩ E 2), ∀ E 1, E 2 ⊆ E,

then

π (⋅)

is called a submodular function. The submodular function is nondecreasing if

π (E 1) ≤ π (E 2)

∀ E 1 ⊆ E 2 ⊆ E

. The submodular function

π (⋅)

is normalized if

π (∅) = 0

. If

π (⋅)

is normalized and

π (E 1) + π (E 2) = π (E 1 ∪ E 2) + π (E 1 ∩ E 2), ∀ E 1, E 2 ⊆ E,

then

π (⋅)

is called a linear function.

The

k

-prize-collecting minimum vertex cover problem with submodular penalties (

k

-PCVCS) is defined as follows: consider an instance

I = (G; c, π; k)

, where

G = (V, E)

is a graph with

V = {v 1, v 2, …, v n}

and

E = {e 1, e 2, …,

e m}

k

is an integer satisfying

0 ≤ k ≤ m

c : V → R +

is a cost function and

π (⋅) : 2 E → R ≥ 0

is a submodular penalty cost function. An edge

e

is covered by a set if at least one of the incident vertices of

e

is in this set. The

k

-PCVCS finds a pair

(S, R)

, where

S ⊆ V

is the vertex set that covers at least

k

edges, and

R ⊆ E

is the set of edges uncovered by

S

. The objective is to minimize the cost

c (S) + π (R),

where

c (S) = ∑ v : v ∈ S c (v) .

To obtain the expected approximation factor, a preprocessing step is required: guessing the maximum cost vertex

v m a x ∗

in an optimal solution, where

v m a x ∗ = arg ⁡ max v ∈ S ∗ c (v),

and

(S ∗, R ∗)

is an optimal solution for instance

I

. Using a preprocessing step: guessing the maximum cost of a vertex in an optimal solution. We can assume that

v m a x ∗

is known.

Using

v m a x ∗

, we construct a graph

G ∖ {v m a x ∗}

by removing vertex

v m a x ∗

, i.e.,

G ∖ {v m a x ∗} = (V ∖ {v m a x ∗}, E ∖ δ ({v m a x ∗}))

, where

δ ({v m a x ∗})

is the set of edges covered by vertex

v m a x ∗

. Then, we construct an auxiliary instance

I ∖ {v m a x ∗} = (G ∖ {v m a x ∗}; c ′, π; k −

| δ ({v m a x ∗}) |)

I = (G; c, π; k)

, where for any vertex

v ∈

V ∖ {v m a x ∗}

(1)

c ′ (v) = {c (v), i f c (v) ≤ c (v m a x ∗); + ∞, otherwise.

Let

O P T I ∖ {v m a x ∗}

be the optimal objective value of instance

I ∖ {v m a x ∗}

. We can obtain the following lemma.

Lemma 2.1

O P T I ∖ {v m a x ∗} + c (v m a x ∗) = O P T I,

where

O P T I =

c (S ∗) + π (R ∗)

is the objective value of the optimal solution

(S ∗, R ∗)

of instance

I

Proof By the definition of

c ′ (⋅)

and

v m a x ∗ = arg ⁡ max v ∈ S ∗ c (v)

, we have

O P T I = ∑ v : v ∈ S ∗ c (v) = ∑ v : v ∈ S ∗ c ′ (v) .

Since

S ∗

covers at least

k

edges in

E

S ∗ ∖ {v m a x ∗}

covers at least

k − | δ (v m a x ∗) |

edges in

E ∖ δ ({v m a x ∗})

, and

(S ∗ ∖ {v m a x ∗}, R ∗)

is a feasible solution of instance

I ∖ {v m a x ∗}

, and we have

(2)

O P T I ∖ {v m a x ∗} ≤ c ′ (S ∗) − c ′ (v m a x ∗) + π (R ∗) = O P T I − c (v m a x ∗),

where the inequality follows from the

O P T I ∖ {v m a x ∗}

is the optimal objective value of instance

I ∖ {v m a x ∗}

, and the equality follows from the definition of

c ′ (⋅)

in Eq. (1).

Let

(S ′, R ′)

be an optimal solution of instance

I ∖ {v m a x ∗}

. By the definition of

c ′ (⋅)

and

I ∖ {v m a x ∗} ≤ O P T I − c (v m a x ∗) ≤ + ∞

, it is not hard to obtain that

c (v) = c ′ (v), ∀ v ∈ S ′ .

For instance

I

S ′

covers at least

k − | δ (v m a x ∗) |

edges in

E ∖ δ ({v m a x ∗})

, and

S ′ ∪ {v m a x ∗}

covers at least

k

edges in

E

. Thus,

(S ′ ∪ {v m a x ∗}, R ′)

is a feasible solution of instance

I

, and the optimal objective value of instance

I

O P T I ≤ c (S ′ ∪ {v m a x ∗}) + π (R ′) = c (S ′) + c (v m a x ∗) + π (R ′) = c ′ (S ′) + π (R ′) + c (v m a x ∗) = O P T I ∖ {v m a x ∗} + c (v m a x ∗) .

This statement and inequality Eq. (2) imply that the lemma holds. 　　　　　　　　　　　　　　　　　　　　　　□

3 A combinatorial algorithm for $k$ -PCVCS

In this section, we first present a two-phase primal-dual approximation algorithm for instance

I ∖ {v m a x ∗}

. Then, using this two-phase primal-dual algorithm and the guessing technique, we present a combinatorial approximation algorithm for instance

I

. If the submodular penalty cost function

π (⋅)

is normalized and nondecreasing, we prove that the approximation factor of this combinatorial algorithm is

3

. In particular, if

π (⋅)

is linear, we prove that this approximation factor is reduced to

2

3.1 A two-phase algorithm for instance $I ∖ {v m a x ∗}$

For simplicity of notation, we use

I = (G; c, π; k)

to denote

I ∖ {v m a x ∗} = (G ∖ {v m a x ∗}; c ′, π; k − | δ ({v m a x ∗}) |)

We introduce a binary variable

x v

for each vertex

v ∈ V

, where

x v = {1, if v is selected to cover some edges, 0, otherwise.

For each subset

R ⊆ E

, we introduce a binary variable

z R

, where

z R = {1, if R is the set of the uncovered edges, 0, otherwise.

We have the following integer linear program for the

k

-PCVCS.

(3)

min ∑ v : v ∈ V c (v) x v + ∑ R : R ⊆ E π (R) z R s . t . ∑ v : v ∈ V (e) x v + ∑ R : R ⊆ E a n d e ∈ R z R ≥ 1, ∀ e ∈ E, ∑ R : R ⊆ E | R | z R ≤ m − k, x v, z R ∈ {0, 1}, ∀ v ∈ V, ∀ R ⊆ E,

where

V (e)

is the set of incident vertices of edge

e

, the first set of constraints of Eq. (3) guarantees that each edge

e ∈ E

is either covered by at least one of its incident vertices or in the set of uncovered edges, and the second constraint of Eq. (3) guarantees that the number of uncovered edges of any feasible solution is at most

m − k

. Relaxing the integrality constraints, we obtain the linear program as follows.

min ∑ v : v ∈ V c (v) x v + ∑ R : R ⊆ E π (R) z R s . t . ∑ v : v ∈ V (e) x v + ∑ R : R ⊆ E a n d e ∈ R z R ≥ 1, ∀ e ∈ E, ∑ R : R ⊆ E | R | z R ≤ m − k, x v ≥ 0 a n d z R ≥ 0, ∀ v ∈ V, ∀ R ⊆ E .

Note that we need not to add the constraints

x v ≤ 1

and

z R ≤ 1

since they are automatically satisfied in an optimal solution. The corresponding dual program is

(4)

max ∑ e : e ∈ E y e − (m − k) γ s . t . ∑ e : e ∈ δ ({v}) y e ≤ c (v), ∀ v ∈ V, ∑ e : e ∈ R y e − | R | γ ≤ π (R), ∀ R ⊆ E . y e ≥ 0 a n d γ ≥ 0, ∀ e ∈ E,

where

δ ({v})

is the set of edges covered by vertex

v

. For a dual feasible solution

(y, γ) = ({y e} e ∈ E, γ)

of Eq. (4), we say that a vertex

v ∈ V

is tight if

∑ e : e ∈ δ ({v}) y e = c (v)

, and an edge set

R ⊆ E

is tight if

∑ e : e ∈ R y e = π (R)

Then, we present the two-phase primal-dual algorithm:

Phase 1 We keep

γ = 0

and find a dual feasible solution of

(y ′, 0)

such that each edge is covered by a tight vertex or is in a tight edge set. To obtain this feasible solution, we start from the trivial dual feasible solution of zero.

Initially,

γ

is frozen, the other dual variables are active, and their dual values are equal to

0

. The dual values for the active dual variables are increased simultaneously until either some vertex or some edge set becomes tight. If a vertex becomes tight, then it is added to the tight vertex set

V t i g h t

, and the dual variables of all active edges covered by this vertex are frozen. Otherwise, if an edge set becomes tight, the dual variables of all active edges in this edge set are frozen. The process is iterated until all dual variables

{y e} e ∈ E

are frozen. For any

e ∈ E

, the dual value for any dual variables will no longer increase after becoming frozen, and let

y e ′

be the dual value when it is frozen. If

| δ (V t i g h t) | ≥ k

, then output pair

(S, R) := (V t i g h t, E ∖ δ (V t i g h t))

and stop the algorithm, where

δ (V t i g h t)

is the set of edges covered by

V t i g h t

; otherwise, go to Phase 2.

Phase 2 Based on increasing dual variable

γ

, we select some vertices added to the tight vertex set

V t i g h t

such that

V t i g h t

covers at least

k

edges. We start from the dual feasible solution

(y ′, 0)

for dual program Eq. (4), which is proven in Lemma 3.3.

Initially, set the dual feasible solution

(y, γ) = (y ′, 0)

; all dual variables covered by

V t i g h t

are frozen, and the other dual variables are active. The dual value of the active dual variables are increased simultaneously until a vertex becomes tight. This vertex is added to the tight vertex set

V t i g h t

, and the dual variables of all active edges covered by this vertex are frozen. This process is iterated until

| δ (V t i g h t) | ≥ k

and output pair

(S, R)

, where

S = V t i g h t

and

R = E ∖ δ (S)

For any

e ∈ E

, let

y e ″

be the dual value for any

e ∈ E

when it is frozen. Since the dual value for any

e ∈ E

will no longer increase after

y e

is frozen,

(y ″, γ)

is a dual feasible solution, which is proven in Lemma 3.5. Since the dual variable

γ

continues increasing until

| δ (V t i g h t) | ≥ k

, we have

γ = max e : e ∈ E (y e ″ − y e ′) .

We propose the detailed primal-dual algorithm in Algorithm 1.

Full size|PPT slide

Next, we use an example to illustrate our algorithm: we are given an instance

I = (G; c, π; k)

(Fig.1), where

G = (V, E)

and

k = 3

. The vertex cost is

c (v 1) = 1

c (v 2) = 8

, and

c (v 3) = c (v 4) = 9

, and the penalty function

Fig.1 For instance $I = (G; c, π; k)$ , we illustrate Phase 1 in a, b and c; and illustrate Phase 2 in d and e

Full size|PPT slide

π ({E ′}) = {2, if E ′ ⊆ E a n d | T | = 1, 3, if E ′ ⊆ E a n d | T | = 2, 4, if E ′ ⊆ E a n d | T | = 3, 5, if E ′ = E .

Lemma 3.1 Algorithm 1 can be implemented in

O (n 16 ⋅ ρ +

n 17)

, where

ρ

is the time for one function evaluation, i.e., the time to determine

π (S)

given

S

Proof It is easy to obtain that the running time of Algorithm 1 is determined by the running time of Phase 1. Let

E a c t (j)

be the set of active edges generated after the

j

th loop (or before the

(j + 1)

th loop) of Phase 1, i.e.,

E a c t (0) = E

before the

1

-loop.

In any loop of Phase 1, without loss of generality, we assume that the number of this loop is

j + 1

, and simultaneously increase the dual variable

{y e} e ∈ E a c t (j)

until either some vertex

v

becomes tight or some edge set

E ′

becomes tight. The tight vertex

v

can be determined by calculating the minimum value

Δ v = min v : E a c t (j) ∩ δ (v) ≠ ∅ c (v) − ∑ e : e ∈ δ (v) ∖ E a c t (j) y e | E a c t (j) ∩ δ (v) | .

Clearly, the value of

Δ v

can be found in

O (n)

{v : E a c t (j) ∩ δ (v) ≠ ∅} ⊆ V

and

| V | = n

The tight edge set

E T

can be determined by calculating the minimum value

Δ E T = min E ′ : E a c t (j) ∩ E ′ ≠ ∅ π (E ′) − ∑ e : e ∈ E ′ ∖ E a c t (j) y e | E a c t (j) ∩ E ′ | = min E 1 ′ ∪ E 2 ′ : E 1 ′ ⊆ E a c t (j) a n d E 2 ′ ⊆ E ∖ E a c t (j) π (E 1 ′ ∪ E 2 ′) − ∑ e : e ∈ E 2 ′ y e | E 1 ′ | = min E 1 ′ : E 1 ′ ⊆ E a c t (j) min E 2 ′ : E 2 ′ ⊆ E ∖ E a c t (j) (π (E 1 ′ ∪ E 2 ′) − ∑ e : e ∈ E 2 ′ y e) | E 1 ′ | = min E 1 ′ : E 1 ′ ⊆ E a c t (j) min E 2 ′ : E 2 ′ ⊆ E ∖ E a c t (j) (w E 1 ′, j (S)) | E 1 ′ | = min E 1 ′ : E 1 ′ ⊆ E a c t (j) w (E 1 ′) | E 1 ′ |,

where the relationship of the sets is shown in Fig.2, we define

w E 1 ′, j (E 1 ′) = π (E 1 ′ ∪ E 2 ′) − ∑ e : e ∈ E 2 ′ y e

for any

E 2 ′ ⊆ E ∖

E a c t (j)

, and

w (E 1 ′) = min E 2 ′ : E 2 ′ ⊆ E ∖ E a c t (j) (π (E 1 ′ ∪ E 2 ′) − ∑ e : e ∈ E 2 ′ y e)

for any

E 1 ′ ⊆ E a c t (j)

. This means, for any

E 1 ′ ⊆ E a c t (j)

w E 1 ′, j (⋅) : 2 E ∖ E a c t (j) → R ≥ 0

is a real-valued set function defined on all subsets of

E ∖ E a c t (j)

, where the non negativity of

w E 1 ′, j (⋅)

comes from the second set of constraints of Eq. (4) and

γ = 0

in Phase 1.

Fig.2 The relationship of $E ′$ , $E 1 ′$ and $E 2 ′$

Full size|PPT slide

Given an edges set

E 1 ′ ⊆ E a c t (j)

, for any two subsets

E 2 ′ (1), E 2 ′ (2) ⊆ E ∖ E a c t (j)

, we have

w E 1 ′, j (E 2 ′ (1)) + w E 1 ′, j (E 2 ′ (2)) = π (E 1 ′ ∪ E 2 ′ (1)) − ∑ e : e ∈ E 2 ′ (1) y e + π (E 1 ′ ∪ E 2 ′ (2)) − ∑ e : e ∈ E 2 ′ (2) y e ≥ π (E 1 ′ ∪ (E 2 ′ (1) ∪ E 2 ′ (2))) − ∑ e : e ∈ E 2 ′ (1) ∪ E 2 ′ (2) y e + π (E 1 ′ ∪ (E 2 ′ (1) ∩ E 2 ′ (2))) − ∑ e : e ∈ E 2 ′ (1) ∩ E 2 ′ (2) y e = w E 1 ′, j (E 2 ′ (1) ∪ E 2 ′ (2)) + w E 1 ′, j (E 2 ′ (1) ∩ E 2 ′ (2)),

where the inequality follows from the submodularity of

π (⋅)

. Thus,

w E 1 ′, j (⋅)

is a submodular function, and by [21]

(5)

w (E 1 ′) c a n b e f o u n d i n O (n 7 ⋅ ρ + n 8),

where

w (E 1 ′) = min E 2 ′ ⊆ E ∖ E a c t (j) (w E 1 ′, j (E 2 ′))

, and

ρ

is the time for one function evaluation, i.e., the time to determine

π (E ′)

given

E ′ ⊆ E

For any two subsets

E 1 ′ (1), E 1 ′ (2) ⊆ E a c t (j)

, let

S 1 ′ = arg ⁡ min S : S ⊆ E ∖ E a c t (j) w E 1 ′ (1), j (S) a n d S 2 ′ = arg ⁡ min S : S ⊆ E ∖ E a c t (j) w E 1 ′ (2), j (S),

we have

w (E 1 ′ (1)) + w (E 1 ′ (2)) = w E 1 ′ (1), j (S 1 ′) + w E 1 ′ (2), j (S 2 ′) = π (E 1 ′ (1) ∪ S 1 ′) − ∑ e : e ∈ S 1 ′ y e + π (E 1 ′ (2) ∪ S 2 ′) − ∑ e : e ∈ S 2 ′ y e ≥ π ((E 1 ′ (1) ∪ S 1 ′) ∪ (E 1 ′ (2) ∪ S 2 ′)) − ∑ e : e ∈ S 1 ′ ∪ S 2 ′ y e + π ((E 1 ′ (1) ∪ S 1 ′) ∩ (E 1 ′ (2) ∪ S 2 ′)) − ∑ e : e ∈ S 1 ′ ∩ S 2 ′ y e ≥ min S : S ⊆ E ∖ E a c t (j) (π ((E 1 ′ (1) ∪ E 1 ′ (2)) ∪ S) − ∑ e : e ∈ S y e) + min S : S ⊆ E ∖ E a c t (j) (π ((E 1 ′ (1) ∩ E 1 ′ (2)) ∪ S) − ∑ e : e ∈ S y e), = w (E 1 ′ (1) ∪ E 1 ′ (2)) + w (E 1 ′ (1) ∩ E 1 ′ (2)),

where the first inequality follows from the submodularity of

π (⋅)

. Therefore,

w (⋅)

is a submodular function.

Thus, similar to the optimization problem of minimizing the ratio of a submodular function and a positive linear function [21], the value of

Δ E T

can be computed in

O (n 15 ⋅ ρ + n 16)

by (5). Since at least one edge is frozen in each loop of Phase 1, and Algorithm 1 can be implemented in

O (n 16 ⋅ ρ + n 17)

.　　□

Let

(S ∗, R ∗)

be the optimal pair for the

k

-PCVCS, let

O P T

be the objective value for

(S ∗, R ∗)

, and let

(S, R)

be the output pair by Algorithm 1 and

O U T

be the objective value for

(S, R)

. Since

(S, R)

can be generated by either Phase 1 or Phase 2, we first consider the case in which

(S, R)

is generated by Phase 1.

Let

(y ′, 0)

be the dual value after Phase 1.

Lemma 3.2

(y ′, 0)

is a feasible solution for dual program Eq. (4).

Proof In Phase 1, when any vertex becomes tight, the dual variable of the edges incident to this vertex will no longer increase, which means

∑ e : e ∈ δ (v) y e ′ ≤ c (v), ∀ v ∈ V .

Similarly, we have

(6)

∑ e : e ∈ E ′ y e ′ ≤ π (E ′), ∀ E ′ ⊆ E .

Therefore,

(y ′, 0)

is a feasible solution for dual program Eq. (4). 　　　　　　　　　　　　　　　　　　　　　□

Lemma 3.3 If the pair

(S, R)

is generated by Phase 1, the cost of the vertex set of

S

c (S) = ∑ v : v ∈ S c (v) ≤ 2 ∑ e : e ∈ δ (S) y e ′ .

Proof Since any vertex

v ∈ S

is tight, we have

c (S) = ∑ v : v ∈ S c (v) = ∑ v : v ∈ S ∑ e : e ∈ δ (v) y e ′ = ∑ e : e ∈ δ (S) y e ′ | V (e) ∩ S | ≤ 2 ∑ e : e ∈ δ (S) y e ′,

where

V (e)

is the set of incident vertices of edge

e

and the first inequality follows from the fact that any edge in

E

is incident to two vertices. 　　　　　　　　　　　　　　　　　　□

In Phase 1, let

T

be the set of tight edge sets, i.e., for any

E T ∈ T

, we have

π (E T) = ∑ e : e ∈ E T y e ′

. Let

E t i g h t = ∪ E T : E T ∈ T E T

Lemma 3.4.

(7)

π (E t i g h t) = ∑ e : e ∈ E t i g h t y e ′ .

Proof Considering any two different tight edge sets

E 1 T

and

E 2 T

T

, we have

π (E 1 T) = ∑ e : e ∈ E 1 T y e ′, and π (E 2 T) = ∑ e : e ∈ E 2 T y e ′ .

Therefore,

∑ e : e ∈ E 1 T ∪ E 2 T y e ′ + ∑ e : e ∈ E 1 T ∩ E 2 T y e ′ = ∑ e : e ∈ E 1 T y e ′ + ∑ e : e ∈ E 2 T y e ′ = π (E 1 T) + π (E 2 T) ≥ π (E 1 T ∪ E 2 T) + π (E 1 T ∩ E 2 T) ≥ π (E 1 T ∪ E 2 T) + ∑ e : e ∈ E 1 T ∩ E 2 T y e ′,

where the first inequality follows from the submodularity of

π (⋅)

and the second inequality follows from inequality Eq. (6), which implies that

∑ e : e ∈ E 1 T ∪ E 2 T y e ′ ≥ π (E 1 T ∪ E 2 T) .

Furthermore, this statement and inequality Eq. (6) imply that

∑ e : e ∈ E 1 T ∪ E 2 T y e = π (E 1 T ∪ E 2 T)

, which means that

E 1 T ∪ E 2 T

is a tight subset. By repeating the merging of the tight subsets in

T

, we obtain that

E t i g h t = ∪ E T ∈ T E T

is a tight subset, and the lemma holds. 　　　　　　　　　　　　　　　　　　　 □

Then, we consider the case in which

(S, R)

is generated by Phase 2 Consistent with the above definition, let

(y ′, 0)

be the value of the dual variables after Phase 1, and let

(y ″, γ)

be the value of the dual variables after Phase 2.

Lemma 3.5

(y ″, γ)

is a feasible solution of dual program Eq. (4).

Proof In Algorithm 1, any dual variable of an edge that is covered by any tight vertex remains unchanged, which means

∑ e : e ∈ δ (v) y e ″ ≤ c (v), ∀ v ∈ V .

Since

γ

keeps increasing in Phase 2, we have

γ = max e ∈ E (y e ″ −

y e ′)

and

y e ″ − γ ≤ y e ′, ∀ e ∈ E .

Thus, for any edge set

E ′ ⊆ E

, we have

∑ e : e ∈ E ′ y e ″ − | E ′ | γ = ∑ e : e ∈ E ′ (y e ″ − γ) ≤ ∑ e : e ∈ E ′ y e ′ ≤ π (E ′),

where the last inequality follows from Lemma 3.2. Therefore, the lemma holds. 　　　　　　　　　　　　　　　　　□

Lemma 3.6 If the pair

(S, R)

is generated by Phase 2, the cost of vertex set

S

c (S) ≤ 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ′ − 2 (m − k) γ + c (v l a s t),

where

v l a s t

is the last vertex added to the tight vertex set in Phase 2.

Proof Since

v l a s t

is the last vertex added to the tight vertex set in Phase 2,

(8)

| δ (S) | ≥ k a n d | δ (S ∖ {v l a s t}) | < k .

Any edge

e ∈ E ∖ δ (S ∖ {v l a s t}))

is not covered by any tight vertex before the last iteration of Phase 2, and dual variables

{y e} e ∈ E ∖ δ (S ∖ {v l a s t})

and

γ

both increase simultaneously until vertex

v l a s t

becomes tight. Thus, we have

y e ″ = y e ′ + γ, ∀ e ∈ E ∖ δ (S ∖ {v l a s t}) .

Since any vertex

v ∈ S

is tight, we have

c (v) = ∑ e : e ∈ δ ({v}) y e ″, ∀ v ∈ S .

Thus, the cost of vertex set

S

c (S) = ∑ v : v ∈ S c (v) = ∑ v : v ∈ S ∖ {v l a s t} ∑ e : e ∈ δ (v) y e ″ + c (v l a s t) = ∑ e : e ∈ δ (S ∖ {v l a s t}) y e ″ | V (e) ∩ (S ∖ {v l a s t}) | + c (v l a s t) ≤ 2 ∑ e : e ∈ δ (S ∖ {v l a s t}) y e ″ + c (v l a s t) = 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ″ + c (v l a s t) = 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) (y e ′ + γ) + c (v l a s t) = 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ′ − 2 | E ∖ δ (S ∖ {v l a s t}) | γ + c (v l a s t) ≤ 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ′ − 2 (m − k) γ + c (v l a s t),

where the first inequality follows from the fact that any edge is incident to two vertices, and the second inequality follows from

| E | = m

, inequality Eq. (8) and

γ ≥ 0

.　　　　　□

Theorem 3.7 If

π (⋅)

is a normalized and nondecreasing function, Algorithm 1 can output a feasible solution

(S, R)

of instance

I

satisfying

c (S) + π (R) ≤ 3 O P T + c (v l a s t),

where

O P T

is the optimal value of instance

I

Proof By Lemma 3.2 and Lemma 3.5, we have

(9)

∑ e ∈ E y e ′ ≤ O P T D P ≤ O P T a n d ∑ e ∈ E y e ″ − (m − k) γ ≤ O P T D P ≤ O P T,

where

O P T D P

is the optimal value of dual program Eq. (4) and

O P T D P ≤ O P T

follows from the famous duality theorem.

Regardless of whether

(S, R)

is generated by Phase 1 or Phase 2, we have

R ⊆ E t i g h t

. By the nondecreasing function

π (⋅)

and Lemma 3.4, we have

π (R) ≤ π (E t i g h t) = ∑ e : e ∈ E t i g h t y e ′ .

(S, R)

is generated by Phase 1, the objective value of

(S, R)

c (S) + π (R) ≤ 2 ∑ e : e ∈ δ (S) y e ′ + ∑ e : e ∈ E t i g h t y e ′ ≤ 3 ∑ e ∈ E y e ′ ≤ 3 O P T,

where the first inequality follows from Lemma 3.3, the second inequality follows from

δ (S) ⊆ E

and

E t i g h t ⊆ E

, and the last inequality follows from inequality Eq. (9).

(S, R)

is generated by Phase 2, the objective value of

(S, R)

c (S) + π (R) ≤ 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ′ − 2 (m − k) γ + c (v l a s t) + ∑ e : e ∈ E t i g h t y e ′ ≤ 2 (∑ e ∈ E y e ″ − (m − k) γ) + ∑ e ∈ E y e ′ + c (v l a s t) ≤ 3 O P T + c (v l a s t),

where the first inequality follows from Lemma 3.6, the second inequality follows from

E t i g h t ⊆ E

and

y e ′ ≥ 0

for any

e ∈ E

, and the last inequality follows from inequality Eq. (9).

Therefore, the theorem holds. 　　　　　　　　　　　　 □

In particular, if

π (⋅)

is a linear function, i.e.,

π (E ′) = ∑ e : e ∈ E ′ π ({e})

for any

E ′ ⊆ E

, we have the following theorem.

Theorem 3.8 If

π (⋅)

is a linear function, Algorithm 1 can output a feasible solution

(S, R)

of instance

I

satisfying

c (S) + π (R) ≤ 2 O P T + c (v l a s t),

where

O P T

is the optimal value of instance

I

Proof Since

R ⊆ E t i g h t

, any edge

e ∈ R

is tight and

π (R) = ∑ e : e ∈ R π ({e}) = ∑ e : e ∈ R y e ′ .

(S, R)

is generated by Phase 1, the objective value of

(S, R)

c (S) + π (R) ≤ 2 ∑ e : e ∈ δ (S) y e ′ + ∑ e : e ∈ R y e ′ ≤ 2 ∑ e ∈ E y e ′ ≤ 2 O P T,

where the first inequality follows from Lemma 3.3, the second inequality follows from

R = E ∖ δ (S)

, and the last inequality follows from inequality (9).

(S, R)

is generated by Phase 2, the objective value of

(S, R)

c (S) + π (R) ≤ 2 ∑ e : e ∈ E y e ″ − 2 ∑ e : e ∈ E ∖ δ (S ∖ {v l a s t}) y e ′ − 2 (m − k) γ + c (v l a s t) + ∑ e : e ∈ R y e ′ ≤ 2 (∑ e ∈ E y e ″ − (m − k) γ) + c (v l a s t) ≤ 2 O P T + c (v l a s t),

where the first inequality follows from Lemma 3.6, the second inequality follows from

R = E ∖ δ (S) ⊆ E ∖ δ (S ∖ {v l a s t})

, and the last inequality follows from inequality Eq. (9). Therefore, the theorem holds.　　　　　　　　　　　　　　　　　　　□

3.2 A combinatorial algorithm of the $k$ -PCVCS

In fact, we cannot know the maximum cost vertex

v m a x ∗

in advance; however, we can guess this vertex by enumerating all the vertices in

V

. Thus, for each

v ∈ V

, the combinatorial algorithm constructs the auxiliary instance

I ∖ {v}

defined in Section 2, and we find a feasible solution

(S v, R v)

of instance

I ∖ {v}

using Algorithm 1. Let

v ′ = arg ⁡ min v ∈ V (c (S v) + π (R v) + c (v)),

and the combinatorial algorithm outputs

(S, R) = (S v ′ ∪ {v ′},

R v ′)

. We propose the detailed algorithm in Algorithm 2.

Full size|PPT slide

Theorem 3.9 If

π (⋅)

is a normalized and nondecreasing function, Algorithm 2 is a combinatorial

3

-approximation algorithm for the

k

-PCVCS. Specifically, if

π (⋅)

is a linear function, Algorithm 2 is a

2

-approximation algorithm.

Proof Let

(S ∗, R ∗)

be the optimal solution for instance

I

and

O P T I

be the objective value of

S ∗, R ∗

. Let

v m a x ∗ = arg ⁡ max v ∈ S ∗

c (v)

and let

I ∖ {v m a x ∗}

be the auxiliary instance defined in Section 2. Let

(S v m a x ∗, R v m a x ∗)

be the output solution of instance

I ∖ {v m a x ∗}

using Algorithm 1. By the definition of

I ∖ {v m a x ∗}

, we have

(10)

c (v l a s t) ≤ c (v m a x ∗),

where

v l a s t

is the last vertex added to

S v m a x ∗

Since

v ′ = arg ⁡ min v ∈ V (c (S v) + π (S v) + c (v))

, the objective of

(S, R)

O U T = c (S v ′ ∪ {v ′}) + π (R v ′) ≤ c (S v m a x ∗ ∪ {v m a x ∗}) + π (R v m a x ∗) = c (S v m a x ∗) + π (R v m a x ∗) + c (v m a x ∗) ≤ 3 O P T I ∖ {v m a x ∗} + c (v l a s t) + c (v m a x ∗) ≤ 3 (O P T I ∖ {v m a x ∗} + c (v m a x ∗)) ≤ 3 O P T I,

where the second inequality follows from Theorem 3.7, the third inequality follows from inequality Eq. (10), and the last inequality follows from Lemma 2.1.

π (⋅)

is a linear function, similar to the proof above, the objective of

(S, R)

O U T ≤ c (S v m a x ∗) + π (R v m a x ∗) + c (v m a x ∗) ≤ 2 O P T I ∖ {v m a x ∗} + c (v l a s t) + c (v m a x ∗) ≤ 2 (O P T I ∖ {v m a x ∗} + c (v m a x ∗)) ≤ 2 O P T I,

where the second inequality follows from Theorem 3.8.

For each

v ∈ V

, Algorithm 2 must run Algorithm 1 once. This statement and Lemma 3.1 imply that Algorithm 2 can be implemented in can be computed in

O (n 17 ⋅ ρ + n 18)

. 　　　　□

4 Conclusions

In this paper, we consider the

k

-prize-collecting minimum vertex cover problem with submodular penalties (

k

-PCVCS), which strictly requires both

δ (S) ∩ R = ∅

and

δ (S) ∪ R = E

to be established, where

(S, R)

is a feasible solution for

k

-PCVCS, and

δ (S)

is the set of edges covered by

S

. When the submodular penalty cost function is normalized and nondecreasing, we propose a combinatorial

3

-approximation algorithm. In many practical situations, we can relax the condition of

k

-PCVCS as

δ (S) ∩ R ≠ ∅

and

δ (S) ∪ R = E

. For this relaxed problem, for the submodular penalty cost function that is not normalized and nondecreasing, the proposed algorithm also has an approximation ratio of

3

. When the submodular penalty cost function is linear, we prove the approximation factor of the proposed algorithm is reduced to 2, which is the best factor if the unique game conjecture holds.

In the real world, the penalty cost function may not be submodular, and the topic could be further studied in the following ways. The version with general

π (⋅)

penalties is worth considering, such as,

π (⋅)

is subadditive or supermodular. Recently, there have been many studies on the minimum vertex cover problem with hard capacities [22-24], in which each vertex

v

is associated with a capacity

c v

. The goal of this problem is to select a minimum cost vertex set

S

such that all edges are covered by

S

and each vertex

v ∈ S

covers at most

c v

edges in

δ (v)

. Thus, the

k

-PCVCS with hard capacities, which can be viewed as a generalization of the

k

-PCVCS, deserves to be explored. It is possible to design a combinatorial

3

-approximation algorithm, but it is a challenge.

Acknowledgements

The work was supported in part by the National Natural Science Foundation of China (Grant No. 12071417).

References

Publishing order | Descend order by publishing year | Descend order by cited within

1	Karp R M. Reducibility among combinatorial problems. In: Miller R E, Thatcher J W, Bohlinger J D, eds. Complexity of Computer Computations. Boston: Springer, 1972, 85–103

2	Vazirani V V. Approximation Algorithms. Berlin, Heidelberg: Springer, 2001

3	Khot S, Regev O . Vertex cover might be hard to approximate to within 2 -ε. Journal of Computer and System Sciences, 2008, 74( 3): 335–349

4	Hochbaum D S . Approximation algorithms for the set covering and vertex cover problems. SIAM Journal on Computing, 1982, 11( 3): 555–556

5	Bar-Yehuda R, Even S . A linear-time approximation algorithm for the weighted vertex cover problem. Journal of Algorithms, 1981, 2( 2): 198–203

6	Bshouty N H, Burroughs L. Massaging a linear programming solution to give a 2-approximation for a generalization of the vertex cover problem. In: Proceedings of the 15th Annual Symposium on Theoretical Aspects of Computer Science. 1998, 298–308

7	Hochbaum D S. The t-vertex cover problem: extending the half integrality framework with budget constraints. In: Proceedings of International Workshop on Approximation Algorithms for Combinatorial Optimization. 1998, 111–122

8	Bar-Yehuda R . Using homogeneous weights for approximating the partial cover problem. Journal of Algorithms, 2001, 39( 2): 137–144

9	Gandhi R, Khuller S, Srinivasan A . Approximation algorithms for partial covering problems. Journal of Algorithms, 2004, 53( 1): 55–84

10	Mestre J . A primal-dual approximation algorithm for partial vertex cover: making educated guesses. Algorithmica, 2009, 55( 1): 227–239

11	Hochbaum D S . Solving integer programs over monotone inequalities in three variables: a framework for half integrality and good approximations. European Journal of Operational Research, 2002, 140( 2): 291–321

12	Bar-Yehuda R, Rawitz D . On the equivalence between the primal-dual schema and the local ratio technique. SIAM Journal on Discrete Mathematics, 2005, 19( 3): 762–797

13	Li Y, Du D, Xiu N, Xu D . Improved approximation algorithms for the facility location problems with linear/submodular penalties. Algorithmica, 2015, 73( 2): 460–482

14	Du D, Lu R, Xu D. A primal-dual approximation algorithm for the facility location problem with submodular penalties. Algorithmica, 2012, 63(1–2): 1–2

15	Liu X, Li W . Approximation algorithms for the multiprocessor scheduling with submodular penalties. Optimization Letters, 2021, 15( 6): 2165–2180

16	Liu X, Li W, Xie R. A primal-dual approximation algorithm for the k-prize-collecting minimum power cover problem. Optimization Letters, 2021, DOI

17	Iwata S, Nagano K. Submodular function minimization under covering constraints. In: Proceedings of the 50th Annual IEEE Symposium on Foundations of Computer Science. 2009, 671–680

18	Xu D, Wang F, Du D, Wu C . Approximation algorithms for submodular vertex cover problems with linear/submodular penalties using primal-dual technique. Theoretical Computer Science, 2016, 630: 117–125

19	Kamiyama N . A note on the submodular vertex cover problem with submodular penalties. Theoretical Computer Science, 2017, 659: 95–97

20	Guo J S, Liu W, Hou B. An approximation algorithm for p-prize-collecting set cover problem. Journal of the Operations Research Society of China, 2021, DOI

21	Fleischer L, Iwata S . A push-relabel framework for submodular function minimization and applications to parametric optimization. Discrete Applied Mathematics, 2003, 131( 2): 311–322

22	Kao M J, Shiau J Y, Lin C C, Lee D T . Tight approximation for partial vertex cover with hard capacities. Theoretical Computer Science, 2019, 778: 61–72

23	Cheung W C, Goemans M X, Wong S C W. Improved algorithms for vertex cover with hard capacities on multigraphs and hypergraphs. In: Proceedings of the 25th Annual ACM-SIAM Symposium on Discrete Algorithms. 2014, 1714–1726

24	Wong S C W. Tight algorithms for vertex cover with hard capacities on multigraphs and hypergraphs. In: Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms. 2017, 2626–2637

Options

Outlines

About the journal

Aims & scope

Description

Editorial board

Abstracting / Indexing

Contact us

Browse

Just accepted

Online first

Latest issue

All volumes and issues

Collections

Featured articles

Most accessed

Most cited

Collections

Multimedia collections

Authors & reviewers

Online submisson

Call for papers

Guidelines for authors

Download templates

Guidelines for reviewers

Abstract

Cite this article

1 Introduction

2 Preliminaries

3 A combinatorial algorithm for $k$ -PCVCS

3.1 A two-phase algorithm for instance $I ∖ {v m a x ∗}$

Fig.1 For instance $I = (G; c, π; k)$ , we illustrate Phase 1 in a, b and c; and illustrate Phase 2 in d and e

Fig.2 The relationship of $E ′$ , $E 1 ′$ and $E 2 ′$

3.2 A combinatorial algorithm of the $k$ -PCVCS

4 Conclusions

Acknowledgements

References

About the journal

Browse

Authors & reviewers

Abstract

Cite this article

1 Introduction

2 Preliminaries

3 A combinatorial algorithm for k-PCVCS

3.1 A two-phase algorithm for instance I∖{vmax∗}

Fig.1 For instance I=(G;c,π;k), we illustrate Phase 1 in a, b and c; and illustrate Phase 2 in d and e

Fig.2 The relationship of E′, E1′ and E2′

3.2 A combinatorial algorithm of the k-PCVCS

4 Conclusions

Acknowledgements

References

3 A combinatorial algorithm for $k$ -PCVCS

3.1 A two-phase algorithm for instance $I ∖ {v m a x ∗}$

Fig.1 For instance $I = (G; c, π; k)$ , we illustrate Phase 1 in a, b and c; and illustrate Phase 2 in d and e

Fig.2 The relationship of $E ′$ , $E 1 ′$ and $E 2 ′$

3.2 A combinatorial algorithm of the $k$ -PCVCS