\hideLIPIcs

Department of Computer Science, ETH Zü[email protected]://orcid.org/0000-0001-9164-3674 CISPA Helmholtz Center for Information [email protected]://orcid.org/0000-0002-1368-3205Work done in part while at ETH Zürich. Department of Computer Science, RWTH [email protected]://orcid.org/0000-0003-0177-8028 \CopyrightHans-Joachim Böckenhauer, Fabian Frei, Peter Rossmanith \ccsdescTheory of computation Online algorithms

Removable Online Knapsack and Advice

Hans-Joachim Böckenhauer Fabian Frei Peter Rossmanith ∈∈))))∈∈))))∈∈))))∈∈))))

Abstract

In the proportional knapsack problem, we are given a knapsack of some capacity and a set of variably sized items. The goal is to pack a selection of these items that fills the knapsack as much as possible. The online version of this problem reveals the items and their sizes not all at once but one by one. For each item, the algorithm has to decide immediately whether to pack it or not. We consider a natural variant of this online knapsack problem, which has been coined removable knapsack. It differs from the classical variant by allowing the removal of any packed item from the knapsack. Repacking is impossible, however: Once an item is removed, it is gone for good. We analyze the advice complexity of this problem. It measures how many advice bits an omniscient oracle needs to provide for an online algorithm to reach any given competitive ratio, which is—understood in its strict sense—just the algorithm’s approximation factor. The online knapsack problem is known for its peculiar advice behavior involving three jumps in competitivity. We show that the advice complexity of the version with removability is quite different but just as interesting: The competitivity starts from the golden ratio when no advice is given. It then drops down to $1+\varepsilon$ for a constant amount of advice already, which requires logarithmic advice in the classical version. Removability comes as no relief to the perfectionist, however: Optimality still requires linear advice as before. These results are particularly noteworthy from a structural viewpoint for the exceptionally slow transition from near-optimality to optimality.

Our most important and demanding result shows that the general knapsack problem, which allows an item’s value to differ from its size, exhibits a similar behavior for removability, but with an even more pronounced jump from an unbounded competitive ratio to near-optimality within just constantly many advice bits. This is a unique behavior among the problems considered in the literature so far.

An advice analysis is interesting in its own right, as it allows us to measure the information content of a problem and leads to structural insights. But it also provides insurmountable lower bounds, applicable to any kind of additional information about the instances, including predictions provided by machine-learning algorithms and artificial intelligence. Unexpectedly, advice algorithms are useful in various real-life situations, too. For example, they provide smart strategies for cooperation in winner-take-all competitions, where several participants pool together to implement different strategies and share the obtained prize. Further illustrating the versatility of our advice-complexity bounds, our results automatically improve some of the best known lower bounds on the competitive ratio for removable knapsack with randomization. The presented advice algorithms also automatically yield deterministic algorithms for established deterministic models such as knapsack with a resource buffer and various problems with more than one knapsack. In their seminal paper introducing removability to the knapsack problem, Iwama and Taketomi have indeed proposed a multiple knapsack problem for which we can establish a one-to-one correspondence with the advice model; this paper therefore even provides a comprehensive analysis for this up until now neglected problem.

keywords:

Removable Online Knapsack, Competitive Ratio, Advice Analysis, Advice Applications, Randomized Algorithms, Machine Learning and AI

1 Introduction

In this first section, we briefly summarize what online algorithms and advice are, then informally present the problem whose advice complexity we will be analyzing, and finally describe several applications of such advice complexity results.

1.1 Online Algorithms and Advice Complexity

Online algorithms receive their input piece by piece and have to determine parts of the solution before knowing the entire instance. This often leaves them unable to compete with offline algorithms, which know the entire input in advance, in a meaningful way. In the advice model, we assume an omniscient oracle that provides the online algorithm with some information on how to solve the upcoming instance best. If the oracle can communicate to the algorithm an unlimited amount of such advice, it will of course be able to lead the algorithm to an optimal solution for every instance. The advice complexity measures the minimum amount of information necessary for the online algorithm to achieve any given approximation ratio, which is commonly called strict competitive ratio or competitivity in this context.

Advice complexity is a well-established tool to gauge the information content of an online problem [3, 9, 15]. For a detailed and careful introduction to the theory, we refer to the textbook by Komm [20]. Another classical textbook on online problems is written by Borodin and Yaniv [5]. The trade-off between a low number of transmitted advice bits on the one hand and achieving a good competitive ratio on the other hand has been examined for a wealth of problems—see the survey by Boyar et al. [6]—but one stands out for its peculiar behavior: the knapsack problem.

1.2 Knapsack and Removability

A knapsack instance presents the online algorithm with a sequence of items of different sizes. Upon the arrival of each item, the algorithm has to decide whether to pack it into a knapsack or discard it. The goal is to fill the knapsack as much as possible without ever exceeding the knapsack’s given capacity. This problem is sometimes also referred to as the proportional or simple knapsack problem, as opposed to the general knapsack problem, in which every item has not only a size but also a value.¹¹1It is also quite common for the proportional and general knapsack problem to be called unweighted and weighted, respectively. The notion weight is ambiguous, however, as some authors [4] use it for what is called size here, while others [18] use it for what is called the value here or profit elsewhere. For the sake of clarity, we are well advised to avoid the term weight altogether. In the generalized version, the goal is to maximize the total value of all packed items. With no further specification given, we are always referring to the proportional case.

A variant of the knapsack problem has been proposed by Iwama and Taketomi [17] under the name of removable knapsack. In this model, we can discard an item not only when it is first presented to us; we may also remove a packed item from the knapsack at any point. This is possible only once for each item, however; once removed, an item cannot be repacked. As for the classical problem without removability, the capacity of the knapsack may not be exceeded at any point in time. Recently, Rossmanith has introduced a similar relaxed online setting for graph problems where decisions are taken only when constraints make it inevitable [24].

This model is arguably just as natural a way to translate the knapsack problem into the online setting as the more well-examined variant without removability. In many cases, it will not be hard to discard items at regular intervals, only the chance of obtaining specific objects is subject to special circumstances. For a practical example, consider a storage room in which you can store all kinds of objects that you come across over time. In the beginning you can just keep collecting everything, but by doing so you inevitably run out of space before too long. Then you will have to start disposing of some of your possessions to make room for new, potentially more interesting acquisitions. Your goal is to end up with a selection of just the most meaningful and useful items that you could have. This paper analyzes the advice complexity of both the proportional and general removable knapsack problem. It is telling you how much information about upcoming opportunities you need to ensure an outcome that is either optimal or off by at most a given factor.

1.3 Advice Applications

Besides inherently interesting insights into the information complexity of the knapsack problem, our advice algorithms also offer more concrete applications. Any algorithm reading a bounded number of advice bits can be implemented by running a bounded number of deterministic algorithms in parallel and selecting the best result. An advice analysis thus tells us, for example, how to optimally organize a betting pool in a winner-take-all scenario. Our main result in particular provides a smart selection of strategies to be assigned to a mere constant number of actors such that one is guaranteed to be as close to optimality as we desire, no matter how difficult the instances of the general knapsack problem with removability may become.

A further advantage of analyzing the advice complexity of a problem is that the resulting bounds are very versatile. The lower bounds are particularly strong. They show that a certain competitivity cannot be achieved with a given amount of additional information, regardless of the form this advice may take. The oracle is indeed able to convey to the algorithm all kinds of structural information about the adversarial instance; for example, in the case of our knapsack problem, whether items smaller than a given threshold should or must not be ignored, whether replacing packed items by later ones will ever be beneficial, whether the values span more than a certain range, whether an optimal solution fills the knapsack completely, whether there are multiple optimal solutions, and so on. Lower bounds on the competitivity of advice algorithms imply lower bounds for randomized algorithms, and our results indeed improve upon the best bounds known for randomization; Theorem 5.3 even completely closes the remaining gap in the analysis of barely random algorithms for the general knapsack problem.

There are also interesting implications for deterministic algorithms. Consider the multiple knapsack problem in which every item is either rejected or packed into one of $k>1$ knapsacks; the goal is for the algorithm to have in the end one knapsack that is as full as possible. This problem has been analyzed with removability by Iwama and Taketomi in the proportional case. In the conclusion of their paper [17], they pose it as an open problem to analyze this model if we are allowed to copy items and pack them into arbitrarily many of the available knapsacks. It turns out that deterministic algorithms for this problem with different $k$ s are equivalent to advice algorithms: An advice algorithm restricted to $\log k$ advice bits can read up to $k$ different advice strings. Even if the algorithm reads the entire advice string right at the beginning, before taking any decision, it will thus implement one of $k$ deterministic strategies. Having $k$ knapsacks and being able to pack each item into several of them at the same time means that we can just simulate each possible strategy in one of the knapsacks and see which one leads to the best result in the end. Conversely, the oracle in our model already knows the optimal choice and can communicate to an advice algorithm which of the knapsacks it should be simulating. Knowing the advice complexity of our problem with only one knapsack therefore automatically yields a comprehensive competitive analysis for this deterministic problem for any $k>1$ . All of this remains true for the general knapsack problem; thus our results provide a comprehensive picture for the proposed model in both the proportional and non-proportional case. We remark that algorithms for the resource buffer model are generally not applicable here. Algorithm $2$ by Han et al. [13, Thm. 9], for example, keeps regrou** items in every step and thus crucially relies on having a single large buffer instead of multiple standard-sized knapsacks without an option to shuffle items between them.

2 Preliminaries

Throughout this paper, $\log$ denotes the binary logarithm. We formally define the removable knapsack problem as follows.

Definition 2.1 (Removable Knapsack Problem).

RemKnap is an online maximization problem. An instance $isasequence$ ( $s$ _1, $v$ _1),…,( $s$ _n, $v$ _n) $of$ n $items,eachofwhichisapairofsomerealpositive\emph{size}$ $s$ _i $and\emph{value}$ $v$ _i $.Whereuseful,wedenotesizeandvalueofanitem$ i $functionallyby$ $s$ (i)= $s$ _i $and$ $v$ (i)= $v$ _i $.Thedomainofthisfunctionnaturallyextendstoarbitrarysubsets$ $T$ ⊆{1,…,n} $ofitemsbydefining$ $s$ ( $T$ )=∑_i∈ $T$ $s$ (i) $and$ $v$ ( $T$ )=∑_i∈ $T$ $v$ (i) $.Theknapsackhasamaximumsizecapacity,whichwenormalizetobe$ 1 $.\par Theinstanceispresenteditembyitemtoanonlinealgorithm$ $\mathcal{A}$ $thathastomaintaina\emph{packing},asetofpackeditems.% Wecallthetotalsizeofthecurrentlypackeditemsthecurrent\emph{filling}% oftheknapsack.Thealgorithmstartsoutwithanemptyknapsack,representedbytheemptyset$ $T$ _0=∅ $.Whenpresentedwithitem$ i $,thealgorithmmayfirstremoveanyoftheitems$ $T$ _i-1 $packedsofar;thenitmaypackthenewitemifthisdoesnotexceedtheknapsackcapacity.% Inotherwords,thealgorithmselectsasubset$ $T$ _i⊆ $T$ _i-1∪{i} $with$ $s$ ( $T$ _i)≤1 $instep$ i $.Thealgorithmlearnsthesizeofitem$ i $onlyonceitispresentedandonlylearnsthetotalnumber$ n $ofitemsafterselecting$ $T$ _n $.Thefinalpackingcomputedby$ $\mathcal{A}$ $isdenotedby$ $T$ = $T$ _n $.The\emph{gain}thatweaimtomaximizeisthetotalvalue$ $v$ ( $T$ ) $ofthefinalpacking.\par Theproportionalvariant\textnormal{{Prop\-Rem\-Knap}}additionallysatisfies$ $s$ _i= $v$ _i $foreachitem$ i.

Definition 2.2 (Competitive Ratio).

Let an online maximization problem with instance set $\mathcal{I}$ be given and let $\mathcal{A}$ be an online algorithm solving it. For any instance $\mathcal{I}$ , denote by $\textnormal{{alg}}($ the gain that $\mathcal{A}$ achieves on $andby$ opt( $thegainofanoptimalsolutionto$ computed offline. The competitive performance of $\mathcal{A}$ on an instance $\mathcal{I}$ is $\textnormal{{opt}}(/\textnormal{{alg}}($ . For any $\rho\in\text{{\boldmath R}}$ , the algorithm $\mathcal{A}$ is called strictly $\rho$ -competitive if it performs $\rho$ -competitively across all instances, that is, if $\forall\,\mathcal{I}\colon\ \textnormal{{opt}}(/\textnormal{{alg}}(\leq\rho$ . The infimal competitivity $\textnormal{inf}\{\,\rho\in\text{{\boldmath R}}\mid\mathcal{A}\text{ is % strictly $\rho$-competitive}\,\}$ is called strict competitive ratio of $\mathcal{A}$ . We can weaken the defining inequality above so that it only needs to hold asymptotically in the sense of $\exists\,\alpha\in\text{{\boldmath R}}_{+}\colon\ \forall\,\mathcal{I}\colon\ % \textnormal{{opt}}(\leq\rho\cdot\textnormal{{alg}}(+\alpha$ . If this condition is met, we call $\mathcal{A}$ nonstrictly $\rho$ -competitive.

Note that strict $\rho$ -competitivity implies nonstrict $\rho$ -competitivity but not vice versa, making it harder to prove lower bounds for nonstrict competitivity. For the knapsack problem, however, it makes sense to always analyze competitivity in the strict sense: On the one hand, we obtain a nonstrict lower bound from a strict one by scaling up the knapsack capacity and all item sizes in a hard instance set such that the smallest item is strictly larger than $\alpha$ . If, on the other hand, scaling is impossible due to the problem being defined with a fixed knapsack capacity of $1$ , for example, then choosing $\alpha=1$ shows any online algorithm is $1$ -competitive in the nonstrict sense.

3 Related Work

Knapsack is one of the 21 NP-complete decision problem in Karp’s famous list [19]. An algorithm based on dynamic programming solves both the proportional and the general version in pseudo-polynomial time; see Bellman [1, Section 1.4] for the general technique and Dantzig [8, p. 275] for a concrete description of its application to the knapsack problem. The pseudo-polynomial time algorithm can be adapted to the optimization version, yielding a fully polynomial-time approximation scheme [16]. In the following two subsections, we list the known results on the advice complexity of the proportional knapsack problem, first for the classical version and then for the variant allowing the removal of packed items.

3.1 Knapsack without Removability

Marchetti-Spaccamela and Vercellis were the first to consider the classical online version of the knapsack problem in 1995. They called it the $\{0,1\}$ knapsack problem to distinguish it from the fractional knapsack problem, which allows for packing items partially. They proved that both versions have an unbounded competitive ratio if items are allowed to have sizes different from their values [21, Thm. 2.1]. We denote the classical problem with neither fractional items nor removability by Knapand its proportional variant by PropKnap.

The concept of advice emerged much later. When it did, Knapquickly became one of the prime examples of a problem with an interesting advice complexity.

First, just a single advice bit brings with it a jump from non-competitivity to a $2$ -competitive algorithm [4, Thm. 4]. More advice bits do not help however, as long as the number stays below $\lfloor\log(n-1)\rfloor$ [4, Thm. 5]. Once this threshold is surpassed, logarithmic advice allows for a competitive ratio that is arbitrarily close to 1 [4, Thm. 6]. Achieving optimality, finally, requires at least $n-1$ advice bits [4, Thm. 3].

The situation for the general variant is simpler: Any algorithm reading less than $\log n$ advice bits has an unbounded competitive ratio, but $\mathcal{O}(\log n)$ advice suffices for a near-optimal solution [4, Thms. 11 and 12]. A schematic plot of the advice complexity behaviors just described can be found in Figure 1 in light gray.

3.2 Online Knapsack Variants

Iwama and Taketomi [17] proposed the online knapsack model with removability as it is examined in the present paper. They proved that the competitive ratio for the proportional variant of this problem, which we denote by PropRemKnap, is exactly the golden ratio. Iwama and Zhang later considered the problem with resource augmentation, that is, for online algorithms that may use a larger knapsack than the offline algorithm [18].

Later still, Han et al. [12] proved an upper bound of $5/3$ on the competitive ratio for a variant of PropRemKnap where the value $v$ of an item is not necessarily proportional to its size $s$ but not arbitrary either; instead, the value is given by a convex function $v=f(s)$ known to the algorithm. They also proved the golden ratio to be optimal if $f$ has some further technical properties. Han et al. [10] considered online knapsack with removal costs, a variant of PropRemKnap where items can be removed, but not for free.

Noga and Sarbua [22] considered a knapsack variant, where it is possible to split each arriving item in two parts of not necessarily equal size, and combine this with resource augmentation. Han and Makino [14] considered another partially fractional variant of PropRemKnap where each item can be split a constant number of times at any time. Most importantly in our context, Han et al. [11] examined randomized algorithms for PropRemKnap, proving an upper bound of $10/7$ and a lower bound of $5/4$ on the expected competitivity. Cygan et al. [7] extended the study of randomization for PropRemKnap to a variant with multiple knapsacks. Recently, Böckenhauer et al. [2] have introduced a new model for the online proportional knapsack problem in which items can be stored outside of the knapsack until the instance ends after paying a reservation fee that is a fixed fraction $\alpha$ of the item’s value.

4 Results for Proportional Removable Knapsack

In Section 4.1, we consider how much—or rather, how little—removability helps when trying to obtain an optimal solution. In Section 4.2, we prove upper and lower bounds on what is possible with a single advice bit. Finally, we prove in Section 4.3 that a constant amount of advice is sufficient to achieve a competitive ratio of $1+\varepsilon$ , for an arbitrary $\varepsilon>0$ , and a constant depending on $\varepsilon$ . See Figure 1 for a rough representation of these results in dark gray.

Figure 1: A schematic plot of the advice complexity behavior of the classical online knapsack problem in light gray and the relaxed variant with removability in dark gray. For the proportional version without removability there are two large plateaus; removability collapses to a single vast expanse. For the general version, in which an item’s value may differ from its size, there is only one but a more extreme jump directly from an unbounded competitive ratio to near optimality; with removability, this jump is occurring earlier and even steeper.

4.1 Achieving Optimality

We begin by briefly considering PropKnap, the classical proportional knapsack problem without removability. Solving it optimally is trivial with $n$ advice bits: The algorithm reads one bit per item, telling it whether to accept or reject. LABEL:{thm:classic_prop_opt_lower_tight} proves this to be tight by lifting the best known lower bound from $n-1$ advice bits [4, Thm. 3] to $n$ advice bits.

Theorem 4.1.

Any algorithm for PropKnap reading less than $n$ advice bits is suboptimal.

Proof 4.2.

For every $n$ , we consider the $2^{n}$ instances that all begin with the same $n-1$ items, namely one for each of the sizes $s_{1}=2^{-1},s_{2}=2^{-2},\dots,s_{n-1}=2^{-(n-1)}$ . The size of the final item is either $s_{n}=2^{-n}$ or any $s_{n}^{\prime}\in\{\,1-\sum_{i\in I}s_{i}\mid I\subsetneq\{1,2,\dots,n-1\}\,\}$ . We remark that this hard instance family is almost identical to the one used by Böckenhauer et al. [4, Thm. 3] for proving the lower bound of $n-1$ advice bits; the only tiny modification is changing the item size $s_{n}^{\prime}=1-\sum_{i\in I}s_{i}$ for $I=\{1,2,\dots,n-1\}\}$ to $s_{n}=2^{-n}$ . There are $2^{n-1}$ options for the final item, and each one requires an optimal algorithm to have packed another one of the $2^{n-1}$ possible subsets of the previous $n-1$ items: If the last item turns out to have size $s_{n}^{\prime}=\sum_{i\in I}s_{i}$ , for any given $I\subsetneq\{1,2,\dots,n-1\}$ , then the knapsack can be filled completely if and only if exactly the items $I$ have been packed before it. And if the last item has size $s_{n}=2^{-n}$ , then the algorithm needs to pack all items for optimality, leaving a gap of $2^{-n}$ . Now, an advice string of length $n-1$ should already enable us to distinguish between the $2^{n-1}$ possible sizes of the last item, allowing the algorithm to select the optimal subset of the first $n-1$ items. An advice algorithm reading fewer than $n-1$ advice bits on instances of length $n$ inevitably packs the same subset of the first $n-1$ items for two different sizes of the last item, leading to a suboptimal performance for one of the two.

This is essentially what led to the previously known lower bound of $n-1$ advice bits for optimality. The key point to note now is that the algorithm has to read these $n-1$ advice bits, proved necessary for optimality on instances of length $n$ , before the last item is presented. Thus it is in fact reading $n-1$ advice bits while processing the first $n-1$ items of the instance, which—thanks to our modification—also constitute a complete instance on their own. Thus $n^{\prime}=n-1$ advice bits are consumed on an instance of length $n^{\prime}$ , concluding the proof.

Having determined PropKnap’s advice complexity for optimality, we now do the same for PropRemKnap, the variant with removability. It turns out that the option to remove items hardly helps at all in achieving optimality. We begin the upper bound, which is simple but instructive as to what is possible with removability.

Theorem 4.3.

There is an optimal algorithm for PropRemKnap reading $n-1$ advice bits.

Proof 4.4.

Consider an algorithm that packs the first item without reading any advice bits. For each subsequent item, it reads one advice bit, telling it whether the new item is part of a fixed optimal solution. If so, then the new item is packed; otherwise, it is rejected. The first item, which has been packed without advice, is kept in the knapsack as long as there is enough room for it. If the first item is part of the fixed optimal solution, then it will always fit in beside the other items being packed; otherwise, it will be discarded at some point. Thus the algorithm is able to reproduce the fixed optimal solution exactly.

Theorem 4.5.

Solving PropRemKnap optimally requires more than $n-\log n$ advice bits.

Proof 4.6.

Let any positive $\varepsilon<1/3$ of the form $\varepsilon=1/(2j)$ for an odd integer $j$ be given. Now, we can choose an arbitrarily large odd integer $m$ such that $2\varepsilon m+1$ is a power of two, namely an appropriate multiple of $j$ . Since $2\varepsilon m+1$ is even and $m$ is odd, we know that $m-2\varepsilon m$ is even and thus $m/2-\varepsilon m$ is an integer. We denote it by $k=m/2-\varepsilon m$ and note that $m-2k=2\varepsilon m$ . We consider a family of instances that are all identical, with exception of the final item. The capacity of the knapsack shall be $m-k+1$ . We could of course normalize this to $1$ by scaling down the capacity and all item sizes.

First, for each $i\in\{0,1,\dots,\log(2\varepsilon m+1)-1\}$ , an item of size $2^{i}$ is presented. Note that these $\log(\varepsilon m+1)$ items have a total size of $\sum_{i=0}^{\log(2\varepsilon m+1)-1}2^{i}=2^{\log(2\varepsilon m+1)}-1=2\varepsilon m$ . Moreover, for every $j\in\{0,1,\dots,2\varepsilon m\}$ , there is exactly one item subset of total size $j$ .

The instance then continues with $m$ more items, one of each size in

M=\Bigl{\{}1+\frac{1}{2},\ldots,1+\frac{1}{2^{m}}\Bigr{\}}=\Bigl{\{}\,1+\frac{% 1}{2^{i}}\Bigm{|}1\leq i\leq m\,\Bigr{\}}.

Finally, a single item of size $1-\sum_{s\in S}(s-1)$ is presented, where $S$ may be any subset of $M$ whose cardinality satisfies $k\leq|S|\leq m-k$ . There are $\sum_{i=k}^{m-k}\binom{m}{i}$ such subsets and we have one instance for each possible choice. A straightforward tail bound, which we derive at the end of this proof, shows that we have $\sum_{i=0}^{k}\binom{m}{i}\leq 2^{mH(1/2-\varepsilon)+\log m-1}$ , where $H(p)=-p\log p-(1-p)\log(1-p)$ is the binary entropy function. We also know that $\sum_{i=0}^{m}\binom{m}{i}=2^{m}$ and therefore obtain

\displaystyle\sum_{i=k}^{m-k}\binom{m}{i}

\displaystyle\geq 2^{m}-2\cdot 2^{mH(1/2-\varepsilon)+\log m-1}=2^{m}(1-2^{-m(% 1-H(1/2-\varepsilon))+\log m})

as a lower bound on the number of instances.

Consider the instance whose final item has size $1-\sum_{s\in S_{0}}(s-1)$ for any given $S_{0}\subseteq M$ . The knapsack of capacity $m-k+1$ can be filled completely with the items of this instance as follows: Pack the last item, all items of sizes in $S_{0}$ —which brings us to a total size of exactly $|S_{0}|+1$ —and finally the unique subset of the first $\log(2\varepsilon m+1)$ items with a total size of $m-k+1-(|S_{0}|+1)=m-(k+|S_{0}|)$ . This is in fact the only way to fill the knapsack entirely for the following reason. There is exactly one selection of the items with non-integer sizes such that their fractional parts sum up to an integer—namely exactly $1$ —which is necessary to fill the knapsack of integer capacity completely. The remaining integer gap has to be filled by the remaining items with integer sizes, and we have already noted before that there is exactly one way to do this for every integer between $0$ and $m-2k=2\varepsilon m$ . The instances are all identical until the last item is presented. Therefore, any online algorithm must have packed exactly the right selection of items when the final one is presented to realize the optimal solution. The number of advice bits necessary to distinguish the possible instances and guarantee optimality is therefore

\displaystyle\log\sum_{i=k}^{m-k}\binom{m}{i}

\displaystyle\geq m+\log(1-2^{-m(1-H(1/2-\varepsilon))+\log m}).

The following straightforward calculations—using the Taylor expansion for the logarithm and Stirling bounds—show that this can be bounded from below by $n-\log(5\varepsilon n)>n-\log(n)+\log(1/\varepsilon)-3$ for sufficiently large $m$ .

Using the Taylor expansion for the natural logarithm around $1$ , we have

	$\displaystyle\log(1-x)$	$\displaystyle=\frac{-1}{\log e}\sum_{i=1}^{\infty}\frac{x^{i}}{i}$
		$\displaystyle>-x\sum_{i=1}^{\infty}\frac{x^{i-1}}{i}$
		$\displaystyle=-x\sum_{i=0}^{\infty}\frac{x^{i}}{i+1}$
		$\displaystyle>-x\sum_{i=0}^{\infty}x^{i}$
		$\displaystyle>-x\frac{1}{1-x}$
		$\displaystyle\geq-2x$

for $0<x<1/2$ . Applying this to $x=2^{-m(1-H(1/2-\varepsilon))+\log m}$ we can thus further bound the number of required advice from below by

\displaystyle\log\sum_{i=k}^{m-k}\binom{m}{i}

\displaystyle\geq m-2\cdot 2^{-m(1-H(1/2-\varepsilon))+\log m}.

Each of the described instances contains $n=\log(2\varepsilon m+1)+m+1$ items. This is greater than the number of required advice bits by at most

\log(2\varepsilon m+1)+1+2\cdot 2^{-m(1-H(1/2-\varepsilon))+\log m}.

For any fixed, positive $\varepsilon<1/2$ , we have $0<H(1/2-\varepsilon)<1$ and thus the exponent $-m(1-H(1/2-\varepsilon))+\log m$ growing to arbitrarily large negative numbers for increasing $m$ . This means that the difference between the number of items $n$ and the number of required advice bits asymptotically coincides with $\log(2\varepsilon m+1)+1=n-m$ .

We have $m=n-\log(2\varepsilon m+1)-1>n-\log(2\varepsilon n+1)-1$ and thus $n-m<\log(2\varepsilon n+1)+1<\log(5\varepsilon n)$ for sufficiently large $m$ and thus $n$ . Hence more than $n-\log(5\varepsilon n)>n-\log(n)+\log(1/\varepsilon)-3$ advice bits are necessary for optimality on instances of length $n$ for an arbitrarily small positive $\varepsilon<1/2$ .

It remains to prove the mentioned tail bound, for which we use the standard Stirling bounds [23], which—in a simple form—are $\sqrt{2\pi n}(n/e)^{n}\leq n!\leq e\sqrt{n}(n/e)^{n}$ .

We obtain

	$\displaystyle\sum_{i=0}^{k}\binom{m}{i}$	$\displaystyle=1+\sum_{i=1}^{k}\binom{m}{i}$
		$\displaystyle\leq 1+k\binom{m}{k}$
		$\displaystyle\leq 1+\frac{m!k}{k!(m-k)!}$
		$\displaystyle\leq 1+\frac{ek\sqrt{m}(m/e)^{m}}{\sqrt{2\pi k}(k/e)^{k}\sqrt{2% \pi(m-k)}((m-k)/e)^{m-k}}$
		$\displaystyle=1+\frac{ek}{2\pi}\frac{\sqrt{m}}{\sqrt{k(m-k)}}\frac{m^{m}}{k^{k% }(m-k)^{m-k}}\frac{e^{k}e^{m-k}}{e^{m}}$
		$\displaystyle=1+\frac{ek}{2\pi}\sqrt{\frac{m}{k(m-k)}}\mathopen{}\mathclose{{}% \left(\frac{m}{k}}\right)^{k}\mathopen{}\mathclose{{}\left(\frac{m}{m-k}}% \right)^{m-k}$
Since $k=m/2-\varepsilon m=m(1/2-\varepsilon)$ , we have $m-k=m(1/2+\varepsilon)$ . Using this and the binomial theorem $(1/2-\varepsilon)(1/2+\varepsilon)=1/4-\varepsilon^{2}$ twice, we obtain the following bound.
	$\displaystyle\sum_{i=0}^{k}\binom{m}{i}$	$\displaystyle\leq 1+\frac{ek}{2\pi}\frac{1}{\sqrt{m(1/4-\varepsilon^{2})}}% \mathopen{}\mathclose{{}\left(\frac{1}{1/2-\varepsilon}}\right)^{m(1/2-% \varepsilon)}\mathopen{}\mathclose{{}\left(\frac{1}{1/2+\varepsilon}}\right)^{% m(1/2+\varepsilon)}$
Since $H(p)=p\log(1/p)+(1-p)\log(1/(1-p))$ , we have $2^{H(p)}=\mathopen{}\mathclose{{}\left(\frac{1}{p}}\right)^{p}\mathopen{}% \mathclose{{}\left(\frac{1}{1-p}}\right)^{1-p}$ and thus
	$\displaystyle\sum_{i=0}^{k}\binom{m}{i}$	$\displaystyle\leq 1+\frac{ek}{2\pi}\frac{1}{\sqrt{m(1/4-\varepsilon^{2})}}2^{% mH(1/2-\varepsilon)}.$
Since $e/(2\pi)>1/3$ and $\varepsilon\leq 1/3$ , we can continue as follows.
	$\displaystyle\sum_{i=0}^{k}\binom{m}{i}$	$\displaystyle\leq 1+3k\frac{1}{\sqrt{m}/9}2^{mH(1/2-\varepsilon)}$
		$\displaystyle\leq 1+\frac{9k}{\sqrt{m}}2^{mH(1/2-\varepsilon)}$
		$\displaystyle=1+9\sqrt{m}(1/2-\varepsilon)2^{mH(1/2-\varepsilon)}$
		$\displaystyle\leq 1+(9/2)\sqrt{m}\cdot 2^{mH(1/2-\varepsilon)}-9\varepsilon% \sqrt{m}\cdot 2^{mH(1/2-\varepsilon)}$
For any given, positive $\varepsilon<1/2$ , we have $H(1/2-\varepsilon)>0$ . Choosing $m\geq 9^{2}$ sufficiently large yields the desired tail bound:
	$\displaystyle\sum_{i=0}^{k}\binom{m}{i}$	$\displaystyle\leq 1+(m/2)\cdot 2^{mH(1/2-\varepsilon)}-9\varepsilon\sqrt{m}% \cdot 2^{mH(1/2-\varepsilon)}$
		$\displaystyle\leq(m/2)\cdot 2^{mH(1/2-\varepsilon)}$
		$\displaystyle=2^{mH(1/2-\varepsilon)+\log m-1}$

This concludes the proof of the theorem.

4.2 A Single Advice Bit

The previous section covered the upper end of the advice spectrum, showing that, asymptotically, reading one advice bit for each item in the instance is necessary and sufficient for ensuring an optimal solution. We now turn to the other extreme and ask what can be done with the least nonzero amount of advice, one single bit for the entire instance.

First, we describe a very simple $3/2$ -competitive advice algorithm where a single advice bit indicates whether there is an optimal solution containing more than one item from the interval $[1/3,2/3]$ : If the answer is yes, the algorithm maintains the smallest item in this interval until a second item fits in, while ignoring all items outside of the interval. As soon as a second item fits, it is packed and all remaining items are rejected. If the answer is no, the algorithm maintains in the knapsack the largest item of size at least $1/3$ seen so far while packing all items smaller than $1/3$ as long as they fit. If the knapsack capacity is never exceeded, the solution is optimal. If the knapsack capacity is exceeded at some point, all packed items but possibly one are smaller than $1/3$ . Discard these items one by one, in arbitrary order, until we are within the capacity of the knapsack again. The remaining gap is at most $1/3$ .

Han et al. [11, Thm. 6] have presented a randomized algorithm that relies on a partition of the items into six size classes. It is rather involved and hard to analyze, yet yields an expected competitive ratio of $10/7\approx 1.428571$ . Because it uses only a single random bit, it provides an upper bound for our case of one advice bit as well. We undercut this bound with a more manageable $\sqrt{2}$ -competitive algorithm that needs only five classes. We then complement this with a lower bound of $(1+\sqrt{17})/4=4/(\sqrt{17}-1)\approx 1.2808$ .

Theorem 4.7.

There is a $\sqrt{2}$ -competitive algorithm for PropRemKnap reading only one advice bit.

Figure 2: The partition of the interval

(0,1]

of possible sizes into the five subintervals used in the proof of Theorem 4.7—namely

(0,a]

(a,b]

(b,c]

(c,d]

, and

(d,1]

—plus the corresponding class names. The values are

a=1-1/\sqrt{2}\approx 0.293

, and

b=\sqrt{2}-1\approx 0.414

, and

c=1/2

, and

d=1/\sqrt{2}\approx 0.707

Proof 4.8.

We split the interval $(0,1]$ of possible sizes into subintervals at four points $a<b<c<d$ . We will call the items with sizes in one of these five intervals tiny, small, medium, big, and huge, respectively. Formally, we partition the items into the five classes

	$\displaystyle P_{\textnormal{tiny}}{}$	$\displaystyle=\{\,i\mid 0<s(i)\leq a\,\},$	$\displaystyle P_{\textnormal{small}}{}$	$\displaystyle=\{\,i\mid a<s(i)\leq b\,\},$	$\displaystyle P_{\textnormal{medium}}{}$	$\displaystyle=\{\,i\mid b<s(i)\leq c\,\},$
	$\displaystyle P_{\textnormal{big}}{}$	$\displaystyle=\{\,i\mid c<s(i)\leq d\,\},\text{}$	$\displaystyle P_{\textnormal{huge}}{}$	$\displaystyle=\{\,i\mid d<s(i)\leq 1\,\},$

where $a=1-1/\sqrt{2}\approx 0.29289$ , $b=\sqrt{2}-1\approx 0.41421$ , $c=1/2$ , and $d=1/\sqrt{2}\approx 0.70711$ .

We will call the small and medium items the little ones collectively and refer to the big and huge items as the large ones. Accordingly, we let $P_{\textnormal{little}}=P_{\textnormal{small}}\cup P_{\textnormal{medium}}$ and $P_{\textnormal{large}}=P_{\textnormal{big}}\cup P_{\textnormal{huge}}$ . See Figure 2 for an illustration of the subintervals and class names.

The oracle uses the one available advice bit to tell the algorithm which of the two strategies described below to apply. For the decision, the oracle picks an arbitrary optimal solution $S$ to the given instance. If $S$ contains a large item, the first strategy will be chosen, with one exception: If the instance contains no huge item but a little and a big item that fit into the knapsack together, then the first strategy is chosen only if a minimal big item appears in the instance before a minimal small item. In all other cases, the second strategy is implemented.

Strategy One: If at any point a huge item appears, the algorithm packs it and keeps it until the end, discarding everything else.

Otherwise, the algorithm operates with two slots, a primary one and a secondary one. In the primary slot, it maintains the minimal big item and in the secondary slot it maintains the minimal little item. The primary slot takes precedence; that is, in case of a conflict where a new minimal item for one slot is presented that does not fit with the minimal item in the other slot, we discard the little item.

While maintaining the slot contents, tiny items are always packed greedily. If at any point a presented tiny item does not fit, the current contents of the knapsack are frozen and kept as they are until the instance has ended. The same happens after a step in which only tiny items have been discarded.
Strategy Two: This strategy manages not only two but three slots, all of which maintain minimal items of some class. In order of precedence, the primary slot maintains two medium items, the secondary slot up to three small items, and the tertiary one big one. As an exception, if at any point a big item appears that can be packed alongside a currently packed small item by discarding everything else, then this is done and these two items are kept till the end. The tiny items are handled as before: They are packed greedily and if either a presented tiny item does not fit or only tiny items have been discarded in one step, then the current knapsack configuration is kept up to the very end.

We now need to carefully work through a case distinction according to the conditions listed in Table 1 and show that the algorithm’s competitivity is indeed bounded from above by $\max\{1/d,d/c,$ $1/2b,1/(a+b),1/(1-a),b/a\}=\sqrt{2}$ .

Table 1: The mutually exclusive cases considered in Theorem 4.7.

Case	Strategy	Competitivity	Case Conditions
A	One	$1/d$	$\|P_{\textnormal{huge}}\|>0$
B	One/Two	$d/c$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|>0$	$\|P_{\textnormal{medium}}\|\leq 1$
C	Two	$1/2b$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|\geq 0$	$\|P_{\textnormal{medium}}\|>1$
D	Two	$1/(a+b)$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|=1$	$\|P_{\textnormal{small}}\|>0$
E	Two	$b/a$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|=0$	$\|P_{\textnormal{small}}\|>0$
F	Two	$1/(1-a)$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|\leq 1$	$\|P_{\textnormal{small}}\|=0$

Case A.

This case is trivial: If there are huge items in the instance, the first one will be packed and kept in the knapsack, yielding a competitive performance of $1/d$ or better.

Case B.

The case condition $|S\cap P_{\textnormal{big}}|>0$ tells us that the optimal solution contains at least one big item. Since big items have a size above $c=1/2$ , it contains exactly one. Moreover, there is at most one medium item in the entire instance. We now consider two subcases.

Subcase B1: Assume first that the instance contains no pair of a little and a big item that can be packed at the same time. This means that the first strategy—which gives preference to big items over little ones—is operative. The algorithm will thus only discard a possibly packed little item when the first appearing big item appears to replace it. The strategy guarantees that one big item of size greater than $c$ is contained in the knapsack in the end. If there are no tiny items, both the online solution and the optimal solution $S$ contain exactly one big item and nothing else, implying a competitive performance of $c/d$ . For the case that there are tiny items, recall that they are always packed greedily. Moreover, if any tiny item does not fit or is dismissed in some step, the algorithm conserves the current state of the knapsack, guaranteeing a filling of at least $1-a$ . We may therefore assume that every tiny item is packed and kept in the online solution computed here. Whatever tiny items are contained in the optimal solution $S$ are thus in the online solution as well. If they have a total size $t$ , the competitive performance is thus bounded from above by $(d+t)/(c+t)\leq d/c$ .

Subcase B2: Assume that there are a little and a big item that can be packed alongside each other. Let $i$ be the first minimal little item and $j$ the first minimal big item. Clearly, $i$ and $j$ fit into the knapsack together. If the knapsack contains these two items in the end, the competitive performance is $1/(a+c)\leq 1/(a+b)=d/c$ or better.

Subcase B2a: Assume that $i$ appears after $j$ . In this case, the first strategy is used. Since it maintains the smallest big item seen so far in the primary slot, it will have $j$ packed when $i$ is presented, allowing for $i$ to be packed alongside it.

Subcase B2b: Assume that $i$ appears before $j$ , letting the algorithm implement the second strategy. The chosen strategy maintains up to two minimal medium items in the primary slot and up to three minimal small items in its secondary slot. However, there is by the global assumption of case $B$ at most one medium item in the entire instance and a small item will always fit in beside a medium item because $b+c\leq 1$ . Hence, overall, a minimal little item is maintained in the knapsack, meaning that $i$ is packed already when $j$ appears, allowing for a little and big item to be packed together according to the strategy’s stated exception.

Case C.

This simple case is covered by the second strategy. Maintaining two medium items in the slot with highest precedence guarantees a filled fraction of at least $2b$ .

Case D.

This case is easy as well: The one medium item will be packed into the primary slot when presented, and at least one small item is packed into and stays in the secondary slot because any small item fits in beside any medium item. The total size of small and a medium item is at least $a+b$ , leading to a competitive performance of $1/(a+b)$ or better.

Case E.

Since there are no medium items in this case, the secondary slot will maintain up to three minimal small items. If there are three small items that fit together, they will eventually be packed, yielding a filled fraction of at least $3a>a/b$ . Otherwise, two small items will be packed, or only one if and only if it is the only one. If there were no tiny items, we could in both cases bound the competitivity by $b/a$ , using the maximal and minimal possible size of a small item. However, repeating the argument of subcase B1, we may assume that all tiny items are packed and none discarded, otherwise the packing would freeze instantly with a filling of at least $1-a$ . If the tiny items have a total size of $t$ , our bound would therefore only improve to $(b+t)/(a+t)\leq b/a$ .

Case F.

This case is quite simple again. If there is a medium item, it is always packed and kept to the end. Beside this one potential medium item, there are only tiny ones, the minimization in the slot will therefore not affect the result adversely. If all items of the instance fit into the knapsack together, they are all packed and the solution is optimal. Otherwise, the greedy packing of tiny items leaves a gap of less than $a$ , ensuring a competitive factor of $1/(1-a)$ or better.

We now complement the upper bound of Theorem 4.7 with a lower bound of $(1+\sqrt{17})/4=4/(\sqrt{17}-1)\approx 1.2808$ .

Theorem 4.9.

No algorithm for PropRemKnap reading only a single advice bit can have a better competitive ratio than $(1+\sqrt{17})/4$ .

Proof 4.10.

Let $\psi=4/(1+\sqrt{17})$ and choose a positive $\varepsilon<\psi\approx 0.7808$ . Let an algorithm for PropRemKnap reading only a single advice bit be given. Consider the three instances $I_{1}$ , $I_{2}$ , and $I_{3}$ that all start with the same three items of sizes $x_{1}=\psi$ , $x_{2}=\psi^{2}$ , and $x_{3}=1-\psi^{2}+\varepsilon$ , which is the end of instance $I_{1}$ but followed by a last item of size $y_{2}=1-\psi^{2}$ for $I_{2}$ and of size $y_{3}=\psi^{2}$ for $I_{3}$ .

Table 2: A hard instance family for PropRemKnap reading one advice bit; see the proof of Theorem 4.9.

	$x_{1}$	$x_{2}$	$x_{3}$	$y_{2}$	$y_{3}$	optimal	second best	ratio
$I_{1}$ :	$\psi$	$\psi^{2}$	$1-\psi^{2}+\varepsilon$			$\psi$	$\psi^{2}$	$\psi/\psi^{2}$
$I_{2}$ :	$\psi$	$\psi^{2}$	$1-\psi^{2}+\varepsilon$	$1-\psi^{2}$		$1$	$2(1-\psi^{2})+\varepsilon$	$1/(2(1-\psi^{2})+\varepsilon)$
$I_{3}$ :	$\psi$	$\psi^{2}$	$1-\psi^{2}+\varepsilon$		$\psi^{2}-\varepsilon$	$1$	$\psi$	$1/\psi$

For each $i\in\{1,2,3\}$ , the instance $I_{i}$ has a unique optimal solution; it contains $x_{i}$ and, except for $i=1$ , additionally $y_{i}$ . Table 2 shows the total size for each of these optimal solutions and the second best solution.

Because any two of these three items $x_{1}$ , $x_{2}$ , and $x_{3}$ sum up to over 1, the advice algorithm can keep at most one of them in the knapsack after the presentation of $x_{3}$ . Moreover, since only one advice bit is given and the three instances are indistinguishable until after the decision on the third item has been taken, there are two instances for which the same item, if any, is kept in the knapsack for the presentation of the potential fourth item. This implies that the algorithm is suboptimal for at least one instance. The second best solutions for $I_{1}$ , $I_{2}$ , and $I_{3}$ fill up a fraction $\psi$ , $2(1-\psi^{2})+\varepsilon$ , and $\psi$ , respectively. Thus, the competitive ratio of cannot be better than the minimum of $\psi/\psi^{2}$ , $1/(2(1-\psi^{2})+\varepsilon)$ , and $1/\psi$ . Since $2(1-\psi^{2})=\psi$ , this means the competitivity is $1/(\psi+\varepsilon)$ at best for arbitrarily small $\varepsilon$ .

Note again that an advice bit is at least as powerful as a random bit, hence Theorem 4.9 also improves the best known lower bound of $5/4$ for one random bit due to Han et al. [11, Thm. 8].

4.3 Near Optimality with Constant Advice

Having seen how much advice is necessary for optimality and what the effect of a single advice bit can be, we now address the entire range in between. For this, we prove the following generalization of Theorem 4.9.

Theorem 4.11.

Let an arbitrary integer $k>1$ be given. No algorithm for PropRemKnap reading at most $\log k$ advice bits can achieve a better competitive ratio than $4/(3-2k+\sqrt{4k(k+1)-7})$ .

Proof 4.12.

We generalize the hard instance family from the proof of Theorem 4.9. Let an arbitrary integer $k>1$ be given and define $\zeta$ as the positive root of $2\zeta^{2}+(2k-3)\zeta-2(k+1)$ , namely $\zeta=(3-2k+\sqrt{4k(k+1)-7})/4$ . Consider $k+1$ instances that all start with the same $k+1$ items of the following, decreasing sizes: first $x_{1}=\zeta$ , then $x_{i}=\zeta^{2}-(i-2)(1-\zeta)$ for every $i\in\{2,\dots,k\}$ , and then $x_{k+1}=\zeta^{2}+(k-1)(1-\zeta)+\varepsilon$ for an arbitrary $\varepsilon$ satisfying $0<\varepsilon<1-\zeta$ . The instance $I_{1}$ ends immediately after these common items, whereas the instances $I_{i}$ , for $i\in\{2,\dots,k+1\}$ , presents one additional item of size $y_{i}=1-x_{i}$ as the final one. There is a unique optimal solution for each instance: For $I_{1}$ , it is to pack the first item of size $x_{1}=\zeta$ . For $I_{i}$ with $i>1$ , it is to pack the item of size $x_{i}$ and the last one of size $y_{i}=1-x_{i}$ , which sum up to the optimal solution value $1$ . Since there are only $\log k$ advice bits available to handle the $k+1$ instances, at least two instances $I_{i}$ and $I_{j}$ with $i<j$ are processed with the same advice string and thus the same deterministic algorithm. Consider this algorithm and the moment after seeing and taking decisions on the first $k+1$ items. It is impossible for the algorithm to have more than one of these common items packed since the two smallest of them already have a combined size of $x_{k}+x_{k+1}=2\zeta^{2}+(2k-3)\zeta-2k+3+\varepsilon=1+\varepsilon$ . Now, if item $i$ is packed at the considered moment, the algorithm will perform suboptimally on instance $I_{j}$ . Analogously, if item $j$ is packed, the performance on instance $I_{i}$ is suboptimal.

Now if suffices to check that the best suboptimal solution has a filling of at most $\zeta^{2}$ for $I_{1}$ , at most $\zeta$ for $I_{2}$ , …, $I_{k}$ , and at most $\zeta+\varepsilon$ for $I_{k+1}$ . This leads to a performance ratio that is $\zeta/\zeta^{2}=1/\zeta$ or $1/(\zeta+1)$ at best, depending on the concrete algorithm, thus proving the theorem. See Table 3 for an overview of the hard instance family. The best and second best solutions to all instances and their associated performances are listed in Table 4.

Table 3: Hard instance family for PropRemKnap reading at most

\log k

advice bits, where

\zeta=(3-2k+\sqrt{4k(k+1)-7})/4

; see the proof of Theorem 4.11.

	$x_{1}$	$x_{2}$	$\cdots$	$x_{k}$	$x_{k+1}$	$x_{k+2}=y_{j}$
$I_{1}$ :	$\zeta$	$\zeta^{2}$	$\cdots$	$\zeta^{2}-(k-2)(1-\zeta)$	$\zeta^{2}-(k-1)(1-\zeta)+\varepsilon$	None
$I_{2}$ :	$\zeta$	$\zeta^{2}$	$\cdots$	$\zeta^{2}-(k-2)(1-\zeta)$	$\zeta^{2}-(k-1)(1-\zeta)+\varepsilon$	$1-x_{2}$
$\,\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\hphantom{1\leavevmode\nobreak\ }\vdots$
$I_{k+1}$ :	$\zeta$	$\zeta^{2}$	$\cdots$	$\zeta^{2}-(k-2)(1-\zeta)$	$\zeta^{2}-(k-1)(1-\zeta)+\varepsilon$	$1-x_{k+1}$

Table 4: The values of the optimal and best suboptimal solutions for each instance in the family given in Table 3 and the resulting competitive performance; see the proof of Theorem 4.11.

	Optimal value	Best suboptimal value	Best suboptimal performance
$I_{1}$ :	$x_{1}=\zeta$	$x_{2}=\zeta^{2}$	$\zeta/\zeta^{2}$
$I_{2}$ :	$x_{2}+y_{2}=1$	$x_{1}=\zeta\phantom{{}^{2}}$	$1/\zeta\phantom{{}^{2}}$
$I_{3}$ :	$x_{3}+y_{3}=1$	$x_{2}+y_{3}=\zeta\phantom{{}^{2}}$	$1/\zeta\phantom{{}^{2}}$
$\,\vdots$	$\vdots\hphantom{\,\leavevmode\nobreak\ \zeta}$	$\vdots\hphantom{\,\leavevmode\nobreak\ \zeta^{2}}$	$\hphantom{1\,}\vdots\hphantom{\,\zeta^{2}}$
$I_{k}$ :	$x_{k}+y_{k}=1$	$x_{k-1}+y_{k}=\zeta\phantom{{}^{2}}$	$1/\zeta\phantom{{}^{2}}$
$I_{k+1}$ :	$x_{k+1}+y_{k+1}=1$	$x_{k}+y_{k+1}=\zeta-\varepsilon$	$1/(\zeta-\varepsilon)$

We remark that Theorem 4.11 and its analogue for RemKnap instead of PropRemKnap, Theorem 5.7, improve upon the best known lower bounds implied by Han et al.’s results on the resource buffer model [13, Thms. 17 and 6]. In this model, the online algorithm may use a knapsack of some increased capacity $R>1$ , but only until the instance ends, at which point it has to choose from the reserved items a selection that fits a knapsack of capacity one. A resource buffer of some natural size $R$ allows us to simulate any algorithm using up to $\log R$ advice bits: We think of the resource buffer as split into $R$ knapsacks of capacity $1$ , allowing us to accommodate the items stored by the advice algorithm for every possible advice string simultaneously.

Instantiating Theorem 4.11 with $k=2^{1}$ , $k=2^{2}$ , and $k=2^{3}$ , for example, we obtain the lower bounds

	$\displaystyle 4/(\sqrt{17}-1)\approx{}$	$\displaystyle 1.2808,\$	$\displaystyle 4/(\sqrt{113}-7)\approx{}$	$\displaystyle 1.1287,\$	and	$\displaystyle 4/(\sqrt{353}-15)\approx{}$	$\displaystyle 1.0630.$
for one, two, and three advice bits, respectively, while the lower bounds provided by Han et al. [13, Thm. 6] for a resource buffer of size $R=2^{k}$ are
	$\displaystyle 6/5={}$	$\displaystyle 1.2,$	$\displaystyle 10/9\approx{}$	$\displaystyle 1.1111,$	and	$\displaystyle 18/17\approx{}$	$\displaystyle 1.0588.$

Clearly, the lower bound of Theorem 4.11 tends to $1$ for increasingly large but still constant advice.

With our most surprising result for the proportional knapsack problem, Theorem 4.15, we will prove that the true competitive ratio displays the same general behavior as the lower bound of Theorem 4.11: For any given $\varepsilon>0$ , we can guarantee a competitive ratio of $1+\varepsilon$ with a constant number of advice bits. It is of course also possible to derive more specific upper bounds for very few advice bits such as the following one.

Theorem 4.13.

There is a $4/3$ -competitive algorithm for PropRemKnap reading two advice bits.

Proof 4.14.

The algorithm operates in one of four modes, depending on the given advice.

We make the following case distinction that primarily depends on how many items falling into the size interval $[1/4,3/4]$ —we call them medium items—appear in the optimal solutions.

Strategy One.

This strategy is chosen if there is any item larger than $3/4$ or if there is an optimal solution containing either no or only one medium item. The algorithm maintains one maximal medium item while packing the smaller items greedily. As an exception, when an item larger than $3/4$ appears, it is packed and kept to the end, discarding everything else.

This procedure obviously produces $(3/4)$ -competitive solution if there is an item larger than $3/4$ . Otherwise, the knapsack will be filled optimally if all items fit into the knapsack together. In the remaining case, the greedy packing will leave a gap of at most $1/4$ because the algorithm will never displace a medium item in favor of a smaller one, thus removing only items smaller than $1/4$ .

Strategy Two.

This strategy can be chosen if there is an optimal solution containing at least three medium items or if the two minimal medium items in the instance have a combined size of $3/4$ or more. The algorithm maintains as long as possible the minimal three medium items from the interval among everything seen so far. If a third medium item does not fit at some point, the algorithm switches to maintaining only the two minimal medium items. This clearly yields a filling of at least $3/4$ .

Strategy Three.

This strategy works if there is an optimal solution containing two items from the size interval $[1/4,1/2]$ . It maintains two maximal such items, which is always possible, and packs the smaller items greedily. This either yields an optimal solution or one with a gap of at most $3/4$ .

Strategy Four.

This strategy is chosen in the remaining case. Specifically, we may now assume that every optimal solution to the given instance contains exactly two medium items, one of which has a size greater than $1/2$ .

In this case, the algorithm maintains on the one hand a minimal item and on the other hand a minimal item larger than $1/2$ . Clearly, this fills the knapsack to a total size of more than $1/4+1/2=3/4$ .

We now turn to our main result for the proportional knapsack problem, which complements Theorem 4.11 with an upper bound. Theorem 5.9 will generalize this result to the general version where an item’s size may differ from its value, albeit with a far more complicated proof. To make it as easily understandable as possible, we first present here the proof for the simple variant, which introduces the idea of slots that are reserved for items with certain properties. This will serve as a useful foundation for the proof of the general variant, which is also making use of such a slot system, although as merely one besides many more components.

Theorem 4.15.

For any $\varepsilon>0$ , there is a strictly $(1+\varepsilon)$ -competitive algorithm for PropRemKnap reading a constant number of advice bits.

Proof 4.16.

We describe such an algorithm called PropPack; see Algorithm 1 for a pseudo-code implementation. We begin by describing the advice communicated to PropPack with a constant number of bits, then explain how the algorithm operates on this advice, prove that it is correct and terminates, and finally analyze its competitive ratio.

Notions and Notation.

Without loss of generality, we assume that all items have size at most 1 and that $\varepsilon\leq 1/2$ . We define the constant $K=\lceil\log_{1-\varepsilon/2}\varepsilon/2\rceil$ .

Let an instance with $n$ items be given. Denote the items in the order of their appearance in the instance by $1,2,\dots,n$ and denote the size of item $i$ by $s(i)$ . We divide the $n$ items into small and big ones, with $\delta=(1-\varepsilon/2)^{K}$ serving as the dividing line: $C_{\textnormal{small}}=\{\,i\mid s(i)\leq\delta\,\}$ and $C_{\textnormal{big}}=\{\,i\mid\delta<s(i)\,\}$ . We further partition the big items into the subclasses $C_{k}=\{\,i\mid(1-\varepsilon/2)^{k}<s(i)\leq(1-\varepsilon/2)^{k-1}\,\}\text{ for }k\in\{1,\dots,K\}$ . To alleviate the notation, we will often refer to $C_{k}$ as class $k$ and to $C_{\textnormal{small}}$ as class $0$ . We also use this convention when writing $C(i)$ to indicate the class to which item $i$ belongs: We have $C(i)\in\{0,\dots,K\}$ , with $C(i)=0$ meaning that $i\in C_{\textnormal{small}}$ and $C(i)=k\neq 0$ meaning that $i\in C_{k}$ .

The oracle chooses an arbitrary but fixed optimal solution $S\subseteq\{1,\dots,n\}$ . We denote the partition classes that are naturally induced by this solution by $S_{\textnormal{small}}=S\cap C_{\textnormal{small}}$ , $S_{\textnormal{big}}=S\cap C_{\textnormal{big}}$ , and $S_{k}=S\cap C_{k}$ for $k\in\{1,\dots,K\}$ . Let $m=|S_{\textnormal{big}}|$ be the number of big items in the optimal solution and denote them by $i_{1}<\ldots<i_{m}$ in order of appearance.

Constant Advice.

The oracle communicates to the algorithm a tuple $(b_{1},\dots,b_{m})$ with the classes of the big items in the chosen optimal solution in order of appearance; that is, we have $b_{j}=C(i_{j})$ for each $j\in\{1,\dots,m\}$ . We remark that this tuple needs to be encoded in a self-delimiting way. A constant number of bits suffices for this because $b_{j}$ is bounded by the constant $K$ for every $j\in\{1,\dots,m\}$ and $m$ is bounded by the constant $1/\delta$ . The latter bound is an immediate consequence of the fact that $s(S_{\textnormal{big}})\leq 1$ and that any big item has a size larger than $\delta$ .

Algorithm Description.

The algorithm PropPack proceeds in $m$ phases as follows. In every phase, the algorithm opens a new virtual slot within the knapsack that can store exactly one item at a time; multiple items in succession are allowed, however. The slot opened in phase $i$ will accommodate items belonging to class $b_{i}$ exclusively; we say that items from this class match slot $i$ . Slots are never closed, thus there are exactly $m$ of them in the end. Small items are generally packed in a greedy manner and discarded one by one whenever necessary to pack a big item.

In the first phase, the algorithm rejects all big items until one of class $b_{1}$ appears. As soon as this is the case, said item is packed into the first slot, ending the first phase.

In the second phase, the algorithm opens the second slot to pack a matching item, that is, one of class $b_{2}$ . It waits for the first item from this class that fits into the knapsack alongside the item in the first slot. As soon as such an item appears, it is packed and the phase ends. In the meantime, whenever an item of class $b_{1}$ appears during the second round, the algorithm substitutes it for the one stored in the first slot if and only if this reduces the size of the stored item.

In general, phase $i$ begins with the opening of slot $i$ , which is reserved for items of class $b_{i}$ . The phase continues until an item appears that both matches the newly opened slot and fits in beside the items currently stored in the previously opened and filled slots without exceeding the capacity. Then this item is packed into the new slot, which ends the phase. During the entire phase, the algorithm maintains in all filled slots the smallest matching items seen so far: Whenever the algorithm is presented with a big item that either it belongs to a class other than $b_{i}$ or does not fit in alongside the items in the previously opened slots, then the new item replaces a largest item in the matching open slots, unless the new item itself is even larger.

The entire time, even after the last phase has terminated, small items are packed greedily and discarded one by one whenever this is necessary to make room for a big item according to the description above. Moreover, we may assume that, whenever a new item has been packed into the knapsack, the algorithm sorts the items in the matching open slots in increasing order. This sorting is not necessary for the algorithm to fulfill its duty, but it facilitates the proof by induction below.

Termination of All Phases.

We need to show that PropPack does in fact finish all $m$ phases; that is, all $m$ slots will be filled with a matching item without ever exceeding the knapsack capacity. Consider the big items of the optimal solution, which we denote by $u_{1}<\dots<u_{m}$ in their order of appearance. To ensure the termination of all phases, we prove by induction over $i\leq m$ that, after processing item $u_{i}$ , the first $i$ slots store items with a total size of $s(u_{1})+\dots+s(u_{i})$ or less.

We may start from $i=0$ as the trivial, if degenerate, base case. For the induction step, assume the hypothesis for $i<m$ and observe that no item in a slot is ever replaced by a larger one. Therefore, the items in the first $i$ slots still have a total size of at most $s(u_{1})+\dots+s(u_{i})$ when $u_{i+1}$ is presented.

There are now three possibilities. If slot $i+1$ has remained closed up to this point, it is now opened and filled with $u_{i+1}$ , which fits in because $s(u_{1})+\dots+s(u_{i+1})\leq s(S_{\textnormal{big}})\leq 1$ . Otherwise, slot $i+1$ is already storing an item: If said item is larger than $s(u_{i+1})$ , then $u_{i+1}$ replaces either this item or one that is at least as large. During the subsequent sorting, $u_{i+1}$ is then moved to slot $i+1$ or one of the slots from $1$ to $i$ , which may force some items from slots $1$ through $i$ into higher slots but never beyond slot $i+1$ . The third possibility is that slot $i+1$ contains an item of size at most $s(u_{i+1})$ already. We immediately obtain the induction claim for $i+1$ in all three cases.

Competitive Analysis.

We still denote by $S$ the optimal solution that served as the basis for the given advice, by $T$ the final output of the online algorithm PropPack (Algorithm 1), and the respective partition classes by $S_{\textnormal{small}}$ , $S_{\textnormal{big}}$ , $S_{k}$ and $T_{\textnormal{small}}$ , $T_{\textnormal{big}}$ , and $T_{k}$ .

Since PropPack opens one slot for each big item in the optimal solution $T$ and fills it with an item from the same subclass, as proved above, we have $|S_{k}|=|T_{k}|$ for every $k\in\{1,\dots,K\}$ . Moreover, the sizes within a subclass $C_{k}$ vary by a factor of at most $1-\varepsilon/2$ ; this means that we can bound both $s(S_{\textnormal{big}})$ and $s(T_{\textnormal{big}})$ from below by $L=\sum_{k=1}^{K}|S_{k}|(1-\varepsilon/2)^{k}$ and from above by $L/(1-\varepsilon/2)$ . We conclude $s(T_{\textnormal{big}})\geq s(S_{\textnormal{big}})\cdot(1-\varepsilon/2)$ .

Furthermore, since small items are packed greedily and only discarded one by one whenever necessary to make room for the big items, we will not lose much from their side either. If the presented small items have a total size of at most $1-L/(1-\varepsilon/2)$ , none is ever discarded. In this case, we have $s(T_{\textnormal{small}})\geq s(S_{\textnormal{small}})$ and thus immediately $s(T)\geq s(S)\cdot(1-\varepsilon/2)$ . If small items are discarded, however, the worst case is the following type of instance: It starts with only small items of the largest possible size $\delta$ , some of which are then discarded to accommodate big items with sizes right at the upper limit for the classes indicated by the advice, leaving a gap of almost $\delta$ , follows up with slightly smaller big items that are in the optimal solution and would not have lead to any discarded small items, and finally presents big items at the lower end of the size span, replacing all previously packed big items.

Even in this worst case, the algorithm remains $(1-\varepsilon/2)$ -competitive on the big items and detracting the largest possible loss of $\delta$ on the small items yields $s(T)\geq s(S)\cdot(1-\varepsilon/2)-\delta$ . By the definition of $\delta$ and $K$ and due to the simple fact that $s(S)$ is at most $1$ , we have $\delta=(1-\varepsilon/2)^{K}\leq\varepsilon/2\leq s(S)\cdot\varepsilon/2$ . This implies $s(T)/s(S)\geq 1-\varepsilon$ , as desired.

Algorithm 1

0^{0^{0}}

PropPack

Parameter: Any $\varepsilon\in(0,1/2]$ .

Online Input: A sequence $I=(1,\dots,n)$ of $n$ items with sizes $(s_{1},\dots,s_{n})$ .

Online Output: A $(1+\varepsilon)$ -competitive packing $T=T_{\textnormal{small}}\cup T_{\textnormal{big}}$ .

Advice: The sequence $B=(b_{1},\dots,b_{m})$ , where $m$ is the number of big items in a fixed optimal solution and $b_{j}$ is the class of the $j$ th big item appearing in it.

Algorithm:

k\leftarrow\textbf{next}(B)

\triangleright

Initialize

k

to class of first big item to be packed.

T_{\textnormal{small}}\leftarrow\emptyset

\triangleright

Initialize

T_{\textnormal{small}}

, set of packed small items, to the empty set.

T_{\textnormal{big}}\leftarrow\emptyset

\triangleright

Initialize

T_{\textnormal{big}}

, set of packed big items, to the empty set.

4:for

i

I

\triangleright

For each new item in order of appearance do the following:

5: if

C(i)=0

\triangleright

If the new item is small, then check if it …

6: if

s(T_{\textnormal{small}}\cup T_{\textnormal{big}}\cup\{i\})\leq 1

\triangleright

… fits in beside everything currently packed; …

T_{\textnormal{small}}\leftarrow T_{\textnormal{small}}\cup\{i\}

\triangleright

… if it does, then pack it.

8: else if

C(i)=k

and

s(T_{\textnormal{big}}\cup\{i\})\leq 1

\triangleright

If big item of advised class can be fit in, …

9: while

s(T_{\textnormal{small}}\cup T_{\textnormal{big}}\cup\{i\})>1

\triangleright

… then, until it actually fits, …

10:

\textrm{pop}(T_{\textnormal{small}})

\triangleright

… greedily discard small items one by one.

11:

T_{\textnormal{big}}\leftarrow T_{\textnormal{big}}\cup\{i\}

\triangleright

Now that it actually fits, pack the new big item.

12:

k\leftarrow\textbf{next}(B)

\triangleright

Update

k

to class of the next item advised to be packed.

13: else

\triangleright

Among big new item and kept ones of same class, remove largest one and …

14:

T_{\textnormal{big}}\leftarrow(T_{\textnormal{big}}\cup\{i\})\smallsetminus% \arg\max\{s(j)\mid j\in\{i\}\cup(C_{C(i)}\cap T_{\textnormal{big}})\}

\triangleright

… pack the rest.

15:return

T_{\textnormal{small}}\cup T_{\textnormal{big}}

\triangleright

Return the current solution after processing the entire input.

5 Results for General Removable Knapsack

First, we note that all lower bounds for the proportional removable knapsack problem carry over to the general removable knapsack problem, in particular Theorem 4.5.

Iwama and Zhang [18] have shown that the competitive ratio of RemKnap is unbounded without advice. This can be seen using an interactive instance that starts with an item $(1,1)$ and then presents items $(\varepsilon^{2},\varepsilon)$ repeatedly, up to $1/\varepsilon^{2}$ times, until one is packed, at which point the instance ends.

The following two theorems show RemKnap’s competitivity for one advice bit to be exactly $2$ .

The existence of a $2$ -competitive algorithm already follows from a result by Han et al. [11, Thm. 9], who proved the statement even for a single random bit instead of an advice bit. We prove Theorem 5.1 by describing a concrete advice algorithm.

Theorem 5.1.

There is a $2$ -competitive algorithm for RemKnap reading only a single advice bit.

Proof 5.2.

The single advice bit indicates whether the instance contains an item worth at least half of the optimal solution value. If so, the algorithm greedily packs the most valuable item. Otherwise, it packs in a yield-greedy manner while ignoring any items larger than $1/2$ . If some optimal solution contains some item larger than $1/2$ , then the rest of this solution is smaller than $1/2$ and worth at least half of the optimal solution value. The yield-greedy algorithm achieves at least this value: Either it packs all items up to size $1/2$ or, when having to discard such an item, leaves a gap smaller than $1/2$ while the rest is filled with the best possible yield.

We now provide the matching lower bound for Theorem 5.1.

Theorem 5.3.

No algorithm for RemKnap reading only a single advice bit can have a competitive ratio better than $2$ .

Proof 5.4.

Fix an arbitrary positive $\varepsilon<1/2$ such that $k=1/\varepsilon$ is an integer. We describe an adversarial instance family on which no algorithm with a single advice bit can achieve a competitive ratio better than $2(1-\varepsilon)$ .

The instance will present some subset of the following items:

$\displaystyle x_{i}\quad=$	$\displaystyle(1-i\varepsilon^{3},\$	$\displaystyle 2-i\varepsilon)$	$\displaystyle\text{ for }i\in\{0,1,\dots,k\},$
$\displaystyle x_{i}^{\prime}\quad=$	$\displaystyle(i\varepsilon^{3},\$	$\displaystyle 2-i\varepsilon+\varepsilon)$	$\displaystyle\text{ for }i\in\{1,2,\dots,k\},\text{ and }$
$\displaystyle y_{j}\quad=$	$\displaystyle(\varepsilon,\$	$\displaystyle 4\varepsilon)$	$\displaystyle\text{ for }j\in\{1,2,\dots,k\}.$

The exact subset and order of presentation depends on the operating advice algorithm and will be explained below.

We begin by making some simple observations. The items $x_{0},x_{1},\dots,x_{k}$ decrease in both size and value. The smallest of them has still size $1-\varepsilon^{2}$ , thus it is impossible to fit two of them into the knapsack together. The same is true for combining any item $x_{i}$ with any item $y_{j}$ , together the exceed the knapsack capacity. Finally, an item $x_{i}^{\prime}$ fits together with $x_{i}$ perfectly, but not with any of the items $x_{0},x_{1},\dots,x_{i-1}$ .

We arrange the items along the axes of a $(k+1)\times k$ grid as shown in Figure 3. Every instance in the hard instance family can be represented by a directed path in this grid that starts at the top left corner, only moves down or right from there, and stops at the latest when reaching either the bottom or right border of the grid. Figure 3 shows one possible path.

We can read a given path as follows. All instances start by presenting the two items $x_{0}$ and $y_{1}$ corresponding to the corner $(x_{0},y_{1})$ . When a new coordinate along the $x$ -axis or $y$ -axis is reached, the corresponding item is presented. Thus, the path moving down means presenting the next item in the sequence $x_{0},x_{1},\dots,x_{k}$ , and analogously for moving to the right and the sequence $y_{1},y_{2},\dots,y_{k}$ . If the path ends at $(x_{i},y_{j})$ with $i<k$ and $j<k$ —that is, before reaching the grid’s bottom or right border—then the instance concludes with $x_{k}^{\prime}$ as an additional final item. If the path reaches the bottom or right end, the instance ends without such an additional item.

Figure 3: The grid described in the proof of Theorem 5.3. The directed path represents the instance that presents the items

x_{0},y_{1},y_{2},x_{1},x_{2},y_{3},y_{4},\dots,y_{j},x_{3},x_{4},\dots,x_{k}

in this order.

We now describe how the path is determined by the actions of the advice algorithm. Note that an algorithm with a single advice bit can be interpreted as two deterministic algorithms with the advice bit determining which algorithm is executed on any given instance. From $(x_{i},y_{j})$ with $i<k$ and $j<k$ , the path continues as follows.

Case 1.: If both algorithms have discarded $y_{j}$ , then it goes to the right.
Case 2.: If one algorithm has $y_{j}$ in its reserve and the other $x_{i}$ , the path continues downward.
Case 3.: If one algorithm has $y_{j}$ in its reserve, but neither has kept $x_{i}$ , then the path stops.

We make two observations. First, at any point $(x_{i},y_{j})$ , the two algorithms may have $y_{j}$ in their reserve, but no other item from $y_{1},y_{2},\dots,y_{k}$ . This is because the path is moving right—and thus the instance presenting an item in this sequence—only after its predecessor has been discarded by both algorithms.

Second, when arriving at $(x_{i},y_{j})$ , one of the algorithms may still have $x_{i-1}$ in its reserve, but neither algorithm will have kept any of the previous items $x_{0},\dots,x_{i-2}$ . This is because the path was able to move down to the current $x$ -coordinate only if one algorithm has had $x_{i-1}$ in its reserve and the other some item $y_{j^{\prime}}$ , which excludes any third item from $x_{0},\dots,x_{i-2}$ .

We consider the three ways that a path representing an instance can end: at the right border, at the bottom edge, or anywhere else.

Case 1.: If the path ends at the right border, the optimal solution consists of the entire sequence $y_{1},y_{2},\dots,y_{k}$ with total size $k\cdot\varepsilon=1$ and total value $k\cdot 4\varepsilon=4$ . The two deterministic algorithms, in contrast, have discarded all items in this optimal solution but possibly $y_{k}$ . Among $y_{k}$ and $x_{0},x_{1},\dots,x_{k}$ , which might have been presented, no two fit into a knapsack of capacity 1 together, thus the best feasible solution for the advice algorithm is taking the single most valuable item $x_{0}$ with a value of $2$ . The competitive performance of the algorithm is $4/2=2$ in this case.
Case 2.: If the path reaches the bottom, this means that the last move was downward; thus one algorithm has kept $x_{k-1}$ in its reserve and the other $y_{j}$ . Therefore, only the three singleton solutions consisting of $x_{k-1}$ , $x_{k}$ , and $y_{j}$ , respectively, are attainable by the advice algorithm. Among these, the first option is the best with a value of $1+\varepsilon$ . The optimum would have been to keep $x_{0}$ with a value of $2$ , however, resulting in a competitive performance of $2/(1+\varepsilon)$ or worse.
Case 3.: Finally, the path may stop at some point $(x_{i},y_{j})$ with $i<k$ and $j<k$ . In this case, both algorithms have discarded $x_{i}$ and, as observed in the beginning, all previously presented items except for $x_{i-1}$ and $y_{j}$ . Thus, when the final item $x_{i}^{\prime}$ is presented, the best option for the advice algorithm is the singleton solution with the item $x_{i-1}$ of value $2-(i-1)\varepsilon$ . The optimal solution, in contrast, can combine the two items $x_{i}$ and $x_{i}^{\prime}$ of complementing sizes and a total value of $2(2-i\varepsilon)+\varepsilon=2(2-(i-1)\varepsilon)-\varepsilon$ . The resulting competitive performance is $2-\varepsilon/(2-(i-1)\varepsilon)\leq 2-\varepsilon/(2-(k-1)\varepsilon)\leq 2-\varepsilon$ .

Overall, the advice algorithm’s competitive ratio cannot be better than

\min\big{\{}2,\frac{2}{1+\varepsilon},2-\varepsilon\big{\}}\geq 2-2\varepsilon,

which tends to $2$ for decreasing $\varepsilon$ , thus proving the theorem.

Advice being more powerful than randomness, Theorem 5.3 also closes the remaining gap for barely random algorithms by lifting the previously best known lower bound by Han et al. [11, Thm. 12] from $1+1/e\approx 1.367$ to $2$ .

The analysis of the hard instance presented in the proof of Theorem 5.3 could be adapted to the case of more than one advice bit. Two advice bits would mean that the oracle can provide to the algorithm one out of four advice strings instead of the two advice strings possible with one bit. We may also consider an intermediate advice algorithm limited to three advice strings, which corresponds to $\log 3$ advice bits. Such an algorithm cannot achieve a competitive ratio better than $2/\Phi=4/(1+\sqrt{5})\approx 1.2361$ since it is forced to use one of the three advice strings to keep the most valuable item $x_{0}$ , one to pack the items $y_{1},\dots,y_{k}$ yield-greedily, and one to keep the $x_{j}$ with the value $\Phi\approx 1.618$ , resulting in a competitive ratio of $2/\Phi=2\Phi/(1+\Phi)$ .

This approach deteriorates too quickly, however. Adapting Theorem 4.11 to the case of PropRemKnap is the better choice; this results in Theorem 5.7, which already yields a better bound in the case of three advice strings, that is, for $k=3$ . For increased accessibility, we first provide an instantiation for the case of a single advice bit, namely $k=2$ .

Theorem 5.5.

No algorithm for RemKnap reading only a single advice bit can have a better competitive ratio than $(1+\sqrt{3})/2\approx 1.36603$ .

Proof 5.6.

The proof is similar to the one of Theorem 4.9 but takes advantage of the fact that the sizes and values of the items can be chosen independently. Let $\eta=(\sqrt{3}-1)/2\approx 0.366025$ and let an algorithm for general removable knapsack reading only a single advice bit be given. Consider the three instances displayed in Table 5, which all start with the same three items.

Table 5: Hard instance for an algorithm for RemKnap reading one advice bit, where

\eta=(\sqrt{3}-1)/2

satisfies

1/(2\eta)=1+\eta=(1+\sqrt{3})/2\approx 1.36603

; see the proof of Theorem 5.5.

	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$	$x_{4}^{\prime}$	optimal	suboptimal	ratio
$I_{1}$ :	$(1,1)$	$(0.9,2\eta)$	$(0.8,\eta)$			$1$	$2\eta$	$1/2\eta$
$I_{2}$ :	$(1,1)$	$(0.9,2\eta)$	$(0.8,\eta)$	$(0.1,1-\eta)$		$1+\eta$	$1$	$1+\eta$
$I_{3}$ :	$(1,1)$	$(0.9,2\eta)$	$(0.8,\eta)$		$(0.2,1)$	$1+\eta$	$1$	$1+\eta$

For each $i\in\{1,2,3\}$ , the instance $I_{i}$ has a unique optimal solution; it contains $x_{i}$ plus the last fourth item if it exists. Table 5 shows the total size for each of these optimal solutions and the second best solution.

Because the sizes of any two of these three items $x_{1}$ , $x_{2}$ , and $x_{3}$ sum up to over 1, the algorithm can have at most one of them in the knapsack after being offered $x_{3}$ . Moreover, since we have only one advice bit but three instances, there are two instances for which the algorithm has packed the same item right before the potential presentation of the fourth item. This implies that the algorithm is suboptimal for at least one instance, thus its competitive ratio cannot be better than $1/(2\eta)=1+\eta=(1+\sqrt{3})/2$ .

Theorem 5.7.

Let an arbitrary integer $k>1$ be given. No algorithm for RemKnap reading at most $\log k$ advice bits can achieve a better competitive ratio than $1/2+\sqrt{1/4+1/k}$ .

Table 6: A hard instance family for RemKnap reading at most

\log k

advice bits; see the proof of Theorem 5.7. Only the values of the items are given, using

\xi=(3-2k+\sqrt{4k(k+1)-7})/4

; the sizes can be chosen arbitrarily satisfying

1\geq s_{1}>s_{2}>\dots>s_{k}>s_{k+1}>1/2

and

s_{k+2}=1-s_{j}

in instance

I_{j}

	$v_{1}$	$v_{2}$	$\cdots$	$v_{k}$	$v_{k+1}$	$v_{k+2}$
$I_{1}$ :	$1$	$1/\xi$	$\cdots$	$1/\xi-(k-2)(\xi-1)$	$1/\xi-(k-1)(\xi-1)$	0
$I_{2}$ :	$1$	$1/\xi$	$\cdots$	$1/\xi-(k-2)(\xi-1)$	$1/\xi-(k-1)(\xi-1)$	$\xi-v_{2}$
$\,\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\hphantom{1\leavevmode\nobreak\ }\vdots$
$I_{k+1}$ :	$1$	$1/\xi$	$\cdots$	$1/\xi-(k-2)(\xi-1)$	$1/\xi-(k-1)(\xi-1)$	$\xi-v_{k+1}$

Table 7: The values of the optimal and best suboptimal solutions for each instance in the family given in Table 6 and the resulting competitive performance; see the proof of Theorem 5.7.

	Optimal value	Best suboptimal value	Best suboptimal performance
$I_{1}$ :	$v_{1}=1$	$v_{2}=1/\xi$	$\xi$
$I_{2}$ :	$v_{2}+v_{2}^{\prime}=\xi$	$v_{1}=v_{3}+v_{2}^{\prime}=1$	$\xi$
$I_{3}$ :	$v_{3}+v_{3}^{\prime}=\xi$	$v_{1}=v_{4}+v_{3}^{\prime}=1$	$\xi$
$\,\vdots$	$\vdots\hphantom{\,\leavevmode\nobreak\ \xi}$	$\vdots\hphantom{\,\leavevmode\nobreak\ \xi}$	$\vdots\hphantom{\,\xi}$
$I_{k}$ :	$v_{k}+v_{k}^{\prime}=\xi$	$v_{1}=v_{k}+v_{k-1}^{\prime}=1$	$\xi$
$I_{k+1}$ :	$v_{k+1}+v_{k+1}^{\prime}=\xi$	$v_{k+1}+v_{k}^{\prime}=1$	$\xi$

Proof 5.8.

Let an arbitrary integer $k>1$ be given and define $\xi$ as the unique positive solution of $1/\xi=k(\xi-1)$ , namely $\xi=1/2+\sqrt{1/4+1/k}$ . Consider $k+1$ instances that all start with the same $k+1$ items of the following, decreasing values: $v_{1}=1$ and $v_{i}=1/\xi-(i-2)(\xi-1)$ for every $i\in\{2,\dots,k+1\}$ . Note that $v_{k+1}=\xi-1$ . The sizes can be chosen arbitrarily satisfying $1\geq s_{1}>s_{2}>\dots>s_{k}>s_{k+1}>1/2$ .

The instance $I_{1}$ ends immediately after these common items, whereas the instance $I_{i}$ , for $i\in\{2,\dots,k+1\}$ , presents a complement to item $i$ , namely an item of size $s_{i}^{\prime}=1-s_{i}$ and value $v_{i}^{\prime}=\xi-v_{i}$ as the final one. There is a unique optimal solution for each instance: For $I_{1}$ , it is to pack the first item, which has value $v_{1}=1$ . For $I_{i}$ with $i>1$ , it is to pack item $i$ and its complement, which sum up to the optimal solution value $\xi$ . Since there are only $\log k$ advice bits available to handle the $k+1$ instances, at least two instances $I_{i}$ and $I_{j}$ with $i<j$ are processed with the same advice string and thus by the same deterministic algorithm. Consider this algorithm and the moment after seeing and taking decisions on the first $k+1$ items. It is impossible for the algorithm to have more than one of these common items packed since all of these items are larger than $1/2$ . Now, if item $i$ is packed at the considered moment, the algorithm will perform suboptimally on instance $I_{j}$ . Analogously, if item $j$ is packed, the performance on instance $I_{i}$ is suboptimal.

Now if suffices to check that the best suboptimal solution has a value of at most $1/\xi$ for $I_{1}$ and at most $1$ for the other instances. This leads to a performance ratio of $\xi$ , thus proving the theorem. See Table 6 for an overview of the hard instance family. The best and second best solutions to all instances and their associated performances are listed in Table 7.

We point out again that Theorem 5.7 slightly improves over the lower bounds known from the resource buffer model by Han et al. [13, Thm. 6]. Specifically, we can choose $k=2^{1}$ , $k=2^{2}$ , and $k=2^{3}$ , for example, we obtain the lower bounds

	$\displaystyle\frac{1+\sqrt{3}}{2}\approx{}$	$\displaystyle 1.3660,\quad$	$\displaystyle\frac{1+\sqrt{2}}{2}\approx{}$	$\displaystyle 1.2071,\quad$	and	$\displaystyle\frac{1+\sqrt{3/2}}{2}\approx{}$	$\displaystyle 1.1124.$
for one, two, and three advice bits, respectively. The corresponding lower bounds by Han et al. for a resource buffer of size $R=2^{k}$ are
	$\displaystyle 4/3\approx{}$	$\displaystyle 1.3333,$	$\displaystyle 6/5={}$	$\displaystyle 1.2,$	and	$\displaystyle 10/9\approx{}$	$\displaystyle 1.1111.$

We now move on to the core result of this paper, proving that a constant amount of advice bits is sufficient to reach a near-optimal competitive ratio not only for the proportional but even for the general removable knapsack problem. This will complete the picture of the global advice behavior of the online knapsack problem with removability outlined in Figure 1.

We first point out that algorithm PropPack (Algorithm 1) generally does not work on instances where the value of an item can vary independently of its size, as seen by the following counterexample: Assume that $1/2$ lies in the interior of some size class and choose an $\varepsilon>0$ such that $1/2+\varepsilon$ and $1/2-\varepsilon$ are still in the same class. Present two items $(1/2+\varepsilon,2)$ and $(1/2,1)$ . If the algorithm picks the first one, $(1/2,2)$ is presented as the last item; otherwise, the instance ends with the item $(1/2-\varepsilon,1)$ . In both cases, the algorithm achieves a total value of $2$ , whereas the optimum is $3$ . The advice does not help us to distinguish the two cases, it only tells us that the optimal solution contains two items from the class. Clearly, we have to adapt the algorithm to take the value of the items into account somehow. A major obstacle is that the online algorithm has no bound on the values of appearing items, thus the algorithm has no way of reconstructing the constantly many value classes used by the oracle just from the parameter $\varepsilon$ .

Moreover, the proof of Theorem 4.15 cannot be adapted for the general case in any simple way. Using only classes based on size, it is impossible for the algorithm to know, when maintaining an item in slot, how to balance minimizing the size against maximizing the value. On the one hand, if the size is not minimized, then the excess size may prevent other slots from being filled. On the other hand, not maximizing the values, the algorithm may incur an arbitrarily high loss because the potential values of items cannot be bounded. This is also the reason why simple value classes do not work either. The algorithm does not know the maximal value occurring in the instance until it has ended and can therefore not use it as a reference point, in contrast to the size classes that can be chosen relative to the known knapsack capacity.

A first step toward solving these issues is the definition of dynamic value classes that are anchored to both the value of the first item appearing in the instance and to the optimal solution value. The latter is of course also unknown to the algorithm until the instance ends. However, we are able to define our classes with some additional properties that enable our algorithm to compute at any point useful provisional bounds on the optimal solution value. These bounds will either turn out to be valid or the algorithm is able to notice that they are off just in time to adjust and take a fresh start before having lost too much due to bad decisions. The adversary may foil the algorithm over and over, forcing it to abandon its plans and adjust the bounds arbitrarily often.

To properly deal with these repeated resets, we develop a level system. One major challenge is to square the level system with some sort of slot system as used by the algorithm for the proportional case. We manage to do this by introducing the concept of a virtual algorithm, which has the special, even though only imagined, capability of kee** one item in a splitting slot and use arbitrary fractions of the item stored in it. We then describe an actual algorithm that tries to equal the idealized performance of the virtual algorithm without making use of the splitting slot. While it cannot quite achieve this, it will fare well enough in the end. Having one algorithm emulating another, we are going to prove the claimed competitivity in two stages, first for the virtual version and then for the actual algorithm.

This split analysis presents several further challenges, for example, a desynchronization of the current phase and the number of slots filled by the algorithm, which coincided in the proportional case. The necessary adaptations entail a number of further challenges, for example a judicious handling of the paltry items, which are worth almost nothing individually, yet may be too numerous to neglect. In fact, the algorithm will need to partition the items not only by their value but simultaneously by their size as well.

Overcoming these and a few other obstacles, we are able to prove our final theorem.

Theorem 5.9.

For any $\varepsilon>0$ , there is a strictly $(1+\varepsilon)$ -competitive algorithm for RemKnap reading a constant number of advice bits.

Proof 5.10.

Proving this theorem requires far more effort than what was necessary for its proportional counterpart Theorem 4.15, where an item’s value is always identical to its size. Having explained already why any straightforward adaption of the substantially simpler approach for the proportional variant is impossible, we now provide a high-level outline of the proof. Then, we introduce some notions and notation necessary to describe what advice the oracle is communicating to the algorithm, and show that encoding this advice is possible with a constant number of bits. Then, we describe the algorithm and how it is using the advice; see Algorithm 2 for an implementation in pseudo-code. Finally, we use two proofs by induction to show that the algorithm’s online output successfully maintains some properties, which help us to conclude the proof by bounding the competitive ratio.

Outline

As announced, we first provide a high-level outline of the workings of the advice algorithm.

The oracle chooses an arbitrary optimal solution $S$ and, based on its value $v(S)$ , partitions the items of the instance into precious and paltry ones; the paltry ones are those worth less than $\varepsilon_{\textnormal{paltry}}\cdot v(S)$ for a suitable $\varepsilon_{\textnormal{paltry}}>0$ . The precious items are further split into finitely many classes such that the item values within any class are at most some factor $1-\varepsilon_{\textnormal{spread}}$ apart from each other.

The advice will encode exactly the classes of the precious items from the optimal solution $S$ in their order of appearance; the goal is to ensure that the algorithm packs just as many precious items from each class as the optimal solution does, thus achieving a competitive factor of $1/(1-\varepsilon_{\textnormal{spread}})$ on these items. Assuming that the algorithm knows the exact value ranges of each class, it can achieve this using a system of slots, which will be filled with precious items in two stages: first just virtually—assuming the algorithm were able to do certain things that are in fact impossible—and then also actually at some point. Each virtual filling of a slot starts a new phase of the algorithm.

However, the algorithm knows only the target value but nothing about the size of the items belonging into each slot; it is necessary to prove that despite this, the algorithm is able to fill the slots in the right order without blocking important items yet to come.

And there is another problem: It is even impossible for the oracle to communicate to the algorithm the exact value range of each class with a constant amount of advice since there is no bound on the potential values occurring in an instance. Instead, the value ranges will be described in relation to the value of the first item of the instance, and merely modulo some constant factor. The algorithm will then operate under the assumption that the first item is a precious one, in which case all of the above will work out. Since the algorithm cannot know for sure which items are precious, it divides them into presumably precious and provenly paltry ones according to some computations based on the advice and the instance seen so far. If the algorithm’s assumption is mistaken, it is able to recognize this just in time by continually comparing the best solution realizable with the already presented items—whether they have been accepted or not—to a rather intricate estimate for the optimal solution value $v(S)$ . Once the algorithm discovers its mistake, it resets with a revised set of assumptions on the value ranges; we say that the algorithm levels up. This is done such that the algorithm can go through arbitrarily many levels, resetting and taking a fresh start as often as necessary without incurring more than a negligible value loss.

The algorithm also needs to take care of the paltry items, which might constitute a considerable part of the optimal solution if there are sufficiently many of them. The algorithm is packing the paltry items in a somewhat inhibited greedy manner that optimizes the value-to-size ratio, which we also call yield. The volume taken up by the paltry items will be restricted sufficiently to guarantee that the precious items can always fill their slots, but not as severely as to lose too much value on the paltry items. The right volume restrictions in each phase of the algorithm are communicated via a constant amount of advice as well. Again, this cannot happen directly, since the right volume range might be infinitesimally small. Instead, the volume is controlled indirectly, via bounds not on some size but instead on the value that is provided by the paltry items packed before filling the current slot with a precious item.

Communicating the necessary volumes in this way is possible only up to some precision $\varepsilon_{\textnormal{round}}$ , and an overestimation could mean the loss of a crucial precious item. If we always round down, then this cannot happen, but the algorithm might reject some paltry item it should have kept for a selection of maximum yield. This is negligible if it happens only once, but we cannot tolerate taking such a loss in every phase, with every new volume bound. The solution is to analyze the situation using a special splitting slot, which can accommodate one paltry item at the time outside of the knapsack. We imagine the splitting slot lending us from the item stored in it any desired fraction at any time—just always the same fraction of the value and size. We refer to the item currently stored in the splitting slot as the split item. We will consider an algorithm that maintains a split item of highest yield after the remaining paltry items kept in the knapsack. This imaginary algorithm is an antecedent to our real advice algorithm, which builds on it but cannot actually split any items of course; we refer to the purely hypothetical precursor as the virtual version of our actual algorithm.

The actual algorithm will mimic the virtual version as closely as possible and deliver a result that is only marginally worse. Whenever the virtual version splits an item, the actual algorithm needs to decide whether to discard this item or store it completely. The challenge is to take the right decisions to allow for all slots to be filled in time and also avoid an undue accumulation of losses by passing on too many split items.

The advice helps the actual algorithm by indicating for every phase whether an item stored in the splitting slot by the virtual version is to be packed or discarded. This is done in such a way that in the end, the actual algorithm will have relinquished only the value of a single paltry item, namely the one kept in the splitting slot when the instance ends.

Packing entire items from the splitting slot comes with problems on its own; these paltry items might block for the actual algorithm some precious items that are packed by the virtual version. This problem is addressed by further advice to the algorithm on how to prioritize the packing of precious versus paltry items in each phase. This advice, telling the algorithm when to actualize a virtual packing of an item, is based on the solution that the virtual version would eventually produce if it existed. The virtual version does not depend on the actual algorithm and has no need for the part of the advice on actualization, which avoids any circular reasoning.

There are a few more technical issues to be dealt with, for example the special case that the precious items contribute only marginally to the value of the optimal solution. This undermines the estimates for the optimal solution value, is thus flagged by a dedicated advice bit $b_{\textnormal{small}}$ , and handled by switching from the elaborate value-based limits that dampen the general yield-greedy strategy to a simpler size-sensitive strategy.

Figure 4: A schematic illustration showing some of the infinitely many classes

V_{k}

and comboclasses

W_{k}

used in the proof of Theorem 5.9. The

x

-axis shows the value ranges of the items contained in each class; it has logarithmic scale. We have

0

infinitely far to the left and

\infty

infinitely far to the right. Each class

V_{k}

covers values that spread across a factor of

1-\varepsilon_{\textnormal{spread}}

, which is the unit length in the logarithmic scale. The comboclasses are

K=4

units wide, and there are

K=4

modulo classes. The modulo class

M_{2}

is shaded in gray; this is the modulo class containing the first item, which has value

v_{1}

. The scale is shifted such that the following three properties are satisfied: The value

v_{1}

of the first item lies right at the top end of its class (i.e., the right end in this illustration), the class of the first item is part of the comboclass

W_{1}

, and the optimal solution value

v(S)

lies somewhere in the highest class of its comboclass, namely in the modulo class

M_{0}

. The position of the optimal solution value determines what we call precious and paltry. The highest value achievable with the items seen so far, represented by

v_{i}

in the example, determines what we call presumably precious and provenly paltry.

Notions and Notation

We make the same assumptions as in the proof of Theorem 4.15 for the proportional problem: The knapsack has capacity $1$ , all items have size at most $1$ , and $\varepsilon\leq 1/2$ . Again, we denote the items in the order of their appearance in the instance by $1,2,\dots,n$ . We write $s_{i}=s(i)$ and $v_{i}=v(i)$ for the size and value of item $i$ , respectively.

To make to proof more understandable, we use the four constants $\varepsilon_{\textnormal{small}}$ , $\varepsilon_{\textnormal{paltry}}$ , $\varepsilon_{\textnormal{spread}}$ , and $\varepsilon_{\textnormal{round}}$ , which all depend only on the given parameter $\varepsilon$ but are used in different roles. They need to satisfy several inequalities; a concrete list of possible choices is $\varepsilon_{\textnormal{small}}=\varepsilon/2^{3}$ , $\varepsilon_{\textnormal{paltry}}=\varepsilon^{2}/2^{5}$ , $\varepsilon_{\textnormal{spread}}=\varepsilon^{4}/2^{14}$ , and $\varepsilon_{\textnormal{round}}=\varepsilon^{6}/2^{20}$ .

The oracle begins by computing an arbitrary optimal solution $S$ . In contrast to the proof for the proportional analogue of our theorem, not just the advice but also the constructed classes depend on $S$ and the instance as well. Specifically, the oracle uses the value $v_{1}$ of the first item in the instance and the optimal solution value $v(S)$ to partition all items into the bi-infinite sequence of value classes

V_{k}=\Big{\{}\,i\in\{1,\dots,n\}\ \Big{|}\ \Big{\lceil}\log_{1-\varepsilon_{% \textnormal{spread}}}\frac{v_{1}}{v_{i}}\Big{\rceil}+K-\Big{(}\Big{\lceil}\log% _{1-\varepsilon_{\textnormal{spread}}}\frac{v_{1}}{v(S)}\Big{\rceil}\bmod{K}% \Big{)}=k\,\Big{\}}

for every integer index $k$ , with $K=\big{\lceil}\log_{1-\varepsilon_{\textnormal{spread}}}\varepsilon_{% \textnormal{paltry}}\big{\rceil}+1$ being a constant for any parameter $\varepsilon$ .

Claim 1.

These classes are constructed such that they simultaneously satisfy the following four properties, which are straightforward to verify and will prove crucial later on.

1.

Each class contains only items whose values lie within an interval spanning a factor of $1-\varepsilon_{\textnormal{spread}}$ from the upper to the lower interval end.
2.

The value $v_{1}$ of the first item marks the upper end point of the value interval of the class containing this first item.
3.

The first item is contained in one of the classes $V_{1},\dots,V_{K}$ .
4.

The index of the class whose interval contains the optimal solution value $v(S)$ is a multiple of $K$ .

Proof 5.11.

For the first property, it we observe that the item value $v_{i}$ only occurs once in the entire definition of $V_{k}$ , namely within the first summand $\lceil\log_{1-\varepsilon_{\textnormal{spread}}}(v_{1}/v_{i})\rceil$ as a factor in the argument of the logarithm. Since $\log_{1-\varepsilon_{\textnormal{spread}}}(v_{1}/v_{i})=(\log_{1-\varepsilon_{% \textnormal{spread}}}v_{1})-\log_{1-\varepsilon_{\textnormal{spread}}}v_{i}$ , this term decreases by $1$ if $v_{i}$ is replaced by $v_{i}^{\prime}=v_{i}(1-\varepsilon_{\textnormal{spread}})$ . Thus the values of items in any given class can span factor of up to $1-\varepsilon_{\textnormal{spread}}$ but never more.

To verify the second property, we observe that $\log_{1-\varepsilon_{\textnormal{spread}}}(v_{1}/v_{i})$ is zero for $v_{i}=v_{1}$ but positive whenever $v_{i}>v_{1}$ and thus $v_{1}/v_{i}<1$ . Note that the logarithm base $1-\varepsilon_{\textnormal{spread}}$ is smaller than $1$ .

The third property follows again due to the first summand vanishing for $v_{i}=v_{1}$ . This leaves the class index $k$ equal to $0+K-j$ , where $j=\lceil\log_{1-\varepsilon_{\textnormal{spread}}}(v_{1}/v(S))\rceil$ is an integer taken modulo $K$ , meaning that we have $j\in\{0,1,\dots,K-1\}$ and thus $k\in\{1,\dots,K\}$ .

Finally, for the fourth and last property, it suffices to set $v_{i}=v(S)$ and see that the minuend becomes identical to the first summand, except for being taken modulo $K$ , meaning that they combine to a multiple of $K$ .

For every $k\in\{0,\dots,K-1\}$ , we define the modulo class $M_{k}=\bigcup_{j=-\infty}^{\infty}V_{jK-k}$ , consisting of every $K$ th class. There are exactly $K$ distinct modulo classes, namely $M_{0},M_{1},\dots,M_{K-1}$ ; each corresponds to one of the possible values of the modulo term in the definition of $V_{k}$ . We could also have defined $M_{k}$ as $\bigcup_{j=-\infty}^{\infty}V_{jK+k}$ , but using $V_{jK-k}$ instead of $V_{jK+k}$ will turn out to be more convenient later on.

Beside the $K$ modulo classes we define an infinite number of comboclasses. For every integer $k$ , comboclass $W_{k}=\bigcup_{j=0}^{K-1}V_{kK-j}$ comprises $K$ consecutive classes. This definition is again optimized for notational convenience in the remainder of this paper, which is also the reason for including into definition of $V_{k}$ the middle summand $K$ . For example, we can now reformulate the third and fourth property more succinctly as follows: Item $1$ is contained comboclass $W_{1}$ , and $v(S)$ belongs to modulo class $M_{0}$ .

For any item $i$ , we write $V(i)$ , $M(i)$ , and $W(i)$ to indicate the class, modulo class, and comboclass in which item $i$ lies. Thus, $V(i)$ is the integer $k$ such that $i\in V_{k}$ , analogously $W(i)$ is the integer $k$ satisfying $i\in W_{k}$ , and finally $M(i)$ is the integer $k$ satisfying $i\in M_{k}$ . Note that by their definitions these three terms satisfy the general relation $V(i)=W(i)\cdot K-M(i)$ for any item $i$ .

An item can be at most as valuable as the entire optimal solution. The comboclass that would contain such a potential item, namely the one with index $W(v(S))$ , is called precious. The $K$ classes comprising the precious comboclass and all the items contained in it are called precious as well. The remaining items are called paltry. In the natural way, we denote the sets of precious and paltry items by $V_{\textnormal{precious}}$ and $V_{\textnormal{paltry}}$ , respectively.

We now refer to Figure 4, which illustrates the value ranges of the classes, comboclasses, and modulo classes and shows which are considered paltry and precious, respectively. The notions of provenly paltry and presumably precious, which also appear in this figure, will only be defined during the discussion of the so-called level system further down.

Using Figure 4, it is also simple to verify the following. The highest class that could contain any paltry item is $V_{(W(v(S))-1)K}$ , and it is separated from $V_{W(v(S))K}$ , whose value range contains $v(S)$ , by $K-1$ classes. Since the value range of each of these classes spans a factor of $1-\varepsilon_{\textnormal{spread}}$ , it follows that any paltry item is worth less than $(1-\varepsilon_{\textnormal{spread}})^{K-1}v(S)\leq\varepsilon_{\textnormal{% paltry}}v(S)$ , where the inequality holds by the definition of $K$ . This fact will be used later on.

We denote by $u_{1}<\ldots<u_{m}$ the precious items appearing in the optimal solution $S$ in the order as they are presented during the given instance. These items partition the paltry items $V_{\textnormal{paltry}}$ into $m+1$ subclasses, depending on their relative appearance time: For $j\in\{0,\dots,m\}$ , we let $V_{\textnormal{paltry},j}=\{\,i\in V_{\textnormal{paltry}}\mid u_{j}<i<u_{j+1}\,\}$ denote the paltry items appearing after $u_{j}$ and before $u_{j+1}$ , where we use the auxiliary definitions $u_{0}=0$ and $u_{m+1}=n+1$ for two valueless items for notational convenience.

Finally, we partition the items— independently of the distinction between paltry and precious—into the big ones of size at least $\varepsilon_{\textnormal{small}}$ , denoted by $V_{\textnormal{big}}$ , and the small ones below this threshold, denoted by $V_{\textnormal{small}}$ .

Advice Content

In analogy to the proof of Theorem 4.15 for the proportional problem variant, where it is sufficient to define a finite number of instance-independent, size-based classes, we would like for the oracle to communicate through its advice the class of each precious item in the optimal solution $S$ in appearance order. This is impossible, however, since the number of classes is unbounded this time around. Instead, we restrict ourselves to conveying the information merely modulo the constant $K$ . We do so by encoding modulo classes—of which there are only finitely many—for all precious items into a tuple $(M(u_{1}),\dots,M(u_{m}))$ .

In addition, the advice tells the algorithm into which modulo class the first item of the instance is falling; that is, it encodes $M(1)$ .

Furthermore, for every $j\in\{0,1,\dots,m\}$ , the oracle encodes $\lfloor v(S\cap V_{\textnormal{paltry},j})/v(S)\rfloor_{\varepsilon_{% \textnormal{round}}}$ for some sufficiently small $\varepsilon_{\textnormal{round}}>0$ ; this is the fraction of the optimal solution value that is due to paltry items appearing in the instance after $u_{j}$ but before $u_{j+1}$ , rounded down to the nearest multiple of $\varepsilon_{\textnormal{round}}$ . We denote this fraction by $f_{j}$ .

There is also one special advice bit $b_{\textnormal{small}}$ , telling the algorithm whether packing only paltry items yields a $(1-\varepsilon_{\textnormal{small}})$ -competitive solution.

The advice described so far is everything needed for the virtual version of our algorithm to work. The actual algorithm makes use of the following additional advice, which depends on the online output of the virtual version: For every $j\in\{1,\dots,m\}$ , the advice contains a tuple $a_{j}=(a_{j,1},a_{j,2},\dots,a_{j,m_{j}})$ of numbers from $\{0,1,2,\dots,m\}$ . Here, the numbers $1,2,\dots,m$ are the indices of the $m$ slots used by the algorithm for storing precious items. The number $0$ is taken as the index of the splitting slot. The numbers of the tuple $a_{j}$ are determined by what the virtual algorithm does with the items appearing in phase $j$ . More precisely, it only matters what the algorithm does with those items that will be part of the eventual virtual solution $S^{\prime}$ , excluding the one item remaining in the splitting slot in the end. If there are no such items during phase $j$ , then the tuple $a_{j}$ is empty. Otherwise, consider the first such item of phase $j$ . It will be part of the eventual solution $S^{\prime}$ , meaning that must be packed by the virtual algorithm upon appearance. The first number of the tuple $a_{j}$ indicates the slot into which this item is stored, namely slot $a_{j,1}$ . The next number of the tuple is analogously determined by the next item that is packed during phase $j$ and a full part of the eventual virtual solution, and so on. In summary, tuple $a_{j}$ contains the indices of the slots filled during phase $j$ with items that will be a complete part of the eventual virtual solution, and the indices are sorted in the order these slots were filled with said items.

Constant Advice

We prove that a constant amount of advice suffices to encode the information just described.

We assume without loss of generality that the following, separately discussed pieces of information are combined into the advice string in such a way that they can be unambiguously retrieved. This is achieved by means of a suitable self-delimiting encoding that increases the advice length by at most a constant factor.

The number $m$ of precious items in the optimal solution, and thus also in the virtual solution, is bounded from above by the constant $1/\varepsilon_{\textnormal{paltry}}$ since every precious item is worth at least $\varepsilon_{\textnormal{paltry}}\cdot v(S)$ and the total value of packed precious items cannot exceed $v(S)$ .

There are only constantly many modulo classes, namely $K=\big{\lceil}\log_{1-\varepsilon_{\textnormal{spread}}}\varepsilon_{% \textnormal{paltry}}\big{\rceil}+1$ ; a constant amount of advice therefore suffices to communicate the modulo classes of the $m$ precious items in the optimal solution and that of the first item of the instance.

For every $j\in\{0,1,\dots,m\}$ , we can encode $f_{j}$ via $\lfloor(1/\varepsilon_{\textnormal{round}})v(S\cap\bigcup_{k=0}^{j}V_{% \textnormal{paltry},k})/v(S)\rfloor$ ; this is the fraction to be approximated divided by $\varepsilon_{\textnormal{round}}$ and rounded down to the nearest integer. These integers are clearly bounded by the constant $1/\varepsilon_{\textnormal{round}}$ . Since both $1/\varepsilon_{\textnormal{round}}$ and $m$ are constants, this information is also encodable within a constant amount of advice.

Finally, there are the $m+1$ tuples $a_{0},a_{1},\dots,a_{m}$ , each containing at most $m+1$ numbers ranging from $0$ to $m$ . A number of advice bits cubic in $m$ , but still constant for any given parameter $\varepsilon$ , is enough to store this information too.

Algorithm Description

A Single Special Switch.

As a first simple special case, we consider what the algorithm does when the bit $b_{\textnormal{small}}$ is set to $1$ , meaning that packing only paltry items in a yield-greedy fashion would lead to a $1/(1-\varepsilon_{\textnormal{small}})$ -competitive solution. In this case, the algorithm does not directly implement this strategy—being unsure what items are in fact paltry, it is in fact unable to do so—but instead runs a yield-greedy strategy on the small items, discarding any big item without consideration. When analyzing the algorithm later on, we will show that this still secures a solution of sufficient value.

For the remainder of this description, we assume that the switch $b_{\textnormal{small}}$ is set to zero.

Computing the Classes.

Immediately after seeing the first item, the algorithm uses the advice-given information about the modulo class of this item and the fact that its value $v_{1}$ lies right at the upper end of the value interval of its class to reconstruct the precise value intervals of every class $V_{k}$ . This is possible thanks to the four properties of these value classes already proved above:

Once the exact value range covered by the class containing the first item is known to the algorithm, the first property can be used to achieve the same for all other classes. The remaining three properties help the algorithm retrieve all information about the value interval of the class of the first item. The second property fixes the upper end of this interval to $v_{1}$ . The third property tells us that $v_{1}$ is contained in one of the classes $V_{1},V_{2},\dots,V_{K}$ of comboclass $W_{1}$ . The fourth property determines which of these $K$ classes it is by fixing the modulo class of $v(S)$ and thus also the one of $v_{1}$ , which is included in the advice given to the algorithm.

As a consequence, the algorithm is able to compute $V(i)$ , $M(i)$ , and $W(i)$ for any item $i$ of the instance as soon as it appears. We emphasize that, nevertheless, it is still unable to tell for sure whether an item is precious or paltry as long as the instance has not ended. Instead, the algorithm partitions the items seen so far into presumably precious and provenly paltry ones. This partition may change over time and is based on the following level system.

The Level System.

At any time, there is exactly one comboclass considered presumably precious; all lower comboclasses are provenly paltry. The designation as presumably precious or provenly paltry is naturally inherited by the classes and items contained in a comboclass. For every integer $\ell$ , we say that the algorithm is at level $\ell$ if $W_{\ell}$ is the presumably precious comboclass at this moment. The algorithm starts at level $1$ , which means that the comboclass containing the first item of the instance is considered presumably precious in the beginning. While processing its input, the algorithm may level up many times, but the level is never decreasing: What has been recognized as presumably precious may become provenly paltry at a later point, but never the other way around. There are exactly two events that can trigger the raise to a new level.

1.

The first one is the appearance of an item with a value above the value range of the comboclass that is currently seen as presumably precious. In this case, the comboclass containing this new item becomes the presumably precious one.
2.

The algorithm might also be triggered into leveling up by the arrival of a provenly paltry item, but only under certain conditions. Namely, the algorithm is always kee** track of what the optimal solution value achievable with the items presented so far, including the rejected ones, would have been. This only hypothetically realizable value is continually compared against some upper estimate $U_{\ell}$ for the optimal solution value $v(S)$ ; the exact estimate is explained and examined in the next subsection. As soon as the estimate $U_{\ell}$ for $v(S)$ is exceeded by the hypothetically achievable solution value, the algorithm enters the next level: The presumably precious comboclass becomes provenly paltry and the comboclass immediately above it becomes the new presumably precious one.

We will explain the precise purpose for employing exactly these two triggers when analyzing the competitive ratio of the algorithm. Essentially, the first one protects precious items from being discarded due to procrastinated level changes, while the second trigger’s paramount duty is to prevent too many paltry items from being misjudged as precious, which can also lead to them being unduly discarded.

Each of the two triggers alone would already ensure that the algorithm eventually reaches a level such that the presumably precious comboclass is in fact the precious one. We call the level for which this happens the final level and refer to all previous ones as the lost levels for a reason that will become clear later on.

Estimating the Optimal Solution Value.

The upper estimate for $v(S)$ mentioned before is

U_{\ell}=v_{1}\sum_{j=1}^{m}(1-\varepsilon_{\textnormal{spread}})^{V(1)+M(u_{j% })-\ell K}/(1-\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{% paltry}})-\sum_{p=0}^{m}f_{p}),

where $\ell$ is the index of the comboclass currently considered presumably precious by the algorithm. The modulo class index $M(u_{j})$ , the class index $V(1)$ , and the rounded fractions $f_{p}=\lfloor v(S\cap V_{\textnormal{paltry},p})/v(S)\rfloor_{\varepsilon_{% \textnormal{round}}}$ of the optimal solution value contributed by paltry items from phase $p$ are all retrieved from the advice. When calculating the estimate above, the algorithm assumes to have reached its final level; that is, it operates under the assumption that the presumably precious comboclass $W_{\ell}$ is in fact the precious one. Under this assumption, $U_{\ell}$ is indeed a valid upper estimate for $v(S)$ as we will show now.

We have $v(S)=v(S\cap V_{\textnormal{precious}})+v(S\cap V_{\textnormal{paltry}})$ and begin by analyzing the first of the two summands. It is the value contributed to the optimal solution by the $m$ precious items contained in it. The algorithm can use the information about the modulo classes of these $m$ items to place the value of each of them into an interval spanning a factor of $1-\varepsilon_{\textnormal{spread}}$ . Specifically, we start with the already established general relation $V(i)=W(i)K-M(i)$ , apply it to $i=u_{j}$ , and use the assumption $W(u_{j})=\ell$ of the current comboclass being the precious one to obtain $V(u_{j})=\ell K-M(u_{j})$ . Since $v_{1}$ lies at the upper endpoint of class $V(1)$ , we can compute the upper end of the value range of class $V(u_{j})$ as $v_{1}/(1-\varepsilon_{\textnormal{spread}})^{V(u_{j})-V(1)}=v_{1}(1-% \varepsilon_{\textnormal{spread}})^{V(1)+M(u_{j})-\ell K}$ precisely. Note that the given advice and the value of the first item are in tandem indeed sufficient to calculate this number. It is an upper bound on $v(u_{j})$ such that the true value cannot be smaller than a factor $1-\varepsilon_{\textnormal{spread}}$ of it, provided that the algorithm’s assumption $W(u_{j})=\ell$ holds true. If the algorithm is mistaken, however—that is, if $\ell<W(u_{j})$ —then the estimate is far too low; namely, it would be exactly a factor $(1-\varepsilon_{\textnormal{spread}})^{(W(u_{j})-\ell)K}\leq(1-\varepsilon_{% \textnormal{spread}})^{K}$ of what it otherwise would have been. If the algorithm has not yet reached the final level, the estimate for $v(u_{j})$ is therefore at most a fraction $(1-\varepsilon_{\textnormal{spread}})^{K-1}\leq\varepsilon_{\textnormal{paltry}}$ of the true value. As mentioned before, the inequality holds by the definition of $K$ . All of the assertions above for any single precious item naturally carry over to the sum

t=\sum_{j=1}^{m}v_{1}(1-\varepsilon_{\textnormal{spread}})^{V(1)+M(u_{j})-\ell K}.

In particular, the total value $v(S\cap V_{\textnormal{precious}})$ of all precious items in the fixed optimal solution $S$ —the first of the two summands to be examined—lies in the interval $[(1-\varepsilon_{\textnormal{spread}})t,t]$ during the final level.

Using the advice information $f_{p}=\lfloor v(S\cap V_{\textnormal{paltry},p})/v(S)\rfloor_{\varepsilon_{% \textnormal{round}}}$ on the value of the paltry items in the optimal solution appearing during each phase $p\in\{0,1,\dots,m\}$ —given as fractions of the optimal solution value, rounded down to the nearest multiple of $\varepsilon_{\textnormal{round}}$ —the algorithm can narrow down the second summand $v(S\cap V_{\textnormal{paltry}})$ to $g\cdot v(S)$ for some factor $g\in[\sum_{p=0}^{m}f_{p},\sum_{p=0}^{m}(f_{p}+\varepsilon_{\textnormal{round}}% )]\subseteq[\sum_{p=0}^{m}f_{p},\varepsilon_{\textnormal{round}}(1+1/% \varepsilon_{\textnormal{paltry}})+\sum_{p=0}^{m}f_{p}]$ , where we have used once more the bound $m\leq 1/\varepsilon_{\textnormal{paltry}}$ , derived via the total value of all precious items. Together with the bounds found for the first summand in the previous paragraph we obtain the following two inequalities, in which the optimal solution value $v(S)$ occurs three times:

(1-\varepsilon_{\textnormal{spread}})t+v(S)\sum_{p=0}^{m}f_{p}\leq v(S)\leq t+% v(S)\big{(}\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{% paltry}})+\sum_{p=0}^{m}f_{p}\big{)}.

From this we obtain for $v(S)$ a lower bound of $L_{\ell}=(1-\varepsilon_{\textnormal{spread}})t/(1-\sum_{p=0}^{m}f_{p})$ and the presumed upper bound $U_{\ell}=t/(1-\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{% paltry}})-\sum_{p=0}^{m}f_{p})$ , which is an actual upper bound only during the last level. The upper and lower bound lie a factor

	$\displaystyle\frac{U_{\ell}}{L_{\ell}}={}$	$\displaystyle\frac{1-\sum_{p=0}^{m}f_{p}}{(1-\varepsilon_{\textnormal{spread}}% )(1-\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry}})-% \sum_{p=0}^{m}f_{p})}$
	$\displaystyle\leq{}$	$\displaystyle\frac{1-\sum_{p=0}^{m}f_{p}}{1-\varepsilon_{\textnormal{spread}}-% \varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry}})-\sum_{% p=0}^{m}f_{p}}$
	$\displaystyle={}$	$\displaystyle 1+\frac{\varepsilon_{\textnormal{spread}}+\varepsilon_{% \textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry}})}{1-\varepsilon_{% \textnormal{spread}}-\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{% \textnormal{paltry}})-\sum_{p=0}^{m}f_{p}}$

apart. Note that this quotient no longer depends on $\ell$ . Furthermore,we may assume that $\sum_{p=0}^{m}f_{p}<1-\varepsilon_{\textnormal{small}}+\varepsilon_{% \textnormal{paltry}}$ . This is because otherwise the algorithm could attain a fraction $\sum_{p=0}^{m}f_{p}-\varepsilon_{\textnormal{paltry}}\geq 1-\varepsilon_{% \textnormal{small}}$ of the optimal solution value by packing paltry items exclusively in an otherwise purely yield-greedy manner uninhibited by any value limit; this special case is covered by the dedicated advice bit $b_{\textnormal{small}}$ , which, when set to $1$ , switches the restrictions applied to the algorithm’s yield-greedy approach from a value limit to a size limit, as already explained at the beginning of this proof. We can therefore use $1-\sum_{p=0}^{m}f_{p}>\varepsilon_{\textnormal{small}}-\varepsilon_{% \textnormal{paltry}}$ to obtain

	$\displaystyle\frac{U_{\ell}}{L_{\ell}}\leq{}$	$\displaystyle 1+\frac{\varepsilon_{\textnormal{spread}}+\varepsilon_{% \textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry}})}{\varepsilon_{% \textnormal{small}}-\varepsilon_{\textnormal{paltry}}-\varepsilon_{\textnormal% {spread}}-\varepsilon_{\textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry% }})}$
	$\displaystyle={}$	$\displaystyle\frac{\varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{% paltry}}}{\varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{paltry}}-% \varepsilon_{\textnormal{spread}}-\varepsilon_{\textnormal{round}}(1+1/% \varepsilon_{\textnormal{paltry}})}$
and thus
	$\displaystyle\frac{L_{\ell}}{U_{\ell}}\geq{}$	$\displaystyle\frac{\varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{% paltry}}-\varepsilon_{\textnormal{spread}}-\varepsilon_{\textnormal{round}}(1+% 1/\varepsilon_{\textnormal{paltry}})}{\varepsilon_{\textnormal{small}}-% \varepsilon_{\textnormal{paltry}}}$
	$\displaystyle={}$	$\displaystyle 1-\frac{\varepsilon_{\textnormal{spread}}+\varepsilon_{% \textnormal{round}}(1+1/\varepsilon_{\textnormal{paltry}})}{\varepsilon_{% \textnormal{small}}-\varepsilon_{\textnormal{paltry}}}$
	$\displaystyle\geq{}$	$\displaystyle 1-2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{% small}},$

where, for the last inequality, we used $\varepsilon_{\textnormal{spread}}\geq\varepsilon_{\textnormal{round}}(1+1/% \varepsilon_{\textnormal{paltry}})$ , which is satisfied by our choices of $\varepsilon_{\textnormal{spread}}=\varepsilon^{4}/\varepsilon^{14}$ , $\varepsilon_{\textnormal{round}}=\varepsilon^{6}/2^{20}$ , and $\varepsilon_{\textnormal{paltry}}=\varepsilon^{2}/2^{5}$ . In other words: During the final level the algorithm’s estimate $U_{\ell}$ is indeed an upper bound on $v(S)$ exceeding the real value by at most $U_{\ell}\cdot 2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{% small}}$ . The algorithm can thus derive from any hypothesized upper bound $U_{\ell}$ for $v(S)$ an actual lower bound $U_{\ell}(1-2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{small}})$ . We will put this insight to use when analyzing the competitive ratio of the algorithm.

The Slot System.

In the precious part of the knapsack, the algorithm uses a slot system to store presumably precious items. There are $m$ slots, each of which can accommodate one such item, and all of them are empty in the beginning. They will be filled one by one, each with exactly one item. The items in filled slots may also be exchanged at some point. In contrast to what happens in the simpler algorithm for the proportional version of the problem, slots might also be emptied again when the algorithm decides to level up. The second difference is that there are now two stages to filling a slot. Any slot first has to be filled virtually before this filling is actualized at some point. The precise meaning of these two notions is explained later on. The order in which the slots are filled virtually is determined by the modulo classes of the precious items $u_{1},\dots,u_{m}$ in the optimal solution: Slot $j$ must be filled virtually after slots $1$ through $j-1$ and before the other slots. As soon as this condition is met, slot $j$ is filled virtually with the first matching item, that is, the first item from the same modulo class as $u_{j}$ . The order in which $m$ slots receive an actualized filling is encoded in the tuples $a_{0},a_{1},\dots,a_{m}$ given by the advice. A filling may be actualized long after or right when it has happened virtually, just never before that.

When exactly a slot is filled virtually, instead of just in which order, is described in the following paragraph; when exactly the fillings are actualized is explained after the discussion on packing the paltry part of the knapsack.

The Phase Progression.

We say that the algorithm is in phase $p$ if slot $p$ has been filled virtually but not yet slot $p+1$ . Since all precious items are discarded when the algorithm is leveling up, all slots are empty again at the start of a new level, implying that the algorithm’s phase is reset to $0$ as well. In general, the current phase $p$ ends as soon as the instance ends or presents a presumably precious item $i$ satisfying the following two conditions. On the one hand, it is from the same modulo class as $u_{p}$ —recall that we can compute $M(i)$ for any given item and know $M(u_{p})$ from the advice. On the other hand, the virtual algorithm can fit it into the precious part of the knapsack beside the items already present in the other slots, even if just virtually, and the packed paltry items, including the currently used fraction of any split item. If both of these conditions are met, said item $i$ is virtually packed into slot $p$ . For any other presumably precious item $i$ , it is first checked whether there are matching slots—that is, slots reserved for items from modulo class $M(i)$ —filled with items larger than $i$ . If this is the case, a largest of these items is evicted, either just virtually or actually, and item $i$ takes its place. Otherwise, the presumably precious item $i$ is discarded.

Packing the Paltry Part.

The paltry items in the instance are in principle packed in a yield-greedy manner, that is, optimizing the ratio of value to size on the paltry items. If there were no precious items, this would already guarantee a sufficient approximation to the optimal solution value; we would lose at most the value of a single paltry item. If there are precious items, however, then we need to restrict the reservation of paltry items such that they never block a crucial precious item from filling an empty slot. But too much restriction is bad as well; if we always favor the packing of precious items matching an empty slot over accruing paltry items, then we might discard too many paltry items; after all, it is impossible to give exact bounds on the sizes of the optimal precious items, and the paltry items can be infinitesimally small.

The right type of restriction keeps the total size of the reserved paltry items low enough to allow for filling the next empty slot when the right precious items appears but not more than a constant fraction below the optimal amount. Directly encoding the right volume bound into the advice is impossible, however, since the requisite precision grows indefinitely with decreasing item sizes. Instead, we can limit the value that we should be accumulating in the form of paltry items in each phase. This does not immediately bound the size occupied by these items—we have no knowledge about the necessary yield—but will always do so just in time to fill all regular slots with precious items.

The value goal for each phase is indicated with a precision of $\varepsilon_{\textnormal{round}}$ and always rounded down. Additionally, this value goal is indicated relative to $v(S)$ , for which the algorithm has a lower bound $L_{\ell}$ that is too small by at most a factor of $2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{small}}$ as seen above. Since there are at most $1/\varepsilon_{\textnormal{paltry}}+1$ phases, the value lost due to this rounding can be bounded by

\displaystyle(1/\varepsilon_{\textnormal{paltry}}+1)(\varepsilon_{\textnormal{% round}}+2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{small}})v(% S)\leq(2\varepsilon_{\textnormal{round}}+4\varepsilon_{\textnormal{spread}}/% \varepsilon_{\textnormal{small}})v(S)/\varepsilon_{\textnormal{paltry}},

which we can still tolerate, as shown by the competitive analysis in the end of this proof. However, the situation could be exacerbated by the fact that when discarding a single paltry item to get below the rounded value threshold, we could incur a loss of up to $\varepsilon_{\textnormal{paltry}}v(S)$ , the maximum value of a paltry item, instead of only $(\varepsilon_{\textnormal{round}}+1+1/\varepsilon_{\textnormal{paltry}})v(S)$ in every phase. There are up to $1/\varepsilon_{\textnormal{paltry}}+1$ phases, so this could sum up to a total loss just shy of $v(S)$ . In the following, we introduce what might be called a virtual version of our algorithm as a tool to circumvent this problem.

The Virtual Algorithm.

Beside the actual advice algorithm producing our online solution, we consider a virtual version of the algorithm that has one huge, if imaginary, advantage: It may use a so-called splitting slot. The splitting slot can store one paltry item at any point, and the virtual algorithm may then split off from the item currently stored there any desired fraction to be used in the packing of the knapsack. The fraction used of the split item is allowed to change at any moment. However, the algorithm must always take the same fraction for the value and the size of an item, meaning that the split-off part retains the yield of the full item.

In other words, the algorithm can, for an arbitrary positive $r\leq 1$ , treat an item $i$ of size $s_{i}$ and value $v_{i}$ in the splitting slot as though it were an item of size $r\cdot s_{i}$ and value, $r\cdot v_{i}$ and the $r$ can be adjusted arbitrarily at any time.

This enables the virtual algorithm to hit the previously described target value precisely and persistently, once it could have exceeded it, by maintaining in the splitting slot a paltry item of highest yield after those actually kept outside the splitting slot, and always taking just the right fraction to stay at the value limit as long as possible. The unused fraction is assumed to neither take up any space nor contribute any value.

There are two ways for an item to be forced out of the splitting slot. If items of higher yield—or identical yield but smaller ones—and sufficient total value appear, the present split item will become useless to the virtual algorithm, which then discards it for good. If, however, there is a phase change accompanied by an increase of the value limit, then the algorithm may decide to retrieve the entire item from the splitting slot, pack it in the knapsack regularly, and store an item of lower yield—or identical yield but lower size—in the splitting slot instead.

Having described the behavior of the virtual version of the algorithm, which uses the imagined capabilities of the splitting slot, we now explain how the actual algorithm acts based on the decisions taken by the virtual version.

The Actual Algorithm.

The actual algorithm is essentially trying to trace every action of its virtual version. However, it is not allowed to split any items and thus faced with a binary decision whenever the virtual algorithm stores an item in the splitting slot: either pack said item as a whole or discard it completely. Both choices come with their own set of problems.

It is the simpler option for the algorithm to discard any split item, as it then can still imitate the virtual version in every other respect, by operating as though any fraction from the split item were available at any time. In particular, the precious items would be handled absolutely identically by both versions of the algorithm. However, if too many split items kept by the virtual algorithm are discarded, then their total value could accumulate to a substantial part the optimal solution value. Thus it will be necessary to store split items under certain circumstances.

This opens up the possibility for the second type of error, entailing even more serious consequences. If the actual algorithm packs an item in its entirety instead of only the fraction used by the virtual version, then there is less space left for precious items needed to fill up all slots in the end. The naïve approach of just following the virtual version in packing a precious item whenever feasible—that is, unless packed paltry items, including the used fraction of the split item, would need to be discarded—does not work here. This could entice the algorithm into filling a slot with a precious item much larger than what this slot will contain in the end, thus preventing the packing of rather large but still important paltry items into the splitting slot.

Instead, the algorithm uses the advice tuples $a_{0},a_{1},\dots,a_{m}$ to navigate the two issues in such a way that its final output is in fact identical to that of the virtual version, except for missing the one item remaining in the splitting slot at the end.

Specifically, the advice tuple $a_{j}=(a_{j,1},a_{j,2},\dots,a_{j,m_{j}})$ is used in phase $j$ as follows. The algorithm does not pack any precious items or split items by default. Once the virtual algorithm packs an item into slot $a_{j,1}$ , however, the actual algorithm tries to follow suit. It might be unable to do so if it has stored items of a larger total size due to its inability to split items. If it can fill a new item into slot $a_{j,1}$ alongside with the virtual version, however, then from this moment on, the algorithm starts actually packing into slot $a_{j,1}$ whatever the virtual version packs into it; we say that slot $a_{j,1}$ is now actualized. After actualizing slot $a_{j,1}$ , the algorithm starts monitoring slot $a_{j,2}$ , which might already be filled virtually at this point, but is not yet filled actually. Whenever a new item—assume that it is precious for the moment—is stored into $a_{j,2}$ by the virtual algorithm, the actual algorithm observes if it could do the same. As soon as such an opportunity arises, it is taken, and slot $a_{j,2}$ is now also actualized. This continues until the tuple is exhausted, which means that no more slots will be actualized in this phase. If the advice tells the algorithm to actualize the splitting slot, which has the number $0$ , with a paltry item, this all works identically, with two exceptions: First, if the virtual algorithm updates an already actualized item in the splitting slot, then all slots that were actualized after the splitting slot in the current phase are virtualized again, meaning that the actual algorithm discards the items stored in them. And second, the splitting slot is reverted to be virtual at the beginning of each new phase. Nevertheless, an item already stored in the splitting slot at the beginning of a phase remains there until it is replaced by a future item.

We have now fully described our algorithm, whose pseudo-code is found in Algorithm 2. Note that both the value-restricted and the size-based yield-greedy procedure used for packing the paltry items are implemented by the function Greedy.

Turning now to the analysis of the algorithm, we will first show that the algorithm performs sufficiently in the special mode where items are filtered based on their size. Then we prove that the virtual version eventually fills all slots and always attains the value goal on the paltry items, resulting in a total value that surpasses our final goal with more than $\varepsilon_{\textnormal{paltry}}v(S)$ to spare. Finally, we prove that the actual algorithm can successfully re-create the solution by the virtual version up to potentially one single paltry item, which we can forego without problem due to the leeway afforded by the virtual version.

Success of Size-Based Strategy.

We first consider the rather simple case that the special bit $b_{\textnormal{small}}$ is set to $1$ . The oracle does this if and only if a purely yield-greedy approach on the paltry items, ignoring all precious ones, is guaranteed to achieve a competitive ratio of $1/(1-\varepsilon_{\textnormal{small}})$ on the given instance. However, as mentioned before, the algorithm does not know for sure which items will turn out to be paltry in the end; thus it is implementing a size-based strategy instead in this case, packing only small items and dismissing the big ones.

Since the big items all have size at least $\varepsilon_{\textnormal{small}}$ , at most $1/\varepsilon_{\textnormal{small}}$ of them fit into any feasible solution, in particular the $1/(1-\varepsilon_{\textnormal{small}})$ -competitive solution that would have been produced by greedily packing only paltry items. Omitting from this solution, which contains only paltry items, the up to $1/\varepsilon_{\textnormal{small}}$ big items decreases its value by less than $\varepsilon_{\textnormal{paltry}}/\varepsilon_{\textnormal{small}}$ . This means that we now have a solution containing only small items, worth at least a fraction $1-\varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{paltry}}/% \varepsilon_{\textnormal{small}}$ of the optimal solution value. Running a yield-greedy strategy on the small items either reproduces this solution perfectly or it leaves, as soon as a packed item is discarded, an unfilled gap smaller than $\varepsilon_{\textnormal{small}}$ . Since the yield is optimized for the filled part of the knapsack, this is a solution worth at least a fraction $(1-\varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{paltry}}/% \varepsilon_{\textnormal{small}})(1-\varepsilon_{\textnormal{small}})\geq 1-% \varepsilon_{\textnormal{small}}-\varepsilon_{\textnormal{paltry}}/\varepsilon% _{\textnormal{small}}-\varepsilon_{\textnormal{small}}$ of the optimal solution value, which for our choices of $\varepsilon_{\textnormal{small}}=\varepsilon/2^{3}$ and $\varepsilon_{\textnormal{paltry}}=\varepsilon^{2}/2^{5}$ is greater than $1-\varepsilon/2\geq 1/(1-\varepsilon)$ . We can therefore assume $b_{\textnormal{small}}=0$ from now on.

Success of Virtual Version.

Assuming that the algorithm already starts in its final level—which is proved to be possible without loss of generality in its own section on leveling up—we show that the virtual version completes all phases and hits its value goal on the paltry items precisely in each phase. Recall that after reaching the final level, the notions of paltry and precious coincide with provenly paltry and presumably precious, respectively.

Denote by $v_{\textnormal{paltry},j}=v(S\cap V_{\textnormal{paltry},j})$ and $s_{\textnormal{paltry},j}=s(S\cap V_{\textnormal{paltry},j})$ the total value and size, respectively, of the items in optimal solution $S$ appearing after $u_{j}$ but before $u_{j+1}$ . Recall that the algorithm computes $f_{j}\cdot L_{\ell}$ —where $f_{j}=\lfloor v(S\cap V_{\textnormal{paltry},j})\rfloor_{\varepsilon_{% \textnormal{round}}}$ is given directly by the advice and $L_{\ell}$ is the current lower bound on $v(S)$ —as a fairly accurate lower bound on $v_{\textnormal{paltry},j}$ . Denote this lower bound by $v^{\prime}_{\textnormal{paltry},j}=f_{j}\cdot L_{\ell}$ , and denote by $s^{\prime}_{\textnormal{paltry},j}$ the proportional lower bound on $s_{\textnormal{paltry},j}$ , namely $s^{\prime}_{\textnormal{paltry},j}=s_{\textnormal{paltry},j}\cdot v^{\prime}_{% \textnormal{paltry},j}/v_{\textnormal{paltry},j}$ . The desired statement follows from a two-part claim by induction over $p\in\{0,1,\dots,m+1\}$ , where we use once more our auxiliary definition of two imaginary items $u_{0}=0$ and $u_{m+1}=n+1$ .

Induction Claim.

1.

After the decision on item $u_{p}$ , the first $p$ slots are filled, whether just virtually or already actualized, with items of size at most $\sum_{j=1}^{p}s(u_{j})$ .
2.

Consider the paltry items in the knapsack when $u_{p}$ is presented, including the used fraction of any split item. Among these, select items in a manner greedy for high yield and secondarily for low size—allowing for any items to be split, regardless of whether they are in the splitting slot—and stop only if the total value reaches $\sum_{j=0}^{p-1}v^{\prime}_{\textnormal{paltry},j}$ . The value of this selection is in fact exactly $\sum_{j=0}^{p-1}v^{\prime}_{\textnormal{paltry},j}$ and its size at most $\sum_{j=0}^{p-1}s^{\prime}_{\textnormal{paltry},j}$ .

Base Case.

The base case $j=0$ is trivially true: Phase $0$ begins with the decision on the auxiliary item $u_{0}=0$ , which does not fit any of the slots, which thus all remain empty. The second part claim is empty as well due to $\sum_{j=0}^{-1}v^{\prime}_{\textnormal{paltry},j}=\sum_{j=0}^{-1}s^{\prime}_{% \textnormal{paltry},j}=0$ .

Induction Step.

Let an arbitrary $p\in\{0,1,\dots,m\}$ be given. We assume the claim for all $p^{\prime}\in\{0,1,\dots,p\}$ as our induction hypothesis and use it to prove the statement for $p+1$ . By the first part of the hypothesis, the algorithm is at least in phase $p$ when the items between $u_{p}$ and $u_{p+1}$ are presented. Therefore, the value limit for packing paltry items is already $\sum_{j=0}^{p}v^{\prime}_{\textnormal{paltry},j}$ or higher at the beginning of this period of the instance. The set $V_{\textnormal{paltry},p}$ of paltry items presented during this period contains a set of items, namely $S\cap V_{\textnormal{paltry},p}$ , with total size $v_{\textnormal{paltry},p}$ and size $s_{\textnormal{paltry},p}$ . A yield-greedy online packing choosing from these items with a value limit $v^{\prime}_{\textnormal{paltry},p}$ achieves, if splitting items arbitrarily is allowed, a selection with a value of exactly $v^{\prime}_{\textnormal{paltry},p}$ and—since the selection has the highest possible yield—size at most $s^{\prime}_{\textnormal{paltry},p}$ . We know from the second part of the hypothesis that at the beginning of the considered period the knapsack has already contained paltry items allowing for a selection of value $\sum_{j=0}^{p-1}v_{\textnormal{paltry},j}$ and size at most $\sum_{j=0}^{p-1}s_{\textnormal{paltry},j}$ . The combination of this selection with the previously described one has value $\sum_{j=0}^{p}v^{\prime}_{\textnormal{paltry},j}$ and some size which is at most $\sum_{j=0}^{p}s^{\prime}_{\textnormal{paltry},j}$ . Our algorithm will therefore also achieve at the end of the considered period a selection of this same value and some size that might be even lower—since it has to chance to use some items in $V_{\textnormal{paltry},p}\smallsetminus S$ of potentially greater yield—but never higher. This concludes the proof of the second part of the claim.

For the first part, it suffices to consider the case that the first $p+1$ slots do not yet contain items of size at most $\sum_{j=1}^{p+1}s(u_{j})$ when item $u_{p+1}$ is presented. By the induction hypothesis, the first $p$ slots already contained items of size at most $\sum_{j=1}^{p}s(u_{j})$ after the decision on $u_{j}$ . This remains true until the instance ends since items in a slot are only ever replaced by smaller ones. If slot $p+1$ is already filled when $u_{p+1}$ is presented, then the item in this slot either has size at most $s(u_{p+1})$ , or it is replaced by $u_{p+1}$ , both of which mean that the induction claim for $p+1$ is satisfied. Finally, if slot $p+1\leq m$ is still empty upon the presentation of $u_{p+1}$ , then we can combine the first part of our induction hypothesis and the already proven second part for $p+1$ to bound the total size of all packed items at this moment by

\sum_{j=0}^{p}(s(u_{j})+s^{\prime}_{\textnormal{paltry},j})\leq\sum_{j=0}^{p}(% s(u_{j})+s_{\textnormal{paltry},j})\leq\sum_{j=0}^{m}(s(u_{j})+s_{\textnormal{% paltry},j})-s(u_{p+1})=s(S)-s(u_{p+1}).

Since $S$ is a feasible solution, item $u_{p+1}$ will fill slot $p+1$ , which concludes the proof of the second part of the claim, the induction step from $p$ to $p+1$ , and thus the entire induction.

Success of Actual Algorithm

Having seen that the virtual version of the algorithm performs sufficiently well—the exact competitive analysis follows later on—we now prove that the actual algorithm successfully mimics this behavior in the following sense: Despite deviating from the virtual online solution by non-negligible values at some times, the actual algorithm produces a final output that is identical to that of the virtual solution $S^{\prime}$ , except for the omission of one single paltry item, namely the one remaining in the splitting slot when the instance ends. We will see that the actual algorithm, which uses exactly the same phases and value limits as the virtual version, can use the advice to actualize any item that is part of the eventual solution immediately when is appears. The first key toward this is the following observation. During any given phase, the value limit used by the algorithm remains fixed. Hence, any item that is split at any point during a given phase either remains in the splitting slot until the end of this phase or—if it leaves the splitting slot before the phase has ended—it is discarded in favor of items of greater yield or identical yield but larger size, as dictated by the yield-greedy approach for paltry items. This means that the paltry items remaining in the knapsack at the end of a phase, excluding the one in the splitting slot at this point, have never been in the splitting slot during the entire phase. By not actualizing the splitting slot, the algorithm will therefore not miss during this phase any paltry item from the eventual virtual solution $S^{\prime}$ but potentially the one item that is in the splitting slot at the end of the phase. Moreover, actualizing the splitting slot only upon appearance of this one item suffices to not miss any paltry item from $S^{\prime}$ in this phase. The algorithm can therefore focus on actualizing the precious items and paltry items that are in the splitting slot at the end of some phase—although it does not know beforehand which of the paltry items will turn out to be of this kind.

The second key, addressing this issue, is to actualize the slots in the right order, which is made possible by the advice. Denote by $w_{1},w_{2},\dots,w_{m^{\prime}}$ those items from the virtual solution $S^{\prime}$ , excluding the item remaining in the splitting slot in the end, that are either precious or have been in the splitting slot at the end of some phase, in the order as they appear during the instance. These are exactly the items that the algorithm might miss by not actualizing them. We inductively prove that the actual algorithm actualizes the items $w_{1},w_{2},\dots,w_{m^{\prime}}$ .

Induction Claim.

Immediately after the decision on $w_{i}$ , the algorithm has actualized the items $w_{1},w_{2},\dots,w_{i}$ .

Induction Basis.

Using once more the auxiliary definition of an imaginary item $w_{0}=u_{0}=0$ of zero value and size, the claim for $i=0$ is empty.

Induction Step.

Our induction hypothesis is that the items $w_{1},w_{2},\dots,w_{i}$ are already packed and actualized immediately after the decision on $w_{i}$ . These items will never be replaced by the actual algorithm because the virtual version does not do so either since they are part of its eventual solution $S^{\prime}$ .

The advice tells us into which slot the virtual algorithm will pack $w_{i+1}$ in the current phase. Whenever the virtual version packs an item into this slot, the algorithm will try to actualize it. We only need to prove this to be possible without exceeding the given capacity.

If the oracle advises the algorithm to not actualize any item in the splitting slot in the current phase, this is trivial: The actual algorithm never actualizes anything not also fully packed by the virtual version, which never exceeds the knapsack capacity.

The more difficult case is that the virtual algorithm puts into the splitting slot during this phase some paltry item $i^{\prime}$ that will be part of the eventual virtual solution in its entirety rather than just as a split item. In the moment when this item is put into the splitting slot, the remaining packed paltry items have a total size that will never be undercut during the rest of the instance. This is because the only way to decrease the total size of the packed paltry items is to discard some of them, which would be possible only if the item of lower priority in the splitting slot were discarded beforehand by the virtual algorithm, which it cannot do since $i^{\prime}$ is part of the eventual solution. This means that $S^{\prime}$ contains paltry items of some total size at least as large the size of paltry items packed when $i^{\prime}$ appears, plus the entire item $i^{\prime}$ , plus all the items $w_{1},w_{2},\dots,w_{m^{\prime}}$ . Thus the total size of $S^{\prime}$ is at least what the total size in the knapsack becomes for the actual algorithm when packing $i^{\prime}$ fully upon appearance, in addition to those the items from $w_{1},w_{2},\dots,w_{m^{\prime}}$ that have appeared earlier and those yet to appear. It might happen that the algorithm has actualized the slot for a precious item $w_{j^{\prime}}$ before it has appeared, which could mean that this slot contains an item larger than $w_{j^{\prime}}$ , which in turn could block the replacement of the item in the splitting slot by one that has higher yield, but is also larger. However, recall that in this case the algorithm just re-virtualizes the slots to be actualized after the splitting slot in the current phase, dismissing all precious items in them. Therefore, the algorithm can and will indeed always actualize $w_{i+1}$ upon appearance.

Competitive Analysis

We analyze the competitive ratios achieved on the precious items and the paltry items separately. During this, we assume that the algorithm starts in the final level and thus never discards previously packed items due to leveling up; this assumption is justified afterward.

We still denote by $S$ the optimal solution picked by the oracle to compute the advice, and let $T$ denote the final output of the online algorithm.

Precious Part.

It follows from the termination of all phases in the last level and the actualization of all slots that the precious part of the online solution $T$ contains exactly as many precious items in each value class as $S$ does. Each class spans a value interval whose lower and upper end spread a factor of exactly $1-\varepsilon_{\textnormal{spread}}$ , thus we know that $v(T\cap V_{\textnormal{precious}})\geq(1-\varepsilon_{\textnormal{spread}})v(S% \cap V_{\textnormal{precious}})$ . The loss on the precious items is therefore at most a fraction $\varepsilon_{\textnormal{spread}}$ of the optimal solution value.

Paltry Part.

For the paltry items, we have already bounded the lost value to a fraction $(2\varepsilon_{\textnormal{round}}+4\varepsilon_{\textnormal{spread}}/% \varepsilon_{\textnormal{small}})/\varepsilon_{\textnormal{paltry}}$ of the optimal solution when discussing how the paltry part of the knapsack is packed using the splitting slot.

Leveling Up.

It remains to show that the potentially unbounded number of resets, which occur whenever the algorithm is leveling up, are only causing a negligible value loss.

The first trigger causing a level change was the appearance of an item worth more than the items considered presumably precious so far. This trigger ensures that the algorithm never mistreats a precious item as paltry since doing so could mean missing a precious item that should have filled a slot. If a precious item is not already recognized as such upon arrival, the algorithm immediately levels up to rectify this and pack the item if necessary.

The second trigger for leveling up was that the algorithm can construct, from the items seen so far—whether they have been accepted or not—a valid solution whose value exceeds the upper bound $U_{\ell}$ on the optimal solution value $v(S)$ . The newest item is part of any such hypothetical solution—otherwise the level would have risen earlier—and when excluding it, the remaining possible solutions are all worth less than the current upper bound $U_{\ell}$ .

When a level change is occurring—independent of whether it was caused by the first or second trigger—this means that the estimate $U_{\ell}$ has in fact been far too low: It has been at most a fraction $(1-\varepsilon_{\textnormal{spread}})^{K-1}\leq\varepsilon_{\textnormal{paltry}}$ of what it is becoming now, as already noted in the section on notions and notation. Moreover, we have proved, when explaining how the optimal solution value $v(S)$ is estimated, that we can derive from any upper bound $U_{\ell}$ a guaranteed lower bound $L_{\ell}=U_{\ell}(1-2\varepsilon_{\textnormal{spread}}/\varepsilon_{% \textnormal{small}})$ .

Combining the three facts from the last two paragraphs, we see that the value of any solution that can be constructed from the items to be discarded with the level change is worth at most $\varepsilon_{\textnormal{paltry}}v(S)/(1-2\varepsilon_{\textnormal{spread}}/% \varepsilon_{\textnormal{small}})$ . If the final optimal solution contains any of these items, then omitting them means losing at most a fraction $\varepsilon_{\textnormal{paltry}}/(1-2\varepsilon_{\textnormal{spread}}/% \varepsilon_{\textnormal{small}})<2\varepsilon_{\textnormal{paltry}}$ of the overall value, where the last inequality is satisfied for our choices of $\varepsilon_{\textnormal{small}}=\varepsilon/2^{3}$ , $\varepsilon_{\textnormal{paltry}}=\varepsilon^{2}/2^{5}$ , and $\varepsilon_{\textnormal{spread}}=\varepsilon^{4}/2^{14}$ . While we might incur such a loss of a factor bounded by $2\varepsilon_{\textnormal{paltry}}$ with every new level change, the aggregate loss of the previous resets is fortunately also diminishing in relation to the estimate for $v(S)$ with every update of this estimate to a higher value. Hence the total fractional value loss caused by the potentially unbounded number of resets of the algorithm can be bounded by $\sum_{k=1}^{\infty}(2\varepsilon_{\textnormal{paltry}})^{k}=2\varepsilon_{% \textnormal{paltry}}/(1-2\varepsilon_{\textnormal{paltry}})\leq 4\varepsilon_{% \textnormal{paltry}}$ . Deducting this fraction from the value of the final online solution, we are justified in assuming without loss of generality that the algorithm already starts at the final level.

Algorithm 2 implements all of this by resetting the input sequence to its initial state upon each level change, and then acting out what it would have done, had it been at the current level from the beginning. For all items encountered at past levels this is a mere simulation since all previously packed items are discarded during a reset. To allow for this simulation, the algorithm maintains a set $J_{\textnormal{seen}}$ of all items seen so far and writes them off as lost whenever the level changes. The lost items are remembered by storing during each reset the current $J_{\textnormal{seen}}$ as $J_{\textnormal{lost}}$ . This set is then omitted from the online output to keep to real.

Conclusion.

Combining the three types of potential losses discussed above—the one on the precious part, the one on the paltry part, and the one due to leveling up repeatedly—to a worst-case scenario, we are still left with at least a fraction $1-\varepsilon_{\textnormal{spread}}-(2\varepsilon_{\textnormal{round}}+4% \varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{small}})/% \varepsilon_{\textnormal{paltry}}-4\varepsilon_{\textnormal{paltry}}$ of the optimal solution value, which is greater than $1-\varepsilon/2\geq 1/(1+\varepsilon)$ , concluding the proof.

Algorithm 2 ProPack

Parameter: Any $\varepsilon\in(0,1/2]$ .

Online Input: A sequence $I=((s_{1},v_{1}),\dots,(s_{n},v_{n}))$ with the sizes and values of $n$ items.

Online Output: A packing $T=(T_{\textnormal{paltry}}\cup\bigcup_{j=0}^{m}\{\,t_{j}\mid b_{\textnormal{% actual},j}=1\,\})\smallsetminus J_{\textnormal{lost}}$ that never exceeds the knapsack capacity and is $(1+\varepsilon)$ -competitive in the end.

Advice:

Algorithm:

\varepsilon_{\textnormal{small}}\leftarrow\varepsilon/2^{3};\quad\varepsilon_{% \textnormal{paltry}}\leftarrow\varepsilon^{2}/2^{5}

\triangleright

Initialize several constants …

\varepsilon_{\textnormal{spread}}\leftarrow\varepsilon^{4}/2^{14};\quad% \varepsilon_{\textnormal{round}}\leftarrow\varepsilon^{6}/2^{20}

\triangleright

… for the given parameter

\varepsilon

for

j\in\{0,1,\dots,m\}

\triangleright

Initialize the slots, where slot

0

is the splitting slot, as empty …

t_{j}\leftarrow 0

\triangleright

… by filling them with an imaginary item

0

of size and value zero, …

b_{\textnormal{actual},j}\leftarrow 0

\triangleright

… and initialize the state of the filling to be virtual only.

I_{\textnormal{copy}}\leftarrow I

\triangleright

Store a copy of the input.

T_{\textnormal{paltry}}\leftarrow\emptyset

\triangleright

Initialize to set of all completely packed paltry items to the empty set.

J_{\textnormal{seen}}\leftarrow\emptyset

;

J_{\textnormal{lost}}\leftarrow\emptyset

\triangleright

Initialize the sets of seen and lost items to the empty set.

p\leftarrow 0

\triangleright

Initialize the phase, equal to the number of virtually filled slots, to zero.

q\leftarrow 0

\triangleright

Initialize counter for the number of slot actualized during current phase to

0

\ell\leftarrow 1

\triangleright

Initialize the level, showing what the algorithm considers precious, to

1

function Greedy(

T^{\prime},v_{\textnormal{limit}},s_{\textnormal{limit}},i_{\textnormal{split}}

)

\triangleright

Greedily reduce the potential packing …

T^{\prime}\leftarrow T^{\prime}\cup\{i_{\textnormal{split}}\}

\triangleright

… passed to the function, including the split item

i_{\textnormal{split}}

, …

T^{\prime}\leftarrow\{i\in T^{\prime}\mid s(i)<s_{\textnormal{limit}}\,\}

\triangleright

… after discarding anything of size at least

s_{\textnormal{limit}}

, …

while

s(T^{\prime})>1

v(T^{\prime})>v_{\textnormal{limit}}

\triangleright

… to a valid solution of value at most

v_{\text{limit}}

T^{\prime}_{\textnormal{LowYield}}\leftarrow\operatorname{arg\,min}\{\,v(j)/s(% j)\mid j\in T^{\prime}\,\}

\triangleright

Find items of lowest yield.

i_{\textnormal{split}}\leftarrow\textrm{pop}(\operatorname{arg\,min}\{\,s(j)% \mid j\in T^{\prime}_{\textnormal{LowYield}}\,\})

\triangleright

Take a smallest among them.

T^{\prime}\leftarrow T^{\prime}\smallsetminus\{i_{\textnormal{split}}\}

\triangleright

Separate out from

T^{\prime}

the item for the splitting slot.

return

(T^{\prime},i_{\textnormal{split}})

\triangleright

Return the reduced solution and the current split item.

function VirtualSize

\triangleright

Returns virtual solution size, including used part of split item.

s_{\textnormal{temp}}\leftarrow s(T_{\textnormal{paltry}}\cup\bigcup_{j=1}^{m}% \{\,t_{j}\})

\triangleright

Size of current virtual solution, excluding split item.

r\leftarrow\min\{1,(1-s_{\textnormal{temp}})/s(t_{0})\}

\triangleright

Calculate the fraction

r

of split item to be used.

return

s(T_{\textnormal{paltry}}\cup\bigcup_{j=1}^{m}\{\,t_{j}\})+r\cdot s(t_{0})

\triangleright

Return the complete virtual solution.

function Actual

\triangleright

Returns online solution of packed paltry items and items in …

return

T_{\textnormal{paltry}}\cup\bigcup_{j=0}^{m}\{\,t_{j}\mid b_{\textnormal{% actual},j}=1\,\}

\triangleright

… actualized slots, ignoring level losses.

Algorithm 2 ProPack (Continuation)

while

I\neq\emptyset

\triangleright

As long as the instance has not ended, …

i\leftarrow\textrm{pop}(I)

\triangleright

… take the next item in the input sequence.

\ b_{\textnormal{small}}=1

\triangleright

If it is advised to just pack small items of size at most

\varepsilon_{\textnormal{small}}

, …

(T_{\textnormal{paltry}},t_{0})\leftarrow\textsc{Greedy}(T_{\textnormal{paltry% }}\cup\{i\},\infty,\varepsilon_{\textnormal{small}},t_{0})

\triangleright

… then do so greedily, …

continue

\triangleright

and continue this way until the instance ends.

J_{\textnormal{seen}}\leftarrow J_{\textnormal{seen}}\cup\{i\}

\triangleright

Update the set of items seen so far.

b_{\textnormal{levelUp}}\leftarrow 0

\triangleright

Initialize the Boolean checking for a level change to zero.

W(i)>\ell

\triangleright

If index of new item’s comboclass is greater than current level, …

\ell\leftarrow W(i)

\triangleright

… then this index becomes the new level.

b_{\textnormal{levelUp}}\leftarrow 1

\triangleright

Flag the level change.

\triangleright

Compute estimated upper bound

U

for

v(S)

based on advice and current level

\ell

U\leftarrow v_{1}\sum_{j=1}^{m}(1-\varepsilon_{\textnormal{spread}})^{V(1)+M(u% _{j})-\ell K}/(1-\varepsilon_{\textnormal{round}}(1+/\varepsilon_{\textnormal{% paltry}})-\sum_{p^{\prime}=0}^{m}f_{p^{\prime}})

\triangleright

Compute optimal solution value attainable with items seen at the current level:

v_{\text{max}}\leftarrow\max\{\,v(S^{\prime})\mid S^{\prime}\subseteq J,s(S^{% \prime})\leq 1\,\}

v_{\max}>U

\triangleright

If this value exceeds the upper bound, …

\ell\leftarrow\ell+1

\triangleright

… then increase the level by one …

b_{\textnormal{levelUp}}\leftarrow 1

\triangleright

… and flag the level change.

b_{\textnormal{levelUp}}=1

\triangleright

If new item has triggered a level change, then …

p\leftarrow 0

;

q\leftarrow 0

;

T_{\textnormal{paltry}}\leftarrow\emptyset

;

t_{0}\leftarrow 0

\triangleright

… reset the algorithm by re-initializing …

for

j\in\{0,1,\dots,m\}

\triangleright

… almost everything, including …

t_{j}\leftarrow 0;b_{\textnormal{actual},j}\leftarrow 0

\triangleright

… the slots and their actualization states, …

J_{\textnormal{lost}}\leftarrow J_{\textnormal{seen}}

;

\triangleright

… excepting only the level and items seen so far, and, …

I\leftarrow I_{\textnormal{copy}}

\triangleright

… after restoring the input sequence to the original state, …

continue

\triangleright

… re-start the algorithm, remembering the level and all items lost.

\ W(i)<\ell

\triangleright

If the new item is provenly paltry, then calculate …

L\leftarrow U(1-2\varepsilon_{\textnormal{spread}}/\varepsilon_{\textnormal{% small}})

\triangleright

… a lower bound on

v(S)

close to

U

, and …

v_{\textnormal{valueLimit}}\leftarrow L\cdot\sum_{p^{\prime}=0}^{p}f_{p^{% \prime}}

\triangleright

… use it to derive an upper bound on the …

\triangleright

… targeted total value of paltry items to be packed before

u_{p+1}

is presented.

t_{0}^{\prime}\leftarrow t_{0}

\triangleright

Store item in splitting slot to see if it will be changed …

(T_{\textnormal{paltry}},t_{0})\leftarrow\textsc{Greedy}(T_{\textnormal{paltry% }}\cup\{i\},v_{\textnormal{valueLimit}},\infty,t_{0})

\triangleright

… by the update.

\ t_{0}^{\prime}\neq t_{0}

and

0\in\{a_{p,q},\dots,a_{p,m_{p}}\}

\triangleright

If so, and splitting slot is to be …

while

\ a_{p,q}\neq 0

\triangleright

… actualized or has been, then re-virtualize all …

b_{\textnormal{actual},q}\leftarrow 0

;

q\leftarrow q-1

\triangleright

… recent slots down to the splitting slot.

\ a_{p,q}=0\textbf{ and }s(\textsc{Actual})+s(t_{0})\leq 1

\triangleright

If advised and possible, …

b_{\textnormal{actual},0}\leftarrow 1

\triangleright

… then (re-)actualize the virtual slot, and …

q\leftarrow q+1

\triangleright

… increment the counter of slots actualized in the phase.

Algorithm 2 ProPack (Continuation)

\ W(i)=\ell

\triangleright

If the newly arriving item is presumably precious …

\ M(i)=M(u_{p+1})

\triangleright

… and also matches the next empty slot …

\ \textsc{VirtualSize}+s_{i}\leq 1

\triangleright

… without exceeding the knapsack capacity, …

j\leftarrow p+1

\triangleright

… then store the number of the slot to be filled virtually, …

p\leftarrow p+1

\triangleright

… enter the next phase, …

q\leftarrow 0

\triangleright

… and accordingly reset the actualization counter.

else

\triangleright

If new item does not match next empty slot, find among all …

J_{\textnormal{matches}}\leftarrow\{\,j^{\prime}\in\{1,2,\dots,m\}\mid M(t_{j^% {\prime}})=M(i)\,\}

\triangleright

… matching slots …

j\leftarrow\textrm{pop}(\arg\max\{\,s(t_{j^{\prime}})\mid j^{\prime}\in J_{% \textnormal{matches}}\,\})

\triangleright

… one with a largest item.

t_{j}\leftarrow i

\triangleright

Fill the new item into the right slot, replacing any present item.

\ a_{p,q}=j\textbf{ and }s(\textsc{Actual})+s(t_{j})\leq 1

\triangleright

If advised and possible, …

b_{\textnormal{actual},0}\leftarrow 1

\triangleright

… then actualize the new filling and …

q\leftarrow q+1

\triangleright

… update the actualization counter.

return

\textsc{Actual}\smallsetminus J_{\textnormal{lost}}

\triangleright

Return the solution, omitting any items from the lost levels.

References

[1] Richard Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, USA, 1957.
[2] Hans-Joachim Böckenhauer, Elisabet Burjons, Juraj Hromkovič, Henri Lotze, and Peter Rossmanith. Online simple knapsack with reservation costs. In Markus Bläser and Benjamin Monmege, editors, 38th International Symposium on Theoretical Aspects of Computer Science, STACS 2021, March 16-19, 2021, Saarbrücken, Germany (Virtual Conference), volume 187 of LIPIcs, pages 16:1–16:18, 2021.
[3] Hans-Joachim Böckenhauer, Dennis Komm, Rastislav Královič, Richard Královič, and Tobias Mömke. On the advice complexity of online problems. In ISAAC 2009, number 5878 in LNCS, pages 331–340, 2009.
[4] Hans-Joachim Böckenhauer, Dennis Komm, Richard Královič, and Peter Rossmanith. The online knapsack problem: Advice and randomization. Theoretical Computer Science, 527:61–72, 2014.
[5] Allan Borodin and Ran El-Yaniv. Online computation and competitive analysis. Cambridge University Press, 1998.
[6] Joan Boyar, Lene M. Favrholdt, Christian Kudahl, Kim S. Larsen, and Jesper W. Mikkelsen. Online algorithms with advice: A survey. ACM Comput. Surv., 50(2):19:1–19:34, 2017.
[7] Marek Cygan, Łukasz Jeż, and Jiří Sgall. Online knapsack revisited. Theory Comput. Syst., 58(1):153–190, 2016.
[8] George B. Dantzig. Discrete-variable extremum problems. Operations Research, 5(2):266–277, 1957.
[9] Yuval Emek, Pierre Fraigniaud, Amos Korman, and Adi Rosén. Online computation with advice. Theoretical Computer Science, 412(24):2642–2656, 2011.
[10] Xin Han, Yasushi Kawase, and Kazuhisa Makino. Online unweighted knapsack problem with removal cost. Algorithmica, 70(1):76–91, 2014.
[11] Xin Han, Yasushi Kawase, and Kazuhisa Makino. Randomized algorithms for online knapsack problems. Theoretical Computer Science, 562:395–405, 2015.
[12] Xin Han, Yasushi Kawase, Kazuhisa Makino, and He Guo. Online removable knapsack problem under convex function. Theoretical Computer Science, 540:62–69, 2014.
[13] Xin Han, Yasushi Kawase, Kazuhisa Makino, and Haruki Yokomaku. Online knapsack problems with a resource buffer. In Pinyan Lu and Guochuan Zhang, editors, 30th International Symposium on Algorithms and Computation, ISAAC 2019, December 8–11, 2019, Shanghai University of Finance and Economics, Shanghai, China, volume 149 of LIPIcs, pages 28:1–28:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019.
[14] Xin Han and Kazuhisa Makino. Online removable knapsack with limited cuts. Theoretical Computer Science, 411(44-46):3956–3964, 2010.
[15] Juraj Hromkovič, Rastislav Královič, and Richard Královič. Information complexity of online problems. In MFCS 2010, number 6281 in LNCS, pages 24–36. Springer, 2010.
[16] Oscar H. Ibarra and Chul E. Kim. Fast approximation algorithms for the knapsack and sum of subset problems. Journal of the ACM, 22(4), 1975.
[17] Kazuo Iwama and Shiro Taketomi. Removable online knapsack problems. In ICALP 2002, number 2380 in LNCS, pages 293–305. Springer, 2002.
[18] Kazuo Iwama and Guochuan Zhang. Online knapsack with resource augmentation. Information Processing Letters, 110(22):1016–1020, 2010.
[19] Richard M. Karp. Reducibility among combinatorial problems. In Complexity of computer computations, pages 85–103. Plenum, 1972.
[20] Dennis Komm. An Introduction to Online Computation – Determinism, Randomization, Advice. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2016.
[21] Alberto Marchetti-Spaccamela and Carlo Vercellis. Stochastic on-line knapsack problems. Mathematical Programming, 68:73–104, 1995.
[22] John Noga and Veerawan Sarbua. An online partially fractional knapsack problem. In 8th International Symposium on Parallel Architectures, Algorithms, and Networks, ISPAN 2005, December 7-9. 2005, Las Vegas, Nevada, USA, pages 108–112. IEEE Computer Society, 2005.
[23] Herbert Ellis Robbins. A remark on Stirling’s formula. American Mathematical Monthly, 62:26–29, 1955.
[24] Peter Rossmanith. On the advice complexity of online edge- and node-deletion problems. In Adventures Between Lower Bounds and Higher Altitudes – Essays Dedicated to Juraj Hromkovič on the Occasion of His 60th Birthday, number 11011 in LNCS, pages 449–462. Springer, 2018.

Case	Strategy	Competitivity	Case Conditions
A	One	$1/d$	$\|P_{\textnormal{huge}}\|>0$
B	One/Two	$d/c$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|>0$	$\|P_{\textnormal{medium}}\|\leq 1$
C	Two	$1/2b$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|\geq 0$	$\|P_{\textnormal{medium}}\|>1$
D	Two	$1/(a+b)$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|=1$	$\|P_{\textnormal{small}}\|>0$
E	Two	$b/a$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|=0$	$\|P_{\textnormal{small}}\|>0$
F	Two	$1/(1-a)$	$\|P_{\textnormal{huge}}\|=0$	$\|S\cap P_{\textnormal{big}}\|=0$	$\|P_{\textnormal{medium}}\|\leq 1$	$\|P_{\textnormal{small}}\|=0$