-
A tight lower bound on non-adaptive group testing estimation
Authors:
Nader H. Bshouty,
Tsun-Ming Cheung,
Gergely Harcos,
Hamed Hatami,
Anthony Ostuni
Abstract:
Efficiently counting or detecting defective items is a crucial task in various fields ranging from biological testing to quality control to streaming algorithms. The \emph{group testing estimation problem} concerns estimating the number of defective elements $d$ in a collection of $n$ total within a given factor. We primarily consider the classical query model, in which a query reveals whether the…
▽ More
Efficiently counting or detecting defective items is a crucial task in various fields ranging from biological testing to quality control to streaming algorithms. The \emph{group testing estimation problem} concerns estimating the number of defective elements $d$ in a collection of $n$ total within a given factor. We primarily consider the classical query model, in which a query reveals whether the selected group of elements contains a defective one. We show that any non-adaptive randomized algorithm that estimates the value of $d$ within a constant factor requires $Ω(\log n)$ queries. This confirms that a known $O(\log n)$ upper bound by Bshouty (2019) is tight and resolves a conjecture by Damaschke and Sheikh Muhammad (2010). Additionally, we prove similar matching upper and lower bounds in the threshold query model.
△ Less
Submitted 6 December, 2023; v1 submitted 18 September, 2023;
originally announced September 2023.
-
A Tight Lower Bound of $Ω(\log n)$ for the Estimation of the Number of Defective Items
Authors:
Nader H. Bshouty,
Gergely Harcos
Abstract:
Let $X$ be a set of items of size $n$ , which may contain some defective items denoted by $I$, where $I \subseteq X$. In group testing, a {\it test} refers to a subset of items $Q \subset X$. The test outcome is $1$ (positive) if $Q$ contains at least one defective item, i.e., $Q\cap I \neq \emptyset$, and $0$ (negative) otherwise. We give a novel approach to obtaining tight lower bounds in non-ad…
▽ More
Let $X$ be a set of items of size $n$ , which may contain some defective items denoted by $I$, where $I \subseteq X$. In group testing, a {\it test} refers to a subset of items $Q \subset X$. The test outcome is $1$ (positive) if $Q$ contains at least one defective item, i.e., $Q\cap I \neq \emptyset$, and $0$ (negative) otherwise. We give a novel approach to obtaining tight lower bounds in non-adaptive randomized group testing. Employing this new method, we can prove the following result. Any non-adaptive randomized algorithm that, for any set of defective items $I$, with probability at least $2/3$, returns an estimate of the number of defective items $|I|$ to within a constant factor requires at least $Ω({\log n})$ tests. Our result matches the upper bound of $O(\log n)$ and solves the open problem posed by Damaschke and Sheikh Muhammad.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Improved Lower Bound for Estimating the Number of Defective Items
Authors:
Nader H. Bshouty
Abstract:
Let $X$ be a set of items of size $n$ that contains some defective items, denoted by $I$, where $I \subseteq X$. In group testing, a {\it test} refers to a subset of items $Q \subset X$. The outcome of a test is $1$ if $Q$ contains at least one defective item, i.e., $Q\cap I \neq \emptyset$, and $0$ otherwise.
We give a novel approach to obtaining lower bounds in non-adaptive randomized group te…
▽ More
Let $X$ be a set of items of size $n$ that contains some defective items, denoted by $I$, where $I \subseteq X$. In group testing, a {\it test} refers to a subset of items $Q \subset X$. The outcome of a test is $1$ if $Q$ contains at least one defective item, i.e., $Q\cap I \neq \emptyset$, and $0$ otherwise.
We give a novel approach to obtaining lower bounds in non-adaptive randomized group testing. The technique produced lower bounds that are within a factor of $1/{\log\log\stackrel{k}{\cdots}\log n}$ of the existing upper bounds for any constant~$k$. Employing this new method, we can prove the following result.
For any fixed constants $k$, any non-adaptive randomized algorithm that, for any set of defective items $I$, with probability at least $2/3$, returns an estimate of the number of defective items $|I|$ to within a constant factor requires at least $$Ω\left(\frac{\log n}{\log\log\stackrel{k}{\cdots}\log n}\right)$$ tests.
Our result almost matches the upper bound of $O(\log n)$ and solves the open problem posed by Damaschke and Sheikh Muhammad [COCOA 2010 and Discrete Math., Alg. and Appl., 2010]. Additionally, it improves upon the lower bound of $Ω(\log n/\log\log n)$ previously established by Bshouty [ISAAC 2019].
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
On Detecting Some Defective Items in Group Testing
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon
Abstract:
Group testing is an approach aimed at identifying up to $d$ defective items among a total of $n$ elements. This is accomplished by examining subsets to determine if at least one defective item is present. In our study, we focus on the problem of identifying a subset of $\ell\leq d$ defective items. We develop upper and lower bounds on the number of tests required to detect $\ell$ defective items i…
▽ More
Group testing is an approach aimed at identifying up to $d$ defective items among a total of $n$ elements. This is accomplished by examining subsets to determine if at least one defective item is present. In our study, we focus on the problem of identifying a subset of $\ell\leq d$ defective items. We develop upper and lower bounds on the number of tests required to detect $\ell$ defective items in both the adaptive and non-adaptive settings while considering scenarios where no prior knowledge of $d$ is available, and situations where an estimate of $d$ or at least some non-trivial upper bound on $d$ is available.
When no prior knowledge on $d$ is available, we prove a lower bound of $ Ω(\frac{\ell \log^2n}{\log \ell +\log\log n})$ tests in the randomized non-adaptive settings and an upper bound of $O(\ell \log^2 n)$ for the same settings. Furthermore, we demonstrate that any non-adaptive deterministic algorithm must ask $Θ(n)$ tests, signifying a fundamental limitation in this scenario. For adaptive algorithms, we establish tight bounds in different scenarios. In the deterministic case, we prove a tight bound of $Θ(\ell\log{(n/\ell)})$. Moreover, in the randomized settings, we derive a tight bound of $Θ(\ell\log{(n/d)})$.
When $d$, or at least some non-trivial estimate of $d$, is known, we prove a tight bound of $Θ(d\log (n/d))$ for the deterministic non-adaptive settings, and $Θ(\ell\log(n/d))$ for the randomized non-adaptive settings. In the adaptive case, we present an upper bound of $O(\ell \log (n/\ell))$ for the deterministic settings, and a lower bound of $Ω(\ell\log(n/d)+\log n)$. Additionally, we establish a tight bound of $Θ(\ell \log(n/d))$ for the randomized adaptive settings.
△ Less
Submitted 27 June, 2023;
originally announced July 2023.
-
Superpolynomial Lower Bounds for Learning Monotone Classes
Authors:
Nader H. Bshouty
Abstract:
Koch, Strassle, and Tan [SODA 2023], show that, under the randomized exponential time hypothesis, there is no distribution-free PAC-learning algorithm that runs in time $n^{\tilde O(\log\log s)}$ for the classes of $n$-variable size-$s$ DNF, size-$s$ Decision Tree, and $\log s$-Junta by DNF (that returns a DNF hypothesis). Assuming a natural conjecture on the hardness of set cover, they give the l…
▽ More
Koch, Strassle, and Tan [SODA 2023], show that, under the randomized exponential time hypothesis, there is no distribution-free PAC-learning algorithm that runs in time $n^{\tilde O(\log\log s)}$ for the classes of $n$-variable size-$s$ DNF, size-$s$ Decision Tree, and $\log s$-Junta by DNF (that returns a DNF hypothesis). Assuming a natural conjecture on the hardness of set cover, they give the lower bound $n^{Ω(\log s)}$. This matches the best known upper bound for $n$-variable size-$s$ Decision Tree, and $\log s$-Junta.
In this paper, we give the same lower bounds for PAC-learning of $n$-variable size-$s$ Monotone DNF, size-$s$ Monotone Decision Tree, and Monotone $\log s$-Junta by~DNF. This solves the open problem proposed by Koch, Strassle, and Tan and subsumes the above results.
The lower bound holds, even if the learner knows the distribution, can draw a sample according to the distribution in polynomial time, and can compute the target function on all the points of the support of the distribution in polynomial time.
△ Less
Submitted 30 January, 2023; v1 submitted 20 January, 2023;
originally announced January 2023.
-
A Note on Property Testing of the Binary Rank
Authors:
Nader H. Bshouty
Abstract:
Let $M$ be a $n\times m$ $(0,1)$-matrix. We define the $s$-binary rank, $br_s(M)$, of $M$ to be the minimal integer $d$ such that there are $d$ monochromatic rectangles that cover all the $1$-entries in the matrix, and each $1$-entry is covered by at most $s$ rectangles. When $s=1$, this is the binary rank,~$br(M)$, known from the literature. Let $R(M)$ and $C(M)$ be the set of rows and columns of…
▽ More
Let $M$ be a $n\times m$ $(0,1)$-matrix. We define the $s$-binary rank, $br_s(M)$, of $M$ to be the minimal integer $d$ such that there are $d$ monochromatic rectangles that cover all the $1$-entries in the matrix, and each $1$-entry is covered by at most $s$ rectangles. When $s=1$, this is the binary rank,~$br(M)$, known from the literature. Let $R(M)$ and $C(M)$ be the set of rows and columns of~$M$, respectively. We use the result of Sgall (Comb. 1999) to prove that if $M$ has $s$-binary rank at most~$d$, then $|R(M)|\cdot |C(M)|\le {d\choose \le s}2^{d}$ where ${d\choose \le s}=\sum_{i=0}^s{d\choose i}$. This bound is tight; that is, there exists a matrix $M'$ of $s$-binary rank $d$ such that $|R(M')|\cdot |C(M')|= {d\choose \le s}2^{d}$. Using this result, we give a new one-sided adaptive and non-adaptive testers for $(0,1)$-matrices of $s$-binary rank at most $d$ (and exactly $d$) that makes $\tilde O\left({d\choose \le s}2^d/ε\right)$ and $\tilde O\left({d\choose \le s}2^d/ε^2\right)$ queries, respectively. For a fixed $s$, this improves the query complexity of the tester of Parnas et al. (Theory Comput. Syst. 2021) by a factor of $\tilde Θ(2^d)$.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
Almost Optimal Proper Learning and Testing Polynomials
Authors:
Nader H. Bshouty
Abstract:
We give the first almost optimal polynomial-time proper learning algorithm of Boolean sparse multivariate polynomial under the uniform distribution. For $s$-sparse polynomial over $n$ variables and $ε=1/s^β$, $β>1$, our algorithm makes $$q_U=\left(\frac{s}ε\right)^{\frac{\log β}β+O(\frac{1}β)}+ \tilde O\left(s\right)\left(\log\frac{1}ε\right)\log n$$ queries. Notice that our query complexity is su…
▽ More
We give the first almost optimal polynomial-time proper learning algorithm of Boolean sparse multivariate polynomial under the uniform distribution. For $s$-sparse polynomial over $n$ variables and $ε=1/s^β$, $β>1$, our algorithm makes $$q_U=\left(\frac{s}ε\right)^{\frac{\log β}β+O(\frac{1}β)}+ \tilde O\left(s\right)\left(\log\frac{1}ε\right)\log n$$ queries. Notice that our query complexity is sublinear in $1/ε$ and almost linear in $s$. All previous algorithms have query complexity at least quadratic in $s$ and linear in $1/ε$.
We then prove the almost tight lower bound $$q_L=\left(\frac{s}ε\right)^{\frac{\log β}β+Ω(\frac{1}β)}+ Ω\left(s\right)\left(\log\frac{1}ε\right)\log n,$$
Applying the reduction in~\cite{Bshouty19b} with the above algorithm, we give the first almost optimal polynomial-time tester for $s$-sparse polynomial. Our tester, for $β>3.404$, makes $$\tilde O\left(\frac{s}ε\right)$$ queries.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
On Learning and Testing Decision Tree
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon
Abstract:
In this paper, we study learning and testing decision tree of size and depth that are significantly smaller than the number of attributes $n$.
Our main result addresses the problem of poly$(n,1/ε)$ time algorithms with poly$(s,1/ε)$ query complexity (independent of $n$) that distinguish between functions that are decision trees of size $s$ from functions that are $ε$-far from any decision tree o…
▽ More
In this paper, we study learning and testing decision tree of size and depth that are significantly smaller than the number of attributes $n$.
Our main result addresses the problem of poly$(n,1/ε)$ time algorithms with poly$(s,1/ε)$ query complexity (independent of $n$) that distinguish between functions that are decision trees of size $s$ from functions that are $ε$-far from any decision tree of size $φ(s,1/ε)$, for some function $φ> s$. The best known result is the recent one that follows from Blank, Lange and Tan,~\cite{BlancLT20}, that gives $φ(s,1/ε)=2^{O((\log^3s)/ε^3)}$. In this paper, we give a new algorithm that achieves $φ(s,1/ε)=2^{O(\log^2 (s/ε))}$.
Moreover, we study the testability of depth-$d$ decision tree and give a {\it distribution free} tester that distinguishes between depth-$d$ decision tree and functions that are $ε$-far from depth-$d^2$ decision tree. In particular, for decision trees of size $s$, the above result holds in the distribution-free model when the tree depth is $O(\log(s/ε))$.
We also give other new results in learning and testing of size-$s$ decision trees and depth-$d$ decision trees that follow from results in the literature and some results we prove in this paper.
△ Less
Submitted 10 August, 2021;
originally announced August 2021.
-
Optimal Deterministic Group Testing Algorithms to Estimate the Number of Defectives
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon
Abstract:
We study the problem of estimating the number of defective items $d$ within a pile of $n$ elements up to a multiplicative factor of $Δ>1$, using deterministic group testing algorithms. We bring lower and upper bounds on the number of tests required in both the adaptive and the non-adaptive deterministic settings given an upper bound $D$ on the defectives number. For the adaptive deterministic sett…
▽ More
We study the problem of estimating the number of defective items $d$ within a pile of $n$ elements up to a multiplicative factor of $Δ>1$, using deterministic group testing algorithms. We bring lower and upper bounds on the number of tests required in both the adaptive and the non-adaptive deterministic settings given an upper bound $D$ on the defectives number. For the adaptive deterministic settings, our results show that, any algorithm for estimating the defectives number up to a multiplicative factor of $Δ$ must make at least $Ω\left((D/Δ^2)\log (n/D) \right )$ tests. This extends the same lower bound achieved in \cite{ALA17} for non-adaptive algorithms. Moreover, we give a polynomial time adaptive algorithm that shows that our bound is tight up to a small additive term.
For non-adaptive algorithms, an upper bound of $O((D/Δ^2)$ $(\log (n/D)+\log Δ) )$ is achieved by means of non-constructive proof. This improves the lower bound $O((\log D)/(\logΔ))D\log n)$ from \cite{ALA17} and matches the lower bound up to a small additive term.
In addition, we study polynomial time constructive algorithms. We use existing polynomial time constructible \emph{expander regular bipartite graphs}, \emph{extractors} and \emph{condensers} to construct two polynomial time algorithms. The first algorithm makes $O((D^{1+o(1)}/Δ^2)\cdot \log n)$ tests, and the second makes $(D/Δ^2)\cdot quazipoly$ $(\log n)$ tests. This is the first explicit construction with an almost optimal test complexity.
△ Less
Submitted 5 September, 2020;
originally announced September 2020.
-
An Optimal Tester for $k$-Linear
Authors:
Nader H. Bshouty
Abstract:
A Boolean function $f:\{0,1\}^n\to \{0,1\}$ is $k$-linear if it returns the sum (over the binary field $F_2$) of $k$ coordinates of the input. In this paper, we study property testing of the classes $k$-Linear, the class of all $k$-linear functions, and $k$-Linear$^*$, the class $\cup_{j=0}^kj$-Linear. We give a non-adaptive distribution-free two-sided $ε$-tester for $k$-Linear that makes…
▽ More
A Boolean function $f:\{0,1\}^n\to \{0,1\}$ is $k$-linear if it returns the sum (over the binary field $F_2$) of $k$ coordinates of the input. In this paper, we study property testing of the classes $k$-Linear, the class of all $k$-linear functions, and $k$-Linear$^*$, the class $\cup_{j=0}^kj$-Linear. We give a non-adaptive distribution-free two-sided $ε$-tester for $k$-Linear that makes $$O\left(k\log k+\frac{1}ε\right)$$ queries. This matches the lower bound known from the literature.
We then give a non-adaptive distribution-free one-sided $ε$-tester for $k$-Linear$^*$ that makes the same number of queries and show that any non-adaptive uniform-distribution one-sided $ε$-tester for $k$-Linear must make at least $ \tildeΩ(k)\log n+Ω(1/ε)$ queries. The latter bound, almost matches the upper bound $O(k\log n+1/ε)$ known from the literature. We then show that any adaptive uniform-distribution one-sided $ε$-tester for $k$-Linear must make at least $\tildeΩ(\sqrt{k})\log n+Ω(1/ε)$ queries.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.
-
Lecture Note on LCSSX's Lower Bounds for Non-Adaptive Distribution-free Property Testing
Authors:
Nader H. Bshouty
Abstract:
In this lecture note we give Liu-Chen-Servedio-Sheng-Xie's (LCSSX) lower bound for property testing in the non-adaptive distribution-free.
In this lecture note we give Liu-Chen-Servedio-Sheng-Xie's (LCSSX) lower bound for property testing in the non-adaptive distribution-free.
△ Less
Submitted 9 April, 2020;
originally announced April 2020.
-
Optimal Randomized Group Testing Algorithm to Determine the Number of Defectives
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon,
Raghd Boulos,
Foad Moalem,
Jalal Nada,
Elias Noufi,
Yara Zaknoon
Abstract:
We study the problem of determining exactly the number of defective items in an adaptive Group testing by using a minimum number of tests. We improve the existing algorithm and prove a lower bound that shows that the number of tests in our algorithm is optimal up to small additive terms.
We study the problem of determining exactly the number of defective items in an adaptive Group testing by using a minimum number of tests. We improve the existing algorithm and prove a lower bound that shows that the number of tests in our algorithm is optimal up to small additive terms.
△ Less
Submitted 2 January, 2020;
originally announced January 2020.
-
Bounds for the Number of Tests in Non-Adaptive Randomized Algorithms for Group Testing
Authors:
Nader H. Bshouty,
George Haddad,
Catherine A. Haddad-Zaknoon
Abstract:
We study the group testing problem with non-adaptive randomized algorithms. Several models have been discussed in the literature to determine how to randomly choose the tests. For a model ${\cal M}$, let $m_{\cal M}(n,d)$ be the minimum number of tests required to detect at most $d$ defectives within $n$ items, with success probability at least $1-δ$, for some constant $δ$. In this paper, we study…
▽ More
We study the group testing problem with non-adaptive randomized algorithms. Several models have been discussed in the literature to determine how to randomly choose the tests. For a model ${\cal M}$, let $m_{\cal M}(n,d)$ be the minimum number of tests required to detect at most $d$ defectives within $n$ items, with success probability at least $1-δ$, for some constant $δ$. In this paper, we study the measures $$c_{\cal M}(d)=\lim_{n\to \infty} \frac{m_{\cal M}(n,d)}{\ln n} \mbox{ and } c_{\cal M}=\lim_{d\to \infty} \frac{c_{\cal M}(d)}{d}.$$
In the literature, the analyses of such models only give upper bounds for $c_{\cal M}(d)$ and $c_{\cal M}$, and for some of them, the bounds are not tight. We give new analyses that yield tight bounds for $c_{\cal M}(d)$ and $c_{\cal M}$ for all the known models~${\cal M}$.
△ Less
Submitted 5 November, 2019;
originally announced November 2019.
-
Almost Optimal Testers for Concise Representations
Authors:
Nader H. Bshouty
Abstract:
We give improved and almost optimal testers for several classes of Boolean functions on $n$ inputs that have concise representation in the uniform and distribution-free model. Classes, such as $k$-junta, $k$-linear functions, $s$-term DNF, $s$-term monotone DNF, $r$-DNF, decision list, $r$-decision list, size-$s$ decision tree, size-$s$ Boolean formula, size-$s$ branching programs, $s$-sparse poly…
▽ More
We give improved and almost optimal testers for several classes of Boolean functions on $n$ inputs that have concise representation in the uniform and distribution-free model. Classes, such as $k$-junta, $k$-linear functions, $s$-term DNF, $s$-term monotone DNF, $r$-DNF, decision list, $r$-decision list, size-$s$ decision tree, size-$s$ Boolean formula, size-$s$ branching programs, $s$-sparse polynomials over the binary field and function with Fourier degree at most $d$. The method can be extended to several other classes of functions over any domain that can be approximated by functions that have a small number of relevant variables.
△ Less
Submitted 21 June, 2023; v1 submitted 22 April, 2019;
originally announced April 2019.
-
Adaptive Exact Learning of Decision Trees from Membership Queries
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon
Abstract:
In this paper we study the adaptive learnability of decision trees of depth at most $d$ from membership queries. This has many applications in automated scientific discovery such as drugs development and software update problem. Feldman solves the problem in a randomized polynomial time algorithm that asks $\tilde O(2^{2d})\log n$ queries and Kushilevitz-Mansour in a deterministic polynomial time…
▽ More
In this paper we study the adaptive learnability of decision trees of depth at most $d$ from membership queries. This has many applications in automated scientific discovery such as drugs development and software update problem. Feldman solves the problem in a randomized polynomial time algorithm that asks $\tilde O(2^{2d})\log n$ queries and Kushilevitz-Mansour in a deterministic polynomial time algorithm that asks $ 2^{18d+o(d)}\log n$ queries. We improve the query complexity of both algorithms. We give a randomized polynomial time algorithm that asks $\tilde O(2^{2d}) + 2^{d}\log n$ queries and a deterministic polynomial time algorithm that asks $2^{5.83d}+2^{2d+o(d)}\log n$ queries.
△ Less
Submitted 23 January, 2019;
originally announced January 2019.
-
Almost Optimal Distribution-free Junta Testing
Authors:
Nader H. Bshouty
Abstract:
We consider the problem of testing whether an unknown $n$-variable Boolean function is a $k$-junta in the distribution-free property testing model, where the distance between function is measured with respect to an arbitrary and unknown probability distribution over $\{0,1\}^n$. Chen, Liu, Servedio, Sheng and Xie showed that the distribution-free $k$-junta testing can be performed, with one-sided…
▽ More
We consider the problem of testing whether an unknown $n$-variable Boolean function is a $k$-junta in the distribution-free property testing model, where the distance between function is measured with respect to an arbitrary and unknown probability distribution over $\{0,1\}^n$. Chen, Liu, Servedio, Sheng and Xie showed that the distribution-free $k$-junta testing can be performed, with one-sided error, by an adaptive algorithm that makes $\tilde O(k^2)/ε$ queries. In this paper, we give a simple two-sided error adaptive algorithm that makes $\tilde O(k/ε)$ queries.
△ Less
Submitted 8 June, 2020; v1 submitted 1 January, 2019;
originally announced January 2019.
-
On Learning Graphs with Edge-Detecting Queries
Authors:
Hasan Abasi,
Nader H. Bshouty
Abstract:
We consider the problem of learning a general graph $G=(V,E)$ using edge-detecting queries, where the number of vertices $|V|=n$ is given to the learner. The information theoretic lower bound gives $m\log n$ for the number of queries, where $m=|E|$ is the number of edges. In case the number of edges $m$ is also given to the learner, Angluin-Chen's Las Vegas algorithm \cite{AC08} runs in $4$ rounds…
▽ More
We consider the problem of learning a general graph $G=(V,E)$ using edge-detecting queries, where the number of vertices $|V|=n$ is given to the learner. The information theoretic lower bound gives $m\log n$ for the number of queries, where $m=|E|$ is the number of edges. In case the number of edges $m$ is also given to the learner, Angluin-Chen's Las Vegas algorithm \cite{AC08} runs in $4$ rounds and detects the edges in $O(m\log n)$ queries. In the other harder case where the number of edges $m$ is unknown, their algorithm runs in $5$ rounds and asks $O(m\log n+\sqrt{m}\log^2 n)$ queries. There have been two open problems: \emph{(i)} can the number of queries be reduced to $O(m\log n)$ in the second case, and, \emph{(ii)} can the number of rounds be reduced without substantially increasing the number of queries (in both cases). For the first open problem (when $m$ is unknown) we give two algorithms. The first is an $O(1)$-round Las Vegas algorithm that asks $m\log n+\sqrt{m}(\log^{[k]}n)\log n$ queries for any constant $k$ where $\log^{[k]}n=\log \stackrel{k}{\cdots} \log n$. The second is an $O(\log^*n)$-round Las Vegas algorithm that asks $O(m\log n)$ queries. This solves the first open problem for any practical $n$, for example, $n<2^{65536}$. We also show that no deterministic algorithm can solve this problem in a constant number of rounds. To solve the second problem we study the case when $m$ is known. We first show that any non-adaptive Monte Carlo algorithm (one-round) must ask at least $Ω(m^2\log n)$ queries, and any two-round Las Vegas algorithm must ask at least $m^{4/3-o(1)}\log n$ queries on average. We then give two two-round Monte Carlo algorithms, the first asks $O(m^{4/3}\log n)$ queries for any $n$ and $m$, and the second asks $O(m\log n)$ queries when $n>2^m$. Finally, we give a $3$-round Monte Carlo algorithm that asks $O(m\log n)$ queries for any $n$ and $m$.
△ Less
Submitted 28 March, 2018;
originally announced March 2018.
-
Elementary Proofs of Some Stirling Bounds
Authors:
Nader H. Bshouty,
Vivian E. Bshouty-Hurani,
George Haddad,
Thomas Hashem,
Fadi Khoury,
Omar Sharafy
Abstract:
We give elementary proofs of several Stirling's precise bounds. We first improve all the precise bounds from the literature and give new precise bounds. In particular, we show that for all $n\ge 8$ $$\sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n}-\frac{1}{360n^3+103n}} \ge n!\ge \sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n}-\frac{1}{360n^3+102n}}$$ and for all $n\ge 3$…
▽ More
We give elementary proofs of several Stirling's precise bounds. We first improve all the precise bounds from the literature and give new precise bounds. In particular, we show that for all $n\ge 8$ $$\sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n}-\frac{1}{360n^3+103n}} \ge n!\ge \sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n}-\frac{1}{360n^3+102n}}$$ and for all $n\ge 3$ $$\sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n+\frac{2}{5n}-\frac{1.1}{10n^3}}} \ge n!\ge \sqrt{2πn}\left(\frac{n}{e}\right)^n e^{\frac{1}{12n+\frac{2}{5n}-\frac{0.9}{10n^3}}}.$$
△ Less
Submitted 22 May, 2018; v1 submitted 20 February, 2018;
originally announced February 2018.
-
On Polynomial time Constructions of Minimum Height Decision Tree
Authors:
Nader H. Bshouty,
Waseem Makhoul
Abstract:
In this paper we study a polynomial time algorithms that for an input $A\subseteq {B_m}$ outputs a decision tree for $A$ of minimum depth. This problem has many applications that include, to name a few, computer vision, group testing, exact learning from membership queries and game theory.
Arkin et al. and Moshkov gave a polynomial time $(\ln |A|)$- approximation algorithm (for the depth). The r…
▽ More
In this paper we study a polynomial time algorithms that for an input $A\subseteq {B_m}$ outputs a decision tree for $A$ of minimum depth. This problem has many applications that include, to name a few, computer vision, group testing, exact learning from membership queries and game theory.
Arkin et al. and Moshkov gave a polynomial time $(\ln |A|)$- approximation algorithm (for the depth). The result of Dinur and Steurer for set cover implies that this problem cannot be approximated with ratio $(1-o(1))\cdot \ln |A|$, unless P=NP. Moskov the combinatorial measure of extended teaching dimension of $A$, $ETD(A)$. He showed that $ETD(A)$ is a lower bound for the depth of the decision tree for $A$ and then gave an {\it exponential time} $ETD(A)/\log(ETD(A))$-approximation algorithm.
In this paper we further study the $ETD(A)$ measure and a new combinatorial measure, $DEN(A)$, that we call the density of the set $A$. We show that $DEN(A)\le ETD(A)+1$. We then give two results. The first result is that the lower bound $ETD(A)$ of Moshkov for the depth of the decision tree for $A$ is greater than the bounds that are obtained by the classical technique used in the literature. The second result is a polynomial time $(\ln 2) DEN(A)$-approximation (and therefore $(\ln 2) ETD(A)$-approximation) algorithm for the depth of the decision tree of $A$. We also show that a better approximation ratio implies P=NP.
We then apply the above results to learning the class of disjunctions of predicates from membership queries. We show that the $ETD$ of this class is bounded from above by the degree $d$ of its Hasse diagram. We then show that Moshkov algorithm can be run in polynomial time and is $(d/\log d)$-approximation algorithm. This gives optimal algorithms when the degree is constant. For example, learning axis parallel rays over constant dimension space.
△ Less
Submitted 1 February, 2018;
originally announced February 2018.
-
Adaptive Group Testing Algorithms to Estimate the Number of Defectives
Authors:
Nader H. Bshouty,
Vivian E. Bshouty-Hurani,
George Haddad,
Thomas Hashem,
Fadi Khoury,
Omar Sharafy
Abstract:
We study the problem of estimating the number of defective items in adaptive Group testing by using a minimum number of queries. We improve the existing algorithm and prove a lower bound that show that, for constant estimation, the number of tests in our algorithm is optimal.
We study the problem of estimating the number of defective items in adaptive Group testing by using a minimum number of queries. We improve the existing algorithm and prove a lower bound that show that, for constant estimation, the number of tests in our algorithm is optimal.
△ Less
Submitted 21 December, 2023; v1 submitted 2 December, 2017;
originally announced December 2017.
-
Non-Adaptive Randomized Algorithm for Group Testing
Authors:
Nader H. Bshouty,
Nuha Diab,
Shada R. Kawar,
Robert J. Shahla
Abstract:
We study the problem of group testing with a non-adaptive randomized algorithm in the random incidence design (RID) model where each entry in the test is chosen randomly independently from $\{0,1\}$ with a fixed probability $p$.
The property that is sufficient and necessary for a unique decoding is the separability of the tests, but unfortunately no linear time algorithm is known for such tests.…
▽ More
We study the problem of group testing with a non-adaptive randomized algorithm in the random incidence design (RID) model where each entry in the test is chosen randomly independently from $\{0,1\}$ with a fixed probability $p$.
The property that is sufficient and necessary for a unique decoding is the separability of the tests, but unfortunately no linear time algorithm is known for such tests. In order to achieve linear-time decodable tests, the algorithms in the literature use the disjunction property that gives almost optimal number of tests.
We define a new property for the tests which we call semi-disjunction property. We show that there is a linear time decoding for such test and for $d\to \infty$ the number of tests converges to the number of tests with the separability property and is therefore optimal (in the RID model). Our analysis shows that, in the RID model, the number of tests in our algorithm is better than the one with the disjunction property even for small $d$.
△ Less
Submitted 9 August, 2017;
originally announced August 2017.
-
The Maximum Cosine Framework for Deriving Perceptron Based Linear Classifiers
Authors:
Nader H. Bshouty,
Catherine A. Haddad-Zaknoon
Abstract:
In this work, we introduce a mathematical framework, called the Maximum Cosine Framework or MCF, for deriving new linear classifiers. The method is based on selecting an appropriate bound on the cosine of the angle between the target function and the algorithm's. To justify its correctness, we use the MCF to show how to regenerate the update rule of Aggressive ROMMA. Moreover, we construct a cosin…
▽ More
In this work, we introduce a mathematical framework, called the Maximum Cosine Framework or MCF, for deriving new linear classifiers. The method is based on selecting an appropriate bound on the cosine of the angle between the target function and the algorithm's. To justify its correctness, we use the MCF to show how to regenerate the update rule of Aggressive ROMMA. Moreover, we construct a cosine bound from which we build the Maximum Cosine Perceptron algorithm or, for short, the MCP algorithm. We prove that the MCP shares the same mistake bound like the Perceptron. In addition, we demonstrate the promising performance of the MCP on a real dataset. Our experiments show that, under the restriction of single pass learning, the MCP algorithm outperforms PA and Aggressive ROMMA.
△ Less
Submitted 4 July, 2017;
originally announced July 2017.
-
Exact Learning of Juntas from Membership Queries
Authors:
Nader H. Bshouty,
Areej Costa
Abstract:
In this paper, we study adaptive and non-adaptive exact learning of Juntas from membership queries. We use new techniques to find new bounds, narrow some of the gaps between the lower bounds and upper bounds and find new deterministic and randomized algorithms with small query and time complexities.
Some of the bounds are tight in the sense that finding better ones either gives a breakthrough re…
▽ More
In this paper, we study adaptive and non-adaptive exact learning of Juntas from membership queries. We use new techniques to find new bounds, narrow some of the gaps between the lower bounds and upper bounds and find new deterministic and randomized algorithms with small query and time complexities.
Some of the bounds are tight in the sense that finding better ones either gives a breakthrough result in some long-standing combinatorial open problem or needs a new technique that is beyond the existing ones.
△ Less
Submitted 21 June, 2017;
originally announced June 2017.
-
Learning Disjunctions of Predicates
Authors:
Nader H. Bshouty,
Dana Drachsler-Cohen,
Martin Vechev,
Eran Yahav
Abstract:
Let $F$ be a set of boolean functions. We present an algorithm for learning $F_\vee := \{\vee_{f\in S} f \mid S \subseteq F\}$ from membership queries. Our algorithm asks at most $|F| \cdot OPT(F_\vee)$ membership queries where $OPT(F_\vee)$ is the minimum worst case number of membership queries for learning $F_\vee$. When $F$ is a set of halfspaces over a constant dimension space or a set of vari…
▽ More
Let $F$ be a set of boolean functions. We present an algorithm for learning $F_\vee := \{\vee_{f\in S} f \mid S \subseteq F\}$ from membership queries. Our algorithm asks at most $|F| \cdot OPT(F_\vee)$ membership queries where $OPT(F_\vee)$ is the minimum worst case number of membership queries for learning $F_\vee$. When $F$ is a set of halfspaces over a constant dimension space or a set of variable inequalities, our algorithm runs in polynomial time.
The problem we address has practical importance in the field of program synthesis, where the goal is to synthesize a program that meets some requirements. Program synthesis has become popular especially in settings aiming to help end users. In such settings, the requirements are not provided upfront and the synthesizer can only learn them by posing membership queries to the end user. Our work enables such synthesizers to learn the exact requirements while bounding the number of membership queries.
△ Less
Submitted 15 June, 2017;
originally announced June 2017.
-
Exact Learning from an Honest Teacher That Answers Membership Queries
Authors:
Nader H. Bshouty
Abstract:
Given a teacher that holds a function $f:X\to R$ from some class of functions $C$. The teacher can receive from the learner an element~$d$ in the domain $X$ (a query) and returns the value of the function in $d$, $f(d)\in R$. The learner goal is to find $f$ with a minimum number of queries, optimal time complexity, and optimal resources.
In this survey, we present some of the results known from…
▽ More
Given a teacher that holds a function $f:X\to R$ from some class of functions $C$. The teacher can receive from the learner an element~$d$ in the domain $X$ (a query) and returns the value of the function in $d$, $f(d)\in R$. The learner goal is to find $f$ with a minimum number of queries, optimal time complexity, and optimal resources.
In this survey, we present some of the results known from the literature, different techniques used, some new problems, and open problems.
△ Less
Submitted 13 June, 2017;
originally announced June 2017.
-
Derandomizing Chernoff Bound with Union Bound with an Application to $k$-wise Independent Sets
Authors:
Nader H. Bshouty
Abstract:
Derandomization of Chernoff bound with union bound is already proven in many papers. We here give another explicit version of it that obtains a construction of size that is arbitrary close to the probabilistic nonconstructive size.
We apply this to give a new simple polynomial time constructions of almost $k$-wise independent sets. We also give almost tight lower bounds for the size of $k$-wise…
▽ More
Derandomization of Chernoff bound with union bound is already proven in many papers. We here give another explicit version of it that obtains a construction of size that is arbitrary close to the probabilistic nonconstructive size.
We apply this to give a new simple polynomial time constructions of almost $k$-wise independent sets. We also give almost tight lower bounds for the size of $k$-wise independent sets.
△ Less
Submitted 4 August, 2016;
originally announced August 2016.
-
Enumerating all the Irreducible Polynomials over Finite Field
Authors:
Nader H. Bshouty,
Nuha Diab,
Shada R. Kawar,
Robert J. Shahla
Abstract:
In this paper we give a detailed analysis of deterministic and randomized algorithms that enumerate any number of irreducible polynomials of degree $n$ over a finite field and their roots in the extension field in quasilinear where $N=n^2$ is the size of the output.} time cost per element.
Our algorithm is based on an improved algorithm for enumerating all the Lyndon words of length $n$ in linea…
▽ More
In this paper we give a detailed analysis of deterministic and randomized algorithms that enumerate any number of irreducible polynomials of degree $n$ over a finite field and their roots in the extension field in quasilinear where $N=n^2$ is the size of the output.} time cost per element.
Our algorithm is based on an improved algorithm for enumerating all the Lyndon words of length $n$ in linear delay time and the known reduction of Lyndon words to irreducible polynomials.
△ Less
Submitted 11 August, 2016; v1 submitted 16 February, 2016;
originally announced February 2016.
-
Almost Optimal Cover-Free Families
Authors:
Nader H. Bshouty,
Ariel Gabizon
Abstract:
Roughly speaking, an $(n,(r,s))$-Cover Free Family (CFF) is a small set of $n$-bit strings such that: "in any $d:=r+s$ indices we see all patterns of weight $r$". CFFs have been of interest for a long time both in discrete mathematics as part of block design theory, and in theoretical computer science where they have found a variety of applications, for example, in parametrized algorithms where th…
▽ More
Roughly speaking, an $(n,(r,s))$-Cover Free Family (CFF) is a small set of $n$-bit strings such that: "in any $d:=r+s$ indices we see all patterns of weight $r$". CFFs have been of interest for a long time both in discrete mathematics as part of block design theory, and in theoretical computer science where they have found a variety of applications, for example, in parametrized algorithms where they were introduced in the recent breakthrough work of Fomin, Lokshtanov and Saurabh under the name `lopsided universal sets'.
In this paper we give the first explicit construction of cover-free families of optimal size up to lower order multiplicative terms, {for any $r$ and $s$}. In fact, our construction time is almost linear in the size of the family. Before our work, such a result existed only for $r=d^{o(1)}$. and $r= ω(d/(\log\log d\log\log\log d))$. As a sample application, we improve the running times of parameterized algorithms from the recent work of Gabizon, Lokshtanov and Pilipczuk.
△ Less
Submitted 27 July, 2015;
originally announced July 2015.
-
Non-Adaptive Learning a Hidden Hipergraph
Authors:
Hasan Abasi,
Nader H. Bshouty,
Hanna Mazzawi
Abstract:
We give a new deterministic algorithm that non-adaptively learns a hidden hypergraph from edge-detecting queries. All previous non-adaptive algorithms either run in exponential time or have non-optimal query complexity. We give the first polynomial time non-adaptive learning algorithm for learning hypergraph that asks almost optimal number of queries.
We give a new deterministic algorithm that non-adaptively learns a hidden hypergraph from edge-detecting queries. All previous non-adaptive algorithms either run in exponential time or have non-optimal query complexity. We give the first polynomial time non-adaptive learning algorithm for learning hypergraph that asks almost optimal number of queries.
△ Less
Submitted 13 February, 2015;
originally announced February 2015.
-
Lower Bounds for Cover-Free Families
Authors:
Ali Z. Abdi,
Nader H. Bshouty
Abstract:
Let ${\cal F}$ be a set of blocks of a $t$-set $X$. $(X,{\cal F})$ is called $(w,r)$-cover-free family ($(w,r)-$CFF) provided that, the intersection of any $w$ blocks in ${\cal F}$ is not contained in the union of any other $r$ blocks in ${\cal F}$. We give new asymptotic lower bounds for the number of minimum points $t$ in a $(w,r)$-CFF when $w\le r=|{\cal F}|^ε$ for some constant $ε\ge 1/2$.
Let ${\cal F}$ be a set of blocks of a $t$-set $X$. $(X,{\cal F})$ is called $(w,r)$-cover-free family ($(w,r)-$CFF) provided that, the intersection of any $w$ blocks in ${\cal F}$ is not contained in the union of any other $r$ blocks in ${\cal F}$. We give new asymptotic lower bounds for the number of minimum points $t$ in a $(w,r)$-CFF when $w\le r=|{\cal F}|^ε$ for some constant $ε\ge 1/2$.
△ Less
Submitted 31 March, 2015; v1 submitted 12 February, 2015;
originally announced February 2015.
-
Dense Testers: Almost Linear Time and Locally Explicit Constructions
Authors:
Nader H. Bshouty
Abstract:
We develop a new notion called $(1-ε)$-tester for a set $M$ of functions $f:A\to C$. A $(1-ε)$-tester for $M$ maps each element $a\in A$ to a finite number of elements $B_a=\{b_1,\ldots,b_t\}\subset B$ in a smaller sub-domain $B\subset A$ where for every $f\in M$ if $f(a)\not=0$ then $f(b)\not=0$ for at least $(1-ε)$ fraction of the elements $b$ of $B_a$. I.e., if $f(a)\not=0$ then…
▽ More
We develop a new notion called $(1-ε)$-tester for a set $M$ of functions $f:A\to C$. A $(1-ε)$-tester for $M$ maps each element $a\in A$ to a finite number of elements $B_a=\{b_1,\ldots,b_t\}\subset B$ in a smaller sub-domain $B\subset A$ where for every $f\in M$ if $f(a)\not=0$ then $f(b)\not=0$ for at least $(1-ε)$ fraction of the elements $b$ of $B_a$. I.e., if $f(a)\not=0$ then $\Pr_{b\in B_a}[f(b)\not=0]\ge 1-ε$. The {\it size} of the $(1-ε)$-tester is $\max_{a\in A}|B_a|$ and the goal is to minimize this size, construct $B_a$ in deterministic almost linear time and access and compute each map in poly-log time.
We use tools from elementary algebra and algebraic function fields to build $(1-ε)$-testers of small size in deterministic almost linear time. We also show that our constructions are locally explicit, i.e., one can find any entry in the construction in time poly-log in the size of the construction and the field size. We also prove lower bounds that show that the sizes of our testers and the densities are almost optimal.
Testers were used in [Bshouty, Testers and its application, ITCS 2014] to construct almost optimal perfect hash families, universal sets, cover-free families, separating hash functions, black box identity testing and hitting sets. The dense testers in this paper shows that such constructions can be done in almost linear time, are locally explicit and can be made to be dense.
△ Less
Submitted 18 December, 2014;
originally announced December 2014.
-
Linear time Constructions of some $d$-Restriction Problems
Authors:
Nader H. Bshouty
Abstract:
We give new linear time globally explicit constructions for perfect hash families, cover-free families and separating hash functions.
We give new linear time globally explicit constructions for perfect hash families, cover-free families and separating hash functions.
△ Less
Submitted 11 June, 2014; v1 submitted 9 June, 2014;
originally announced June 2014.
-
Learning Boolean Halfspaces with Small Weights from Membership Queries
Authors:
Hasan Abasi,
Ali Z. Abdi,
Nader H. Bshouty
Abstract:
We consider the problem of proper learning a Boolean Halfspace with integer weights $\{0,1,\ldots,t\}$ from membership queries only. The best known algorithm for this problem is an adaptive algorithm that asks $n^{O(t^5)}$ membership queries where the best lower bound for the number of membership queries is $n^t$ [Learning Threshold Functions with Small Weights Using Membership Queries. COLT 1999]…
▽ More
We consider the problem of proper learning a Boolean Halfspace with integer weights $\{0,1,\ldots,t\}$ from membership queries only. The best known algorithm for this problem is an adaptive algorithm that asks $n^{O(t^5)}$ membership queries where the best lower bound for the number of membership queries is $n^t$ [Learning Threshold Functions with Small Weights Using Membership Queries. COLT 1999]
In this paper we close this gap and give an adaptive proper learning algorithm with two rounds that asks $n^{O(t)}$ membership queries. We also give a non-adaptive proper learning algorithm that asks $n^{O(t^3)}$ membership queries.
△ Less
Submitted 7 May, 2014;
originally announced May 2014.
-
On Exact Learning Monotone DNF from Membership Queries
Authors:
Hasan Abasi,
Nader H. Bshouty,
Hanna Mazzawi
Abstract:
In this paper, we study the problem of learning a monotone DNF with at most $s$ terms of size (number of variables in each term) at most $r$ ($s$ term $r$-MDNF) from membership queries. This problem is equivalent to the problem of learning a general hypergraph using hyperedge-detecting queries, a problem motivated by applications arising in chemical reactions and genome sequencing.
We first pres…
▽ More
In this paper, we study the problem of learning a monotone DNF with at most $s$ terms of size (number of variables in each term) at most $r$ ($s$ term $r$-MDNF) from membership queries. This problem is equivalent to the problem of learning a general hypergraph using hyperedge-detecting queries, a problem motivated by applications arising in chemical reactions and genome sequencing.
We first present new lower bounds for this problem and then present deterministic and randomized adaptive algorithms with query complexities that are almost optimal. All the algorithms we present in this paper run in time linear in the query complexity and the number of variables $n$. In addition, all of the algorithms we present in this paper are asymptotically tight for fixed $r$ and/or $s$.
△ Less
Submitted 5 May, 2014;
originally announced May 2014.
-
A Simple Algorithm for Hamiltonicity
Authors:
Hasan Abasi,
Nader H. Bshouty
Abstract:
We develop a new algebraic technique that solves the following problem: Given a black box that contains an arithmetic circuit $f$ over a field of characteristic $2$ of degree~$d$. Decide whether $f$, expressed as an equivalent multivariate polynomial, contains a multilinear monomial of degree $d$.
This problem was solved by Williams \cite{W} and Björklund et. al. \cite{BHKK} for a white box (the…
▽ More
We develop a new algebraic technique that solves the following problem: Given a black box that contains an arithmetic circuit $f$ over a field of characteristic $2$ of degree~$d$. Decide whether $f$, expressed as an equivalent multivariate polynomial, contains a multilinear monomial of degree $d$.
This problem was solved by Williams \cite{W} and Björklund et. al. \cite{BHKK} for a white box (the circuit is given as an input) that contains arithmetic circuit. We show a simple black box algorithm that solves the problem with the same time complexity.
This gives a simple randomized algorithm for the simple $k$-path problem for directed graphs of the same time complexity\footnote{$O^*(f(k))$ is $O(poly(n)\cdot f(k))$} $O^*(2^k)$ as in \cite{W} and with reusing the same ideas from \cite{BHKK} with the above gives another algorithm (probably not simpler) for undirected graphs of the same time complexity $O^*(1.657^k)$ as in \cite{B10,BHKK}.
△ Less
Submitted 10 April, 2014;
originally announced April 2014.
-
On $r$-Simple $k$-Path
Authors:
Hasan Abasi,
Nader H. Bshouty,
Ariel Gabizon,
Elad Haramaty
Abstract:
An $r$-simple $k$-path is a {path} in the graph of length $k$ that passes through each vertex at most $r$ times. The $r$-SIMPLE $k$-PATH problem, given a graph $G$ as input, asks whether there exists an $r$-simple $k$-path in $G$. We first show that this problem is NP-Complete. We then show that there is a graph $G$ that contains an $r$-simple $k$-path and no simple path of length greater than…
▽ More
An $r$-simple $k$-path is a {path} in the graph of length $k$ that passes through each vertex at most $r$ times. The $r$-SIMPLE $k$-PATH problem, given a graph $G$ as input, asks whether there exists an $r$-simple $k$-path in $G$. We first show that this problem is NP-Complete. We then show that there is a graph $G$ that contains an $r$-simple $k$-path and no simple path of length greater than $4\log k/\log r$. So this, in a sense, motivates this problem especially when one's goal is to find a short path that visits many vertices in the graph while bounding the number of visits at each vertex.
We then give a randomized algorithm that runs in time $$\mathrm{poly}(n)\cdot 2^{O( k\cdot \log r/r)}$$ that solves the $r$-SIMPLE $k$-PATH on a graph with $n$ vertices with one-sided error. We also show that a randomized algorithm with running time $\mathrm{poly}(n)\cdot 2^{(c/2)k/ r}$ with $c<1$ gives a randomized algorithm with running time $\poly(n)\cdot 2^{cn}$ for the Hamiltonian path problem in a directed graph - an outstanding open problem. So in a sense our algorithm is optimal up to an $O(\log r)$ factor.
△ Less
Submitted 22 January, 2014; v1 submitted 17 December, 2013;
originally announced December 2013.
-
Optimal Query Complexity for Reconstructing Hypergraphs
Authors:
Nader H. Bshouty,
Hanna Mazzawi
Abstract:
In this paper we consider the problem of reconstructing a hidden weighted hypergraph of constant rank using additive queries. We prove the following: Let $G$ be a weighted hidden hypergraph of constant rank with n vertices and $m$ hyperedges. For any $m$ there exists a non-adaptive algorithm that finds the edges of the graph and their weights using $$ O(\frac{m\log n}{\log m}) $$ additive querie…
▽ More
In this paper we consider the problem of reconstructing a hidden weighted hypergraph of constant rank using additive queries. We prove the following: Let $G$ be a weighted hidden hypergraph of constant rank with n vertices and $m$ hyperedges. For any $m$ there exists a non-adaptive algorithm that finds the edges of the graph and their weights using $$ O(\frac{m\log n}{\log m}) $$ additive queries. This solves the open problem in [S. Choi, J. H. Kim. Optimal Query Complexity Bounds for Finding Graphs. {\em STOC}, 749--758,~2008].
When the weights of the hypergraph are integers that are less than $O(poly(n^d/m))$ where $d$ is the rank of the hypergraph (and therefore for unweighted hypergraphs) there exists a non-adaptive algorithm that finds the edges of the graph and their weights using $$ O(\frac{m\log \frac{n^d}{m}}{\log m}). $$ additive queries.
Using the information theoretic bound the above query complexities are tight.
△ Less
Submitted 3 January, 2010;
originally announced January 2010.