-
Three-dimensional quantum Hall states as a chiral electromagnetic filter
Authors:
Nandagopal Manoj,
Valerio Peri
Abstract:
Extensive research has explored the optical properties of topological insulating materials, driven by their inherent stability and potential applications. In this study, we unveil a novel functionality of three-dimensional integer quantum Hall (3D IQH) states as broad-band filters for circularly polarized light, particularly effective in the terahertz (THz) frequency range under realistic system p…
▽ More
Extensive research has explored the optical properties of topological insulating materials, driven by their inherent stability and potential applications. In this study, we unveil a novel functionality of three-dimensional integer quantum Hall (3D IQH) states as broad-band filters for circularly polarized light, particularly effective in the terahertz (THz) frequency range under realistic system parameters. We also investigate the impact of practical imperfections, demonstrating the resilience of this filtering effect. Our findings reveal that this phenomenon is independent of the microscopic origin of the 3D IQH state, prompting discussions on its feasibility across diverse candidate materials. These results contribute to our understanding of fundamental optical properties and hold promise for practical applications in optical technologies.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Dueling Optimization with a Monotone Adversary
Authors:
Avrim Blum,
Meghal Gupta,
Gene Li,
Naren Sarayu Manoj,
Aadirupa Saha,
Yuanyuan Yang
Abstract:
We introduce and study the problem of dueling optimization with a monotone adversary, which is a generalization of (noiseless) dueling convex optimization. The goal is to design an online algorithm to find a minimizer $\mathbf{x}^{*}$ for a function $f\colon X \to \mathbb{R}$, where $X \subseteq \mathbb{R}^d$. In each round, the algorithm submits a pair of guesses, i.e., $\mathbf{x}^{(1)}$ and…
▽ More
We introduce and study the problem of dueling optimization with a monotone adversary, which is a generalization of (noiseless) dueling convex optimization. The goal is to design an online algorithm to find a minimizer $\mathbf{x}^{*}$ for a function $f\colon X \to \mathbb{R}$, where $X \subseteq \mathbb{R}^d$. In each round, the algorithm submits a pair of guesses, i.e., $\mathbf{x}^{(1)}$ and $\mathbf{x}^{(2)}$, and the adversary responds with any point in the space that is at least as good as both guesses. The cost of each query is the suboptimality of the worse of the two guesses; i.e., ${\max} \left( f(\mathbf{x}^{(1)}), f(\mathbf{x}^{(2)}) \right) - f(\mathbf{x}^{*})$. The goal is to minimize the number of iterations required to find an $\varepsilon$-optimal point and to minimize the total cost (regret) of the guesses over many rounds. Our main result is an efficient randomized algorithm for several natural choices of the function $f$ and set $X$ that incurs cost $O(d)$ and iteration complexity $O(d\log(1/\varepsilon)^2)$. Moreover, our dependence on $d$ is asymptotically optimal, as we show examples in which any randomized algorithm for this problem must incur $Ω(d)$ cost and iteration complexity.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
The Change-of-Measure Method, Block Lewis Weights, and Approximating Matrix Block Norms
Authors:
Naren Sarayu Manoj,
Max Ovsiankin
Abstract:
Given a matrix $\mathbf{A} \in \mathbb{R}^{k \times n}$, a partitioning of $[k]$ into groups $S_1,\dots,S_m$, an outer norm $p$, and a collection of inner norms such that either $p \ge 1$ and $p_1,\dots,p_m \ge 2$ or $p_1=\dots=p_m=p \ge 1/\log n$, we prove that there is a sparse weight vector $\mathbfβ \in \mathbb{R}^{m}$ such that…
▽ More
Given a matrix $\mathbf{A} \in \mathbb{R}^{k \times n}$, a partitioning of $[k]$ into groups $S_1,\dots,S_m$, an outer norm $p$, and a collection of inner norms such that either $p \ge 1$ and $p_1,\dots,p_m \ge 2$ or $p_1=\dots=p_m=p \ge 1/\log n$, we prove that there is a sparse weight vector $\mathbfβ \in \mathbb{R}^{m}$ such that $\sum_{i=1}^m β_i \cdot \|\mathbf{A}_{S_i}\mathbf{x}\|_{p_i}^p \approx_{1\pm\varepsilon} \sum_{i=1}^m \|\mathbf{A}_{S_i}\mathbf{x}\|_{p_i}^p$, where the number of nonzero entries of $\mathbfβ$ is at most $O_{p,p_i}(\varepsilon^{-2}n^{\max(1,p/2)}(\log n)^2(\log(n/\varepsilon)))$. When $p_1\dots,p_m \ge 2$, this weight vector arises from an importance sampling procedure based on the block Lewis weights, a recently proposed generalization of Lewis weights. Additionally, we prove that there exist efficient algorithms to find the sparse weight vector $\mathbfβ$ in several important regimes of $p$ and $p_1,\dots,p_m$.
Our main technical contribution is a substantial generalization of the change-of-measure method that Bourgain, Lindenstrauss, and Milman used to obtain the analogous result when every group has size $1$. Our generalization allows one to analyze change of measures beyond those implied by D. Lewis's original construction, including the measure implied by the block Lewis weights and natural approximations of this measure.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Near-Optimal Streaming Ellipsoidal Rounding for General Convex Polytopes
Authors:
Yury Makarychev,
Naren Sarayu Manoj,
Max Ovsiankin
Abstract:
We give near-optimal algorithms for computing an ellipsoidal rounding of a convex polytope whose vertices are given in a stream. The approximation factor is linear in the dimension (as in John's theorem) and only loses an excess logarithmic factor in the aspect ratio of the polytope. Our algorithms are nearly optimal in two senses: first, their runtimes nearly match those of the most efficient kno…
▽ More
We give near-optimal algorithms for computing an ellipsoidal rounding of a convex polytope whose vertices are given in a stream. The approximation factor is linear in the dimension (as in John's theorem) and only loses an excess logarithmic factor in the aspect ratio of the polytope. Our algorithms are nearly optimal in two senses: first, their runtimes nearly match those of the most efficient known algorithms for the offline version of the problem. Second, their approximation factors nearly match a lower bound we show against a natural class of geometric streaming algorithms. In contrast to existing works in the streaming setting that compute ellipsoidal roundings only for centrally symmetric convex polytopes, our algorithms apply to general convex polytopes. We also show how to use our algorithms to construct coresets from a stream of points that approximately preserve both the ellipsoidal rounding and the convex hull of the original set of points.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Interpolation Learning With Minimum Description Length
Authors:
Naren Sarayu Manoj,
Nathan Srebro
Abstract:
We prove that the Minimum Description Length learning rule exhibits tempered overfitting. We obtain tempered agnostic finite sample learning guarantees and characterize the asymptotic behavior in the presence of random label noise.
We prove that the Minimum Description Length learning rule exhibits tempered overfitting. We obtain tempered agnostic finite sample learning guarantees and characterize the asymptotic behavior in the presence of random label noise.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Streaming Algorithms for Ellipsoidal Approximation of Convex Polytopes
Authors:
Yury Makarychev,
Naren Sarayu Manoj,
Max Ovsiankin
Abstract:
We give efficient deterministic one-pass streaming algorithms for finding an ellipsoidal approximation of a symmetric convex polytope. The algorithms are near-optimal in that their approximation factors differ from that of the optimal offline solution only by a factor sub-logarithmic in the aspect ratio of the polytope.
We give efficient deterministic one-pass streaming algorithms for finding an ellipsoidal approximation of a symmetric convex polytope. The algorithms are near-optimal in that their approximation factors differ from that of the optimal offline solution only by a factor sub-logarithmic in the aspect ratio of the polytope.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
An Optimal Algorithm for Certifying Monotone Functions
Authors:
Meghal Gupta,
Naren Sarayu Manoj
Abstract:
Given query access to a monotone function $f\colon\{0,1\}^n\to\{0,1\}$ with certificate complexity $C(f)$ and an input $x^{\star}$, we design an algorithm that outputs a size-$C(f)$ subset of $x^{\star}$ certifying the value of $f(x^{\star})$. Our algorithm makes $O(C(f) \cdot \log n)$ queries to $f$, which matches the information-theoretic lower bound for this problem and resolves the concrete op…
▽ More
Given query access to a monotone function $f\colon\{0,1\}^n\to\{0,1\}$ with certificate complexity $C(f)$ and an input $x^{\star}$, we design an algorithm that outputs a size-$C(f)$ subset of $x^{\star}$ certifying the value of $f(x^{\star})$. Our algorithm makes $O(C(f) \cdot \log n)$ queries to $f$, which matches the information-theoretic lower bound for this problem and resolves the concrete open question posed in the STOC '22 paper of Blanc, Koch, Lange, and Tan [BKLT22].
We extend this result to an algorithm that finds a size-$2C(f)$ certificate for a real-valued monotone function with $O(C(f) \cdot \log n)$ queries. We also complement our algorithms with a hardness result, in which we show that finding the shortest possible certificate in $x^{\star}$ may require $Ω\left(\binom{n}{C(f)}\right)$ queries in the worst case.
△ Less
Submitted 3 April, 2022;
originally announced April 2022.
-
Arboreal Topological and Fracton Phases
Authors:
Nandagopal Manoj,
Vijay B. Shenoy
Abstract:
We describe topologically ordered and fracton ordered states on novel geometries which do not have an underlying manifold structure. Using tree graphs such as the $k$-coordinated Bethe lattice ${\cal B}(k)$ and a hypertree called the $(k,n)$-hyper-Bethe lattice ${\cal HB}(k,n)$ consisting of $k$-coordinated hyperlinks (defined by $n$ sites), we construct multidimensional arboreal arenas such as…
▽ More
We describe topologically ordered and fracton ordered states on novel geometries which do not have an underlying manifold structure. Using tree graphs such as the $k$-coordinated Bethe lattice ${\cal B}(k)$ and a hypertree called the $(k,n)$-hyper-Bethe lattice ${\cal HB}(k,n)$ consisting of $k$-coordinated hyperlinks (defined by $n$ sites), we construct multidimensional arboreal arenas such as ${\cal B}(k_1) \square {\cal B}(k_2)$ by the notion of a graph Cartesian product $\square$. We study various quantum systems such as the ${\mathbb Z}_2$ gauge theory, generalized quantum Ising models (GQIM), the fractonic X-cube model, and related X-cube gauge theory defined on these arenas. Even the simplest ${\mathbb Z}_2$ gauge theory on a 2d arboreal arena is fractonic -- the monopole excitation is immobile. The X-cube model on a 3d arboreal arena is fully fractonic, all multipoles are rendered immobile. We obtain variational ground state phase diagrams of these gauge theories. Further, we find an intriguing class of dualities in arboreal arenas as illustrated by the ${\mathbb Z}_2$ gauge theory defined on ${\cal B}(k_1) \square {\cal B}(k_2)$ being dual to a GQIM defined on ${\cal HB}(2,k_1) \square {\cal HB}(2,k_2)$. Finally, we discuss different classes of topological and fracton orders on arboreal arenas. We find three distinct classes of arboreal toric code orders on 2d arboreal arenas, those that occur on ${\cal B}(2) \square {\cal B}(2)$, ${\cal B}(k) \square {\cal B}(2), k >2$, and ${\cal B}(k_1) \square {\cal B}(k_2)$, $k_1,k_2>2$. Likewise, four classes of X-cube fracton orders are found in 3d arboreal arenas -- those on ${\cal B}(2)\square{\cal B}(2)\square {\cal B}(2)$, ${\cal B}(k) \square {\cal B}(2)\square {\cal B}(2), k>2$, ${\cal B}(k_1) \square {\cal B}(k_2) \square {\cal B}(2), k_1,k_2 >2$, and ${\cal B}(k_1) \square {\cal B}(k_2) \square {\cal B}(k_3), k_1,k_2,k_3 >2$.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Excess Capacity and Backdoor Poisoning
Authors:
Naren Sarayu Manoj,
Avrim Blum
Abstract:
A backdoor data poisoning attack is an adversarial attack wherein the attacker injects several watermarked, mislabeled training examples into a training set. The watermark does not impact the test-time performance of the model on typical data; however, the model reliably errs on watermarked examples.
To gain a better foundational understanding of backdoor data poisoning attacks, we present a for…
▽ More
A backdoor data poisoning attack is an adversarial attack wherein the attacker injects several watermarked, mislabeled training examples into a training set. The watermark does not impact the test-time performance of the model on typical data; however, the model reliably errs on watermarked examples.
To gain a better foundational understanding of backdoor data poisoning attacks, we present a formal theoretical framework within which one can discuss backdoor data poisoning attacks for classification problems. We then use this to analyze important statistical and computational issues surrounding these attacks.
On the statistical front, we identify a parameter we call the memorization capacity that captures the intrinsic vulnerability of a learning problem to a backdoor attack. This allows us to argue about the robustness of several natural learning problems to backdoor attacks. Our results favoring the attacker involve presenting explicit constructions of backdoor attacks, and our robustness results show that some natural problem settings cannot yield successful backdoor attacks.
From a computational standpoint, we show that under certain assumptions, adversarial training can detect the presence of backdoors in a training set. We then show that under similar assumptions, two closely related problems we call backdoor filtering and robust generalization are nearly equivalent. This implies that it is both asymptotically necessary and sufficient to design algorithms that can identify watermarked examples in the training set in order to obtain a learning algorithm that both generalizes well to unseen data and is robust to backdoors.
△ Less
Submitted 3 November, 2021; v1 submitted 1 September, 2021;
originally announced September 2021.
-
Screw dislocations in the X-cube fracton model
Authors:
Nandagopal Manoj,
Kevin Slagle,
Wilbur Shirley,
Xie Chen
Abstract:
The X-cube model, a prototypical gapped fracton model, has been shown to have a foliation structure. That is, inside the 3+1D model, there are hidden layers of 2+1D gapped topological states. A screw dislocation in a 3+1D lattice can often reveal nontrivial features associated with a layered structure. In this paper, we study the X-cube model on lattices with screw dislocations. In particular, we…
▽ More
The X-cube model, a prototypical gapped fracton model, has been shown to have a foliation structure. That is, inside the 3+1D model, there are hidden layers of 2+1D gapped topological states. A screw dislocation in a 3+1D lattice can often reveal nontrivial features associated with a layered structure. In this paper, we study the X-cube model on lattices with screw dislocations. In particular, we find that a screw dislocation results in a finite change in the logarithm of the ground state degeneracy of the model. Part of the change can be traced back to the effect of screw dislocations in a simple stack of 2+1D topological states, hence corroborating the foliation structure in the model. The other part of the change comes from the induced motion of fractons or sub-dimensional excitations along the dislocation, a feature absent in the stack of 2+1D layers.
△ Less
Submitted 14 April, 2021; v1 submitted 14 December, 2020;
originally announced December 2020.
-
Tearing Fractons
Authors:
Nandagopal Manoj,
Roderich Moessner,
Vijay B. Shenoy
Abstract:
We offer a fractonic perspective on a familiar observation -- a flat sheet of paper can be folded only along a straight line if one wants to avoid the creation of additional creases or tears. Our core underlying technical result is the establishment of a duality between the theory of elastic plates and a fractonic gauge theory with a second rank symmetric electric field tensor, a scalar magnetic f…
▽ More
We offer a fractonic perspective on a familiar observation -- a flat sheet of paper can be folded only along a straight line if one wants to avoid the creation of additional creases or tears. Our core underlying technical result is the establishment of a duality between the theory of elastic plates and a fractonic gauge theory with a second rank symmetric electric field tensor, a scalar magnetic field, a vector charge, and a symmetric tensor current. Bending moment and momentum of the plate are dual to the electric and magnetic fields, respectively. While the flexural waves correspond to the quadratically dispersing photon of the gauge theory, a fold defect is dual to its vector charge. Crucially, the fractonic condition constrains the latter to move only along its direction, i.e., the fold's growth direction. By contrast, fracton motion in the perpendicular direction amounts to tearing the paper.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Random Smoothing Might be Unable to Certify $\ell_\infty$ Robustness for High-Dimensional Images
Authors:
Avrim Blum,
Travis Dick,
Naren Manoj,
Hongyang Zhang
Abstract:
We show a hardness result for random smoothing to achieve certified adversarial robustness against attacks in the $\ell_p$ ball of radius $ε$ when $p>2$. Although random smoothing has been well understood for the $\ell_2$ case using the Gaussian distribution, much remains unknown concerning the existence of a noise distribution that works for the case of $p>2$. This has been posed as an open probl…
▽ More
We show a hardness result for random smoothing to achieve certified adversarial robustness against attacks in the $\ell_p$ ball of radius $ε$ when $p>2$. Although random smoothing has been well understood for the $\ell_2$ case using the Gaussian distribution, much remains unknown concerning the existence of a noise distribution that works for the case of $p>2$. This has been posed as an open problem by Cohen et al. (2019) and includes many significant paradigms such as the $\ell_\infty$ threat model. In this work, we show that any noise distribution $\mathcal{D}$ over $\mathbb{R}^d$ that provides $\ell_p$ robustness for all base classifiers with $p>2$ must satisfy $\mathbb{E}η_i^2=Ω(d^{1-2/p}ε^2(1-δ)/δ^2)$ for 99% of the features (pixels) of vector $η\sim\mathcal{D}$, where $ε$ is the robust radius and $δ$ is the score gap between the highest-scored class and the runner-up. Therefore, for high-dimensional images with pixel values bounded in $[0,255]$, the required noise will eventually dominate the useful information in the images, leading to trivial smoothed classifiers.
△ Less
Submitted 5 March, 2020; v1 submitted 9 February, 2020;
originally announced February 2020.
-
Quantifying Perceptual Distortion of Adversarial Examples
Authors:
Matt Jordan,
Naren Manoj,
Surbhi Goel,
Alexandros G. Dimakis
Abstract:
Recent work has shown that additive threat models, which only permit the addition of bounded noise to the pixels of an image, are insufficient for fully capturing the space of imperceivable adversarial examples. For example, small rotations and spatial transformations can fool classifiers, remain imperceivable to humans, but have large additive distance from the original images. In this work, we l…
▽ More
Recent work has shown that additive threat models, which only permit the addition of bounded noise to the pixels of an image, are insufficient for fully capturing the space of imperceivable adversarial examples. For example, small rotations and spatial transformations can fool classifiers, remain imperceivable to humans, but have large additive distance from the original images. In this work, we leverage quantitative perceptual metrics like LPIPS and SSIM to define a novel threat model for adversarial attacks.
To demonstrate the value of quantifying the perceptual distortion of adversarial examples, we present and employ a unifying framework fusing different attack styles. We first prove that our framework results in images that are unattainable by attack styles in isolation. We then perform adversarial training using attacks generated by our framework to demonstrate that networks are only robust to classes of adversarial perturbations they have been trained against, and combination attacks are stronger than any of their individual components. Finally, we experimentally demonstrate that our combined attacks retain the same perceptual distortion but induce far higher misclassification rates when compared against individual attacks.
△ Less
Submitted 21 February, 2019;
originally announced February 2019.