Search | arXiv e-print repository

Asymptotics of quantized barycenters of lattice polytopes with applications to algebraic geometry

Abstract: This article addresses a combinatorial problem with applications to algebraic geometry. To a convex lattice polytope $P$ and each of its integer dilations $kP$ one may associate the barycenter of its lattice points. This sequence of $k$-quantized barycenters converge to the (classical) barycenter of the polytope considered as a convex body. A basic question arises: is there a complete asymptotic e… ▽ More This article addresses a combinatorial problem with applications to algebraic geometry. To a convex lattice polytope $P$ and each of its integer dilations $kP$ one may associate the barycenter of its lattice points. This sequence of $k$-quantized barycenters converge to the (classical) barycenter of the polytope considered as a convex body. A basic question arises: is there a complete asymptotic expansion for this sequence? If so, what are its terms? This article initiates the study of this question. First, we establish the existence of such an expansion as well as determine the first two terms. Second, for Delzant lattice polytopes we use toric algebra to determine all terms using mixed volumes of virtual rooftop polytopes, or alternatively in terms of higher Donaldson--Futaki invariants. Third, for reflexive polytopes we show the quantized barycenters are colinear to first order, and actually colinear in the case of polygons. The proofs use Ehrhart theory, convexity arguments, and toric algebra. As applications we derive the complete asymptotic expansion of the Fujita--Odaka stability thresholds $δ_k$ on arbitrary polarizations on (possibly singular) toric varieties. In fact, we show they are rational functions of $k$ for sufficiently large $k$. This gives the first general result on Tian's stabilization problem for $δ_k$-invariants for (possibly singular) toric Fanos: $δ_k$ stabilize in $k$ if and only if they are all equal to $1$, and when smooth if and only if asymptotically Chow semistable. We also relate the asymptotic expansions to higher Donaldson--Futaki invariants of test configurations motivated by Ehrhart theory, and unify in passing previous results of Donaldson, Ono, Futaki, and Rubinstein--Tian--Zhang. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: with an appendix by Yaxiong Liu

arXiv:2406.08595 [pdf, other]

Approximating Maximum Matching Requires Almost Quadratic Time

Authors: Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein

Abstract: We study algorithms for estimating the size of maximum matching. This problem has been subject to extensive research. For $n$-vertex graphs, Bhattacharya, Kiss, and Saranurak [FOCS'23] (BKS) showed that an estimate that is within $\varepsilon n$ of the optimal solution can be achieved in $n^{2-Ω_\varepsilon(1)}$ time, where $n$ is the number of vertices. While this is subquadratic in $n$ for any f… ▽ More We study algorithms for estimating the size of maximum matching. This problem has been subject to extensive research. For $n$-vertex graphs, Bhattacharya, Kiss, and Saranurak [FOCS'23] (BKS) showed that an estimate that is within $\varepsilon n$ of the optimal solution can be achieved in $n^{2-Ω_\varepsilon(1)}$ time, where $n$ is the number of vertices. While this is subquadratic in $n$ for any fixed $\varepsilon > 0$, it gets closer and closer to the trivial $Θ(n^2)$ time algorithm that reads the entire input as $\varepsilon$ is made smaller and smaller. In this work, we close this gap and show that the algorithm of BKS is close to optimal. In particular, we prove that for any fixed $δ> 0$, there is another fixed $\varepsilon = \varepsilon(δ) > 0$ such that estimating the size of maximum matching within an additive error of $\varepsilon n$ requires $Ω(n^{2-δ})$ time in the adjacency list model. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.02357 [pdf, ps, other]

The complexity of approximate (coarse) correlated equilibrium for incomplete information games

Authors: Binghui Peng, Aviad Rubinstein

Abstract: We study the iteration complexity of decentralized learning of approximate correlated equilibria in incomplete information games. On the negative side, we prove that in $\mathit{extensive}$-$\mathit{form}$ $\mathit{games}$, assuming $\mathsf{PPAD} \not\subset \mathsf{TIME}(n^{\mathsf{polylog}(n)})$, any polynomial-time learning algorithms must take at least $2^{\log_2^{1-o(1)}(|\mathcal{I}|)}$ i… ▽ More We study the iteration complexity of decentralized learning of approximate correlated equilibria in incomplete information games. On the negative side, we prove that in $\mathit{extensive}$-$\mathit{form}$ $\mathit{games}$, assuming $\mathsf{PPAD} \not\subset \mathsf{TIME}(n^{\mathsf{polylog}(n)})$, any polynomial-time learning algorithms must take at least $2^{\log_2^{1-o(1)}(|\mathcal{I}|)}$ iterations to converge to the set of $ε$-approximate correlated equilibrium, where $|\mathcal{I}|$ is the number of nodes in the game and $ε> 0$ is an absolute constant. This nearly matches, up to the $o(1)$ term, the algorithms of [PR'24, DDFG'24] for learning $ε$-approximate correlated equilibrium, and resolves an open question of Anagnostides, Kalavasis, Sandholm, and Zampetakis [AKSZ'24]. Our lower bound holds even for the easier solution concept of $ε$-approximate $\mathit{coarse}$ correlated equilibrium On the positive side, we give uncoupled dynamics that reach $ε$-approximate correlated equilibria of a $\mathit{Bayesian}$ $\mathit{game}$ in polylogarithmic iterations, without any dependence of the number of types. This demonstrates a separation between Bayesian games and extensive-form games. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2404.16032 [pdf, other]

Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts

Authors: Evgenii Kortukov, Alexander Rubinstein, Elisa Nguyen, Seong Joon Oh

Abstract: Retrieval-augmented generation (RAG) mitigates many problems of fully parametric language models, such as temporal degradation, hallucinations, and lack of grounding. In RAG, the model's knowledge can be updated from documents provided in context. This leads to cases of conflict between the model's parametric knowledge and the contextual information, where the model may not always update its knowl… ▽ More Retrieval-augmented generation (RAG) mitigates many problems of fully parametric language models, such as temporal degradation, hallucinations, and lack of grounding. In RAG, the model's knowledge can be updated from documents provided in context. This leads to cases of conflict between the model's parametric knowledge and the contextual information, where the model may not always update its knowledge. Previous work studied knowledge conflicts by creating synthetic documents that contradict the model's correct parametric answers. We present a framework for studying knowledge conflicts in a realistic setup. We update incorrect parametric knowledge using real conflicting documents. This reflects how knowledge conflicts arise in practice. In this realistic scenario, we find that knowledge updates fail less often than previously reported. In cases where the models still fail to update their answers, we find a parametric bias: the incorrect parametric answer appearing in context makes the knowledge update likelier to fail. These results suggest that the factual parametric knowledge of LLMs can negatively influence their reading abilities and behaviors. Our code is available at https://github.com/kortukov/realistic_knowledge_conflicts/. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.15399 [pdf, other]

JWST detections of amorphous and crystalline HDO ice toward massive protostars

Authors: Katerina Slavicinska, Ewine F. van Dishoeck, Łukasz Tychoniec, Pooneh Nazari, Adam E. Rubinstein, Robert Gutermuth, Himanshu Tyagi, Yuan Chen, Nashanty G. C. Brunken, Will R. M. Rocha, P. Manoj, Mayank Narang, S. Thomas Megeath, Yao-Lun Yang, Leslie W. Looney, John J. Tobin, Henrik Beuther, Tyler L. Bourke, Harold Linnartz, Samuel Federman, Dan M. Watson, Hendrik Linz

Abstract: This work aims to utilize the increased sensitivity and resolution of the JWST to quantify the HDO/H$_{2}$O ratio in ices toward young stellar objects (YSOs) and to determine if the HDO/H$_{2}$O ratios measured in the gas phase toward massive YSOs (MYSOs) are representative of the ratios in their ice envelopes. Two protostars observed in the Investigating Protostellar Accretion (IPA) program using… ▽ More This work aims to utilize the increased sensitivity and resolution of the JWST to quantify the HDO/H$_{2}$O ratio in ices toward young stellar objects (YSOs) and to determine if the HDO/H$_{2}$O ratios measured in the gas phase toward massive YSOs (MYSOs) are representative of the ratios in their ice envelopes. Two protostars observed in the Investigating Protostellar Accretion (IPA) program using JWST NIRSpec were analyzed: HOPS 370, an intermediate-mass YSO (IMYSO), and IRAS 20126+4104, a MYSO. The HDO ice toward these sources was detected above the 3$σ$ level and quantified via its 4.1 $μ$m band. The contributions from the CH$_{3}$OH combination modes to the observed optical depth in this spectral region were constrained via the CH$_{3}$OH 3.53 $μ$m band to ensure that the integrated optical depth of the HDO feature was not overestimated. H$_{2}$O ice was quantified via its 3 $μ$m band. From these fits, ice HDO/H$_{2}$O abundance ratios of 4.6$\pm$1.8$\times$10$^{-3}$ and 2.6$\pm$1.2$\times$10$^{-3}$ are obtained for HOPS 370 and IRAS 20126+4104, respectively. The simultaneous detections of both crystalline HDO and crystalline H$_{2}$O corroborate the assignment of the observed feature at 4.1 $μ$m to HDO ice. The ice HDO/H$_{2}$O ratios are similar to the highest reported gas HDO/H$_{2}$O ratios measured toward MYSOs as well as the hot inner regions of isolated low-mass protostars, suggesting that at least some of the gas HDO/H$_{2}$O ratios measured toward massive hot cores are representative of the HDO/H$_{2}$O ratios in ices. The need for an H$_{2}$O-rich CH$_{3}$OH component in the CH$_{3}$OH ice analysis supports recent experimental and observational results that indicate that some CH$_{3}$OH ice may form prior to the CO freeze-out stage in H$_{2}$O-rich ice layers. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: Accepted for publication in A&A. 23 pages, 17 figures, 10 tables

arXiv:2404.07299 [pdf, ps, other]

JWST/MIRI detection of suprathermal OH rotational emissions: probing the dissociation of the water by Lyman alpha photons near the protostar HOPS 370

Authors: David A. Neufeld, P. Manoj, Himanshu Tyagi, Mayank Narang, Dan M. Watson, S. Thomas Megeath, Ewine F. Van Dishoeck, Robert A. Gutermuth, Thomas Stanke, Yao-Lun Yang, Adam E. Rubinstein, Guillem Anglada, Henrik Beuther, Alessio Caratti o Garatti, Neal J. Evans II, Samuel Federman, William J. Fischer, Joel Green, Pamela Klaassen, Leslie W. Looney, Mayra Osorio, Pooneh Nazari, John J. Tobin, Lukasz Tychoniec, Scott Wolk

Abstract: Using the MIRI/MRS spectrometer on JWST, we have detected pure rotational, suprathermal OH emissions from the vicinity of the intermediate-mass protostar HOPS 370 (OMC2/FIR3). These emissions are observed from shocked knots in a jet/outflow, and originate in states of rotational quantum number as high as 46 that possess excitation energies as large as $E_U/k = 4.65 \times 10^4$ K. The relative str… ▽ More Using the MIRI/MRS spectrometer on JWST, we have detected pure rotational, suprathermal OH emissions from the vicinity of the intermediate-mass protostar HOPS 370 (OMC2/FIR3). These emissions are observed from shocked knots in a jet/outflow, and originate in states of rotational quantum number as high as 46 that possess excitation energies as large as $E_U/k = 4.65 \times 10^4$ K. The relative strengths of the observed OH lines provide a powerful diagnostic of the ultraviolet radiation field in a heavily-extinguished region ($A_V \sim 10 - 20$) where direct UV observations are impossible. To high precision, the OH line strengths are consistent with a picture in which the suprathermal OH states are populated following the photodissociation of water in its $\tilde B - X$ band by ultraviolet radiation produced by fast ($\sim 80\,\rm km\,s^{-1}$) shocks along the jet. The observed dominance of emission from symmetric ($A^\prime$) OH states over that from antisymmetric ($A^{\prime\prime}$) states provides a distinctive signature of this particular population mechanism. Moreover, the variation of intensity with rotational quantum number suggests specifically that Ly$α$ radiation is responsible for the photodissociation of water, an alternative model with photodissociation by a 10$^4$ K blackbody being disfavored at a high level of significance. Using measurements of the Br$α$ flux to estimate the Ly$α$ production rate, we find that $\sim 4\%$ of the Ly$α$ photons are absorbed by water. Combined with direct measurements of water emissions in the $ν_2 = 1 -0$ band, the OH observations promise to provide key constraints on future models for the diffusion of Ly$α$ photons in the vicinity of a shock front. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 30 pages, 7 figures. Accepted for publication in ApJ Letters

arXiv:2403.17262 [pdf, ps, other]

Tian's stabilization problem for toric Fanos

Authors: Chenzi **, Yanir A. Rubinstein

Abstract: In 1988, Tian posed the stabilization problem for equivariant global log canonical thresholds. We solve it in the case of toric Fano manifolds. This is the first general result on Tian's problem. A key new estimate involves expressing complex singularity exponents associated to orbits of a group action in terms of support and gauge functions from convex geometry. These techniques also yield a reso… ▽ More In 1988, Tian posed the stabilization problem for equivariant global log canonical thresholds. We solve it in the case of toric Fano manifolds. This is the first general result on Tian's problem. A key new estimate involves expressing complex singularity exponents associated to orbits of a group action in terms of support and gauge functions from convex geometry. These techniques also yield a resolution of another conjecture of Tian from 2012 on more general thresholds associated to Grassmannians of plurianticanonical series. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.09794 [pdf, ps, other]

The Query Complexity of Contracts

Authors: Paul Dütting, Michal Feldman, Yoav Gal-Tzur, Aviad Rubinstein

Abstract: Algorithmic contract design is a new frontier in the intersection of economics and computation, with combinatorial contracts being a core problem in this domain. A central model within combinatorial contracts explores a setting where a principal delegates the execution of a task, which can either succeed or fail, to an agent. The agent can choose any subset among a given set of costly actions, whe… ▽ More Algorithmic contract design is a new frontier in the intersection of economics and computation, with combinatorial contracts being a core problem in this domain. A central model within combinatorial contracts explores a setting where a principal delegates the execution of a task, which can either succeed or fail, to an agent. The agent can choose any subset among a given set of costly actions, where every subset is associated with a success probability. The principal incentivizes the agent through a contract that specifies the payment upon success of the task. A natural setting of interest is one with submodular success probabilities. It is known that finding the optimal contract for the principal is $\mathsf{NP}$-hard, but the hardness result is derived from the hardness of demand queries. A major open problem is whether the hardness arises solely from the hardness of demand queries, or if the complexity lies within the optimal contract problem itself. In other words: does the problem retain its hardness, even when provided access to a demand oracle? We resolve this question in the affirmative, showing that any algorithm that computes the optimal contract for submodular success probabilities requires an exponential number of demand queries, thus settling the query complexity problem. △ Less

Submitted 14 March, 2024; originally announced March 2024.

arXiv:2403.07968 [pdf, other]

Do Deep Neural Network Solutions Form a Star Domain?

Authors: Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad, Seong Joon Oh

Abstract: It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means that a linear path can connect two independent solutions with low loss, given the weights of one of the models are appropriately permuted. However, current methods to test this theory often require ver… ▽ More It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means that a linear path can connect two independent solutions with low loss, given the weights of one of the models are appropriately permuted. However, current methods to test this theory often require very wide networks to succeed. In this work, we conjecture that more generally, the SGD solution set is a "star domain" that contains a "star model" that is linearly connected to all the other solutions via paths with low loss values, modulo permutations. We propose the Starlight algorithm that finds a star model of a given learning task. We validate our claim by showing that this star model is linearly connected with other independently found solutions. As an additional benefit of our study, we demonstrate better uncertainty estimates on the Bayesian Model Averaging over the obtained star domain. Further, we demonstrate star models as potential substitutes for model ensembles. Our code is available at https://github.com/aktsonthalia/starlight. △ Less

Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

arXiv:2402.08637 [pdf, other]

Strategizing against No-Regret Learners in First-Price Auctions

Authors: Aviad Rubinstein, Junyao Zhao

Abstract: We study repeated first-price auctions and general repeated Bayesian games between two players, where one player, the learner, employs a no-regret learning algorithm, and the other player, the optimizer, knowing the learner's algorithm, strategizes to maximize its own utility. For a commonly used class of no-regret learning algorithms called mean-based algorithms, we show that (i) in standard (i.e… ▽ More We study repeated first-price auctions and general repeated Bayesian games between two players, where one player, the learner, employs a no-regret learning algorithm, and the other player, the optimizer, knowing the learner's algorithm, strategizes to maximize its own utility. For a commonly used class of no-regret learning algorithms called mean-based algorithms, we show that (i) in standard (i.e., full-information) first-price auctions, the optimizer cannot get more than the Stackelberg utility -- a standard benchmark in the literature, but (ii) in Bayesian first-price auctions, there are instances where the optimizer can achieve much higher than the Stackelberg utility. On the other hand, Mansour et al. (2022) showed that a more sophisticated class of algorithms called no-polytope-swap-regret algorithms are sufficient to cap the optimizer's utility at the Stackelberg utility in any repeated Bayesian game (including Bayesian first-price auctions), and they pose the open question whether no-polytope-swap-regret algorithms are necessary to cap the optimizer's utility. For general Bayesian games, under a reasonable and necessary condition, we prove that no-polytope-swap-regret algorithms are indeed necessary to cap the optimizer's utility and thus answer their open question. For Bayesian first-price auctions, we give a simple improvement of the standard algorithm for minimizing the polytope swap regret by exploiting the structure of Bayesian first-price auctions. △ Less

Submitted 13 February, 2024; originally announced February 2024.

arXiv:2402.04314 [pdf, other]

doi 10.1051/0004-6361/202348718

JWST observations of $^{13}$CO$_{2}$ ice: Tracing the chemical environment and thermal history of ices in protostellar envelopes

Authors: Nashanty G. C. Brunken, Will R. M. Rocha, Ewine F. van Dishoeck, Robert Gutermuth, Himanshu Tyagi, Katerina Slavicinska, Pooneh Nazari, S. Thomas Megeath, Neal J. Evans II, Mayank Narang, P. Manoj, Adam E. Rubinstein, Dan M. Watson, Leslie W. Looney, Harold Linnartz, Alessio Caratti o Garatti, Henrik Beuther, Hendrik Linz, Pamela Klaassen, Charles A. Poteet, Samuel Federman, Guillem Anglada, Prabhani Atnagulov, Tyler L. Bourke, William J. Fischer , et al. (16 additional authors not shown)

Abstract: The structure and composition of simple ices can be modified during stellar evolution by protostellar heating. Key to understanding the involved processes are thermal and chemical tracers that can diagnose the history and environment of the ice. The 15.2 $μ$m bending mode of $^{12}$CO$_2$ has proven to be a valuable tracer of ice heating events but suffers from grain shape and size effects. A viab… ▽ More The structure and composition of simple ices can be modified during stellar evolution by protostellar heating. Key to understanding the involved processes are thermal and chemical tracers that can diagnose the history and environment of the ice. The 15.2 $μ$m bending mode of $^{12}$CO$_2$ has proven to be a valuable tracer of ice heating events but suffers from grain shape and size effects. A viable alternative tracer is the weaker $^{13}$CO$_2$ isotopologue band at 4.39 $μ$m which has now become accessible at high S/N with the $\textit{James Webb}$ Space Telescope (JWST). We present JWST NIRSpec observations of $^{13}$CO$_2$ ice in five deeply embedded Class 0 sources spanning a wide range in luminosities (0.2 - 10$^4$ L$_{\odot}$ ) taken as part of the Investigating Protostellar Accretion Across the Mass Spectrum (IPA) program. The band profiles vary significantly, with the most luminous sources showing a distinct narrow peak at 4.38 $μ$m. We first apply a phenomenological approach and show that a minimum of 3-4 Gaussian profiles are needed to fit the $^{13}$CO$_2$ absorption feature. We then combine these findings with laboratory data and show that a 15.2 $μ$m $^{12}$CO$_2$ band inspired five-component decomposition can be applied for the isotopologue band where each component is representative of CO$_2$ ice in a specific molecular environment. The final solution consists of cold mixtures of CO$_2$ with CH$_3$OH, H$_2$O and CO as well as segregated heated pure CO$_2$ ice. Our results are in agreement with previous studies of the $^{12}$CO$_2$ ice band, further confirming that $^{13}$CO$_{2}$ is a useful alternative tracer of protostellar heating events. We also propose an alternative solution consisting only of heated CO$_2$:CH$_3$OH and CO$_2$:H$_2$O ices and warm pure CO$_2$ ice for decomposing the ice profiles of the two most luminous sources in our sample. △ Less

Submitted 7 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Journal ref: A&A 685, A27 (2024)

arXiv:2401.10992 [pdf, ps, other]

Two-dimensional Błocki, $L^p$-Mahler, and Bourgain conjectures

Authors: Vlassis Mastrantonis, Yanir A. Rubinstein

Abstract: We confirm, in dimension two, Blocki's conjectures on sharp lower bounds for Bergman kernels of tube domains. To that end, we verify a broader class of $L^p$-Mahler conjectures due to Berndtsson and the authors, where $p=1$ are Blocki's conjecture, and $p=\infty$ are Mahler's conjectures. The proofs are technically challenging as the $L^p$-Mahler volume is considerably harder to deal with analytic… ▽ More We confirm, in dimension two, Blocki's conjectures on sharp lower bounds for Bergman kernels of tube domains. To that end, we verify a broader class of $L^p$-Mahler conjectures due to Berndtsson and the authors, where $p=1$ are Blocki's conjecture, and $p=\infty$ are Mahler's conjectures. The proofs are technically challenging as the $L^p$-Mahler volume is considerably harder to deal with analytically compared to Mahler's volume, and furthermore duality is lost. In addition, unlike in the classical Mahler setting, the non-symmetric setting is considerably more involved than the symmetric one. The proofs involve studying the effect of Mahler's classical sliding of vertices on two-dimensional polytopes on the $L^p$-polar body (no longer a polytope). Some arguments are inspired by works of Campi--Gronchi and Meyer--Reisner on volumes of classical polar bodies of shadow systems. In passing, we also explore how Mahler's sliding affects the isotropic constant and offer a direct proof of Bourgain's strong hyperplane conjectures in dimension two, which was previously proven indirectly through the work of Bisztriczky--Böröczky and Meckes. △ Less

Submitted 19 January, 2024; originally announced January 2024.

arXiv:2401.07901 [pdf, other]

doi 10.1051/0004-6361/202348695

Hunt for complex cyanides in protostellar ices with JWST: Tentative detection of CH$_3$CN and C$_2$H$_5$CN

Authors: P. Nazari, W. R. M. Rocha, A. E. Rubinstein, K. Slavicinska, M. G. Rachid, E. F. van Dishoeck, S. T. Megeath, R. Gutermuth, H. Tyagi, N. Brunken, M. Narang, P. Manoj, D. M. Watson, N. J. Evans II, S. Federman, J. Muzerolle Page, G. Anglada, H. Beuther, P. Klaassen, L. W. Looney, M. Osorio, T. Stanke, Y. -L. Yang

Abstract: Nitrogen-bearing complex organic molecules have been commonly detected in the gas phase but not yet in interstellar ices. This has led to the long-standing question of whether these molecules form in the gas phase or in ices. $\textit{James Webb}$ Space Telescope ($\textit{JWST}$) offers the sensitivity, spectral resolution, and wavelength coverage needed to detect them in ices and investigate whe… ▽ More Nitrogen-bearing complex organic molecules have been commonly detected in the gas phase but not yet in interstellar ices. This has led to the long-standing question of whether these molecules form in the gas phase or in ices. $\textit{James Webb}$ Space Telescope ($\textit{JWST}$) offers the sensitivity, spectral resolution, and wavelength coverage needed to detect them in ices and investigate whether their abundance ratios are similar in gas and ice. We report the first tentative detection of CH$_3$CN, C$_2$H$_5$CN, and the simple molecule, N$_2$O, based on the CN-stretch band in interstellar ices toward three (HOPS 153, HOPS 370, and IRAS 20126+4104) out of the five protostellar systems observed as part of the Investigating Protostellar Accretion (IPA) GO program with $\textit{JWST}$-NIRSpec. We also provide upper limits for the two other sources with smaller luminosities in the sample. We detect OCN$^-$ in the ices of all sources with typical CH$_3$CN/OCN$^-$ ratios of around 1. Ice and gas column density ratios of the nitrogen-bearing species with respect to each other are better matched than those with respect to methanol, which are a factor of ${\sim}5$ larger in the ices than the gas. We attribute the elevated ice column densities with respect to methanol to the difference in snowline locations of nitrogen-bearing molecules and of methanol, biasing the gas-phase observations toward fewer nitrogen-bearing molecules. Moreover, we find tentative evidence for enhancement of OCN$^-$, CH$_3$CN, and C$_2$H$_5$CN in warmer ices, although formation of these molecules likely starts along with methanol in the cold prestellar phase. Future surveys combining NIRSpec and MIRI, and additional laboratory spectroscopic measurements of C$_2$H$_5$CN ice, are necessary for robust detection and conclusions on the formation history of complex cyanides. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: Accepted for publication in A&A

Journal ref: A&A 686, A71 (2024)

arXiv:2312.07807 [pdf, other]

IPA. Class 0 Protostars Viewed in CO Emission Using JWST/NIRSpec

Authors: Adam E. Rubinstein, Himanshu Tyagi, Pooneh Nazari, Robert Gutermuth, Samuel Federman, Mayank Narang, Will R. M. Rocha, Nashanty Brunken, Katie Slavicinska, Neal J. Evans II, Joel D. Green, Dan M. Watson, Henrik Beuther, Tyler Bourke, Alessio Caratti o Garatti, Lee Hartmann, Pamela Klaassen, Hendrik Linz, Leslie W. Looney, Puravankara Manoj, S. Thomas Megeath, James Muzerolle Page, Thomas Stanke, John J. Tobin, Ewine F. van Dishoeck , et al. (2 additional authors not shown)

Abstract: We investigate the bright CO fundamental emission in the central regions of five Class 0 protostars using the JWST's Near-Infrared Spectrograph (NIRSpec) and provide clues to what processes excite the gas. CO line emission images are extracted for a forest of $\sim$150 ro-vibrational transitions from two vibrational bands, $v=1-0$ and $v=2-1$. However, ${}^{13}$CO is not detected, and thus we can… ▽ More We investigate the bright CO fundamental emission in the central regions of five Class 0 protostars using the JWST's Near-Infrared Spectrograph (NIRSpec) and provide clues to what processes excite the gas. CO line emission images are extracted for a forest of $\sim$150 ro-vibrational transitions from two vibrational bands, $v=1-0$ and $v=2-1$. However, ${}^{13}$CO is not detected, and thus we can only statistically constrain the ${}^{12}$CO optical depth. Using noise measurements to determine upper limits to the ${}^{13}$CO emission, the flux ratio of ${}^{12}$CO/${}^{13}$CO indicates that the ${}^{12}$CO emission itself is not optically thick for ro-vibrational transitions with upper state rotational quantum number $J_u \geq 15$. We construct population diagrams to estimate the rotational temperature and number of molecules from extinction-corrected CO line fluxes assuming CO emission is optically thin. Two different temperature components are required for $v=1$ ($\sim600-1000$ K and $\sim1500-3500$ K), while one hotter component is required for $v=2$ ($\sim2000-6000$ K). The vibrational temperature is $\sim 900$ K among our sources and shows no trend with luminosity. Using vibrational temperatures and the inferred total amount of CO molecules for our sources, the total warm gas mass correlates strongly with luminosity ranging from $\sim$0.1 $\rm M_{Earth}$ for the low-mass protostars to $\sim$1 M$_{\rm sun}$ for the high-mass protostars. Interpreting the distribution of gas column densities and temperatures depends on radiative and chemical processes affecting CO. The presence of a $v=2$ population may indicate CO gas radiatively excited. Selective UV photodissociation of CO isotopologues around our high-mass sources may explain their depletion of ${}^{13}$CO. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 30 pages, 6 figures, 5 tables, submitted to ApJ

arXiv:2311.16176 [pdf, other]

Mitigating Biases with Diverse Ensembles and Diffusion Models

Authors: Luca Scimeca, Alexander Rubinstein, Damien Teney, Seong Joon Oh, Armand Mihai Nicolicioiu, Yoshua Bengio

Abstract: Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to a phenomenon known as shortcut learning, where a model relies on erroneous, easy-to-learn cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting Diffusion Probabilistic Models (DPMs) to mitigate this form of bias. We show that at particular… ▽ More Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to a phenomenon known as shortcut learning, where a model relies on erroneous, easy-to-learn cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting Diffusion Probabilistic Models (DPMs) to mitigate this form of bias. We show that at particular training intervals, DPMs can generate images with novel feature combinations, even when trained on samples displaying correlated input features. We leverage this crucial property to generate synthetic counterfactuals to increase model diversity via ensemble disagreement. We show that DPM-guided diversification is sufficient to remove dependence on primary shortcut cues, without a need for additional supervised signals. We further empirically quantify its efficacy on several diversification objectives, and finally show improved generalization and diversification performance on par with prior work that relies on auxiliary data collection. △ Less

Submitted 6 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

Comments: arXiv admin note: text overlap with arXiv:2310.02230

arXiv:2311.12444 [pdf, ps, other]

Quantum Communication Complexity of Classical Auctions

Authors: Aviad Rubinstein, Zixin Zhou

Abstract: We study the fundamental, classical mechanism design problem of single-buyer multi-item Bayesian revenue-maximizing auctions under the lens of communication complexity between the buyer and the seller. Specifically, we ask whether using quantum communication can be more efficient than classical communication. We have two sets of results, revealing a surprisingly rich landscape - which looks quite… ▽ More We study the fundamental, classical mechanism design problem of single-buyer multi-item Bayesian revenue-maximizing auctions under the lens of communication complexity between the buyer and the seller. Specifically, we ask whether using quantum communication can be more efficient than classical communication. We have two sets of results, revealing a surprisingly rich landscape - which looks quite different from both quantum communication in non-strategic parties, and classical communication in mechanism design. We first study the expected communication complexity of approximately optimal auctions. We give quantum auction protocols for buyers with unit-demand or combinatorial valuations that obtain an arbitrarily good approximation of the optimal revenue while running in exponentially more efficient communication compared to classical approximately optimal auctions. However, these auctions come with the caveat that they may require the seller to charge exponentially large payments from a deviating buyer. We show that this caveat is necessary - we give an exponential lower bound on the product of the expected quantum communication and the maximum payment. We then study the worst-case communication complexity of exactly optimal auctions in an extremely simple setting: additive buyer's valuations over two items. We show the following separations: 1. There exists a prior where the optimal classical auction protocol requires infinitely many bits, but a one-way message of 1 qubit and 2 classical bits suffices. 2. There exists a prior where no finite one-way quantum auction protocol can obtain the optimal revenue. However, there is a barely-interactive revenue-optimal quantum auction protocol. 3. There exists a prior where no multi-round quantum auction protocol with a finite bound on communication complexity can obtain the optimal revenue. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.09359 [pdf, other]

Local Computation Algorithms for Maximum Matching: New Lower Bounds

Authors: Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein

Abstract: We study local computation algorithms (LCA) for maximum matching. An LCA does not return its output entirely, but reveals parts of it upon query. For matchings, each query is a vertex $v$; the LCA should return whether $v$ is matched -- and if so to which neighbor -- while spending a small time per query. In this paper, we prove that any LCA that computes a matching that is at most an additive o… ▽ More We study local computation algorithms (LCA) for maximum matching. An LCA does not return its output entirely, but reveals parts of it upon query. For matchings, each query is a vertex $v$; the LCA should return whether $v$ is matched -- and if so to which neighbor -- while spending a small time per query. In this paper, we prove that any LCA that computes a matching that is at most an additive of $εn$ smaller than the maximum matching in $n$-vertex graphs of maximum degree $Δ$ must take at least $Δ^{Ω(1/\varepsilon)}$ time. This comes close to the existing upper bounds that take $(Δ/ε)^{O(1/ε^2)} polylog(n)$ time. In terms of sublinear time algorithms, our techniques imply that any algorithm that estimates the size of maximum matching up to an additive error of $εn$ must take $Δ^{Ω(1/ε)}$ time. This negatively resolves a decade old open problem of the area (see Open Problem 39 of sublinear.info) on whether such estimates can be achieved in $poly(Δ/ε)$ time. △ Less

Submitted 15 November, 2023; originally announced November 2023.

arXiv:2311.02075 [pdf, other]

Envy-Free Cake-Cutting for Four Agents

Authors: Alexandros Hollender, Aviad Rubinstein

Abstract: In the envy-free cake-cutting problem we are given a resource, usually called a cake and represented as the $[0,1]$ interval, and a set of $n$ agents with heterogeneous preferences over pieces of the cake. The goal is to divide the cake among the $n$ agents such that no agent is envious of any other agent. Even under a very general preferences model, this fundamental fair division problem is known… ▽ More In the envy-free cake-cutting problem we are given a resource, usually called a cake and represented as the $[0,1]$ interval, and a set of $n$ agents with heterogeneous preferences over pieces of the cake. The goal is to divide the cake among the $n$ agents such that no agent is envious of any other agent. Even under a very general preferences model, this fundamental fair division problem is known to always admit an exact solution where each agent obtains a connected piece of the cake; we study the complexity of finding an approximate solution, i.e., a connected $\varepsilon$-envy-free allocation. For monotone valuations of cake pieces, Deng, Qi, and Saberi (2012) gave an efficient ($\textsf{poly}(\log(1/\varepsilon))$ queries) algorithm for three agents and posed the open problem of four (or more) monotone agents. Even for the special case of additive valuations, Brânzei and Nisan (2022) conjectured an $Ω(1/\varepsilon)$ lower bound on the number of queries for four agents. We provide the first efficient algorithm for finding a connected $\varepsilon$-envy-free allocation with four monotone agents. We also prove that as soon as valuations are allowed to be non-monotone, the problem becomes hard: it becomes PPAD-hard, requires $\textsf{poly}(1/\varepsilon)$ queries in the black-box model, and even $\textsf{poly}(1/\varepsilon)$ communication complexity. This constitutes, to the best of our knowledge, the first intractability result for any version of the cake-cutting problem in the communication complexity model. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.19647 [pdf, ps, other]

Fast swap regret minimization and applications to approximate correlated equilibria

Authors: Binghui Peng, Aviad Rubinstein

Abstract: We give a simple and computationally efficient algorithm that, for any constant $\varepsilon>0$, obtains $\varepsilon T$-swap regret within only $T = \mathsf{polylog}(n)$ rounds; this is an exponential improvement compared to the super-linear number of rounds required by the state-of-the-art algorithm, and resolves the main open problem of [Blum and Mansour 2007]. Our algorithm has an exponential… ▽ More We give a simple and computationally efficient algorithm that, for any constant $\varepsilon>0$, obtains $\varepsilon T$-swap regret within only $T = \mathsf{polylog}(n)$ rounds; this is an exponential improvement compared to the super-linear number of rounds required by the state-of-the-art algorithm, and resolves the main open problem of [Blum and Mansour 2007]. Our algorithm has an exponential dependence on $\varepsilon$, but we prove a new, matching lower bound. Our algorithm for swap regret implies faster convergence to $\varepsilon$-Correlated Equilibrium ($\varepsilon$-CE) in several regimes: For normal form two-player games with $n$ actions, it implies the first uncoupled dynamics that converges to the set of $\varepsilon$-CE in polylogarithmic rounds; a $\mathsf{polylog}(n)$-bit communication protocol for $\varepsilon$-CE in two-player games (resolving an open problem mentioned by [Babichenko-Rubinstein'2017, Goos-Rubinstein'2018, Ganor-CS'2018]); and an $\tilde{O}(n)$-query algorithm for $\varepsilon$-CE (resolving an open problem of [Babichenko'2020] and obtaining the first separation between $\varepsilon$-CE and $\varepsilon$-Nash equilibrium in the query complexity model). For extensive-form games, our algorithm implies a PTAS for $\mathit{normal}$ $\mathit{form}$ $\mathit{correlated}$ $\mathit{equilibria}$, a solution concept often conjectured to be computationally intractable (e.g. [Stengel-Forges'08, Fujii'23]). △ Less

Submitted 14 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.19514 [pdf, other]

Approximate Earth Mover's Distance in Truly-Subquadratic Time

Authors: Lorenzo Beretta, Aviad Rubinstein

Abstract: We design an additive approximation scheme for estimating the cost of the min-weight bipartite matching problem: given a bipartite graph with non-negative edge costs and $\varepsilon > 0$, our algorithm estimates the cost of matching all but $O(\varepsilon)$-fraction of the vertices in truly subquadratic time $O(n^{2-δ(\varepsilon)})$. Our algorithm has a natural interpretation for computing the… ▽ More We design an additive approximation scheme for estimating the cost of the min-weight bipartite matching problem: given a bipartite graph with non-negative edge costs and $\varepsilon > 0$, our algorithm estimates the cost of matching all but $O(\varepsilon)$-fraction of the vertices in truly subquadratic time $O(n^{2-δ(\varepsilon)})$. Our algorithm has a natural interpretation for computing the Earth Mover's Distance (EMD), up to a $\varepsilon$-additive approximation. Notably, we make no assumptions about the underlying metric (more generally, the costs do not have to satisfy triangle inequality). Note that compared to the size of the instance (an arbitrary $n \times n$ cost matrix), our algorithm runs in {\em sublinear} time. Our algorithm can approximate a slightly more general problem: max-cardinality bipartite matching with a knapsack constraint, where the goal is to maximize the number of vertices that can be matched up to a total cost $B$. △ Less

Submitted 10 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.14061 [pdf, other]

Discovery of a collimated jet from the low luminosity protostar IRAS 16253$-$2429 in a quiescent accretion phase with the JWST

Authors: Mayank Narang, Manoj P., Himanshu Tyagi, Dan M. Watson, S. Thomas Megeath, Samuel Federman, Adam E. Rubinstein, Robert Gutermuth, Alessio Caratti o Garatti, Henrik Beuther, Tyler L. Bourke, Ewine F. Van Dishoeck, Neal J. Evans II, Guillem Anglada, Mayra Osorio, Thomas Stanke, James Muzerolle, Leslie W. Looney, Yao-Lun Yang, John J. Tobin, Pamela Klaassen, Nicole Karnath, Prabhani Atnagulov, Nashanty Brunken, William J. Fischer , et al. (14 additional authors not shown)

Abstract: Investigating Protostellar Accretion (IPA) is a JWST Cycle~1 GO program that uses NIRSpec IFU and MIRI MRS to obtain 2.9--28~$μ$m spectral cubes of young, deeply embedded protostars with luminosities of 0.2 to 10,000~L$_{\odot}$ and central masses of 0.15 to 12~M$_{\odot}$. In this Letter, we report the discovery of a highly collimated atomic jet from the Class~0 protostar IRAS~16253$-$2429, the l… ▽ More Investigating Protostellar Accretion (IPA) is a JWST Cycle~1 GO program that uses NIRSpec IFU and MIRI MRS to obtain 2.9--28~$μ$m spectral cubes of young, deeply embedded protostars with luminosities of 0.2 to 10,000~L$_{\odot}$ and central masses of 0.15 to 12~M$_{\odot}$. In this Letter, we report the discovery of a highly collimated atomic jet from the Class~0 protostar IRAS~16253$-$2429, the lowest luminosity source ($L_\mathrm{bol}$ = 0.2 $L_\odot$) in the IPA program. The collimated jet is detected in multiple [Fe~II] lines, [Ne~II], [Ni~II], and H~I lines, but not in molecular emission. The atomic jet has a velocity of about 169~$\pm$~15~km\,s$^{-1}$, after correcting for inclination. The width of the jet increases with distance from the central protostar from 23 to~60 au, corresponding to an opening angle of 2.6~$\pm$~0.5\arcdeg. By comparing the measured flux ratios of various fine structure lines to those predicted by simple shock models, we derive a shock {speed} of 54~km\,s$^{-1}$ and a preshock density of 2.0$\times10^{3}$~cm$^{-3}$ at the base of the jet. {From these quantities and using a suite of jet models and extinction laws we compute a mass loss rate between $0.4 -1.1\times10^{-10}~M_{\odot}$~yr~$^{-1}$.} The low mass loss rate is consistent with simultaneous measurements of low mass accretion rate ($2.4~\pm~0.8~\times~10^{-9}~M_{\odot}$~yr$^{-1}$) for IRAS~16253$-$2429 from JWST observations (Watson et al. in prep), indicating that the protostar is in a quiescent accretion phase. Our results demonstrate that very low-mass protostars can drive highly collimated, atomic jets, even during the quiescent phase. △ Less

Submitted 11 January, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

Comments: Accepted to ApJL. Comments and feedback welcome

arXiv:2310.09572 [pdf, other]

Wind erosion and transport on planetesimals

Authors: Alice C. Quillen, Stephen Luniewski, Adam E. Rubinstein, Jeremy Couturier, Rachel Glade, Miki Nakajima

Abstract: We consider the possibility that aeolian (wind blown) processes occur on small, 1 to 100~km diameter, planetesimals when they were embedded in the protosolar nebula. Drag from a headwind within a protostellar disk is sufficiently large to loft cm and smaller sized particles off the surface of a 10 km diameter asteroid in the inner solar system (at a few AU), and micron sized particles off the surf… ▽ More We consider the possibility that aeolian (wind blown) processes occur on small, 1 to 100~km diameter, planetesimals when they were embedded in the protosolar nebula. Drag from a headwind within a protostellar disk is sufficiently large to loft cm and smaller sized particles off the surface of a 10 km diameter asteroid in the inner solar system (at a few AU), and micron sized particles off the surface of a 10 km diameter object in the Transneptunian region. The headwind is sufficiently strong to overcome surface cohesion in the inner solar system, but not in the outer solar system. However, in the outer solar system, surface particles can be redistributed or escape due to impacts from particles that are in the protosolar disk's wind. Based on scaling crater ejecta, we estimate that impacts from particles in the headwind will lead to erosion of mass rather than accretion for planetesimals below about 6 km in diameter. The erosion limit is independent of material strength but proportional to the wind velocity. We explore the sensitivity of splash particle trajectories to particle size, headwind velocity and Reynolds number. Winds from a protostellar disk could account for Kuiper Belt Object (486958) Arrokoth's smooth undulating terrain but only during an epoch of high particle flux and low wind velocity. These conditions could have been present during and just after coalescence of Arrokoth's building blocks. △ Less

Submitted 10 January, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

arXiv:2310.08215 [pdf, other]

Trustworthy Machine Learning

Authors: Bálint Mucsányi, Michael Kirchhof, Elisa Nguyen, Alexander Rubinstein, Seong Joon Oh

Abstract: As machine learning technology gets applied to actual products and solutions, new challenges have emerged. Models unexpectedly fail to generalize to small changes in the distribution, tend to be confident on novel data they have never seen, or cannot communicate the rationale behind their decisions effectively with the end users. Collectively, we face a trustworthiness issue with the current machi… ▽ More As machine learning technology gets applied to actual products and solutions, new challenges have emerged. Models unexpectedly fail to generalize to small changes in the distribution, tend to be confident on novel data they have never seen, or cannot communicate the rationale behind their decisions effectively with the end users. Collectively, we face a trustworthiness issue with the current machine learning technology. This textbook on Trustworthy Machine Learning (TML) covers a theoretical and technical background of four key topics in TML: Out-of-Distribution Generalization, Explainability, Uncertainty Quantification, and Evaluation of Trustworthiness. We discuss important classical and contemporary research papers of the aforementioned fields and uncover and connect their underlying intuitions. The book evolved from the homonymous course at the University of Tübingen, first offered in the Winter Semester of 2022/23. It is meant to be a stand-alone product accompanied by code snippets and various pointers to further sources on topics of TML. The dedicated website of the book is https://trustworthyml.io/. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: 373 pages, textbook at the University of Tübingen

ACM Class: I.2.0

arXiv:2310.03803 [pdf]

doi 10.3847/1538-4357/ad2fa0

Investigating Protostellar Accretion-Driven Outflows Across the Mass Spectrum: JWST NIRSpec IFU 3-5~$μ$m Spectral Map** of Five Young Protostars

Authors: Samuel Federman, S. Thomas Megeath, Adam E. Rubinstein, Robert Gutermuth, Mayank Narang, Himanshu Tyagi, P. Manoj, Guillem Anglada, Prabhani Atnagulov, Henrik Beuther, Tyler L. Bourke, Nashanty Brunken, Alessio Caratti o Garatti, Neal J. Evans II, William J. Fischer, Elise Furlan, Joel Green, Nolan Habel, Lee Hartmann, Nicole Karnath, Pamela Klaassen, Hendrik Linz, Leslie W. Looney, Mayra Osorio, James Muzerolle Page , et al. (13 additional authors not shown)

Abstract: Investigating Protostellar Accretion is a Cycle 1 JWST program using the NIRSpec+MIRI integral field units to obtain 2.9--28 $μ$m spectral cubes of five young protostars with luminosities of 0.2-10,000 L$_{\odot}$ in their primary accretion phase. This paper introduces the NIRSpec 2.9--5.3 $μ$m data of the inner 840-9000 au with spatial resolutions from 28-300 au. The spectra show rising continuum… ▽ More Investigating Protostellar Accretion is a Cycle 1 JWST program using the NIRSpec+MIRI integral field units to obtain 2.9--28 $μ$m spectral cubes of five young protostars with luminosities of 0.2-10,000 L$_{\odot}$ in their primary accretion phase. This paper introduces the NIRSpec 2.9--5.3 $μ$m data of the inner 840-9000 au with spatial resolutions from 28-300 au. The spectra show rising continuum emission; deep ice absorption; emission from H$_{2}$, H~I, and [Fe~II]; and the CO fundamental series in emission and absorption. Maps of the continuum emission show scattered light cavities for all five protostars. In the cavities, collimated jets are detected in [Fe~II] for the four $< 320$~L$_{\odot}$ protostars, two of which are additionally traced in Br-$α$. Knots of [Fe~II] emission are detected toward the most luminous protostar, and knots of [FeII] emission with dynamical times of $< 30$~yrs are found in the jets of the others. While only one jet is traced in H$_2$, knots of H$_2$ and CO are detected in the jets of four protostars. H$_2$ is seen extending through the cavities, showing that they are filled by warm molecular gas. Bright H$_2$ emission is seen along the walls of a single cavity, while in three cavities narrow shells of H$_2$ emission are found, one of which has an [Fe~II] knot at its apex. These data show cavities containing collimated jets traced in atomic/ionic gas surrounded by warm molecular gas in a wide-angle wind and/or gas accelerated by bow shocks in the jets. △ Less

Submitted 24 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

Comments: 26 pages, 12 figures

Journal ref: ApJ 966 41 (2024)

arXiv:2310.02230 [pdf, other]

Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks

Authors: Luca Scimeca, Alexander Rubinstein, Armand Mihai Nicolicioiu, Damien Teney, Yoshua Bengio

Abstract: Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to shortcut learning phenomena, where a model may rely on erroneous, easy-to-learn, cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting the generation of synthetic counterfactuals using Diffusion Probabilistic Models (DPMs). We discover tha… ▽ More Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to shortcut learning phenomena, where a model may rely on erroneous, easy-to-learn, cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting the generation of synthetic counterfactuals using Diffusion Probabilistic Models (DPMs). We discover that DPMs have the inherent capability to represent multiple visual cues independently, even when they are largely correlated in the training data. We leverage this characteristic to encourage model diversity and empirically show the efficacy of the approach with respect to several diversification objectives. We show that diffusion-guided diversification can lead models to avert attention from shortcut cues, achieving ensemble diversity performance comparable to previous methods requiring additional data collection. △ Less

Submitted 18 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

Comments: Accepted at Neural Information Processing Systems(NeurIPS) 2023 - Workshop on Diffusion Models

arXiv:2309.04656 [pdf, ps, other]

A constant factor approximation for Nash social welfare with subadditive valuations

Authors: Shahar Dobzinski, Wenzheng Li, Aviad Rubinstein, Jan Vondrak

Abstract: We present a constant-factor approximation algorithm for the Nash social welfare maximization problem with subadditive valuations accessible via demand queries. More generally, we propose a template for NSW optimization by solving a configuration-type LP and using a rounding procedure for (utilitarian) social welfare as a blackbox, which could be applicable to other variants of the problem. We present a constant-factor approximation algorithm for the Nash social welfare maximization problem with subadditive valuations accessible via demand queries. More generally, we propose a template for NSW optimization by solving a configuration-type LP and using a rounding procedure for (utilitarian) social welfare as a blackbox, which could be applicable to other variants of the problem. △ Less

Submitted 8 September, 2023; originally announced September 2023.

ACM Class: F.2.2

arXiv:2305.11406 [pdf, other]

Practical algorithms and experimentally validated incentives for equilibrium-based fair division (A-CEEI)

Authors: Eric Budish, Ruiquan Gao, Abraham Othman, Aviad Rubinstein, Qianfan Zhang

Abstract: Approximate Competitive Equilibrium from Equal Incomes (A-CEEI) is an equilibrium-based solution concept for fair division of discrete items to agents with combinatorial demands. In theory, it is known that in asymptotically large markets: 1. For incentives, the A-CEEI mechanism is Envy-Free-but-for-Tie-Breaking (EF-TB), which implies that it is Strategyproof-in-the-Large (SP-L). 2. From a com… ▽ More Approximate Competitive Equilibrium from Equal Incomes (A-CEEI) is an equilibrium-based solution concept for fair division of discrete items to agents with combinatorial demands. In theory, it is known that in asymptotically large markets: 1. For incentives, the A-CEEI mechanism is Envy-Free-but-for-Tie-Breaking (EF-TB), which implies that it is Strategyproof-in-the-Large (SP-L). 2. From a computational perspective, computing the equilibrium solution is unfortunately a computationally intractable problem (in the worst-case, assuming $\textsf{PPAD}\ne \textsf{FP}$). We develop a new heuristic algorithm that outperforms the previous state-of-the-art by multiple orders of magnitude. This new, faster algorithm lets us perform experiments on real-world inputs for the first time. We discover that with real-world preferences, even in a realistic implementation that satisfies the EF-TB and SP-L properties, agents may have surprisingly simple and plausible deviations from truthful reporting of preferences. To this end, we propose a novel strengthening of EF-TB, which dramatically reduces the potential for strategic deviations from truthful reporting in our experiments. A (variant of) our algorithm is now in production: on real course allocation problems it is much faster, has zero clearing error, and has stronger incentive properties than the prior state-of-the-art implementation. △ Less

Submitted 30 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

Comments: To appear in EC 2023

arXiv:2304.14363 [pdf, other]

$L^p$-polarity, Mahler volumes, and the isotropic constant

Authors: Bo Berndtsson, Vlassis Mastrantonis, Yanir A. Rubinstein

Abstract: This article introduces $L^p$ versions of the support function of a convex body $K$ and associates to these canonical $L^p$-polar bodies $K^{\circ, p}$ and Mahler volumes $\mathcal{M}_p(K)$. Classical polarity is then seen as $L^\infty$-polarity. This one-parameter generalization of polarity leads to a generalization of the Mahler conjectures, with a subtle advantage over the original conjecture:… ▽ More This article introduces $L^p$ versions of the support function of a convex body $K$ and associates to these canonical $L^p$-polar bodies $K^{\circ, p}$ and Mahler volumes $\mathcal{M}_p(K)$. Classical polarity is then seen as $L^\infty$-polarity. This one-parameter generalization of polarity leads to a generalization of the Mahler conjectures, with a subtle advantage over the original conjecture: conjectural uniqueness of extremizers for each $p\in(0,\infty)$. We settle the upper bound by demonstrating the existence and uniqueness of an $L^p$-Santaló point and an $L^p$-Santaló inequality for symmetric convex bodies. The proof uses Ball's Brunn--Minkowski inequality for harmonic means, the classical Brunn--Minkowski inequality, symmetrization, and a systematic study of the $\mathcal{M}_p$ functionals. Using our results on the $L^p$-Santaló point and a new observation motivated by complex geometry, we show how Bourgain's slicing conjecture can be reduced to lower bounds on the $L^p$-Mahler volume coupled with a certain conjectural convexity property of the logarithm of the Monge--Ampère measure of the $L^p$-support function. We derive a suboptimal version of this convexity using Kobayashi's theorem on the Ricci curvature of Bergman metrics to illustrate this approach to slicing. Finally, we explain how Nazarov's complex analytic approach to the classical Mahler conjecture is instead precisely an approach to the $L^1$-Mahler conjecture. △ Less

Submitted 27 April, 2023; originally announced April 2023.

arXiv:2303.01673 [pdf, other]

Near Optimal Memory-Regret Tradeoff for Online Learning

Authors: Binghui Peng, Aviad Rubinstein

Abstract: In the experts problem, on each of $T$ days, an agent needs to follow the advice of one of $n$ ``experts''. After each day, the loss associated with each expert's advice is revealed. A fundamental result in learning theory says that the agent can achieve vanishing regret, i.e. their cumulative loss is within $o(T)$ of the cumulative loss of the best-in-hindsight expert. Can the agent perform wel… ▽ More In the experts problem, on each of $T$ days, an agent needs to follow the advice of one of $n$ ``experts''. After each day, the loss associated with each expert's advice is revealed. A fundamental result in learning theory says that the agent can achieve vanishing regret, i.e. their cumulative loss is within $o(T)$ of the cumulative loss of the best-in-hindsight expert. Can the agent perform well without sufficient space to remember all the experts? We extend a nascent line of research on this question in two directions: $\bullet$ We give a new algorithm against the oblivious adversary, improving over the memory-regret tradeoff obtained by [PZ23], and nearly matching the lower bound of [SWXZ22]. $\bullet$ We also consider an adaptive adversary who can observe past experts chosen by the agent. In this setting we give both a new algorithm and a novel lower bound, proving that roughly $\sqrt{n}$ memory is both necessary and sufficient for obtaining $o(T)$ regret. △ Less

Submitted 8 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

arXiv:2301.05350 [pdf, other]

Sublinear Algorithms for TSP via Path Covers

Authors: Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein, Amin Saberi

Abstract: We study sublinear time algorithms for the traveling salesman problem (TSP). First, we focus on the closely related {\em maximum path cover} problem, which asks for a collection of vertex disjoint paths that include the maximum number of edges. We show that for any fixed $ε> 0$, there is an algorithm that $(1/2 - ε)$-approximates the maximum path cover size of an $n$-vertex graph in… ▽ More We study sublinear time algorithms for the traveling salesman problem (TSP). First, we focus on the closely related {\em maximum path cover} problem, which asks for a collection of vertex disjoint paths that include the maximum number of edges. We show that for any fixed $ε> 0$, there is an algorithm that $(1/2 - ε)$-approximates the maximum path cover size of an $n$-vertex graph in $\widetilde{O}(n)$ time. This improves upon a $(3/8-ε)$-approximate $\widetilde{O}(n \sqrt{n})$-time algorithm of Chen, Kannan, and Khanna [ICALP'20]. Equipped with our path cover algorithm, we give an $\widetilde{O}(n)$ time algorithm that estimates the cost of $(1,2)$-TSP within a factor of $(1.5+ε)$ which is an improvement over a folklore $(1.75 + ε)$-approximate $\widetilde{O}(n)$-time algorithm, as well as a $(1.625+ε)$-approximate $\widetilde{O}(n\sqrt{n})$-time algorithm of [CHK ICALP'20]. For graphic TSP, we present an $\widetilde{O}(n)$ algorithm that estimates the cost of graphic TSP within a factor of $1.83$ which is an improvement over a $1.92$-approximate $\widetilde{O}(n)$ time algorithm due to [CHK ICALP'20, Behnezhad FOCS'21]. We show that the approximation can be further improved to $1.66$ using $n^{2-Ω(1)}$ time. All of our $\widetilde{O}(n)$ time algorithms are information-theoretically time-optimal up to poly log n factors. Additionally, we show that our approximation guarantees for path cover and $(1,2)$-TSP hit a natural barrier: We show better approximations require better sublinear time algorithms for the well-studied maximum matching problem. △ Less

Submitted 28 April, 2024; v1 submitted 12 January, 2023; originally announced January 2023.

arXiv:2211.15843 [pdf, other]

Sublinear Time Algorithms and Complexity of Approximate Maximum Matching

Authors: Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein

Abstract: Sublinear time algorithms for approximating maximum matching size have long been studied. Much of the progress over the last two decades on this problem has been on the algorithmic side. For instance, an algorithm of Behnezhad [FOCS'21] obtains a 1/2-approximation in $\tilde{O}(n)$ time for $n$-vertex graphs. A more recent algorithm by Behnezhad, Roghani, Rubinstein, and Saberi [SODA'23] obtains a… ▽ More Sublinear time algorithms for approximating maximum matching size have long been studied. Much of the progress over the last two decades on this problem has been on the algorithmic side. For instance, an algorithm of Behnezhad [FOCS'21] obtains a 1/2-approximation in $\tilde{O}(n)$ time for $n$-vertex graphs. A more recent algorithm by Behnezhad, Roghani, Rubinstein, and Saberi [SODA'23] obtains a slightly-better-than-1/2 approximation in $O(n^{1+ε})$ time. On the lower bound side, Parnas and Ron [TCS'07] showed 15 years ago that obtaining any constant approximation of maximum matching size requires $Ω(n)$ time. Proving any super-linear in $n$ lower bound, even for $(1-ε)$-approximations, has remained elusive since then. In this paper, we prove the first super-linear in $n$ lower bound for this problem. We show that at least $n^{1.2 - o(1)}$ queries in the adjacency list model are needed for obtaining a $(\frac{2}{3} + Ω(1))$-approximation of maximum matching size. This holds even if the graph is bipartite and is promised to have a matching of size $Θ(n)$. Our lower bound argument builds on techniques such as correlation decay that to our knowledge have not been used before in proving sublinear time lower bounds. We complement our lower bound by presenting two algorithms that run in strongly sublinear time of $n^{2-Ω(1)}$. The first algorithm achieves a $(\frac{2}{3}-ε)$-approximation; this significantly improves prior close-to-1/2 approximations. Our second algorithm obtains an even better approximation factor of $(\frac{2}{3}+Ω(1))$ for bipartite graphs. This breaks the prevalent $2/3$-approximation barrier and importantly shows that our $n^{1.2-o(1)}$ time lower bound for $(\frac{2}{3}+Ω(1))$-approximations cannot be improved all the way to $n^{2-o(1)}$. △ Less

Submitted 28 November, 2022; originally announced November 2022.

arXiv:2211.12599 [pdf, other]

doi 10.3847/1538-4357/acc401

HOPS 361-C's Jet Decelerating and Precessing Through NGC 2071 IR

Authors: Adam E. Rubinstein, Nicole Karnath, Alice C. Quillen, Samuel Federman, Joel D. Green, Edward T. Chambers, Dan M. Watson, S. Thomas Megeath

Abstract: We present a two-epoch Hubble Space Telescope (HST) study of NGC 2071 IR highlighting HOPS 361-C, a protostar producing an arced 0.2 parsec-scale jet. Proper motions for the brightest knots decrease from 350 to 100 km/s with increasing distance from the source. The [Fe II] and Pa$β$ emission line intensity ratio gives a velocity jump through each knot of 40--50 km/s. A new [O I] 63 \mic\ spectrum,… ▽ More We present a two-epoch Hubble Space Telescope (HST) study of NGC 2071 IR highlighting HOPS 361-C, a protostar producing an arced 0.2 parsec-scale jet. Proper motions for the brightest knots decrease from 350 to 100 km/s with increasing distance from the source. The [Fe II] and Pa$β$ emission line intensity ratio gives a velocity jump through each knot of 40--50 km/s. A new [O I] 63 \mic\ spectrum, taken with the German REciever for Astronomy at Terahertz frequencies (GREAT) instrument aboard Stratospheric Observatory for Infrared Astronomy (SOFIA), shows a low line-of-sight velocity indicative of high jet inclination. Proper motions and jump velocities then estimate 3D flow speed for knots. Subsequently, we model knot positions and speeds with a precessing jet that decelerates. Measurements are matched with a precession period of 1,000--3,000 years and half opening angle of $15^\circ$. The [Fe II] 1.26-to-1.64 \mic\ line intensity ratio determines visual extinction to each knot from 5--30 mag. Relative to $\sim$14 mag of extinction through the cloud from $\rm{C}^{18}$O emission maps, the jet is embedded at a 1/5 to 4/5 fractional cloud depth. Our model suggests the jet is dissipated over a 0.2 pc arc. This short distance may result from the jet swee** through a wide angle, allowing the cloud time to fill cavities opened by the jet. Precessing jets contrast with nearly unidirectional protostellar jets that puncture host clouds and can propagate significantly further. △ Less

Submitted 24 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

Comments: 25 pages, 10 figures, submitted to ApJ on 22-11-2022, revised until 8-3-2023, accepted 9-3-2023

arXiv:2211.08711 [pdf, other]

Beyond Worst-Case Budget-Feasible Mechanism Design

Authors: Aviad Rubinstein, Junyao Zhao

Abstract: Motivated by large-market applications such as crowdsourcing, we revisit the problem of budget-feasible mechanism design under a "small-bidder assumption". Anari, Goel, and Nikzad (2018) gave a mechanism that has optimal competitive ratio $1-1/e$ on worst-case instances. However, we observe that on many realistic instances, their mechanism is significantly outperformed by a simpler open clock auct… ▽ More Motivated by large-market applications such as crowdsourcing, we revisit the problem of budget-feasible mechanism design under a "small-bidder assumption". Anari, Goel, and Nikzad (2018) gave a mechanism that has optimal competitive ratio $1-1/e$ on worst-case instances. However, we observe that on many realistic instances, their mechanism is significantly outperformed by a simpler open clock auction by Ensthaler and Giebe (2014), although the open clock auction only achieves competitive ratio $1/2$ in the worst case. Is there a mechanism that gets the best of both worlds, i.e., a mechanism that is worst-case optimal and performs favorably on realistic instances? Our first main result is the design and the analysis of a natural mechanism that gives an affirmative answer to our question above: (i) We prove that on every instance, our mechanism performs at least as good as all uniform mechanisms, including Anari, Goel, and Nikzad's and Ensthaler and Giebe's mechanisms. (ii) Moreover, we empirically evaluate our mechanism on various realistic instances and observe that it beats the worst-case $1-1/e$ competitive ratio by a large margin and compares favorably to both mechanisms mentioned above. Our second main result is more interesting in theory: We show that in the semi-adversarial model of budget-smoothed analysis, where the adversary designs a single worst-case market for a distribution of budgets, our mechanism is optimal among all (including non-uniform) mechanisms; furthermore our mechanism guarantees a strictly better-than-$(1-1/e)$ expected competitive ratio for any non-trivial budget distribution regardless of the market. We complement the positive result with a characterization of the worst-case markets for any given budget distribution and prove a fairly robust hardness result that holds against any budget distribution and any mechanism. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: ITCS 2023

arXiv:2211.05178 [pdf, ps, other]

Fully-dynamic-to-incremental reductions with known deletion order (e.g. sliding window)

Authors: Binghui Peng, Aviad Rubinstein

Abstract: Dynamic algorithms come in three main flavors: $\mathit{incremental}$ (insertions-only), $\mathit{decremental}$ (deletions-only), or $\mathit{fully}$ $\mathit{dynamic}$ (both insertions and deletions). Fully dynamic is the holy grail of dynamic algorithm design; it is obviously more general than the other two, but is it strictly harder? Several works managed to reduce fully dynamic to the incremen… ▽ More Dynamic algorithms come in three main flavors: $\mathit{incremental}$ (insertions-only), $\mathit{decremental}$ (deletions-only), or $\mathit{fully}$ $\mathit{dynamic}$ (both insertions and deletions). Fully dynamic is the holy grail of dynamic algorithm design; it is obviously more general than the other two, but is it strictly harder? Several works managed to reduce fully dynamic to the incremental or decremental models by taking advantage of either specific structure of the incremental/decremental algorithms (e.g. [HK99, HLT01, BKS12, ADKKP16, BS80, OL81, OvL81]), or specific order of insertions/deletions (e.g. [AW14,HKNS15,KPP16]). Our goal in this work is to get a black-box fully-to-incremental reduction that is as general as possible. We find that the following conditions are necessary: $\bullet$ The incremental algorithm must have a worst-case (rather than amortized) running time guarantee. $\bullet$ The reduction must work in what we call the $\mathit{deletions}$-$\mathit{look}$-$\mathit{ahead}$ $\mathit{model}$, where the order of deletions among current elements is known in advance. A notable practical example is the "sliding window" (FIFO) order of updates. Under those conditions, we design: $\bullet$ A simple, practical, amortized-fully-dynamic to worst-case-incremental reduction with a $\log(T)$-factor overhead on the running time, where $T$ is the total number of updates. $\bullet$ A theoretical worst-case-fully-dynamic to worst-case-incremental reduction with a $\mathsf{polylog}(T)$-factor overhead on the running time. △ Less

Submitted 16 November, 2022; v1 submitted 9 November, 2022; originally announced November 2022.

arXiv:2210.13802 [pdf, ps, other]

doi 10.1112/blms.12971

Chebyshev potentials, Fubini--Study metrics, and geometry of the space of Kähler metrics

Authors: Chenzi **, Yanir A. Rubinstein

Abstract: The Chebyshev potential of a Kähler potential on a projective variety, introduced by Witt Nyström, is a convex function defined on the Okounkov body. It is a generalization of the symplectic potential of a torus-invariant Kähler potential on a toric variety, introduced by Guillemin, that is a convex function on the Delzant polytope. A folklore conjecture asserts that a curve of Chebyshev potential… ▽ More The Chebyshev potential of a Kähler potential on a projective variety, introduced by Witt Nyström, is a convex function defined on the Okounkov body. It is a generalization of the symplectic potential of a torus-invariant Kähler potential on a toric variety, introduced by Guillemin, that is a convex function on the Delzant polytope. A folklore conjecture asserts that a curve of Chebyshev potentials associated to a curve in the space of Kähler potentials is linear in the time variable if and only if the latter curve is a geodesic in the Mabuchi metric. This is classically true in the special toric setting, and in general Witt Nyström established the sufficiency. The goal of this article is to disprove this conjecture. More generally, we characterize the Fubini--Study geodesics for which the conjecture is true on projective space. The proof involves explicitly solving the Monge--Ampère equation describing geodesics on the subspace of Fubini--Study metrics and computing their Chebyshev potentials. △ Less

Submitted 25 October, 2022; originally announced October 2022.

arXiv:2206.13057 [pdf, other]

Beating Greedy Matching in Sublinear Time

Authors: Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein, Amin Saberi

Abstract: We study sublinear time algorithms for estimating the size of maximum matching in graphs. Our main result is a $(\frac{1}{2}+Ω(1))$-approximation algorithm which can be implemented in $O(n^{1+ε})$ time, where $n$ is the number of vertices and the constant $ε> 0$ can be made arbitrarily small. The best known lower bound for the problem is $Ω(n)$, which holds for any constant approximation. Existi… ▽ More We study sublinear time algorithms for estimating the size of maximum matching in graphs. Our main result is a $(\frac{1}{2}+Ω(1))$-approximation algorithm which can be implemented in $O(n^{1+ε})$ time, where $n$ is the number of vertices and the constant $ε> 0$ can be made arbitrarily small. The best known lower bound for the problem is $Ω(n)$, which holds for any constant approximation. Existing algorithms either obtain the greedy bound of $\frac{1}{2}$-approximation [Behnezhad FOCS'21], or require some assumption on the maximum degree to run in $o(n^2)$-time [Yoshida, Yamamoto, and Ito STOC'09]. We improve over these by designing a less "adaptive" augmentation algorithm for maximum matching that might be of independent interest. △ Less

Submitted 27 June, 2022; originally announced June 2022.

arXiv:2206.06188 [pdf, ps, other]

The Nazarov proof of the non-symmetric Bourgain--Milman inequality

Authors: Vlassis Mastrantonis, Yanir A. Rubinstein

Abstract: In 2012, Nazarov used Bergman kernels and Hormander's $L^2$ estimates for the $\bar\partial$-equation to give a new proof of the Bourgain--Milman theorem for symmetric convex bodies and made some suggestions on how his proof should extend to general convex bodies. This article achieves this extension and serves simultaneously as an exposition to Nazarov's work. A key new ingredient is an affine in… ▽ More In 2012, Nazarov used Bergman kernels and Hormander's $L^2$ estimates for the $\bar\partial$-equation to give a new proof of the Bourgain--Milman theorem for symmetric convex bodies and made some suggestions on how his proof should extend to general convex bodies. This article achieves this extension and serves simultaneously as an exposition to Nazarov's work. A key new ingredient is an affine invariant associated to the Bergman kernel of a tube domain. This gives the first `complex' proof of the Bourgain--Milman theorem for general convex bodies, specifically, without using symmetrization. △ Less

Submitted 13 June, 2022; originally announced June 2022.

arXiv:2206.04638 [pdf, ps, other]

On large deviation principles and the Monge--Ampère equation (following Berman, Hultgren)

Authors: Yanir A. Rubinstein

Abstract: This is mostly an exposition, aimed to be accessible to geometers, analysts, and probabilists, of a fundamental recent theorem of R. Berman with recent developments by J. Hultgren, that asserts that the second boundary value problem for the real Monge--Ampère equation admits a probabilistic interpretation, in terms of many particle limit of permanental point processes satisfying a large deviation… ▽ More This is mostly an exposition, aimed to be accessible to geometers, analysts, and probabilists, of a fundamental recent theorem of R. Berman with recent developments by J. Hultgren, that asserts that the second boundary value problem for the real Monge--Ampère equation admits a probabilistic interpretation, in terms of many particle limit of permanental point processes satisfying a large deviation principle with a rate function given explicitly using optimal transport. An alternative proof of a step in the Berman--Hultgren Theorem is presented allowing to to deal with all "tempratures" simultaneously instead of first reducing to the zero-temperature case. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: To appear in "Birational Geometry, Kahler-Einstein Metrics and Degenerations," Springer Proceedings in Mathematics & Statistics

arXiv:2204.11149 [pdf, other]

Maximizing Non-Monotone Submodular Functions over Small Subsets: Beyond $1/2$-Approximation

Authors: Aviad Rubinstein, Junyao Zhao

Abstract: In this work we give two new algorithms that use similar techniques for (non-monotone) submodular function maximization subject to a cardinality constraint. The first is an offline fixed parameter tractable algorithm that guarantees a $0.539$-approximation for all non-negative submodular functions. The second algorithm works in the random-order streaming model. It guarantees a $(1/2+c)$-approx… ▽ More In this work we give two new algorithms that use similar techniques for (non-monotone) submodular function maximization subject to a cardinality constraint. The first is an offline fixed parameter tractable algorithm that guarantees a $0.539$-approximation for all non-negative submodular functions. The second algorithm works in the random-order streaming model. It guarantees a $(1/2+c)$-approximation for symmetric functions, and we complement it by showing that no space-efficient algorithm can beat $1/2$ for asymmetric functions. To the best of our knowledge this is the first provable separation between symmetric and asymmetric submodular function maximization. △ Less

Submitted 23 April, 2022; originally announced April 2022.

Comments: ICALP 2022

arXiv:2111.10538 [pdf, other]

Approximation Algorithms for LCS and LIS with Truly Improved Running Times

Authors: Aviad Rubinstein, Saeed Seddighin, Zhao Song, Xiaorui Sun

Abstract: Longest common subsequence ($\mathsf{LCS}$) is a classic and central problem in combinatorial optimization. While $\mathsf{LCS}$ admits a quadratic time solution, recent evidence suggests that solving the problem may be impossible in truly subquadratic time. A special case of $\mathsf{LCS}$ wherein each character appears at most once in every string is equivalent to the longest increasing subseque… ▽ More Longest common subsequence ($\mathsf{LCS}$) is a classic and central problem in combinatorial optimization. While $\mathsf{LCS}$ admits a quadratic time solution, recent evidence suggests that solving the problem may be impossible in truly subquadratic time. A special case of $\mathsf{LCS}$ wherein each character appears at most once in every string is equivalent to the longest increasing subsequence problem ($\mathsf{LIS}$) which can be solved in quasilinear time. In this work, we present novel algorithms for approximating $\mathsf{LCS}$ in truly subquadratic time and $\mathsf{LIS}$ in truly sublinear time. Our approximation factors depend on the ratio of the optimal solution size over the input size. We denote this ratio by $λ$ and obtain the following results for $\mathsf{LCS}$ and $\mathsf{LIS}$ without any prior knowledge of $λ$. $\bullet$ A truly subquadratic time algorithm for $\mathsf{LCS}$ with approximation factor $Ω(λ^3)$. $\bullet$A truly sublinear time algorithm for $\mathsf{LIS}$ with approximation factor $Ω(λ^3)$. Triangle inequality was recently used by [Boroujeni, Ehsani, Ghodsi, HajiAghayi and Seddighin SODA 2018] and [Charkraborty, Das, Goldenberg, Koucky and Saks FOCS 2018] to present new approximation algorithms for edit distance. Our techniques for $\mathsf{LCS}$ extend the notion of triangle inequality to non-metric settings. △ Less

Submitted 20 November, 2021; originally announced November 2021.

Comments: FOCS 2019

arXiv:2111.07217 [pdf, other]

Cardinality constrained submodular maximization for random streams

Authors: Paul Liu, Aviad Rubinstein, Jan Vondrak, Junyao Zhao

Abstract: We consider the problem of maximizing submodular functions in single-pass streaming and secretaries-with-shortlists models, both with random arrival order. For cardinality constrained monotone functions, Agrawal, Shadravan, and Stein gave a single-pass $(1-1/e-\varepsilon)$-approximation algorithm using only linear memory, but their exponential dependence on $\varepsilon$ makes it impractical even… ▽ More We consider the problem of maximizing submodular functions in single-pass streaming and secretaries-with-shortlists models, both with random arrival order. For cardinality constrained monotone functions, Agrawal, Shadravan, and Stein gave a single-pass $(1-1/e-\varepsilon)$-approximation algorithm using only linear memory, but their exponential dependence on $\varepsilon$ makes it impractical even for $\varepsilon=0.1$. We simplify both the algorithm and the analysis, obtaining an exponential improvement in the $\varepsilon$-dependence (in particular, $O(k/\varepsilon)$ memory). Extending these techniques, we also give a simple $(1/e-\varepsilon)$-approximation for non-monotone functions in $O(k/\varepsilon)$ memory. For the monotone case, we also give a corresponding unconditional hardness barrier of $1-1/e+\varepsilon$ for single-pass algorithms in randomly ordered streams, even assuming unlimited computation. Finally, we show that the algorithms are simple to implement and work well on real world datasets. △ Less

Submitted 13 November, 2021; originally announced November 2021.

Comments: To appear in NeurIPS 2021

arXiv:2111.00652 [pdf, other]

Eguchi--Hanson metrics arising from Kahler--Einstein edge metrics

Authors: Yuxiang Ji, Yanir A. Rubinstein, Kewei Zhang

Abstract: Calabi--Hirzebruch manifolds are higher-dimensional generalizations of both the football and Hirzebruch surfaces. We construct a family of Kahler--Einstein edge metrics singular along two disjoint divisors on the Calabi--Hirzebruch manifolds and study their Gromov--Hausdorff limits when either cone angle tends to its extreme value. As a very special case, we show that the celebrated Eguchi--Hanson… ▽ More Calabi--Hirzebruch manifolds are higher-dimensional generalizations of both the football and Hirzebruch surfaces. We construct a family of Kahler--Einstein edge metrics singular along two disjoint divisors on the Calabi--Hirzebruch manifolds and study their Gromov--Hausdorff limits when either cone angle tends to its extreme value. As a very special case, we show that the celebrated Eguchi--Hanson metric arises in this way naturally as a Gromov--Hausdorff limit. We also completely describe all other (possibly rescaled) Gromov--Hausdorff limits which exhibit a wide range of behaviors, resolving in this setting a conjecture of Cheltsov--Rubinstein. This gives a new interpretation of both the Eguchi--Hanson space and Calabi's Ricci flat spaces as limits of compact singular Einstein spaces. △ Less

Submitted 27 February, 2024; v1 submitted 31 October, 2021; originally announced November 2021.

Comments: Accepted by J. Topology & Analysis

arXiv:2108.09115 [pdf, ps, other]

Does Preprocessing help in Fast Sequence Comparisons?

Authors: Elazar Goldenberg, Aviad Rubinstein, Barna Saha

Abstract: We study edit distance computation with preprocessing: the preprocessing algorithm acts on each string separately, and then the query algorithm takes as input the two preprocessed strings. This model is inspired by scenarios where we would like to compute edit distance between many pairs in the same pool of strings. Our results include: Permutation-LCS: If the LCS between two permutations has… ▽ More We study edit distance computation with preprocessing: the preprocessing algorithm acts on each string separately, and then the query algorithm takes as input the two preprocessed strings. This model is inspired by scenarios where we would like to compute edit distance between many pairs in the same pool of strings. Our results include: Permutation-LCS: If the LCS between two permutations has length $n-k$, we can compute it \textit{ exactly} with $O(n \log(n))$ preprocessing and $O(k \log(n))$ query time. Small edit distance: For general strings, if their edit distance is at most $k$, we can compute it \textit{ exactly} with $O(n\log(n))$ preprocessing and $O(k^2 \log(n))$ query time. Approximate edit distance: For the most general input, we can approximate the edit distance to within factor $(7+o(1))$ with preprocessing time $\tilde{O}(n^2)$ and query time $\tilde{O}(n^{1.5+o(1)})$. All of these results significantly improve over the state of the art in edit distance computation without preprocessing. Interestingly, by combining ideas from our algorithms with preprocessing, we provide new improved results for approximating edit distance without preprocessing in subquadratic time. △ Less

Submitted 20 August, 2021; originally announced August 2021.

ACM Class: F.2.2

arXiv:2106.08037 [pdf, other]

The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing

Authors: Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner, Reut Tsarfaty

Abstract: Modality is the linguistic ability to describe events with added information such as how desirable, plausible, or feasible they are. Modality is important for many NLP downstream tasks such as the detection of hedging, uncertainty, speculation, and more. Previous studies that address modality detection in NLP often restrict modal expressions to a closed syntactic class, and the modal sense labels… ▽ More Modality is the linguistic ability to describe events with added information such as how desirable, plausible, or feasible they are. Modality is important for many NLP downstream tasks such as the detection of hedging, uncertainty, speculation, and more. Previous studies that address modality detection in NLP often restrict modal expressions to a closed syntactic class, and the modal sense labels are vastly different across different studies, lacking an accepted standard. Furthermore, these senses are often analyzed independently of the events that they modify. This work builds on the theoretical foundations of the Georgetown Gradable Modal Expressions (GME) work by Rubinstein et al. (2013) to propose an event-based modality detection task where modal expressions can be words of any syntactic class and sense labels are drawn from a comprehensive taxonomy which harmonizes the modal concepts contributed by the different studies. We present experiments on the GME corpus aiming to detect and classify fine-grained modal concepts and associate them with their modified events. We show that detecting and classifying modal expressions is not only feasible, but also improves the detection of modal events in their own right. △ Less

Submitted 15 June, 2021; originally announced June 2021.

Comments: ACL 2021

arXiv:2104.11275 [pdf, other]

The Randomized Communication Complexity of Randomized Auctions

Authors: Aviad Rubinstein, Junyao Zhao

Abstract: We study the communication complexity of incentive compatible auction-protocols between a monopolist seller and a single buyer with a combinatorial valuation function over $n$ items. Motivated by the fact that revenue-optimal auctions are randomized [Tha04,MV10,BCKW10,Pav11,HR15] (as well as by an open problem of Babaioff, Gonczarowski, and Nisan [BGN17]),we focus on the randomized communication c… ▽ More We study the communication complexity of incentive compatible auction-protocols between a monopolist seller and a single buyer with a combinatorial valuation function over $n$ items. Motivated by the fact that revenue-optimal auctions are randomized [Tha04,MV10,BCKW10,Pav11,HR15] (as well as by an open problem of Babaioff, Gonczarowski, and Nisan [BGN17]),we focus on the randomized communication complexity of this problem (in contrast to most prior work on deterministic communication). We design simple, incentive compatible, and revenue-optimal auction-protocols whose expected communication complexity is much (in fact infinitely) more efficient than their deterministic counterparts. We also give nearly matching lower bounds on the expected communication complexity of approximately-revenue-optimal auctions. These results follow from a simple characterization of incentive compatible auction-protocols that allows us to prove lower bounds against randomized auction-protocols. In particular, our lower bounds give the first approximation-resistant, exponential separation between communication complexity of incentivizing vs implementing a Bayesian incentive compatible social choice rule, settling an open question of Fadel and Segal [FS09]. △ Less

Submitted 22 April, 2021; originally announced April 2021.

arXiv:2102.05782 [pdf, other]

Budget-Smoothed Analysis for Submodular Maximization

Authors: Aviad Rubinstein, Junyao Zhao

Abstract: The greedy algorithm for monotone submodular function maximization subject to cardinality constraint is guaranteed to approximate the optimal solution to within a $1-1/e$ factor. Although it is well known that this guarantee is essentially tight in the worst case -- for greedy and in fact any efficient algorithm, experiments show that greedy performs better in practice. We observe that for many ap… ▽ More The greedy algorithm for monotone submodular function maximization subject to cardinality constraint is guaranteed to approximate the optimal solution to within a $1-1/e$ factor. Although it is well known that this guarantee is essentially tight in the worst case -- for greedy and in fact any efficient algorithm, experiments show that greedy performs better in practice. We observe that for many applications in practice, the empirical distribution of the budgets (i.e., cardinality constraints) is supported on a wide range, and moreover, all the existing hardness results in theory break under a large perturbation of the budget. To understand the effect of the budget from both algorithmic and hardness perspectives, we introduce a new notion of budget smoothed analysis. We prove that greedy is optimal for every budget distribution, and we give a characterization for the worst-case submodular functions. Based on these results, we show that on the algorithmic side, under realistic budget distributions, greedy and related algorithms enjoy provably better approximation guarantees, that hold even for worst-case functions, and on the hardness side, there exist hard functions that are fairly robust to all the budget distributions. △ Less

Submitted 11 February, 2022; v1 submitted 10 February, 2021; originally announced February 2021.

Comments: ITCS 2022

arXiv:2012.14898 [pdf, ps, other]

Exponential Communication Separations between Notions of Selfishness

Authors: Aviad Rubinstein, Raghuvansh R. Saxena, Clayton Thomas, S. Mathew Weinberg, Junyao Zhao

Abstract: We consider the problem of implementing a fixed social choice function between multiple players (which takes as input a type $t_i$ from each player $i$ and outputs an outcome $f(t_1,\ldots, t_n)$), in which each player must be incentivized to follow the protocol. In particular, we study the communication requirements of a protocol which: (a) implements $f$, (b) implements $f$ and computes payments… ▽ More We consider the problem of implementing a fixed social choice function between multiple players (which takes as input a type $t_i$ from each player $i$ and outputs an outcome $f(t_1,\ldots, t_n)$), in which each player must be incentivized to follow the protocol. In particular, we study the communication requirements of a protocol which: (a) implements $f$, (b) implements $f$ and computes payments that make it ex-post incentive compatible (EPIC) to follow the protocol, and (c) implements $f$ and computes payments in a way that makes it dominant-strategy incentive compatible (DSIC) to follow the protocol. We show exponential separations between all three of these quantities, already for just two players. That is, we first construct an $f$ such that $f$ can be implemented in communication $c$, but any EPIC implementation of $f$ (with any choice of payments) requires communication $\exp(c)$. This answers an open question of [FS09, BBS13]. Second, we construct an $f$ such that an EPIC protocol implements $f$ with communication $C$, but all DSIC implementations of $f$ require communication $\exp(C)$. △ Less

Submitted 2 June, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

arXiv:2012.07935 [pdf, other]

Hitting the High Notes: Subset Selection for Maximizing Expected Order Statistics

Authors: Aranyak Mehta, Uri Nadav, Alexandros Psomas, Aviad Rubinstein

Abstract: We consider the fundamental problem of selecting $k$ out of $n$ random variables in a way that the expected highest or second-highest value is maximized. This question captures several applications where we have uncertainty about the quality of candidates (e.g. auction bids, search results) and have the capacity to explore only a small subset due to an exogenous constraint. For example, consider a… ▽ More We consider the fundamental problem of selecting $k$ out of $n$ random variables in a way that the expected highest or second-highest value is maximized. This question captures several applications where we have uncertainty about the quality of candidates (e.g. auction bids, search results) and have the capacity to explore only a small subset due to an exogenous constraint. For example, consider a second price auction where system constraints (e.g., costly retrieval or model computation) allow the participation of only $k$ out of $n$ bidders, and the goal is to optimize the expected efficiency (highest bid) or expected revenue (second highest bid). We study the case where we are given an explicit description of each random variable. We give a PTAS for the problem of maximizing the expected highest value. For the second-highest value, we prove a hardness result: assuming the Planted Clique Hypothesis, there is no constant factor approximation algorithm that runs in polynomial time. Surprisingly, under the assumption that each random variable has monotone hazard rate (MHR), a simple score-based algorithm, namely picking the $k$ random variables with the largest $1/\sqrt{k}$ top quantile value, is a constant approximation to the expected highest and second highest value, \emph{simultaneously}. △ Less

Submitted 14 December, 2020; originally announced December 2020.

arXiv:2012.04327 [pdf, ps, other]

Settling the complexity of Nash equilibrium in congestion games

Authors: Yakov Babichenko, Aviad Rubinstein

Abstract: We consider (i) the problem of finding a (possibly mixed) Nash equilibrium in congestion games, and (ii) the problem of finding an (exponential precision) fixed point of the gradient descent dynamics of a smooth function $f:[0,1]^n \rightarrow \mathbb{R}$. We prove that these problems are equivalent. Our result holds for various explicit descriptions of $f$, ranging from (almost general) arithmeti… ▽ More We consider (i) the problem of finding a (possibly mixed) Nash equilibrium in congestion games, and (ii) the problem of finding an (exponential precision) fixed point of the gradient descent dynamics of a smooth function $f:[0,1]^n \rightarrow \mathbb{R}$. We prove that these problems are equivalent. Our result holds for various explicit descriptions of $f$, ranging from (almost general) arithmetic circuits, to degree-$5$ polynomials. By a very recent result of [Fearnley, Goldberg, Hollender, Savani '20] this implies that these problems are PPAD$\cap$PLS-complete. As a corollary, we also obtain the following equivalence of complexity classes: CCLS = PPAD$\cap$PLS. △ Less

Submitted 23 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

arXiv:2011.12666 [pdf, ps, other]

Angle deformation of Kähler-Einstein edge metrics on Hirzebruch surfaces

Authors: Yanir A. Rubinstein, Kewei Zhang

Abstract: We construct a family of Kähler-Einstein edge metrics on all Hirzebruch surfaces using the Calabi ansatz and study their angle deformation. This allows us to verify in some special cases a conjecture of Cheltsov-Rubinstein that predicts convergence towards a non-compact Calabi-Yau fibration in the small angle limit. We also give an example of a Kähler-Einstein edge metric whose edge singularity is… ▽ More We construct a family of Kähler-Einstein edge metrics on all Hirzebruch surfaces using the Calabi ansatz and study their angle deformation. This allows us to verify in some special cases a conjecture of Cheltsov-Rubinstein that predicts convergence towards a non-compact Calabi-Yau fibration in the small angle limit. We also give an example of a Kähler-Einstein edge metric whose edge singularity is rigid, answering a question posed by Cheltsov. △ Less

Submitted 24 February, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

Comments: Final version, to appear in special issue in honor of Bernie Shiffman, Pure Appl. Math. Quart

Showing 1–50 of 138 results for author: Rubinstein, A