-
Improved bounds for calibration via stronger sign preservation games
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Maxwell Fishelson,
Noah Golowich,
Robert Kleinberg,
Princewill Okoroafor
Abstract:
A set of probabilistic forecasts is calibrated if each prediction of the forecaster closely approximates the empirical distribution of outcomes on the subset of timesteps where that prediction was made. We study the fundamental problem of online calibrated forecasting of binary sequences, which was initially studied by Foster & Vohra (1998). They derived an algorithm with $O(T^{2/3})$ calibration…
▽ More
A set of probabilistic forecasts is calibrated if each prediction of the forecaster closely approximates the empirical distribution of outcomes on the subset of timesteps where that prediction was made. We study the fundamental problem of online calibrated forecasting of binary sequences, which was initially studied by Foster & Vohra (1998). They derived an algorithm with $O(T^{2/3})$ calibration error after $T$ time steps, and showed a lower bound of $Ω(T^{1/2})$. These bounds remained stagnant for two decades, until Qiao & Valiant (2021) improved the lower bound to $Ω(T^{0.528})$ by introducing a combinatorial game called sign preservation and showing that lower bounds for this game imply lower bounds for calibration.
We introduce a strengthening of Qiao & Valiant's game that we call sign preservation with reuse (SPR). We prove that the relationship between SPR and calibrated forecasting is bidirectional: not only do lower bounds for SPR translate into lower bounds for calibration, but algorithms for SPR also translate into new algorithms for calibrated forecasting. In particular, any strategy that improves the trivial upper bound for the value of the SPR game would imply a forecasting algorithm with calibration error exponent less than 2/3, improving Foster & Vohra's upper bound for the first time. Using similar ideas, we then prove a slightly stronger lower bound than that of Qiao & Valiant, namely $Ω(T^{0.54389})$. Our lower bound is obtained by an oblivious adversary, marking the first $ω(T^{1/2})$ calibration lower bound for oblivious adversaries.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As
Authors:
Eden Avnat,
Michal Levy,
Daniel Herstain,
Elia Yanko,
Daniel Ben Joya,
Michal Tzuchman Katz,
Dafna Eshel,
Sahar Laros,
Yael Dagan,
Shahar Barami,
Joseph Mermelstein,
Shahar Ovadia,
Noam Shomron,
Varda Shalev,
Raja-Elie E. Abdulnour
Abstract:
Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited…
▽ More
Clinical problem-solving requires processing of semantic medical knowledge such as illness scripts and numerical medical knowledge of diagnostic tests for evidence-based decision-making. As large language models (LLMs) show promising results in many aspects of language-based clinical practice, their ability to generate non-language evidence-based answers to clinical questions is inherently limited by tokenization. Therefore, we evaluated LLMs' performance on two question types: numeric (correlating findings) and semantic (differentiating entities) while examining differences within and between LLMs in medical aspects and comparing their performance to humans. To generate straightforward multi-choice questions and answers (QAs) based on evidence-based medicine (EBM), we used a comprehensive medical knowledge graph (encompassed data from more than 50,00 peer-reviewed articles) and created the "EBMQA". EBMQA contains 105,000 QAs labeled with medical and non-medical topics and classified into numerical or semantic questions. We benchmarked this dataset using more than 24,500 QAs on two state-of-the-art LLMs: Chat-GPT4 and Claude3-Opus. We evaluated the LLMs accuracy on semantic and numerical question types and according to sub-labeled topics. For validation, six medical experts were tested on 100 numerical EBMQA questions. We found that both LLMs excelled more in semantic than numerical QAs, with Claude3 surpassing GPT4 in numerical QAs. However, both LLMs showed inter and intra gaps in different medical aspects and remained inferior to humans. Thus, their medical advice should be addressed carefully.
△ Less
Submitted 1 July, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Enhanced Superconductivity in SrTiO$_3$-based Interfaces via Amorphous Al2O3 Cap**
Authors:
I. Silber,
A. Azulay,
A. Basha,
D. Ketchker,
M. Baskin,
A. Yagoda,
L. Kornblum,
A. Kohn,
Y. Dagan
Abstract:
Oxide interfaces feature unique two-dimensional (2D) electronic systems with diverse electronic properties such as tunable spin-orbit interaction and superconductivity. Conductivity emerges in these interfaces when the thickness of an epitaxial polar layer surpasses a critical value, leading to charge transfer to the interface. Here, we show that depositing amorphous alumina on top of the polar ox…
▽ More
Oxide interfaces feature unique two-dimensional (2D) electronic systems with diverse electronic properties such as tunable spin-orbit interaction and superconductivity. Conductivity emerges in these interfaces when the thickness of an epitaxial polar layer surpasses a critical value, leading to charge transfer to the interface. Here, we show that depositing amorphous alumina on top of the polar oxide can reduce the critical thickness and enhance the superconducting properties for the (111) and the (100) SrTiO$_3$-based interfaces. A detailed transmission electron microscopy analysis reveals that the enhancement of the superconducting properties is linked to the expansion of the LaAlO$_3$ lattice in a direction perpendicular to the interface. We propose that the increase in the superconducting critical temperature, Tc, is a result of epitaxial strain
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
On the early stages of vapor bubble growth: From the surface-tension to the inertial regime
Authors:
Orr Avni,
Eran Sher,
Yuval Dagan
Abstract:
This paper presents a new analytical model for the early stages of vapor bubble growth in superheated liquids. The model bridges a gap in current knowledge by focusing on the surface tension-controlled, near-equilibrium growth regime and its transition to an inertia-controlled regime. A unified analytical model is derived by combining a perturbation method for the initial growth regime with a comp…
▽ More
This paper presents a new analytical model for the early stages of vapor bubble growth in superheated liquids. The model bridges a gap in current knowledge by focusing on the surface tension-controlled, near-equilibrium growth regime and its transition to an inertia-controlled regime. A unified analytical model is derived by combining a perturbation method for the initial growth regime with a complementary outer solution to model the subsequent bubble growth rate. The model successfully predicts the initial delay in bubble growth due to surface tension effects. Two non-dimensional parameters govern this delay period: the initial perturbation from the equilibrium radius and the Ohnesorge number at the onset of nucleation. The Ohnesorge number encapsulates the interplay between viscous dam** and surface tension forces acting on the bubble during its early growth stages. The analytical solutions presented here allow for quantifying the surface-tension, inertial, and transitional regimes while establishing a simple criterion for estimating the influence of thermal effects on early-stage growth. Our findings emphasize the significance of considering the surface tension delay, particularly for short timescales. The derived analytical solutions and the obtained correlation for surface-tension-induced delay may prove a practical tool and could be integrated into existing models of vapor bubble growth.
△ Less
Submitted 26 May, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
From External to Swap Regret 2.0: An Efficient Reduction and Oblivious Adversary for Large Action Spaces
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Maxwell Fishelson,
Noah Golowich
Abstract:
We provide a novel reduction from swap-regret minimization to external-regret minimization, which improves upon the classical reductions of Blum-Mansour [BM07] and Stolz-Lugosi [SL05] in that it does not require finiteness of the space of actions. We show that, whenever there exists a no-external-regret algorithm for some hypothesis class, there must also exist a no-swap-regret algorithm for that…
▽ More
We provide a novel reduction from swap-regret minimization to external-regret minimization, which improves upon the classical reductions of Blum-Mansour [BM07] and Stolz-Lugosi [SL05] in that it does not require finiteness of the space of actions. We show that, whenever there exists a no-external-regret algorithm for some hypothesis class, there must also exist a no-swap-regret algorithm for that same class. For the problem of learning with expert advice, our result implies that it is possible to guarantee that the swap regret is bounded by ε after $\log(N)^{O(1/ε)}$ rounds and with $O(N)$ per iteration complexity, where $N$ is the number of experts, while the classical reductions of Blum-Mansour and Stolz-Lugosi require $O(N/ε^2)$ rounds and at least $Ω(N^2)$ per iteration complexity. Our result comes with an associated lower bound, which -- in contrast to that in [BM07] -- holds for oblivious and $\ell_1$-constrained adversaries and learners that can employ distributions over experts, showing that the number of rounds must be $\tildeΩ(N/ε^2)$ or exponential in $1/ε$.
Our reduction implies that, if no-regret learning is possible in some game, then this game must have approximate correlated equilibria, of arbitrarily good approximation. This strengthens the folklore implication of no-regret learning that approximate coarse correlated equilibria exist. Importantly, it provides a sufficient condition for the existence of correlated equilibrium which vastly extends the requirement that the action set is finite, thus answering a question left open by [DG22; Ass+23]. Moreover, it answers several outstanding questions about equilibrium computation and learning in games.
△ Less
Submitted 6 December, 2023; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Enhanced Non-linear Response by Manipulating the Dirac Point in the (111) LaTiO$_3$/SrTiO$_3$ Interface
Authors:
G. Tuvia,
A. Burshtein,
I. Silber,
A. Aharony,
O. Entin-Wohlman,
M. Goldstein,
Y. Dagan
Abstract:
Tunable spin-orbit interaction (SOI) is an important feature for future spin-based devices. In the presence of a magnetic field, SOI induces an asymmetry in the energy bands, which can produce non-linear transport effects ($V\sim I^2$). Here, we focus on such effects to study the role of SOI in the (111) LaTiO$_3$/SrTiO$_3$ interface. This system is a convenient platform for understanding the role…
▽ More
Tunable spin-orbit interaction (SOI) is an important feature for future spin-based devices. In the presence of a magnetic field, SOI induces an asymmetry in the energy bands, which can produce non-linear transport effects ($V\sim I^2$). Here, we focus on such effects to study the role of SOI in the (111) LaTiO$_3$/SrTiO$_3$ interface. This system is a convenient platform for understanding the role of SOI since it exhibits a single-band Hall-response through the entire gate-voltage range studied. We report a pronounced rise in the non-linear resistance at a critical in-plane field $H_{cr}$. This rise disappears with a small out-of-plane field. We explain these results by considering the location of the Dirac point formed at the crossing of the spin-split energy bands. An in-plane magnetic field pushes this point outside of the Fermi surface, and consequently changes the symmetry of the Fermi contours and intensifies the non-linear transport. An out-of-plane magnetic field opens a gap at the Dirac point, thereby significantly diminishing the non-linear effects. We propose that magnetoresistance effects previously reported in interfaces with SOI could be comprehended within our suggested scenario.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Hydrodynamically Inspired Pilot-Wave Theory: An Ensemble Interpretation
Authors:
Yuval Dagan
Abstract:
This chapter explores a deterministic hydrodynamically-inspired ensemble interpretation for free relativistic particles, following the original pilot wave theory conceptualized by de Broglie in 1924 and recent advances in hydrodynamic quantum analogs. We couple a one-dimensional periodically forced Klein-Gordon wave equation and a relativistic particle equation of motion, and simulate an ensemble…
▽ More
This chapter explores a deterministic hydrodynamically-inspired ensemble interpretation for free relativistic particles, following the original pilot wave theory conceptualized by de Broglie in 1924 and recent advances in hydrodynamic quantum analogs. We couple a one-dimensional periodically forced Klein-Gordon wave equation and a relativistic particle equation of motion, and simulate an ensemble of multiple uncorrelated particle trajectories. The simulations reveal a chaotic particle dynamic behavior, highly sensitive to the initial random condition. Although particles in the simulated ensemble seem to fill out the entire spatiotemporal domain, we find coherent spatiotemporal structures in which particles are less likely to cross. These structures are characterized by de Broglie's wavelength and the relativistic modulation frequency kc. Markedly, the probability density function of the particle ensemble correlates to the square of the absolute wave field, solved here analytically, suggesting a classical deterministic interpretation of de Broglie's matter waves and Born's rule.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Online Learning and Solving Infinite Games with an ERM Oracle
Authors:
Angelos Assos,
Idan Attias,
Yuval Dagan,
Constantinos Daskalakis,
Maxwell Fishelson
Abstract:
While ERM suffices to attain near-optimal generalization error in the stochastic learning setting, this is not known to be the case in the online learning setting, where algorithms for general concept classes rely on computationally inefficient oracles such as the Standard Optimal Algorithm (SOA). In this work, we propose an algorithm for online binary classification setting that relies solely on…
▽ More
While ERM suffices to attain near-optimal generalization error in the stochastic learning setting, this is not known to be the case in the online learning setting, where algorithms for general concept classes rely on computationally inefficient oracles such as the Standard Optimal Algorithm (SOA). In this work, we propose an algorithm for online binary classification setting that relies solely on ERM oracle calls, and show that it has finite regret in the realizable setting and sublinearly growing regret in the agnostic setting. We bound the regret in terms of the Littlestone and threshold dimensions of the underlying concept class.
We obtain similar results for nonparametric games, where the ERM oracle can be interpreted as a best response oracle, finding the best response of a player to a given history of play of the other players. In this setting, we provide learning algorithms that only rely on best response oracles and converge to approximate-minimax equilibria in two-player zero-sum games and approximate coarse correlated equilibria in multi-player general-sum games, as long as the game has a bounded fat-threshold dimension. Our algorithms apply to both binary-valued and real-valued games and can be viewed as providing justification for the wide use of double oracle and multiple oracle algorithms in the practice of solving large games.
△ Less
Submitted 10 July, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Authors:
Giannis Daras,
Kulin Shah,
Yuval Dagan,
Aravind Gollakota,
Alexandros G. Dimakis,
Adam Klivans
Abstract:
We present the first diffusion-based framework that can learn an unknown distribution using only highly-corrupted samples. This problem arises in scientific applications where access to uncorrupted samples is impossible or expensive to acquire. Another benefit of our approach is the ability to train generative models that are less likely to memorize individual training samples since they never obs…
▽ More
We present the first diffusion-based framework that can learn an unknown distribution using only highly-corrupted samples. This problem arises in scientific applications where access to uncorrupted samples is impossible or expensive to acquire. Another benefit of our approach is the ability to train generative models that are less likely to memorize individual training samples since they never observe clean training data. Our main idea is to introduce additional measurement distortion during the diffusion process and require the model to predict the original corrupted image from the further corrupted image. We prove that our method leads to models that learn the conditional expectation of the full uncorrupted image given this additional measurement corruption. This holds for any corruption process that satisfies some technical conditions (and in particular includes inpainting and compressed sensing). We train models on standard benchmarks (CelebA, CIFAR-10 and AFHQ) and show that we can learn the distribution even when all the training samples have $90\%$ of their pixels missing. We also show that we can finetune foundation models on small corrupted datasets (e.g. MRI scans with block corruptions) and learn the clean distribution without memorizing the training set.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent
Authors:
Giannis Daras,
Yuval Dagan,
Alexandros G. Dimakis,
Constantinos Daskalakis
Abstract:
Imperfect score-matching leads to a shift between the training and the sampling distribution of diffusion models. Due to the recursive nature of the generation process, errors in previous steps yield sampling iterates that drift away from the training distribution. Yet, the standard training objective via Denoising Score Matching (DSM) is only designed to optimize over non-drifted data. To train o…
▽ More
Imperfect score-matching leads to a shift between the training and the sampling distribution of diffusion models. Due to the recursive nature of the generation process, errors in previous steps yield sampling iterates that drift away from the training distribution. Yet, the standard training objective via Denoising Score Matching (DSM) is only designed to optimize over non-drifted data. To train on drifted data, we propose to enforce a \emph{consistency} property which states that predictions of the model on its own generated data are consistent across time. Theoretically, we show that if the score is learned perfectly on some non-drifted points (via DSM) and if the consistency property is enforced everywhere, then the score is learned accurately everywhere. Empirically we show that our novel training objective yields state-of-the-art results for conditional and unconditional generation in CIFAR-10 and baseline improvements in AFHQ and FFHQ. We open-source our code and models: https://github.com/giannisdaras/cdm
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Learning and Testing Latent-Tree Ising Models Efficiently
Authors:
Davin Choo,
Yuval Dagan,
Constantinos Daskalakis,
Anthimos Vardis Kandiros
Abstract:
We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide…
▽ More
We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide an efficient algorithm with fewer samples for testing whether two latent-tree Ising models have leaf-node distributions that are close or far in Total Variation distance. We obtain our algorithms by showing novel localization results for the total variation distance between the leaf-node distributions of tree-structured Ising models, in terms of their marginals on pairs of leaves.
△ Less
Submitted 10 July, 2023; v1 submitted 23 November, 2022;
originally announced November 2022.
-
EM's Convergence in Gaussian Latent Tree Models
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Anthimos Vardis Kandiros
Abstract:
We study the optimization landscape of the log-likelihood function and the convergence of the Expectation-Maximization (EM) algorithm in latent Gaussian tree models, i.e. tree-structured Gaussian graphical models whose leaf nodes are observable and non-leaf nodes are unobservable. We show that the unique non-trivial stationary point of the population log-likelihood is its global maximum, and estab…
▽ More
We study the optimization landscape of the log-likelihood function and the convergence of the Expectation-Maximization (EM) algorithm in latent Gaussian tree models, i.e. tree-structured Gaussian graphical models whose leaf nodes are observable and non-leaf nodes are unobservable. We show that the unique non-trivial stationary point of the population log-likelihood is its global maximum, and establish that the expectation-maximization algorithm is guaranteed to converge to it in the single latent variable case. Our results for the landscape of the log-likelihood function in general latent tree models provide support for the extensive practical use of maximum likelihood based-methods in this setting. Our results for the EM algorithm extend an emerging line of work on obtaining global convergence guarantees for this celebrated algorithm. We show our results for the non-trivial stationary points of the log-likelihood by arguing that a certain system of polynomial equations obtained from the EM updates has a unique non-trivial solution. The global convergence of the EM algorithm follows by arguing that all trivial fixed points are higher-order saddle points.
△ Less
Submitted 23 November, 2022; v1 submitted 21 November, 2022;
originally announced November 2022.
-
A generalized theory of Brownian particle diffusion in shear flows
Authors:
Nan Wang,
Yuval Dagan
Abstract:
This study presents a generalized theory for the diffusion of Brownian particles in shear flows. By solving the Langevin equations using stochastic instead of classical calculus, we propose a new mathematical formulation that resolves the particle MSD at all time scales for any two-dimensional parallel shear flow described by a polynomial velocity profile. We show that at long-time scales, the pol…
▽ More
This study presents a generalized theory for the diffusion of Brownian particles in shear flows. By solving the Langevin equations using stochastic instead of classical calculus, we propose a new mathematical formulation that resolves the particle MSD at all time scales for any two-dimensional parallel shear flow described by a polynomial velocity profile. We show that at long-time scales, the polynomial order of time in the particle MSD is n+2, where n is the polynomial order of the transverse coordinate of the velocity profile. We generalize the theory to resolve particle diffusion in any polynomial shear flow at all time scales, including the order of particle relaxation time scale, which is unresolved in current theories. Particle diffusion at all time scales is then studied for the cases of Couette and plane-Poiseuille flows and a polynomial expansion of a hyperbolic tangent flow while neglecting the boundary effects. We observe three main stages of particle diffusion along the timeline for which the particle MSD is distinctly different due to different dominated physical mechanisms. Thus, higher temporal and spatial resolution for diffusion processes in shear flows may be realized, suggesting a more accurate analytical approach for the diffusion of Brownian particles.
△ Less
Submitted 18 September, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Chiral to Nematic Crossover in the Superconducting State of 4Hb-TaS$_2$
Authors:
I. Silber,
S. Mathimalar,
I. Mangel,
O. Green,
N. Avraham,
H. Beidenkopf,
I. Feldman,
A. Kanigel,
A. Klein,
M. Goldstein,
A. Banerjee,
E. Sela,
Y. Dagan
Abstract:
Most superconductors have an isotropic, single component order parameter, and are well described by the BCS theory for superconductivity. Unconventional, multiple components superconductors are exceptionally rare and are much less understood. Here, we combine scanning tunneling microscopy and angle-resolved macroscopic transport to study the candidate chiral superconductor, 4Hb-TaS$_2$. We reveal…
▽ More
Most superconductors have an isotropic, single component order parameter, and are well described by the BCS theory for superconductivity. Unconventional, multiple components superconductors are exceptionally rare and are much less understood. Here, we combine scanning tunneling microscopy and angle-resolved macroscopic transport to study the candidate chiral superconductor, 4Hb-TaS$_2$. We reveal quasi-periodic one-dimensional modulations in the tunneling conductance accompanied by two-fold symmetric superconducting critical field. The strong modulation of the in-plane critical field, points to a nematic, unconventional order parameter. However, the imaged vortex core is nearly circular symmetric, suggesting an isotropic order parameter. We reconcile this apparent discrepancy by modeling a competition between a dominating chiral superconducting order parameter and a nematic one, the latter emerges close to the normal phase. Our results strongly support the existence of two-component superconductivity in 4Hb-TaS$_2$ and can provide useful insights to other systems with coexistent charge order and superconductivity.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Authors:
Giannis Daras,
Yuval Dagan,
Alexandros G. Dimakis,
Constantinos Daskalakis
Abstract:
We prove fast mixing and characterize the stationary distribution of the Langevin Algorithm for inverting random weighted DNN generators. This result extends the work of Hand and Voroninski from efficient inversion to efficient posterior sampling. In practice, to allow for increased expressivity, we propose to do posterior sampling in the latent space of a pre-trained generative model. To achieve…
▽ More
We prove fast mixing and characterize the stationary distribution of the Langevin Algorithm for inverting random weighted DNN generators. This result extends the work of Hand and Voroninski from efficient inversion to efficient posterior sampling. In practice, to allow for increased expressivity, we propose to do posterior sampling in the latent space of a pre-trained generative model. To achieve that, we train a score-based model in the latent space of a StyleGAN-2 and we use it to solve inverse problems. Our framework, Score-Guided Intermediate Layer Optimization (SGILO), extends prior work by replacing the sparsity regularization with a generative prior in the intermediate layer. Experimentally, we obtain significant improvements over the previous state-of-the-art, especially in the low measurement regime.
△ Less
Submitted 22 June, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Generalized stability theory of polydisperse particle-laden flows. Part1. Channel flow
Authors:
Zhixuan Liu,
Yuval Dagan
Abstract:
We present a generalized hydrodynamic stability theory for interacting particles in polydisperse particle-laden flows. The addition of dispersed particulate matter to a clean flow can either stabilize or destabilize the flow, depending on the particles' relaxation time-scale relative to the carrier flow time scales and the particle loading. To study the effects of polydispersity and particle inter…
▽ More
We present a generalized hydrodynamic stability theory for interacting particles in polydisperse particle-laden flows. The addition of dispersed particulate matter to a clean flow can either stabilize or destabilize the flow, depending on the particles' relaxation time-scale relative to the carrier flow time scales and the particle loading. To study the effects of polydispersity and particle interactions on the hydrodynamic stability of shear flows, we propose a new mathematical framework by combining a linear stability analysis and a discrete Eulerian sectional formulation to describe the flow and the dispersed particulate matter. In this formulation, multiple momentum and transport equations are written for each size-section of the dispersed phase, where interphase and inter-particle mass and momentum transfer are modelled as source terms in the governing equations. A new modal linear stability framework is derived by linearizing the coupled equations. Using this approach, particle-flow interactions, such as polydispersity, droplet vaporization, condensation, and coalescence, may be modelled. The method is validated with linear stability analyses of clean and monodisperse particle-laden flows. We show that the stability characteristics of a channel flow laden with particles drastically change due to polydispersity. While relatively large monodisperse particles tend to stabilize the flow, adding a second size section of a very small mass fraction of low-to-moderate Stokes number particles may significantly increase the growth rates, and for high-Reynolds numbers may destabilize flows that might have been regarded as linearly stable in the monodisperse case. These findings may apply to a vast number of fluid mechanics applications involving particle-laden flows such as atmospheric flows, environmental flows, medical applications, propulsion, and energy systems.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Tunable Magnetic Scattering and Ferroelectric Switching at the LaAlO$_3$/EuTiO$_3$/Sr$_{0.99}$Ca$_{0.01}$TiO$_3$ Interface
Authors:
Gal Tuvia,
Sapir Weitz Sobelman,
Shay Sandik,
Beena Kalisky,
Yoram Dagan
Abstract:
Ferroelectric and ferromagnetic orders rarely coexist, and magnetoelectric coupling is even more scarce. A possible avenue for combining these orders is by interface design, where orders formed at the constituent materials can overlap and interact. Using a combination of magneto-transport and scanning SQUID measurements, we explore the interactions between ferroelectricity, magnetism, and the 2D e…
▽ More
Ferroelectric and ferromagnetic orders rarely coexist, and magnetoelectric coupling is even more scarce. A possible avenue for combining these orders is by interface design, where orders formed at the constituent materials can overlap and interact. Using a combination of magneto-transport and scanning SQUID measurements, we explore the interactions between ferroelectricity, magnetism, and the 2D electron system (2DES) formed at the novel LaAlO$_3$/EuTiO$_3$/Sr$_{0.99}$Ca$_{0.01}$TiO$_3$ heterostructure. We find that the electrons at the interface experience magnetic scattering appearing along with a diverging Curie-Weiss-type behaviour in the EuTiO$_3$ layer. The 2DES is also affected by the switchable ferroelectric polarization at the Sr$_{0.99}$Ca$_{0.01}$TiO$_3$ bulk. While the 2DES interacts with both magnetism and ferroelectricity, we show that the presence of the conducting electrons has no effect on magnetization in the EuTiO$_3$ layer. Our results provide a first step towards realizing a new multiferroic system where magnetism and ferroelectricity can interact via an intermediate conducting layer.
△ Less
Submitted 10 April, 2022;
originally announced April 2022.
-
Smoothed Online Learning is as Easy as Statistical Learning
Authors:
Adam Block,
Yuval Dagan,
Noah Golowich,
Alexander Rakhlin
Abstract:
Much of modern learning theory has been split between two regimes: the classical offline setting, where data arrive independently, and the online setting, where data arrive adversarially. While the former model is often both computationally and statistically tractable, the latter requires no distributional assumptions. In an attempt to achieve the best of both worlds, previous work proposed the sm…
▽ More
Much of modern learning theory has been split between two regimes: the classical offline setting, where data arrive independently, and the online setting, where data arrive adversarially. While the former model is often both computationally and statistically tractable, the latter requires no distributional assumptions. In an attempt to achieve the best of both worlds, previous work proposed the smooth online setting where each sample is drawn from an adversarially chosen distribution, which is smooth, i.e., it has a bounded density with respect to a fixed dominating measure. We provide tight bounds on the minimax regret of learning a nonparametric function class, with nearly optimal dependence on both the horizon and smoothness parameters. Furthermore, we provide the first oracle-efficient, no-regret algorithms in this setting. In particular, we propose an oracle-efficient improper algorithm whose regret achieves optimal dependence on the horizon and a proper algorithm requiring only a single oracle call per round whose regret has the optimal horizon dependence in the classification setting and is sublinear in general. Both algorithms have exponentially worse dependence on the smoothness parameter of the adversary than the minimax rate. We then prove a lower bound on the oracle complexity of any proper learning algorithm, which matches the oracle-efficient upper bounds up to a polynomial factor, thus demonstrating the existence of a statistical-computational gap in smooth online learning. Finally, we apply our results to the contextual bandit setting to show that if a function class is learnable in the classical setting, then there is an oracle-efficient, no-regret algorithm for contextual bandits in the case that contexts arrive in a smooth manner.
△ Less
Submitted 31 May, 2022; v1 submitted 9 February, 2022;
originally announced February 2022.
-
Dispersion of Free-Falling Saliva Droplets by Two-Dimensional Vortical Flows
Authors:
Orr Avni,
Yuval Dagan
Abstract:
The dispersion of respiratory saliva droplets by indoor wake structures may enhance the transmission of various infectious diseases, as the wake spreads virus-laden droplets across the room. Thus, this study analyses the interaction between vortical wake structures and exhaled multi-component saliva droplets. A self-propelling analytically-described dipolar vortex is chosen as a model wake flow, p…
▽ More
The dispersion of respiratory saliva droplets by indoor wake structures may enhance the transmission of various infectious diseases, as the wake spreads virus-laden droplets across the room. Thus, this study analyses the interaction between vortical wake structures and exhaled multi-component saliva droplets. A self-propelling analytically-described dipolar vortex is chosen as a model wake flow, passing through a cloud of micron-sized evaporating saliva droplets. The droplets' spatial location, velocity, diameter, and temperature are traced and coupled to their local flow field. For the first time, the wake structure decay is incorporated and analyzed, which is proved essential for accurately predicting the settling distances of the dispersed droplets. The model also considers the non-volatile saliva components, adequately capturing the essence of droplet-aerosol transition and predicting the equilibrium diameter of the residual aerosols. Our analytic model reveals non-intuitive interactions between wake flows, droplet relaxation time, gravity, and transport phenomena. We reveal that given the right conditions, a virus-laden saliva droplet might translate to distances two orders of magnitude larger than the carrier-flow characteristic size. Moreover, accounting for the non-volatile contents inside the droplet may lead to fundamentally different dispersion and settling behavior compared to non-evaporating particles or pure water droplets. Ergo, we suggest that the implementation of more complex evaporation models might be critical in high-fidelity simulations aspiring to assess the spread of airborne respiratory droplets.
△ Less
Submitted 25 December, 2022; v1 submitted 2 January, 2022;
originally announced January 2022.
-
Dynamics of Evaporating Respiratory Droplets in the Vicinity of Vortex Dipoles
Authors:
Orr Avni,
Yuval Dagan
Abstract:
A new mathematical analysis of exhaled respiratory droplet dynamics and settling distances in the vicinity of vortical environments is presented. Recent experimental and theoretical studies suggest that vortical flow structures may enhance the settling distances of exhaled respiratory droplets beyond the two-meter distancing rule recommended by health authorities lately. We propose a mathematical…
▽ More
A new mathematical analysis of exhaled respiratory droplet dynamics and settling distances in the vicinity of vortical environments is presented. Recent experimental and theoretical studies suggest that vortical flow structures may enhance the settling distances of exhaled respiratory droplets beyond the two-meter distancing rule recommended by health authorities lately. We propose a mathematical framework to study the underlying physical mechanism responsible for the entrapment and subsequently delayed settling times of evaporating droplets and solid particles. A dipolar vortex is considered self-propelling through a cloud of micron-sized evaporating droplets. This configuration might be utilized to approximate an indoor environment in which similar unsteady vortical flow structures interact with exhaled respiratory droplets. We demonstrate the vortex dipole effect on droplet and solid particles settling distances, depending on the evaporation rate, the vorticity of the dipole, and the droplet's initial diameter and location relative to the vortex core. Our theoretical analysis reveals non-intuitive interactions between the vortex dipole, droplet relaxation time, gravity, and mass transfer. The existence of optimal conditions for maximum displacement is suggested, where the droplet entrainment reaches up to an order of magnitude larger than the vortex core length scale. We present a basic model that may be applied for evaluating the spread of exhaled respiratory droplets in vortical environments. Our theoretical study suggests that exhaled respiratory droplets initially at rest can translate to significant distances, hence implying that vortical flow might enhance the transmission of airborne pathogens.
△ Less
Submitted 11 January, 2022; v1 submitted 16 August, 2021;
originally announced August 2021.
-
Statistical Estimation from Dependent Data
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Nishanth Dikkala,
Surbhi Goel,
Anthimos Vardis Kandiros
Abstract:
We consider a general statistical estimation problem wherein binary labels across different observations are not independent conditioned on their feature vectors, but dependent, capturing settings where e.g. these observations are collected on a spatial domain, a temporal domain, or a social network, which induce dependencies. We model these dependencies in the language of Markov Random Fields and…
▽ More
We consider a general statistical estimation problem wherein binary labels across different observations are not independent conditioned on their feature vectors, but dependent, capturing settings where e.g. these observations are collected on a spatial domain, a temporal domain, or a social network, which induce dependencies. We model these dependencies in the language of Markov Random Fields and, importantly, allow these dependencies to be substantial, i.e do not assume that the Markov Random Field capturing these dependencies is in high temperature. As our main contribution we provide algorithms and statistically efficient estimation rates for this model, giving several instantiations of our bounds in logistic regression, sparse logistic regression, and neural network settings with dependent data. Our estimation guarantees follow from novel results for estimating the parameters (i.e. external fields and interaction strengths) of Ising models from a {\em single} sample. {We evaluate our estimation approach on real networked data, showing that it outperforms standard regression approaches that ignore dependencies, across three text classification datasets: Cora, Citeseer and Pubmed.}
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Concomitant appearance of conductivity and superconductivity in (111)LaAlO3/SrTiO3 interface with metal cap**
Authors:
R. S. Bisht,
M. Mograbi,
P. K. Rout,
G. Tuvia,
Y. Dagan,
Hyeok Yoon,
A. G. Swartz,
H. Y. Hwang,
L. L. Li,
R. Pentcheva
Abstract:
In polar-oxide interfaces, a certain number of monolayers (ML) is needed for conductivity to appear. This threshold for conductivity is explained by accumulating sufficient electric potential to initiate charge transfer to the interface. Here we study experimentally and theoretically the (111) SrTiO3/LaAlO3 interface where a critical thickness, tc, of nine epitaxial LaAlO3 ML is required to turn t…
▽ More
In polar-oxide interfaces, a certain number of monolayers (ML) is needed for conductivity to appear. This threshold for conductivity is explained by accumulating sufficient electric potential to initiate charge transfer to the interface. Here we study experimentally and theoretically the (111) SrTiO3/LaAlO3 interface where a critical thickness, tc, of nine epitaxial LaAlO3 ML is required to turn the interface from insulating to conducting and even superconducting. We show that tc decreases to 3ML when depositing a cobalt over-layer (cap**) and 6ML for platinum cap**. The latter result contrasts with the (100) interface, where platinum cap** increases tc beyond the bare interface. The observed threshold for conductivity for the bare and the metal-capped interfaces is confirmed by our density functional theory calculations. Interestingly, for (111) SrTiO3/LaAlO3/Metal interfaces, conductivity appears concomitantly with superconductivity in contrast with the (100) SrTiO3/LaAlO3/Metal interfaces where tc is smaller than the critical thickness for superconductivity. We attribute this dissimilarity to the different orbital polarization of e'g for the (111) versus dxy for the (001) interface.
△ Less
Submitted 19 July, 2021; v1 submitted 14 February, 2021;
originally announced February 2021.
-
Majorizing Measures, Sequential Complexities, and Online Learning
Authors:
Adam Block,
Yuval Dagan,
Sasha Rakhlin
Abstract:
We introduce the technique of generic chaining and majorizing measures for controlling sequential Rademacher complexity. We relate majorizing measures to the notion of fractional covering numbers, which we show to be dominated in terms of sequential scale-sensitive dimensions in a horizon-independent way, and, under additional complexity assumptions establish a tight control on worst-case sequenti…
▽ More
We introduce the technique of generic chaining and majorizing measures for controlling sequential Rademacher complexity. We relate majorizing measures to the notion of fractional covering numbers, which we show to be dominated in terms of sequential scale-sensitive dimensions in a horizon-independent way, and, under additional complexity assumptions establish a tight control on worst-case sequential Rademacher complexity in terms of the integral of sequential scale-sensitive dimension. Finally, we establish a tight contraction inequality for worst-case sequential Rademacher complexity. The above constitutes the resolution of a number of outstanding open problems in extending the classical theory of empirical processes to the sequential case, and, in turn, establishes sharp results for online learning.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
Adversarial Laws of Large Numbers and Optimal Regret in Online Classification
Authors:
Noga Alon,
Omri Ben-Eliezer,
Yuval Dagan,
Shay Moran,
Moni Naor,
Eylon Yogev
Abstract:
Laws of large numbers guarantee that given a large enough sample from some population, the measure of any fixed sub-population is well-estimated by its frequency in the sample. We study laws of large numbers in sampling processes that can affect the environment they are acting upon and interact with it. Specifically, we consider the sequential sampling model proposed by Ben-Eliezer and Yogev (2020…
▽ More
Laws of large numbers guarantee that given a large enough sample from some population, the measure of any fixed sub-population is well-estimated by its frequency in the sample. We study laws of large numbers in sampling processes that can affect the environment they are acting upon and interact with it. Specifically, we consider the sequential sampling model proposed by Ben-Eliezer and Yogev (2020), and characterize the classes which admit a uniform law of large numbers in this model: these are exactly the classes that are \emph{online learnable}. Our characterization may be interpreted as an online analogue to the equivalence between learnability and uniform convergence in statistical (PAC) learning.
The sample-complexity bounds we obtain are tight for many parameter regimes, and as an application, we determine the optimal regret bounds in online learning, stated in terms of \emph{Littlestone's dimension}, thus resolving the main open question from Ben-David, Pál, and Shalev-Shwartz (2009), which was also posed by Rakhlin, Sridharan, and Tewari (2015).
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
Link between superconductivity and a Lifshitz transition in intercalated Bi$_2$Se$_3$
Authors:
A. Almoalem,
I. Silber,
S. Sandik,
M. Lotem,
A. Ribak,
Y. Nitzav,
A. Yu. Kuntsevich,
O. A. Sobolevskiy,
Yu. G. Selivanov,
V. A. Prudkoglyad,
M. Shi,
L. Petaccia,
M. Goldstein,
Y. Dagan,
A. Kanigel
Abstract:
Topological superconductivity is an exotic phase of matter in which the fully gapped superconducting bulk hosts gapless Majorana surface states protected by topology. Intercalation of copper, strontium or niobium between the quintuple layers of the topological insulator Bi$_2$Se$_3$ increases the carrier density and leads to superconductivity that is suggested to be topological. Here we study the…
▽ More
Topological superconductivity is an exotic phase of matter in which the fully gapped superconducting bulk hosts gapless Majorana surface states protected by topology. Intercalation of copper, strontium or niobium between the quintuple layers of the topological insulator Bi$_2$Se$_3$ increases the carrier density and leads to superconductivity that is suggested to be topological. Here we study the electronic structure of strontium-intercalated Bi$_2$Se$_3$ using angle resolved photoemission spectroscopy (ARPES) and Shubnikov-de Haas (SdH) oscillations. Despite the apparent low Hall number of $\sim2 \times 10 ^{19}$cm$^{-3}$, we show that the Fermi surface is shaped as an open cylinder with a larger carrier density of $\sim 10 ^{20}$cm$^{-3}$. We suggest that superconductivity in intercalated Bi$_2$Se$_3$ emerges with the appearance of a quasi-2D open Fermi surface.
△ Less
Submitted 19 December, 2020; v1 submitted 12 December, 2020;
originally announced December 2020.
-
A bounded-noise mechanism for differential privacy
Authors:
Yuval Dagan,
Gil Kur
Abstract:
We present an asymptotically optimal $(ε,δ)$ differentially private mechanism for answering multiple, adaptively asked, $Δ$-sensitive queries, settling the conjecture of Steinke and Ullman [2020]. Our algorithm has a significant advantage that it adds independent bounded noise to each query, thus providing an absolute error bound. Additionally, we apply our algorithm in adaptive data analysis, obt…
▽ More
We present an asymptotically optimal $(ε,δ)$ differentially private mechanism for answering multiple, adaptively asked, $Δ$-sensitive queries, settling the conjecture of Steinke and Ullman [2020]. Our algorithm has a significant advantage that it adds independent bounded noise to each query, thus providing an absolute error bound. Additionally, we apply our algorithm in adaptive data analysis, obtaining an improved guarantee for answering multiple queries regarding some underlying distribution using a finite sample. Numerical computations show that the bounded-noise mechanism outperforms the Gaussian mechanism in many standard settings.
△ Less
Submitted 6 November, 2021; v1 submitted 7 December, 2020;
originally announced December 2020.
-
Ferroelectric Exchange Bias Affects Interfacial Electronic States
Authors:
Gal Tuvia,
Yiftach Frenkel,
Prasanna K. Rout,
Itai Silber,
Beena Kalisky,
Yoram Dagan
Abstract:
In polar oxide interfaces phenomena such as conductivity, superconductivity, magnetism, one-dimensional conductivity and Quantum Hall states can emerge at the polar discontinuity. Combining controllable ferroelectricity at such interfaces can affect the superconducting properties and shed light on the mutual effects between the polar oxide and the ferroelectric oxide. Here we study the interface b…
▽ More
In polar oxide interfaces phenomena such as conductivity, superconductivity, magnetism, one-dimensional conductivity and Quantum Hall states can emerge at the polar discontinuity. Combining controllable ferroelectricity at such interfaces can affect the superconducting properties and shed light on the mutual effects between the polar oxide and the ferroelectric oxide. Here we study the interface between the polar oxide LaAlO3 and the ferroelectric Ca-doped SrTiO3 by means of electrical transport combined with local imaging of the current flow with the use of scanning Superconducting Quantum Interference Device (SQUID). Anomalous behavior of the interface resistivity is observed at low temperatures. The scanning SQUID maps of the current flow suggest that this behavior originates from an intrinsic bias induced by the polar LaAlO3 layer. Our data imply that the intrinsic bias combined with ferroelectricity constrain the possible structural domain tiling near the interface. We recommend the use of this intrinsic bias as a method of controlling and tuning the initial state of ferroelectric materials by design of the polar structure. The hysteretic dependence of the normal and the superconducting state properties on gate voltage can be utilized in multifaceted controllable memory devices.
△ Less
Submitted 3 July, 2020;
originally announced July 2020.
-
Learning Ising models from one or multiple samples
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Nishanth Dikkala,
Anthimos Vardis Kandiros
Abstract:
There have been two separate lines of work on estimating Ising models: (1) estimating them from multiple independent samples under minimal assumptions about the model's interaction matrix; and (2) estimating them from one sample in restrictive settings. We propose a unified framework that smoothly interpolates between these two settings, enabling significantly richer estimation guarantees from one…
▽ More
There have been two separate lines of work on estimating Ising models: (1) estimating them from multiple independent samples under minimal assumptions about the model's interaction matrix; and (2) estimating them from one sample in restrictive settings. We propose a unified framework that smoothly interpolates between these two settings, enabling significantly richer estimation guarantees from one, a few, or many samples.
Our main theorem provides guarantees for one-sample estimation, quantifying the estimation error in terms of the metric entropy of a family of interaction matrices. As corollaries of our main theorem, we derive bounds when the model's interaction matrix is a (sparse) linear combination of known matrices, or it belongs to a finite set, or to a high-dimensional manifold. In fact, our main result handles multiple independent samples by viewing them as one sample from a larger model, and can be used to derive estimation bounds that are qualitatively similar to those obtained in the afore-described multiple-sample literature. Our technical approach benefits from sparsifying a model's interaction network, conditioning on subsets of variables that make the dependencies in the resulting conditional distribution sufficiently weak. We use this sparsification technique to prove strong concentration and anti-concentration results for the Ising model, which we believe have applications beyond the scope of this paper.
△ Less
Submitted 10 December, 2020; v1 submitted 20 April, 2020;
originally announced April 2020.
-
PAC learning with stable and private predictions
Authors:
Yuval Dagan,
Vitaly Feldman
Abstract:
We study binary classification algorithms for which the prediction on any point is not too sensitive to individual examples in the dataset. Specifically, we consider the notions of uniform stability (Bousquet and Elisseeff, 2001) and prediction privacy (Dwork and Feldman, 2018). Previous work on these notions shows how they can be achieved in the standard PAC model via simple aggregation of models…
▽ More
We study binary classification algorithms for which the prediction on any point is not too sensitive to individual examples in the dataset. Specifically, we consider the notions of uniform stability (Bousquet and Elisseeff, 2001) and prediction privacy (Dwork and Feldman, 2018). Previous work on these notions shows how they can be achieved in the standard PAC model via simple aggregation of models trained on disjoint subsets of data. Unfortunately, this approach leads to a significant overhead in terms of sample complexity. Here we demonstrate several general approaches to stable and private prediction that either eliminate or significantly reduce the overhead. Specifically, we demonstrate that for any class $C$ of VC dimension $d$ there exists a $γ$-uniformly stable algorithm for learning $C$ with excess error $α$ using $\tilde O(d/(αγ) + d/α^2)$ samples. We also show that this bound is nearly tight. For $ε$-differentially private prediction we give two new algorithms: one using $\tilde O(d/(α^2ε))$ samples and another one using $\tilde O(d^2/(αε) + d/α^2)$ samples. The best previously known bounds for these problems are $O(d/(α^2γ))$ and $O(d/(α^3ε))$, respectively.
△ Less
Submitted 23 September, 2020; v1 submitted 24 November, 2019;
originally announced November 2019.
-
Interaction is necessary for distributed learning with privacy or communication constraints
Authors:
Yuval Dagan,
Vitaly Feldman
Abstract:
Local differential privacy (LDP) is a model where users send privatized data to an untrusted central server whose goal it to solve some data analysis task. In the non-interactive version of this model the protocol consists of a single round in which a server sends requests to all users then receives their responses. This version is deployed in industry due to its practical advantages and has attra…
▽ More
Local differential privacy (LDP) is a model where users send privatized data to an untrusted central server whose goal it to solve some data analysis task. In the non-interactive version of this model the protocol consists of a single round in which a server sends requests to all users then receives their responses. This version is deployed in industry due to its practical advantages and has attracted significant research interest. Our main result is an exponential lower bound on the number of samples necessary to solve the standard task of learning a large-margin linear separator in the non-interactive LDP model. Via a standard reduction this lower bound implies an exponential lower bound for stochastic convex optimization and specifically, for learning linear models with a convex, Lipschitz and smooth loss. These results answer the questions posed in \citep{SmithTU17,DanielyF18}. Our lower bound relies on a new technique for constructing pairs of distributions with nearly matching moments but whose supports can be nearly separated by a large margin hyperplane. These lower bounds also hold in the model where communication from each user is limited and follow from a lower bound on learning using non-adaptive \emph{statistical queries}.
△ Less
Submitted 23 September, 2020; v1 submitted 10 November, 2019;
originally announced November 2019.
-
Learning from weakly dependent data under Dobrushin's condition
Authors:
Yuval Dagan,
Constantinos Daskalakis,
Nishanth Dikkala,
Siddhartha Jayanti
Abstract:
Statistical learning theory has largely focused on learning and generalization given independent and identically distributed (i.i.d.) samples. Motivated by applications involving time-series data, there has been a growing literature on learning and generalization in settings where data is sampled from an ergodic process. This work has also developed complexity measures, which appropriately extend…
▽ More
Statistical learning theory has largely focused on learning and generalization given independent and identically distributed (i.i.d.) samples. Motivated by applications involving time-series data, there has been a growing literature on learning and generalization in settings where data is sampled from an ergodic process. This work has also developed complexity measures, which appropriately extend the notion of Rademacher complexity to bound the generalization error and learning rates of hypothesis classes in this setting. Rather than time-series data, our work is motivated by settings where data is sampled on a network or a spatial domain, and thus do not fit well within the framework of prior work. We provide learning and generalization bounds for data that are complexly dependent, yet their distribution satisfies the standard Dobrushin's condition. Indeed, we show that the standard complexity measures of Gaussian and Rademacher complexities and VC dimension are sufficient measures of complexity for the purposes of bounding the generalization error and learning rates of hypothesis classes in our setting. Moreover, our generalization bounds only degrade by constant factors compared to their i.i.d. analogs, and our learnability bounds degrade by log factors in the size of the training set.
△ Less
Submitted 21 June, 2019;
originally announced June 2019.
-
Chiral superconductivity in the alternate stacking compound 4Hb-TaS$_2$
Authors:
A. Ribak,
R. Majlin Skiff,
M. Mograbi,
P. K. Rout,
M. H. Fischer,
J. Ruhman,
K. Chashka,
Y. Dagan,
A. Kanigel
Abstract:
Layered van der Waals (vdW) materials are emerging as one of the most versatile directions in the field of quantum condensed matter physics. They allow an unprecedented control of electronic properties via stacking of different types of two-dimensional (2D) materials. A fascinating frontier, largely unexplored, is the stacking of strongly-correlated phases of matter in vdW materials. Here, we stud…
▽ More
Layered van der Waals (vdW) materials are emerging as one of the most versatile directions in the field of quantum condensed matter physics. They allow an unprecedented control of electronic properties via stacking of different types of two-dimensional (2D) materials. A fascinating frontier, largely unexplored, is the stacking of strongly-correlated phases of matter in vdW materials. Here, we study 4Hb-TaS$_2$, which naturally realizes an alternating stacking of a Mott insulator, recently reported as a gapless spin-liquid candidate(1T-TaS$_2$), and a 2D superconductor (1H-TaS$_2$). This raises the question of how these two components affect each other. We find a superconducting ground state with a transition temperature of 2.7K, which is significantly elevated compared to the 2H polytype (Tc=0.7K). Strikingly, the superconducting state exhibits signatures of time-reversal-symmetry breaking abruptly appearing at the superconducting transition, which can be naturally explained by a chiral superconducting state.
△ Less
Submitted 13 May, 2019; v1 submitted 6 May, 2019;
originally announced May 2019.
-
Optimality of Maximum Likelihood for Log-Concave Density Estimation and Bounded Convex Regression
Authors:
Gil Kur,
Yuval Dagan,
Alexander Rakhlin
Abstract:
In this paper, we study two problems: (1) estimation of a $d$-dimensional log-concave distribution and (2) bounded multivariate convex regression with random design with an underlying log-concave density or a compactly supported distribution with a continuous density.
First, we show that for all $d \ge 4$ the maximum likelihood estimators of both problems achieve an optimal risk of…
▽ More
In this paper, we study two problems: (1) estimation of a $d$-dimensional log-concave distribution and (2) bounded multivariate convex regression with random design with an underlying log-concave density or a compactly supported distribution with a continuous density.
First, we show that for all $d \ge 4$ the maximum likelihood estimators of both problems achieve an optimal risk of $Θ_d(n^{-2/(d+1)})$ (up to a logarithmic factor) in terms of squared Hellinger distance and $L_2$ squared distance, respectively. Previously, the optimality of both these estimators was known only for $d\le 3$. We also prove that the $ε$-entropy numbers of the two aforementioned families are equal up to logarithmic factors. We complement these results by proving a sharp bound $Θ_d(n^{-2/(d+4)})$ on the minimax rate (up to logarithmic factors) with respect to the total variation distance.
Finally, we prove that estimating a log-concave density - even a uniform distribution on a convex set - up to a fixed accuracy requires the number of samples \emph{at least} exponential in the dimension. We do that by improving the dimensional constant in the best known lower bound for the minimax rate from $2^{-d}\cdot n^{-2/(d+1)}$ to $c\cdot n^{-2/(d+1)}$ (when $d\geq 2$).
△ Less
Submitted 20 February, 2020; v1 submitted 13 March, 2019;
originally announced March 2019.
-
Space lower bounds for linear prediction in the streaming model
Authors:
Yuval Dagan,
Gil Kur,
Ohad Shamir
Abstract:
We show that fundamental learning tasks, such as finding an approximate linear separator or linear regression, require memory at least \emph{quadratic} in the dimension, in a natural streaming setting. This implies that such problems cannot be solved (at least in this setting) by scalable memory-efficient streaming algorithms. Our results build on a memory lower bound for a simple linear-algebraic…
▽ More
We show that fundamental learning tasks, such as finding an approximate linear separator or linear regression, require memory at least \emph{quadratic} in the dimension, in a natural streaming setting. This implies that such problems cannot be solved (at least in this setting) by scalable memory-efficient streaming algorithms. Our results build on a memory lower bound for a simple linear-algebraic problem -- finding orthogonal vectors -- and utilize the estimates on the packing of the Grassmannian, the manifold of all linear subspaces of fixed dimension.
△ Less
Submitted 11 June, 2019; v1 submitted 9 February, 2019;
originally announced February 2019.
-
Symmetry and correlation effects on band structure explain the anomalous transport properties of (111) LaAlO$_3$/SrTiO$_3$
Authors:
Udit Khanna,
Prasanna K. Rout,
Michael Mograbi,
Gal Tuvia,
Inge Leermakers,
Uli Zeitler,
Yoram Dagan,
Moshe Goldstein
Abstract:
The interface between the two insulating oxides SrTiO$_3$ and LaAlO$_3$ gives rise to a two-dimensional electron system with intriguing transport phenomena, including superconductivity, which are controllable by a gate. Previous measurements on the (001) interface have shown that the superconducting critical temperature, the Hall density, and the frequency of quantum oscillations, vary nonmonotoni…
▽ More
The interface between the two insulating oxides SrTiO$_3$ and LaAlO$_3$ gives rise to a two-dimensional electron system with intriguing transport phenomena, including superconductivity, which are controllable by a gate. Previous measurements on the (001) interface have shown that the superconducting critical temperature, the Hall density, and the frequency of quantum oscillations, vary nonmonotonically and in a correlated fashion with the gate voltage. In this paper we experimentally demonstrate that the (111) interface features a qualitatively distinct behavior, in which the frequency of Shubnikov-de Haas oscillations changes monotonically, while the variation of other properties is nonmonotonic albeit uncorrelated. We develop a theoretical model, incorporating the different symmetries of these interfaces as well as electronic-correlation-induced band competition. We show that the latter dominates at (001), leading to similar nonmonotonicity in all observables, while the former is more important at (111), giving rise to highly curved Fermi contours, and accounting for all its anomalous transport measurements.
△ Less
Submitted 29 July, 2019; v1 submitted 30 January, 2019;
originally announced January 2019.
-
The entropy of lies: playing twenty questions with a liar
Authors:
Yuval Dagan,
Yuval Filmus,
Daniel Kane,
Shay Moran
Abstract:
`Twenty questions' is a guessing game played by two players: Bob thinks of an integer between $1$ and $n$, and Alice's goal is to recover it using a minimal number of Yes/No questions. Shannon's entropy has a natural interpretation in this context. It characterizes the average number of questions used by an optimal strategy in the distributional variant of the game: let $μ$ be a distribution over…
▽ More
`Twenty questions' is a guessing game played by two players: Bob thinks of an integer between $1$ and $n$, and Alice's goal is to recover it using a minimal number of Yes/No questions. Shannon's entropy has a natural interpretation in this context. It characterizes the average number of questions used by an optimal strategy in the distributional variant of the game: let $μ$ be a distribution over $[n]$, then the average number of questions used by an optimal strategy that recovers $x\sim μ$ is between $H(μ)$ and $H(μ)+1$. We consider an extension of this game where at most $k$ questions can be answered falsely. We extend the classical result by showing that an optimal strategy uses roughly $H(μ) + k H_2(μ)$ questions, where $H_2(μ) = \sum_x μ(x)\log\log\frac{1}{μ(x)}$. This also generalizes a result by Rivest et al. for the uniform distribution. Moreover, we design near optimal strategies that only use comparison queries of the form `$x \leq c$?' for $c\in[n]$. The usage of comparison queries lends itself naturally to the context of sorting, where we derive sorting algorithms in the presence of adversarial noise.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Vortex excitations in the Insulating State of an Oxide Interface
Authors:
M. Mograbi,
E. Maniv,
P. K. Rout,
D. Graf,
J. -H Park,
Y. Dagan
Abstract:
In two-dimensional (2D) superconductors an insulating state can be induced either by applying a magnetic field, $H$, or by increasing disorder. Many scenarios have been put forth to explain the superconductor to insulator transition (SIT): dominating fermionic physics after the breaking of Cooper pairs, loss of phase coherence between superconducting islands embedded in a metallic or insulating ma…
▽ More
In two-dimensional (2D) superconductors an insulating state can be induced either by applying a magnetic field, $H$, or by increasing disorder. Many scenarios have been put forth to explain the superconductor to insulator transition (SIT): dominating fermionic physics after the breaking of Cooper pairs, loss of phase coherence between superconducting islands embedded in a metallic or insulating matrix and localization of Cooper pairs with concomitant condensation of vortex-type excitations. The difficulty in characterizing the insulating state and its origin stems from the lack of a continuous map** of the superconducting to insulating phase diagram in a single sample. Here we use the two-dimensional (2D) electron liquid formed at the interface between the two insulators (111) SrTiO$_3$ and LaAlO$_3$ to study the superconductor to insulator transition. This crystalline interface surprisingly exhibits very strong features previously observed only in amorphous systems. By use of electrostatic gating and magnetic fields, the sample is tuned from the metallic region, where supeconductivity is fully manifested, deep into the insulating state. Through examination of the field dependence of the sheet resistance and comparison of the response to fields in different orientations we identify a new magnetic field scale, H$_{pairing}$, where superconducting fluctuations are muted. Our findings show that vortex fluctuations excitations and Cooper pair localization are responsible for the observed SIT and that these excitations surprisingly persist deep into the insulating state.
△ Less
Submitted 24 May, 2018;
originally announced May 2018.
-
A Better Resource Allocation Algorithm with Semi-Bandit Feedback
Authors:
Yuval Dagan,
Koby Crammer
Abstract:
We study a sequential resource allocation problem between a fixed number of arms. On each iteration the algorithm distributes a resource among the arms in order to maximize the expected success rate. Allocating more of the resource to a given arm increases the probability that it succeeds, yet with a cut-off. We follow Lattimore et al. (2014) and assume that the probability increases linearly unti…
▽ More
We study a sequential resource allocation problem between a fixed number of arms. On each iteration the algorithm distributes a resource among the arms in order to maximize the expected success rate. Allocating more of the resource to a given arm increases the probability that it succeeds, yet with a cut-off. We follow Lattimore et al. (2014) and assume that the probability increases linearly until it equals one, after which allocating more of the resource is wasteful. These cut-off values are fixed and unknown to the learner. We present an algorithm for this problem and prove a regret upper bound of $O(\log n)$ improving over the best known bound of $O(\log^2 n)$. Lower bounds we prove show that our upper bound is tight. Simulations demonstrate the superiority of our algorithm.
△ Less
Submitted 28 March, 2018;
originally announced March 2018.
-
Detecting Correlations with Little Memory and Communication
Authors:
Yuval Dagan,
Ohad Shamir
Abstract:
We study the problem of identifying correlations in multivariate data, under information constraints: Either on the amount of memory that can be used by the algorithm, or the amount of communication when the data is distributed across several machines. We prove a tight trade-off between the memory/communication complexity and the sample complexity, implying (for example) that to detect pairwise co…
▽ More
We study the problem of identifying correlations in multivariate data, under information constraints: Either on the amount of memory that can be used by the algorithm, or the amount of communication when the data is distributed across several machines. We prove a tight trade-off between the memory/communication complexity and the sample complexity, implying (for example) that to detect pairwise correlations with optimal sample complexity, the number of required memory/communication bits is at least quadratic in the dimension. Our results substantially improve those of Shamir [2014], which studied a similar question in a much more restricted setting. To the best of our knowledge, these are the first provable sample/memory/communication trade-offs for a practical estimation problem, using standard distributions, and in the natural regime where the memory/communication budget is larger than the size of a single data point. To derive our theorems, we prove a new information-theoretic result, which may be relevant for studying other information-constrained learning problems.
△ Less
Submitted 6 June, 2018; v1 submitted 4 March, 2018;
originally announced March 2018.
-
Solution Monolayer Epitaxy for Tunable Atomically Sharp Oxide Interfaces
Authors:
A. Ron,
A. Hevroni,
E. Maniv,
M. Mograbi,
L. **,
C. -L. Jia,
K. W. Urban,
G. Markovich,
Y. Dagan
Abstract:
Epitaxial growth of atomically-sharp interfaces serves as one of the main building blocks of nanofabrication. Such interfaces are crucial for the operation of various devices including transistors, photo-voltaic cells, and memory components. In order to avoid charge traps that may hamper the operation of such devices, it is critical for the layers to be atomically-sharp. Fabrication of atomically…
▽ More
Epitaxial growth of atomically-sharp interfaces serves as one of the main building blocks of nanofabrication. Such interfaces are crucial for the operation of various devices including transistors, photo-voltaic cells, and memory components. In order to avoid charge traps that may hamper the operation of such devices, it is critical for the layers to be atomically-sharp. Fabrication of atomically sharp interfaces normally requires ultra-high vacuum techniques and high substrate temperatures. We present here a new self-limiting wet chemical process for deposition of epitaxial layers from alkoxide precursors. This method is fast, cheap, and yields perfect interfaces as we validate by various analysis techniques. It allows the design of heterostructures with half-unit cell resolution. We demonstrate our method by designing hole-type oxide interfaces SrTiO3/BaO/LaAlO3. We show that transport through this interface exhibits properties of mixed electron-hole contributions with hole mobility exceeding that of electrons. Our method and results are an important step forward towards a controllable design of a p-type oxide interface.
△ Less
Submitted 11 October, 2017;
originally announced October 2017.
-
Gapless excitations in the ground state of 1T-TaS$_2$
Authors:
A. Ribak,
I. Silber,
C. Baines,
K. Chashka,
Z. Salman,
Y. Dagan,
A. Kanigel
Abstract:
1T-TaS$_2$ is a layered transition metal dichalgeonide with a very rich phase diagram. At T=180K it undergoes a metal to Mott insulator transition. Mott insulators usually display anti-ferromagnetic ordering in the insulating phase but 1T-TaS$_2$ was never shown to order magnetically. In this letter we show that 1T-TaS$_2$ has a large paramagnetic contribution to the magnetic susceptibility but it…
▽ More
1T-TaS$_2$ is a layered transition metal dichalgeonide with a very rich phase diagram. At T=180K it undergoes a metal to Mott insulator transition. Mott insulators usually display anti-ferromagnetic ordering in the insulating phase but 1T-TaS$_2$ was never shown to order magnetically. In this letter we show that 1T-TaS$_2$ has a large paramagnetic contribution to the magnetic susceptibility but it does not show any sign of magnetic ordering or freezing down to 20mK, as probed by $μ$SR, possibly indicating a quantum spin liquid ground state. Although 1T-TaS$_2$ exhibits a strong resistive behavior both in and out-of plane at low temperatures we find a linear term in the heat capacity suggesting the existence of a Fermi-surface, which has an anomalously strong magnetic field dependence.
△ Less
Submitted 27 September, 2017;
originally announced September 2017.
-
Superconductor-insulator transition in fcc-GeSb2Te4 at elevated pressures
Authors:
Bar Hen,
Samar Layek,
Moshe Goldstein,
Victor Shelukhin,
Mark Shulman,
Michael Karpovski,
Eran Greenberg,
Eran Sterer,
Yoram Dagan,
Gregory Kh. Rozenberg,
Alexander Palevski
Abstract:
We show that polycrystalline GeSb2Te4 in the fcc phase (f-GST), which is an insulator at low temperature at ambient pressure, becomes a superconductor at elevated pressures. Our study of the superconductor to insulator transition versus pressure at low temperatures reveals a second order quantum phase transition with linear scaling (critical exponent close to unity) of the transition temperature w…
▽ More
We show that polycrystalline GeSb2Te4 in the fcc phase (f-GST), which is an insulator at low temperature at ambient pressure, becomes a superconductor at elevated pressures. Our study of the superconductor to insulator transition versus pressure at low temperatures reveals a second order quantum phase transition with linear scaling (critical exponent close to unity) of the transition temperature with the pressure above the critical zero-temperature pressure. In addition, we demonstrate that at higher pressures the f-GST goes through a structural phase transition via amorphization to bcc GST (b-GST), which also become superconducting. We also find that the pressure regime where an inhomogeneous mixture of amorphous and b-GST exists, there is an anomalous peak in magnetoresistance, and suggest an explanation for this anomaly.
△ Less
Submitted 6 September, 2017;
originally announced September 2017.
-
Link between the Superconducting Dome and Spin-Orbit Interaction in the (111) LaAlO$_3$/SrTiO$_3$ Interface
Authors:
P. K. Rout,
E. Maniv,
Y. Dagan
Abstract:
We measure the gate voltage ($V_g$) dependence of the superconducting properties and the spin-orbit interaction in the (111)-oriented LaAlO$_3$/SrTiO$_3$ interface. Superconductivity is observed in a dome-shaped region in the carrier density-temperature phase diagram with the maxima of superconducting transition temperature $T_c$ and the upper critical fields lying at the same $V_g$. The spin-orbi…
▽ More
We measure the gate voltage ($V_g$) dependence of the superconducting properties and the spin-orbit interaction in the (111)-oriented LaAlO$_3$/SrTiO$_3$ interface. Superconductivity is observed in a dome-shaped region in the carrier density-temperature phase diagram with the maxima of superconducting transition temperature $T_c$ and the upper critical fields lying at the same $V_g$. The spin-orbit interaction determined from the superconducting parameters and confirmed by weak-antilocalization measurements follows the same gate voltage dependence as $T_c$. The correlation between the superconductivity and spin-orbit interaction as well as the enhancement of the parallel upper critical field, well beyond the Chandrasekhar-Clogston limit suggest that superconductivity and the spin-orbit interaction are linked in a nontrivial fashion. We propose possible scenarios to explain this unconventional behavior.
△ Less
Submitted 4 December, 2017; v1 submitted 6 June, 2017;
originally announced June 2017.
-
Six-fold crystalline anisotropic magnetoresistance in the (111) LaAlO$_3$/SrTiO$_3$ oxide interface
Authors:
P. K. Rout,
I. Agireen,
E. Maniv,
M. Goldstein,
Y. Dagan
Abstract:
We measured the magnetoresistance of the 2D electron liquid formed at the (111) LaAlO$_3$/SrTiO$_3$ interface. The hexagonal symmetry of the interface is manifested in a six-fold crystalline component appearing in the anisotropic magnetoresistance (AMR) and planar Hall data, which agree well with symmetry analysis we performed. The six-fold component increases with carrier concentration, reaching…
▽ More
We measured the magnetoresistance of the 2D electron liquid formed at the (111) LaAlO$_3$/SrTiO$_3$ interface. The hexagonal symmetry of the interface is manifested in a six-fold crystalline component appearing in the anisotropic magnetoresistance (AMR) and planar Hall data, which agree well with symmetry analysis we performed. The six-fold component increases with carrier concentration, reaching 15% of the total AMR signal. Our results suggest the coupling between higher itinerant electronic bands and the crystal as the origin of this effect and demonstrate that the (111) oxide interface is a unique hexagonal system with tunable magnetocrystalline effects.
△ Less
Submitted 1 June, 2017; v1 submitted 9 January, 2017;
originally announced January 2017.
-
Correlation-Induced Band Competition in SrTiO3/LaAlO3
Authors:
Eran Maniv,
Yoram Dagan,
Moshe Goldstein
Abstract:
The oxide interface SrTiO3/LaAlO3 supports a 2D electron liquid displaying superconductivity and magnetism, while allowing for a continuous control of the electron density using a gate. Our recent measurements have shown a similar surprising nonmonotonic behavior as function of the gate voltage (carrier density) of three quantities: the superconducting critical temperature and field, the inverse H…
▽ More
The oxide interface SrTiO3/LaAlO3 supports a 2D electron liquid displaying superconductivity and magnetism, while allowing for a continuous control of the electron density using a gate. Our recent measurements have shown a similar surprising nonmonotonic behavior as function of the gate voltage (carrier density) of three quantities: the superconducting critical temperature and field, the inverse Hall coefficient, and the frequency of quantum oscillations. While the total density has to be monotonic as function of gate, the last result indicates that one of the involved bands has a nonmontonic occupancy as function of the chemical potential. We show how electronic interactions can lead to such an effect, by creating a competition between the involved bands and making their structure non-rigid, and thus account for all these effects. Adding Fock terms to our previous Hartree treatment makes this scenario even more generic.
△ Less
Submitted 21 February, 2017; v1 submitted 15 December, 2016;
originally announced December 2016.
-
Fermi Surface Reconstruction in the Electron-doped Cuprate Pr(2-x)CexCuO4
Authors:
Yoram Dagan,
Richard L. Greene
Abstract:
We report extensive resistivity, Hall, and magnetoresistance measurements on thin films of the electron-doped cuprate \PCCO~(PCCO), as a function of do**, temperature and magnetic field. The do** dependence of the resistivity and Hall number at low temperatures are characteristic of a system near a quantum phase transition or a Fermi Surface Reconstruction (FSR) point. The spin magnetoresistan…
▽ More
We report extensive resistivity, Hall, and magnetoresistance measurements on thin films of the electron-doped cuprate \PCCO~(PCCO), as a function of do**, temperature and magnetic field. The do** dependence of the resistivity and Hall number at low temperatures are characteristic of a system near a quantum phase transition or a Fermi Surface Reconstruction (FSR) point. The spin magnetoresistance drops to zero near the critical point. The data presented in this paper were compiled during the 2004-2007 period but were never published in this comprehensive form. Because of the recent interest in very similar results now being found in the normal state of hole-doped cuprates, we believe the results of our older, mostly unpublished, work will be of interest to the present community of cuprate researchers. In particular, Fig.11 shows the large change in Hall number at the FSR point in \PCCO, similar to that found recently in YBCO and LSCO (See ref[1] and [2]). Also, Fig.6 illustrates how the resistivity upturn is affected by the FSR. The cause of the resistivity upturn has been attributed to the loss of carriers at do** below the FSR in the hole-doped cuprates (see Ref. 3), however, this scenario does not explain the data for PCCO. The upturn in n-doped cuprates is more-likely due to a combination of carrier decrease and a change in the scattering rate below the FSR (see also Ref. 4). The change in spin scattering below the FSR is illustrated by Fig.18 in this paper. Chen et al. [5] have developed a model based on spin scattering that is able to explain qualitatively the resistivity upturn in all the cuprates.
△ Less
Submitted 6 December, 2016;
originally announced December 2016.
-
Signature of surface state coupling in thin films of the topological Kondo insulator SmB$_6$ from anisotropic magnetoresistance
Authors:
M. Shaviv Petrushevsky,
P. K. Rout,
G. Levi,
A. Kohn,
Y. Dagan
Abstract:
The temperature and thickness dependencies of the in-plane anisotropic magnetoresistance (AMR) of SmB$_6$ thin films are reported. We find that the AMR changes sign from negative ($ρ_{||}<ρ_{\perp}$) at high temperatures to positive ($ρ_{||}>ρ_{\perp}$) at low temperatures. The temperature, T$_s$, at which this sign change occurs, decreases with increasing film thickness $t$ and T$_s$ vanishes for…
▽ More
The temperature and thickness dependencies of the in-plane anisotropic magnetoresistance (AMR) of SmB$_6$ thin films are reported. We find that the AMR changes sign from negative ($ρ_{||}<ρ_{\perp}$) at high temperatures to positive ($ρ_{||}>ρ_{\perp}$) at low temperatures. The temperature, T$_s$, at which this sign change occurs, decreases with increasing film thickness $t$ and T$_s$ vanishes for $t$ $>$ 30 nm. We interpret our results in the framework of a competition between two components: a negative bulk contribution and a positive surface AMR.
△ Less
Submitted 29 March, 2017; v1 submitted 1 December, 2016;
originally announced December 2016.
-
Trading information complexity for error
Authors:
Yuval Dagan,
Yuval Filmus,
Hamed Hatami,
Yaqiao Li
Abstract:
We consider the standard two-party communication model. The central problem studied in this article is how much one can save in information complexity by allowing an error of $ε$.
For arbitrary functions, we obtain lower bounds and upper bounds indicating a gain that is of order $Ω(h(ε))$ and $O(h(\sqrtε))$. Here $h$ denotes the binary entropy function. We analyze the case of the two-bit AND fun…
▽ More
We consider the standard two-party communication model. The central problem studied in this article is how much one can save in information complexity by allowing an error of $ε$.
For arbitrary functions, we obtain lower bounds and upper bounds indicating a gain that is of order $Ω(h(ε))$ and $O(h(\sqrtε))$. Here $h$ denotes the binary entropy function. We analyze the case of the two-bit AND function in detail to show that for this function the gain is $Θ(h(ε))$. This answers a question of [M. Braverman, A. Garg, D. Pankratov, and O. Weinstein, From information to exact communication (extended abstract), STOC'13].
We obtain sharp bounds for the set disjointness function of order $n$. For the case of the distributional error, we introduce a new protocol that achieves a gain of $Θ(\sqrt{h(ε)})$ provided that $n$ is sufficiently large. We apply these results to answer another of question of Braverman et al. regarding the randomized communication complexity of the set disjointness function.
Answering a question of [Mark Braverman, Interactive information complexity, STOC'12], we apply our analysis of the set disjointness function to establish a gap between the two different notions of the prior-free information cost. This implies that amortized randomized communication complexity is not necessarily equal to the amortized distributional communication complexity with respect to the hardest distribution.
△ Less
Submitted 21 November, 2016;
originally announced November 2016.
-
Twenty (simple) questions
Authors:
Yuval Dagan,
Yuval Filmus,
Ariel Gabizon,
Shay Moran
Abstract:
A basic combinatorial interpretation of Shannon's entropy function is via the "20 questions" game. This cooperative game is played by two players, Alice and Bob: Alice picks a distribution $π$ over the numbers $\{1,\ldots,n\}$, and announces it to Bob. She then chooses a number $x$ according to $π$, and Bob attempts to identify $x$ using as few Yes/No queries as possible, on average.
An optimal…
▽ More
A basic combinatorial interpretation of Shannon's entropy function is via the "20 questions" game. This cooperative game is played by two players, Alice and Bob: Alice picks a distribution $π$ over the numbers $\{1,\ldots,n\}$, and announces it to Bob. She then chooses a number $x$ according to $π$, and Bob attempts to identify $x$ using as few Yes/No queries as possible, on average.
An optimal strategy for the "20 questions" game is given by a Huffman code for $π$: Bob's questions reveal the codeword for $x$ bit by bit. This strategy finds $x$ using fewer than $H(π)+1$ questions on average. However, the questions asked by Bob could be arbitrary. In this paper, we investigate the following question: Are there restricted sets of questions that match the performance of Huffman codes, either exactly or approximately?
Our first main result shows that for every distribution $π$, Bob has a strategy that uses only questions of the form "$x < c$?" and "$x = c$?", and uncovers $x$ using at most $H(π)+1$ questions on average, matching the performance of Huffman codes in this sense. We also give a natural set of $O(rn^{1/r})$ questions that achieve a performance of at most $H(π)+r$, and show that $Ω(rn^{1/r})$ questions are required to achieve such a guarantee.
Our second main result gives a set $\mathcal{Q}$ of $1.25^{n+o(n)}$ questions such that for every distribution $π$, Bob can implement an optimal strategy for $π$ using only questions from $\mathcal{Q}$. We also show that $1.25^{n-o(n)}$ questions are needed, for infinitely many $n$. If we allow a small slack of $r$ over the optimal strategy, then roughly $(rn)^{Θ(1/r)}$ questions are necessary and sufficient.
△ Less
Submitted 25 April, 2017; v1 submitted 5 November, 2016;
originally announced November 2016.
-
Tunneling into a quantum confinement created by a single-step nano-lithography of conducting oxide interfaces
Authors:
E. Maniv,
A. Ron,
M. Goldstein,
A. Palevski,
Y. Dagan
Abstract:
A new nano-lithography technique compatible with conducting oxide interfaces, which requires a single lithographic step with no additional amorphous layer deposition or etching, is presented. It is demonstrated on SrTiO3/LaAlO3 interface where a constriction is patterned in the electron liquid. We find that an additional back-gating can further confine the electron liquid into an isolated island.…
▽ More
A new nano-lithography technique compatible with conducting oxide interfaces, which requires a single lithographic step with no additional amorphous layer deposition or etching, is presented. It is demonstrated on SrTiO3/LaAlO3 interface where a constriction is patterned in the electron liquid. We find that an additional back-gating can further confine the electron liquid into an isolated island. Conductance and differential conductance measurements show resonant tunneling through the island. The data at various temperatures and magnetic fields are analyzed and the effective island size is found to be of the order of 10nm. The magnetic field dependence suggests absence of spin degeneracy in the island. Our method is suitable for creating superconducting and oxide-interface based electronic devices.
△ Less
Submitted 30 June, 2016;
originally announced June 2016.