Search | arXiv e-print repository

Designing high-fidelity two-qubit gates between fluxonium qubits

Authors: Emma L. Rosenfeld, Connor T. Hann, David I. Schuster, Matthew H. Matheny, Aashish A. Clerk

Abstract: We take a bottom-up, first-principles approach to design a two-qubit gate between fluxonium qubits for minimal error, speed, and control simplicity. Our proposed architecture consists of two fluxoniums coupled via a linear resonator. Using a linear coupler introduces the possibility of material optimization for suppressing its loss, enables efficient driving of state-selective transitions through… ▽ More We take a bottom-up, first-principles approach to design a two-qubit gate between fluxonium qubits for minimal error, speed, and control simplicity. Our proposed architecture consists of two fluxoniums coupled via a linear resonator. Using a linear coupler introduces the possibility of material optimization for suppressing its loss, enables efficient driving of state-selective transitions through its large charge zero point fluctuation, reduces sensitivity to junction aging, and partially mitigates coherent coupling to two-level systems. Crucially, a resonator-as-coupler approach also suggests a clear path to increased connectivity between fluxonium qubits, by reducing capacitive loading when the coupler has a high impedance. After performing analytic and numeric analyses of the circuit Hamiltonian and gate dynamics, we tune circuit parameters to destructively interfere sources of coherent error, revealing an efficient, fourth-order scaling of coherent error with gate duration. For component properties from the literature, we predict an open-system average CZ gate infidelity of $1.86 \times 10^{-4}$ in 70ns. △ Less

Submitted 19 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: Minor wording revisions, changed color of arrows in Fig.1a, added section 4.3 to the supplementary information

arXiv:2311.04163 [pdf, other]

Outliers with Opposing Signals Have an Outsized Effect on Neural Network Optimization

Authors: Elan Rosenfeld, Andrej Risteski

Abstract: We identify a new phenomenon in neural network optimization which arises from the interaction of depth and a particular heavy-tailed structure in natural data. Our result offers intuitive explanations for several previously reported observations about network training dynamics. In particular, it implies a conceptually new cause for progressive sharpening and the edge of stability; we also highligh… ▽ More We identify a new phenomenon in neural network optimization which arises from the interaction of depth and a particular heavy-tailed structure in natural data. Our result offers intuitive explanations for several previously reported observations about network training dynamics. In particular, it implies a conceptually new cause for progressive sharpening and the edge of stability; we also highlight connections to other concepts in optimization and generalization including grokking, simplicity bias, and Sharpness-Aware Minimization. Experimentally, we demonstrate the significant influence of paired groups of outliers in the training data with strong opposing signals: consistent, large magnitude features which dominate the network output throughout training and provide gradients which point in opposite directions. Due to these outliers, early optimization enters a narrow valley which carefully balances the opposing groups; subsequent sharpening causes their loss to rise rapidly, oscillating between high on one group and then the other, until the overall loss spikes. We describe how to identify these groups, explore what sets them apart, and carefully study their effect on the network's optimization and behavior. We complement these experiments with a mechanistic explanation on a toy example of opposing signals and a theoretical analysis of a two-layer linear network on a simple model. Our finding enables new qualitative predictions of training behavior which we confirm experimentally. It also provides a new lens through which to study and improve modern training practices for stochastic optimization, which we highlight via a case study of Adam versus SGD. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2311.02761 [pdf, other]

One-Shot Strategic Classification Under Unknown Costs

Authors: Elan Rosenfeld, Nir Rosenfeld

Abstract: The goal of strategic classification is to learn decision rules which are robust to strategic input manipulation. Earlier works assume that these responses are known; while some recent works handle unknown responses, they exclusively study online settings with repeated model deployments. But there are many domains$\unicode{x2014}$particularly in public policy, a common motivating use case… ▽ More The goal of strategic classification is to learn decision rules which are robust to strategic input manipulation. Earlier works assume that these responses are known; while some recent works handle unknown responses, they exclusively study online settings with repeated model deployments. But there are many domains$\unicode{x2014}$particularly in public policy, a common motivating use case$\unicode{x2014}$where multiple deployments are infeasible, or where even one bad round is unacceptable. To address this gap, we initiate the formal study of one-shot strategic classification under unknown responses, which requires committing to a single classifier once. Focusing on uncertainty in the users' cost function, we begin by proving that for a broad class of costs, even a small mis-estimation of the true cost can entail trivial accuracy in the worst case. In light of this, we frame the task as a minimax problem, aiming to minimize worst-case risk over an uncertainty set of costs. We design efficient algorithms for both the full-batch and stochastic settings, which we prove converge (offline) to the minimax solution at the rate of $\tilde{\mathcal{O}}(T^{-\frac{1}{2}})$. Our analysis reveals important structure stemming from strategic responses, particularly the value of dual norm regularization with respect to the cost function. △ Less

Submitted 20 June, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

Comments: Accepted to ICML 2024

arXiv:2310.04295 [pdf, other]

Identifying Representations for Intervention Extrapolation

Authors: Sorawit Saengkyongam, Elan Rosenfeld, Pradeep Ravikumar, Niklas Pfister, Jonas Peters

Abstract: The premise of identifiable and causal representation learning is to improve the current representation learning paradigm in terms of generalizability or robustness. Despite recent progress in questions of identifiability, more theoretical results demonstrating concrete advantages of these methods for downstream tasks are needed. In this paper, we consider the task of intervention extrapolation: p… ▽ More The premise of identifiable and causal representation learning is to improve the current representation learning paradigm in terms of generalizability or robustness. Despite recent progress in questions of identifiability, more theoretical results demonstrating concrete advantages of these methods for downstream tasks are needed. In this paper, we consider the task of intervention extrapolation: predicting how interventions affect an outcome, even when those interventions are not observed at training time, and show that identifiable representations can provide an effective solution to this task even if the interventions affect the outcome non-linearly. Our setup includes an outcome Y, observed features X, which are generated as a non-linear transformation of latent features Z, and exogenous action variables A, which influence Z. The objective of intervention extrapolation is to predict how interventions on A that lie outside the training support of A affect Y. Here, extrapolation becomes possible if the effect of A on Z is linear and the residual when regressing Z on A has full support. As Z is latent, we combine the task of intervention extrapolation with identifiable representation learning, which we call Rep4Ex: we aim to map the observed features X into a subspace that allows for non-linear extrapolation in A. We show that the hidden representation is identifiable up to an affine transformation in Z-space, which is sufficient for intervention extrapolation. The identifiability is characterized by a novel constraint describing the linearity assumption of A on Z. Based on this insight, we propose a method that enforces the linear invariance constraint and can be combined with any type of autoencoder. We validate our theoretical findings through synthetic experiments and show that our approach succeeds in predicting the effects of unseen interventions. △ Less

Submitted 5 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: Accepted at the International Conference on Learning Representations (ICLR) 2024

arXiv:2307.12193 [pdf, other]

Programmable Quantum Processors based on Spin Qubits with Mechanically-Mediated Interactions and Transport

Authors: F. Fung, E. Rosenfeld, J. D. Schaefer, A. Kabcenell, J. Gieseler, T. X. Zhou, T. Madhavan, N. Aslam, A. Yacoby, M. D. Lukin

Abstract: Solid state spin qubits are promising candidates for quantum information processing, but controlled interactions and entanglement in large, multi-qubit systems are currently difficult to achieve. We describe a method for programmable control of multi-qubit spin systems, in which individual nitrogen-vacancy (NV) centers in diamond nanopillars are coupled to magnetically functionalized silicon nitri… ▽ More Solid state spin qubits are promising candidates for quantum information processing, but controlled interactions and entanglement in large, multi-qubit systems are currently difficult to achieve. We describe a method for programmable control of multi-qubit spin systems, in which individual nitrogen-vacancy (NV) centers in diamond nanopillars are coupled to magnetically functionalized silicon nitride mechanical resonators in a scanning probe configuration. Qubits can be entangled via interactions with nanomechanical resonators while programmable connectivity is realized via mechanical transport of qubits in nanopillars. To demonstrate the feasibility of this approach, we characterize both the mechanical properties and the magnetic field gradients around the micromagnet placed on the nanobeam resonator. Furthermore, we show coherent manipulation and mechanical transport of a proximal spin qubit by utilizing nuclear spin memory, and use the NV center to detect the time-varying magnetic field from the oscillating micromagnet, extracting a spin-mechanical coupling of 7.7(9) Hz. With realistic improvements the high-cooperativity regime can be reached, offering a new avenue towards scalable quantum information processing with spin qubits. △ Less

Submitted 22 July, 2023; originally announced July 2023.

Comments: 7 pages, 4 figures

arXiv:2306.02235 [pdf, other]

Learning Linear Causal Representations from Interventions under General Nonlinear Mixing

Authors: Simon Buchholz, Goutham Rajendran, Elan Rosenfeld, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar

Abstract: We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability results given unknown single-node interventions, i.e., without having access to the intervention targets. This generalizes prior works which have focused on weaker cl… ▽ More We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability results given unknown single-node interventions, i.e., without having access to the intervention targets. This generalizes prior works which have focused on weaker classes, such as linear maps or paired counterfactual data. This is also the first instance of causal identifiability from non-paired interventions for deep neural network embeddings. Our proof relies on carefully uncovering the high-dimensional geometric structure present in the data distribution after a non-linear density transformation, which we capture by analyzing quadratic forms of precision matrices of the latent distributions. Finally, we propose a contrastive algorithm to identify the latent variables in practice and evaluate its performance on various tasks. △ Less

Submitted 18 December, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

Comments: Accepted as Oral paper at NeurIPS 2023

arXiv:2306.00312 [pdf, other]

(Almost) Provable Error Bounds Under Distribution Shift via Disagreement Discrepancy

Authors: Elan Rosenfeld, Saurabh Garg

Abstract: We derive an (almost) guaranteed upper bound on the error of deep neural networks under distribution shift using unlabeled test data. Prior methods either give bounds that are vacuous in practice or give estimates that are accurate on average but heavily underestimate error for a sizeable fraction of shifts. In particular, the latter only give guarantees based on complex continuous measures such a… ▽ More We derive an (almost) guaranteed upper bound on the error of deep neural networks under distribution shift using unlabeled test data. Prior methods either give bounds that are vacuous in practice or give estimates that are accurate on average but heavily underestimate error for a sizeable fraction of shifts. In particular, the latter only give guarantees based on complex continuous measures such as test calibration -- which cannot be identified without labels -- and are therefore unreliable. Instead, our bound requires a simple, intuitive condition which is well justified by prior empirical works and holds in practice effectively 100% of the time. The bound is inspired by $\mathcal{H}Δ\mathcal{H}$-divergence but is easier to evaluate and substantially tighter, consistently providing non-vacuous guarantees. Estimating the bound requires optimizing one multiclass classifier to disagree with another, for which some prior works have used sub-optimal proxy losses; we devise a "disagreement loss" which is theoretically justified and performs better in practice. We expect this loss can serve as a drop-in replacement for future methods which require maximizing multiclass disagreement. Across a wide range of benchmarks, our method gives valid error bounds while achieving average accuracy comparable to competitive estimation baselines. Code is publicly available at https://github.com/erosenfeld/disagree_discrep . △ Less

Submitted 31 May, 2023; originally announced June 2023.

arXiv:2210.03927 [pdf, other]

APE: Aligning Pretrained Encoders to Quickly Learn Aligned Multimodal Representations

Authors: Elan Rosenfeld, Preetum Nakkiran, Hadi Pouransari, Oncel Tuzel, Fartash Faghri

Abstract: Recent advances in learning aligned multimodal representations have been primarily driven by training large neural networks on massive, noisy paired-modality datasets. In this work, we ask whether it is possible to achieve similar results with substantially less training time and data. We achieve this by taking advantage of existing pretrained unimodal encoders and careful curation of alignment da… ▽ More Recent advances in learning aligned multimodal representations have been primarily driven by training large neural networks on massive, noisy paired-modality datasets. In this work, we ask whether it is possible to achieve similar results with substantially less training time and data. We achieve this by taking advantage of existing pretrained unimodal encoders and careful curation of alignment data relevant to the downstream task of interest. We study a natural approach to aligning existing encoders via small auxiliary functions, and we find that this method is competitive with (or outperforms) state of the art in many settings while being less prone to overfitting, less costly to train, and more robust to distribution shift. With a properly chosen alignment distribution, our method surpasses prior state of the art for ImageNet zero-shot classification on public data while using two orders of magnitude less time and data and training 77% fewer parameters. △ Less

Submitted 8 October, 2022; originally announced October 2022.

arXiv:2202.06856 [pdf, other]

Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient for Out-of-Distribution Generalization

Authors: Elan Rosenfeld, Pradeep Ravikumar, Andrej Risteski

Abstract: A common explanation for the failure of deep networks to generalize out-of-distribution is that they fail to recover the "correct" features. We challenge this notion with a simple experiment which suggests that ERM already learns sufficient features and that the current bottleneck is not feature learning, but robust regression. Our findings also imply that given a small amount of data from the tar… ▽ More A common explanation for the failure of deep networks to generalize out-of-distribution is that they fail to recover the "correct" features. We challenge this notion with a simple experiment which suggests that ERM already learns sufficient features and that the current bottleneck is not feature learning, but robust regression. Our findings also imply that given a small amount of data from the target distribution, retraining only the last linear layer will give excellent performance. We therefore argue that devising simpler methods for learning predictors on existing features is a promising direction for future research. Towards this end, we introduce Domain-Adjusted Regression (DARE), a convex objective for learning a linear predictor that is provably robust under a new model of distribution shift. Rather than learning one function, DARE performs a domain-specific adjustment to unify the domains in a canonical latent space and learns to predict in this space. Under a natural model, we prove that the DARE solution is the minimax-optimal predictor for a constrained set of test distributions. Further, we provide the first finite-environment convergence guarantee to the minimax risk, improving over existing analyses which only yield minimax predictors after an environment threshold. Evaluated on finetuned features, we find that DARE compares favorably to prior methods, consistently achieving equal or better performance. △ Less

Submitted 27 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

arXiv:2110.11271 [pdf, other]

Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation

Authors: Bingbin Liu, Elan Rosenfeld, Pradeep Ravikumar, Andrej Risteski

Abstract: Noise-contrastive estimation (NCE) is a statistically consistent method for learning unnormalized probabilistic models. It has been empirically observed that the choice of the noise distribution is crucial for NCE's performance. However, such observations have never been made formal or quantitative. In fact, it is not even clear whether the difficulties arising from a poorly chosen noise distribut… ▽ More Noise-contrastive estimation (NCE) is a statistically consistent method for learning unnormalized probabilistic models. It has been empirically observed that the choice of the noise distribution is crucial for NCE's performance. However, such observations have never been made formal or quantitative. In fact, it is not even clear whether the difficulties arising from a poorly chosen noise distribution are statistical or algorithmic in nature. In this work, we formally pinpoint reasons for NCE's poor performance when an inappropriate noise distribution is used. Namely, we prove these challenges arise due to an ill-behaved (more precisely, flat) loss landscape. To address this, we introduce a variant of NCE called "eNCE" which uses an exponential loss and for which normalized gradient descent addresses the landscape issues provably when the target and noise distributions are in a given exponential family. △ Less

Submitted 21 October, 2021; originally announced October 2021.

arXiv:2106.09913 [pdf, other]

Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments

Authors: Yining Chen, Elan Rosenfeld, Mark Sellke, Tengyu Ma, Andrej Risteski

Abstract: Domain generalization aims at performing well on unseen test environments with data from a limited number of training environments. Despite a proliferation of proposal algorithms for this task, assessing their performance both theoretically and empirically is still very challenging. Distributional matching algorithms such as (Conditional) Domain Adversarial Networks [Ganin et al., 2016, Long et al… ▽ More Domain generalization aims at performing well on unseen test environments with data from a limited number of training environments. Despite a proliferation of proposal algorithms for this task, assessing their performance both theoretically and empirically is still very challenging. Distributional matching algorithms such as (Conditional) Domain Adversarial Networks [Ganin et al., 2016, Long et al., 2018] are popular and enjoy empirical success, but they lack formal guarantees. Other approaches such as Invariant Risk Minimization (IRM) require a prohibitively large number of training environments -- linear in the dimension of the spurious feature space $d_s$ -- even on simple data models like the one proposed by [Rosenfeld et al., 2021]. Under a variant of this model, we show that both ERM and IRM cannot generalize with $o(d_s)$ environments. We then present an iterative feature matching algorithm that is guaranteed with high probability to yield a predictor that generalizes after seeing only $O(\log d_s)$ environments. Our results provide the first theoretical justification for a family of distribution-matching algorithms widely used in practice under a concrete nontrivial data model. △ Less

Submitted 22 November, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

Comments: We acknowledge that the previous version of this paper (v1) contained an error - Theorem 3.2 was incorrect. We removed this theorem and updated the rest of the paper in v2

arXiv:2102.13128 [pdf, other]

An Online Learning Approach to Interpolation and Extrapolation in Domain Generalization

Authors: Elan Rosenfeld, Pradeep Ravikumar, Andrej Risteski

Abstract: A popular assumption for out-of-distribution generalization is that the training data comprises sub-datasets, each drawn from a distinct distribution; the goal is then to "interpolate" these distributions and "extrapolate" beyond them -- this objective is broadly known as domain generalization. A common belief is that ERM can interpolate but not extrapolate and that the latter is considerably more… ▽ More A popular assumption for out-of-distribution generalization is that the training data comprises sub-datasets, each drawn from a distinct distribution; the goal is then to "interpolate" these distributions and "extrapolate" beyond them -- this objective is broadly known as domain generalization. A common belief is that ERM can interpolate but not extrapolate and that the latter is considerably more difficult, but these claims are vague and lack formal justification. In this work, we recast generalization over sub-groups as an online game between a player minimizing risk and an adversary presenting new test distributions. Under an existing notion of inter- and extrapolation based on reweighting of sub-group likelihoods, we rigorously demonstrate that extrapolation is computationally much harder than interpolation, though their statistical complexity is not significantly different. Furthermore, we show that ERM -- or a noisy variant -- is provably minimax-optimal for both tasks. Our framework presents a new avenue for the formal analysis of domain generalization algorithms which may be of independent interest. △ Less

Submitted 18 November, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

arXiv:2011.02623 [pdf, other]

doi 10.1103/PhysRevLett.126.250505

Efficient entanglement of spin qubits mediated by a hot mechanical oscillator

Authors: Emma Rosenfeld, Ralf Riedinger, Jan Gieseler, Martin Schuetz, Mikhail D. Lukin

Abstract: Localized electronic and nuclear spin qubits in the solid state constitute a promising platform for storage and manipulation of quantum information, even at room temperature. However, the development of scalable systems requires the ability to entangle distant spins, which remains a challenge today. We propose and analyze an efficient, heralded scheme that employs a parity measurement in a decoher… ▽ More Localized electronic and nuclear spin qubits in the solid state constitute a promising platform for storage and manipulation of quantum information, even at room temperature. However, the development of scalable systems requires the ability to entangle distant spins, which remains a challenge today. We propose and analyze an efficient, heralded scheme that employs a parity measurement in a decoherence free subspace to enable fast and robust entanglement generation between distant spin qubits mediated by a hot mechanical oscillator. We find that high-fidelity entanglement at cryogenic and even ambient temperatures is feasible with realistic parameters, and show that the entangled pair can be subsequently leveraged for deterministic controlled-NOT operations between nuclear spins. Our results open the door for novel quantum processing architectures for a wide variety of solid-state spin qubits. △ Less

Submitted 4 November, 2020; originally announced November 2020.

Journal ref: Phys. Rev. Lett. 126, 250505 (2021)

arXiv:2010.05761 [pdf, other]

The Risks of Invariant Risk Minimization

Authors: Elan Rosenfeld, Pradeep Ravikumar, Andrej Risteski

Abstract: Invariant Causal Prediction (Peters et al., 2016) is a technique for out-of-distribution generalization which assumes that some aspects of the data distribution vary across the training set but that the underlying causal mechanisms remain constant. Recently, Arjovsky et al. (2019) proposed Invariant Risk Minimization (IRM), an objective based on this idea for learning deep, invariant features of d… ▽ More Invariant Causal Prediction (Peters et al., 2016) is a technique for out-of-distribution generalization which assumes that some aspects of the data distribution vary across the training set but that the underlying causal mechanisms remain constant. Recently, Arjovsky et al. (2019) proposed Invariant Risk Minimization (IRM), an objective based on this idea for learning deep, invariant features of data which are a complex function of latent variables; many alternatives have subsequently been suggested. However, formal guarantees for all of these works are severely lacking. In this paper, we present the first analysis of classification under the IRM objective--as well as these recently proposed alternatives--under a fairly natural and general model. In the linear case, we show simple conditions under which the optimal solution succeeds or, more often, fails to recover the optimal invariant predictor. We furthermore present the very first results in the non-linear regime: we demonstrate that IRM can fail catastrophically unless the test data are sufficiently similar to the training distribution--this is precisely the issue that it was intended to solve. Thus, in this setting we find that IRM and its alternatives fundamentally do not improve over standard Empirical Risk Minimization. △ Less

Submitted 27 March, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

Comments: ICLR 2021 Camera-Ready

arXiv:2007.05166 [pdf, other]

Self-Reflective Variational Autoencoder

Authors: Ifigeneia Apostolopoulou, Elan Rosenfeld, Artur Dubrawski

Abstract: The Variational Autoencoder (VAE) is a powerful framework for learning probabilistic latent variable generative models. However, typical assumptions on the approximate posterior distribution of the encoder and/or the prior, seriously restrict its capacity for inference and generative modeling. Variational inference based on neural autoregressive models respects the conditional dependencies of the… ▽ More The Variational Autoencoder (VAE) is a powerful framework for learning probabilistic latent variable generative models. However, typical assumptions on the approximate posterior distribution of the encoder and/or the prior, seriously restrict its capacity for inference and generative modeling. Variational inference based on neural autoregressive models respects the conditional dependencies of the exact posterior, but this flexibility comes at a cost: such models are expensive to train in high-dimensional regimes and can be slow to produce samples. In this work, we introduce an orthogonal solution, which we call self-reflective inference. By redesigning the hierarchical structure of existing VAE architectures, self-reflection ensures that the stochastic flow preserves the factorization of the exact posterior, sequentially updating the latent codes in a recurrent manner consistent with the generative model. We empirically demonstrate the clear advantages of matching the variational posterior to the exact posterior - on binarized MNIST, self-reflective inference achieves state-of-the art performance without resorting to complex, computationally expensive components such as autoregressive layers. Moreover, we design a variational normalizing flow that employs the proposed architecture, yielding predictive benefits compared to its purely generative counterpart. Our proposed modification is quite general and complements the existing literature; self-reflective inference can naturally leverage advances in distribution estimation and generative modeling to improve the capacity of each layer in the hierarchy. △ Less

Submitted 10 July, 2020; originally announced July 2020.

arXiv:2002.03018 [pdf, other]

Certified Robustness to Label-Flip** Attacks via Randomized Smoothing

Authors: Elan Rosenfeld, Ezra Winston, Pradeep Ravikumar, J. Zico Kolter

Abstract: Machine learning algorithms are known to be susceptible to data poisoning attacks, where an adversary manipulates the training data to degrade performance of the resulting classifier. In this work, we present a unifying view of randomized smoothing over arbitrary functions, and we leverage this novel characterization to propose a new strategy for building classifiers that are pointwise-certifiably… ▽ More Machine learning algorithms are known to be susceptible to data poisoning attacks, where an adversary manipulates the training data to degrade performance of the resulting classifier. In this work, we present a unifying view of randomized smoothing over arbitrary functions, and we leverage this novel characterization to propose a new strategy for building classifiers that are pointwise-certifiably robust to general data poisoning attacks. As a specific instantiation, we utilize our framework to build linear classifiers that are robust to a strong variant of label flip**, where each test example is targeted independently. In other words, for each test point, our classifier includes a certification that its prediction would be the same had some number of training labels been changed adversarially. Randomized smoothing has previously been used to guarantee---with high probability---test-time robustness to adversarial manipulation of the input to a classifier; we derive a variant which provides a deterministic, analytical bound, sidestep** the probabilistic certificates that traditionally result from the sampling subprocedure. Further, we obtain these certified bounds with minimal additional runtime complexity over standard classification and no assumptions on the train or test distributions. We generalize our results to the multi-class case, providing the first multi-class classification algorithm that is certifiably robust to label-flip** attacks. △ Less

Submitted 11 August, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

Comments: ICML 2020

arXiv:1912.10397 [pdf, other]

doi 10.1103/PhysRevLett.124.163604

Single-Spin Magnetomechanics with Levitated Micromagnets

Authors: Jan Gieseler, Aaron Kabcenell, Emma Rosenfeld, J. D. Schaefer, Arthur Safira, Martin J. A. Schuetz, Carlos Gonzalez-Ballestero, Cosimo C. Rusconi, Oriol Romero-Isart, Mikhail D. Lukin

Abstract: We demonstrate a new mechanical transduction platform for individual spin qubits. In our approach, single micro-magnets are trapped using a type-II superconductor in proximity of spin qubits, enabling direct magnetic coupling between the two systems. Controlling the distance between the magnet and the superconductor during cooldown, we demonstrate three dimensional trap** with quality factors ar… ▽ More We demonstrate a new mechanical transduction platform for individual spin qubits. In our approach, single micro-magnets are trapped using a type-II superconductor in proximity of spin qubits, enabling direct magnetic coupling between the two systems. Controlling the distance between the magnet and the superconductor during cooldown, we demonstrate three dimensional trap** with quality factors around one million and kHz trap** frequencies. We further exploit the large magnetic moment to mass ratio of this mechanical oscillator to couple its motion to the spin degree of freedom of an individual nitrogen vacancy center in diamond. Our approach provides a new path towards interfacing individual spin qubits with mechanical motion for testing quantum mechanics with mesoscopic objects, realization of quantum networks, and ultra-sensitive metrology. △ Less

Submitted 22 December, 2019; originally announced December 2019.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. Lett. 124, 163604 (2020)

arXiv:1906.00029 [pdf, other]

Human-Usable Password Schemas: Beyond Information-Theoretic Security

Authors: Elan Rosenfeld, Santosh Vempala, Manuel Blum

Abstract: Password users frequently employ passwords that are too simple, or they just reuse passwords for multiple websites. A common complaint is that utilizing secure passwords is too difficult. One possible solution to this problem is to use a password schema. Password schemas are deterministic functions which map challenges (typically the website name) to responses (passwords). Previous work has been d… ▽ More Password users frequently employ passwords that are too simple, or they just reuse passwords for multiple websites. A common complaint is that utilizing secure passwords is too difficult. One possible solution to this problem is to use a password schema. Password schemas are deterministic functions which map challenges (typically the website name) to responses (passwords). Previous work has been done on develo** and analyzing publishable schemas, but these analyses have been information-theoretic, not complexity-theoretic; they consider an adversary with infinite computing power. We perform an analysis with respect to adversaries having currently achievable computing capabilities, assessing the realistic practical security of such schemas. We prove for several specific schemas that a computer is no worse off than an infinite adversary and that it can successfully extract all information from leaked challenges and their respective responses, known as challenge-response pairs. We also show that any schema that hopes to be secure against adversaries with bounded computation should obscure information in a very specific way, by introducing many possible constraints with each challenge-response pair. These surprising results put the analyses of password schemas on a more solid and practical footing. △ Less

Submitted 31 May, 2019; originally announced June 2019.

arXiv:1902.02918 [pdf, other]

Certified Adversarial Robustness via Randomized Smoothing

Authors: Jeremy M Cohen, Elan Rosenfeld, J. Zico Kolter

Abstract: We show how to turn any classifier that classifies well under Gaussian noise into a new classifier that is certifiably robust to adversarial perturbations under the $\ell_2$ norm. This "randomized smoothing" technique has been proposed recently in the literature, but existing guarantees are loose. We prove a tight robustness guarantee in $\ell_2$ norm for smoothing with Gaussian noise. We use rand… ▽ More We show how to turn any classifier that classifies well under Gaussian noise into a new classifier that is certifiably robust to adversarial perturbations under the $\ell_2$ norm. This "randomized smoothing" technique has been proposed recently in the literature, but existing guarantees are loose. We prove a tight robustness guarantee in $\ell_2$ norm for smoothing with Gaussian noise. We use randomized smoothing to obtain an ImageNet classifier with e.g. a certified top-1 accuracy of 49% under adversarial perturbations with $\ell_2$ norm less than 0.5 (=127/255). No certified defense has been shown feasible on ImageNet except for smoothing. On smaller-scale datasets where competing approaches to certified $\ell_2$ robustness are viable, smoothing delivers higher certified accuracies. Our strong empirical results suggest that randomized smoothing is a promising direction for future research into adversarially robust classification. Code and models are available at http://github.com/locuslab/smoothing. △ Less

Submitted 15 June, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

Comments: ICML 2019

arXiv:1801.00198 [pdf, other]

doi 10.1103/PhysRevLett.120.243604

Sensing coherent dynamics of electronic spin clusters in solids

Authors: Emma L. Rosenfeld, Linh M. Pham, Mikhail D. Lukin, Ronald L. Walsworth

Abstract: We present experimental observations and a study of quantum dynamics of strongly interacting electronic spins, at room temperature in the solid state. In a diamond substrate, a single nitrogen vacancy (NV) center coherently interacts with two adjacent S = 1/2 dark electron spins. We quantify NV-electron and electron-electron couplings via detailed spectroscopy, with good agreement to a model of st… ▽ More We present experimental observations and a study of quantum dynamics of strongly interacting electronic spins, at room temperature in the solid state. In a diamond substrate, a single nitrogen vacancy (NV) center coherently interacts with two adjacent S = 1/2 dark electron spins. We quantify NV-electron and electron-electron couplings via detailed spectroscopy, with good agreement to a model of strongly interacting spins. The electron-electron coupling enables an observation of coherent flip-flop dynamics between electronic spins in the solid state, which occur conditionally on the state of the NV. Finally, as a demonstration of coherent control, we selectively couple and transfer polarization between the NV and the pair of electron spins. These results demonstrate a key step towards full quantum control of electronic spin registers in room temperature solids. △ Less

Submitted 14 June, 2018; v1 submitted 30 December, 2017; originally announced January 2018.

Journal ref: Phys. Rev. Lett. 120, 243604 (2018)

arXiv:1706.09664 [pdf]

Dust levitation above the lunar surface: role of charge fluctuations

Authors: E V Rosenfeld, A V Zakharov

Abstract: The most likely cause of levitation of dust above the surface of atmosphereless planets is the electrostatic mechanism. However, the crucial problem in the explanation of this effect is a determination of the reason why a large electric charge (units or even dozens of elementary charges) required for take-off can be accumulated on the smallest dust particles. Due to the photoeffect the charge of s… ▽ More The most likely cause of levitation of dust above the surface of atmosphereless planets is the electrostatic mechanism. However, the crucial problem in the explanation of this effect is a determination of the reason why a large electric charge (units or even dozens of elementary charges) required for take-off can be accumulated on the smallest dust particles. Due to the photoeffect the charge of such value could be easily accumulated on a solitary dust particle, but if a dust particle has not yet taken off, the average value of its charge is several orders of magnitude lower because of the extremely low charge density on the planet's surface. The paper shows that surface charge density is really small only for averaging over regions of macroscopic size, and on a submicron scale the surface appear to be a collection of chaotic "spots" with charges of different signs. The positively charged "spots" are only slightly larger than the negatively charged spots, which explains the small value of the charge density averaged over macroscopic regions. "Spots" arise due to fluctuations in the fluxes of the photoelectrons taking off and falling back the surface, and the charge density inside the "spots" is sufficient to allow a takeoff of particles with the dimensions less than or on the order of 0.1 micron in the field of a double layer. △ Less

Submitted 29 June, 2017; originally announced June 2017.

arXiv:1706.01311 [pdf]

Trigger-type metamaterials on the base of collective Jahn-Teller effect

Authors: E. V. Rosenfeld

Abstract: Generally, in case of the collective Jahn-Teller effect, a high-symmetry structure of a matrix in which quantum systems with degenerate ground state are inserted becomes distorted. This usually smooth transition can become abrupt only if the matrix by itself is a trigger and JTE merely activates its switching. It is shown in this paper that proper insertion into matrix of quantum systems with the… ▽ More Generally, in case of the collective Jahn-Teller effect, a high-symmetry structure of a matrix in which quantum systems with degenerate ground state are inserted becomes distorted. This usually smooth transition can become abrupt only if the matrix by itself is a trigger and JTE merely activates its switching. It is shown in this paper that proper insertion into matrix of quantum systems with the singlet ground state and degenerate excited state leads to the formation of a new metastable state of the whole system and a stepwise appearance of JTE. A matrix of any nature can be transformed into trigger in this way if one manages to synthesize and insert into it proper quantum active centers with appropriate energy spectrum. Theoretically, this provides advanced possibilities for metamaterials development. △ Less

Submitted 29 May, 2017; originally announced June 2017.

Comments: 9 pages, 4 figues

arXiv:1611.00811 [pdf]

Role of stochastic processes in particle charging due to photoeffect on the Moon

Authors: Eugene V. Rosenfeld, Alexander V. Zakharov

Abstract: Neglecting the effects associated with the solar wind plasma, the photoelectrons are the only elementary particles which create an electrical current through sunlit surface of the moon. They are knocked off of the surface soil, rise above the surface, and then fall back. Therefore, on average, on any unit of surface area there is a positive charge, equal in magnitude to the charge of photoelectron… ▽ More Neglecting the effects associated with the solar wind plasma, the photoelectrons are the only elementary particles which create an electrical current through sunlit surface of the moon. They are knocked off of the surface soil, rise above the surface, and then fall back. Therefore, on average, on any unit of surface area there is a positive charge, equal in magnitude to the charge of photoelectrons flying over this area. However, the charge of any small dust particle can strongly fluctuate discretely: a photoelectron can be either knocked off of the or be reacquired by the particle. The result is a "random walk" in sign and magnitude of the charge of grains. In a few minutes after sunrise, almost every dust particle on the surface has at least one extra or missing electron, and the average modulus of the charge accumulated on a particle is proportional to the square root of the number of "steps" (knocking off /returns of photoelectrons). Therefore, the average value of the modulus of the charge of a fine dust particle exceeds by several orders of magnitude the proportion of the average surface charge attributable to the particle. So dust particles that have ejected a sufficient number of photoelectrons can take off from the surface because of the electric field of the near surface charge double layer. It is shown that: (i) almost half of all dust particles on the illuminated lunar surface are missing at least one electron; (ii) a significant portion of particles up to 100 nm in size emits several photoelectrons and acquire a positive charge, sufficient to take off from the surface; (iii) above the surface there is a "boiling layer" of dust with a maximum thickness of several hundred meters where the average size of the particles and their density non-monotonically depend on the altitude. △ Less

Submitted 1 November, 2016; originally announced November 2016.

Comments: 15 pages, 2figures

arXiv:1602.08913 [pdf]

X-ray experiment provides a way to reveal the distinction between discrete and continuous conformation of myosin head

Authors: E. V. Rosenfeld

Abstract: The corner stone of the classical model after Huxley and Simmons is supposition that a myosin head can reside only in several discrete states and irregularly jumps from one state to another. Until now, it has not been found a way to experimentally verify this supposition although confirmation or refutation of the existence of discrete states is crucial for the solution of myosin motor problem. Her… ▽ More The corner stone of the classical model after Huxley and Simmons is supposition that a myosin head can reside only in several discrete states and irregularly jumps from one state to another. Until now, it has not been found a way to experimentally verify this supposition although confirmation or refutation of the existence of discrete states is crucial for the solution of myosin motor problem. Here I show that a set of equal myosin heads arranged equidistantly along an actin filament produce X-ray pattern which varies with the type of conformation. If the lever arms of all myosin heads reside in one and the same position (continuous conformation), all the heads have the same form-factor and equally scatter electromagnetic wave. In this case, only the geometric factor associated with a spatial ordering of the heads will determine the X-ray pattern. The situation changes if the average lever arm position is the same, but inherently every head can reside only in several diverse discrete states, hop** irregularly from one to another. In this case, the form-factors corresponding to distinct states are dissimilar, and the increments in phases of X-rays scattered by different heads are different as well. Inasmuch as every quantum of radiation interacts with the heads residing in different states, this results in additional interference and some peaks in the X-ray pattern should slacken or even extinct as compared with the pattern from the heads with the continuous-type conformation. The formulas describing both cases are compared in this article. In general, the distinction between X-ray patterns is insignificant, but they could be appreciably different at some stages of conformation process (respective lever arm position depends on the type of discrete model). Consequently, one can with luck attempt to find out this difference using a high-sensitive equipment. △ Less

Submitted 29 February, 2016; originally announced February 2016.

Comments: Anyone who understands the principles of interference can easily check the results of very simple calculations in this text. However, the editors of a number of biophysical journals rejected the MS and refused to discuss details. I believe the result is important and stimulates an experiment. So, I would be sincerely grateful to anyone who could explain me why it was rejected

arXiv:1507.06303 [pdf]

Bond rupture mechanism enables to explain in block asymmetry of elaxation, force-velocity curve and the path of energy dissipation in muscle

Authors: Eugene V. Rosenfeld

Abstract: Bond rupture mechanism enables to explain in block asymmetry of elaxation, force-velocity curve and the path of energy dissipation in muscle Bond rupture mechanism enables to explain in block asymmetry of elaxation, force-velocity curve and the path of energy dissipation in muscle △ Less

Submitted 22 July, 2015; originally announced July 2015.

Comments: in Russian, 11 pages

arXiv:1401.5634 [pdf]

Relation between the charge/discharge processes of dust particles and the dynamics of dust clouds over the Moon surface

Authors: E. V. Rosenfeld, A. V. Zakharov

Abstract: It is shown that the appearance of lunar horizon glow and streamers observed above the lunar terminator described by the dynamic fountain mode byl Stubbs et al. (2006) requires a value of dust particles charge several orders exceed what they obtain on the Lunar surface. To obtain a sufficient charge due to a departure of photoelectrons separated submicron particles have to flow over the surface du… ▽ More It is shown that the appearance of lunar horizon glow and streamers observed above the lunar terminator described by the dynamic fountain mode byl Stubbs et al. (2006) requires a value of dust particles charge several orders exceed what they obtain on the Lunar surface. To obtain a sufficient charge due to a departure of photoelectrons separated submicron particles have to flow over the surface during tens of seconds or even several minutes. Therefore for emergence of dust streamers at the lunar sunrise it is necessary that dust particles do not lose the positive charge during a night. △ Less

Submitted 22 January, 2014; originally announced January 2014.

Comments: in Russian

arXiv:0903.1775 [pdf]

Mechanism of Canted Magnetic Structure Formation in the Absence of Spin-Orbital Interaction

Authors: E. V. Rosenfeld

Abstract: A simple exactly solvable model of canted magnetic structure appearance in the system of crystallographic and chemically equivalent atoms is proposed. The corresponding mechanism originates from the competition of intra- and interatomic exchange interactions. A simple exactly solvable model of canted magnetic structure appearance in the system of crystallographic and chemically equivalent atoms is proposed. The corresponding mechanism originates from the competition of intra- and interatomic exchange interactions. △ Less

Submitted 10 March, 2009; originally announced March 2009.

Comments: PACS nomber: 75.45.+j Macroscopic quantum phenomena in magnetic systems 7 pages, 4 igures

arXiv:q-bio/0703014 [pdf]

Coulomb Interaction as the Source of Muscle Force

Authors: E. V. Rosenfeld

Abstract: Myosin motor is the machine, which performs mechanical work in the course of adenosine triphosphate molecule hydrolysis and myosin head conformations accompanying this process. For displacement of individual fragments of large molecule relative to each other to arise and work to be performed, force must be born inside protein. What kind of interaction generates this force? Models based on Huxl… ▽ More Myosin motor is the machine, which performs mechanical work in the course of adenosine triphosphate molecule hydrolysis and myosin head conformations accompanying this process. For displacement of individual fragments of large molecule relative to each other to arise and work to be performed, force must be born inside protein. What kind of interaction generates this force? Models based on Huxley 1957 theory ascertain relations between chemical reactions rate constants and energies of crossbridge conformations. Nevertheless, understand in the framework of thermodynamics how myosin motor works in principle is impossible: it is smoothly heated device cyclically producing mechanical work (second law). Furthermore, in every working cycle myosin head captures and splits a single molecule. Hence, ordinary dynamic laws rather than stochastic laws govern this process. The simple mechanism of chemomechanical transduction is proposed. The two products of adenosine triphosphate hydrolysis, adenosine diphosphate and inorganic phosphate, have charges of the same sense and Coulomb interaction of these charges produces the force pushing backdoor and rotating converter domain. The velocity of filaments sliding becomes the principal parameter in the model and new mechanism of indirect interaction between the cross-bridges radically different from one suggested by Huxley and Simmons in 1971 appears. The working stroke duration is inversely proportional to the velocity now. Therefore Hill equation appears and the parameter values obtained are in reasonable agreement with experiment. △ Less

Submitted 6 March, 2007; originally announced March 2007.

Comments: 23 pages, 2 figures

arXiv:physics/0610060 [pdf, ps, other]

Is the Fermi field contact and isotropic?

Authors: E. V. Rosenfeld

Abstract: It is shown that the contribution to the induction which at an internal point of a spin density distribution is mathematically described as a local is virtually caused by the summing-up of the fields created by all elements of this distribution. Therefore, the proportionality coefficient between this contact (Fermi) field and magnetic moment density at the point of observation is equal to 8pi/3… ▽ More It is shown that the contribution to the induction which at an internal point of a spin density distribution is mathematically described as a local is virtually caused by the summing-up of the fields created by all elements of this distribution. Therefore, the proportionality coefficient between this contact (Fermi) field and magnetic moment density at the point of observation is equal to 8pi/3 only for spherically symmetrical s-shells. If the symmetry of spin density distribution lowers, the value of this coefficient becomes dependent on the spin direction. As a sequence, in low-symmetry crystals and molecules additional anisotropic contributions to the hyperfine field emerge. △ Less

Submitted 11 October, 2006; v1 submitted 10 October, 2006; originally announced October 2006.

Comments: 10 pages, 1 figure. In this case the output should be produced correctly

Showing 1–29 of 29 results for author: Rosenfeld, E