Skip to main content

Showing 1–22 of 22 results for author: Ohana, R

.
  1. arXiv:2406.02585  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task

    Authors: Siavash Golkar, Alberto Bietti, Mariel Pettee, Michael Eickenberg, Miles Cranmer, Keiya Hirashima, Geraud Krawezik, Nicholas Lourie, Michael McCabe, Rudy Morel, Ruben Ohana, Liam Holden Parker, Bruno Régaldo-Saint Blancard, Kyunghyun Cho, Shirley Ho

    Abstract: Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem aimed at enhancing our understanding of Transformers in quantitative and scientific contexts. This task requires precise localization and computation within datas… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2402.19455  [pdf, other

    stat.ML astro-ph.CO cs.CV cs.LG eess.SP

    Listening to the Noise: Blind Denoising with Gibbs Diffusion

    Authors: David Heurtel-Depeiges, Charles C. Margossian, Ruben Ohana, Bruno Régaldo-Saint Blancard

    Abstract: In recent years, denoising problems have become intertwined with the development of deep generative models. In particular, diffusion models are trained like denoisers, and the distribution they model coincide with denoising priors in the Bayesian picture. However, denoising through diffusion-based posterior sampling requires the noise level and covariance to be known, preventing blind denoising. W… ▽ More

    Submitted 25 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: 12+9 pages, 7+5 figures, 1+1 tables; accepted to 2024 International Conference on Machine Learning; code: https://github.com/rubenohana/Gibbs-Diffusion

  3. arXiv:2310.16285  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM cs.LG

    Removing Dust from CMB Observations with Diffusion Models

    Authors: David Heurtel-Depeiges, Blakesley Burkhart, Ruben Ohana, Bruno Régaldo-Saint Blancard

    Abstract: In cosmology, the quest for primordial $B$-modes in cosmic microwave background (CMB) observations has highlighted the critical need for a refined model of the Galactic dust foreground. We investigate diffusion-based modeling of the dust foreground and its interest for component separation. Under the assumption of a Gaussian CMB with known cosmology (or covariance matrix), we show that diffusion m… ▽ More

    Submitted 11 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 5+6 pages, 2+3 figures, accepted at the NeurIPS 2023 workshop on "Machine Learning and the Physical Sciences" and selected for a spotlight talk

  4. arXiv:2310.03024  [pdf, other

    astro-ph.IM cs.AI cs.LG

    AstroCLIP: A Cross-Modal Foundation Model for Galaxies

    Authors: Liam Parker, Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation fro… ▽ More

    Submitted 14 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted in Monthly Notices of the Royal Astronomical Society, Presented at the NeurIPS 2023 AI4Science Workshop

  5. arXiv:2310.02994  [pdf, other

    cs.LG cs.AI stat.ML

    Multiple Physics Pretraining for Physical Surrogate Models

    Authors: Michael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Geraud Krawezik, Francois Lanusse, Mariel Pettee, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We introduce multiple physics pretraining (MPP), an autoregressive task-agnostic pretraining approach for physical surrogate modeling. MPP involves training large surrogate models to predict the dynamics of multiple heterogeneous physical systems simultaneously by learning features that are broadly useful across diverse physical tasks. In order to learn effectively in this setting, we introduce a… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  6. arXiv:2310.02989  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    xVal: A Continuous Number Encoding for Large Language Models

    Authors: Siavash Golkar, Mariel Pettee, Michael Eickenberg, Alberto Bietti, Miles Cranmer, Geraud Krawezik, Francois Lanusse, Michael McCabe, Ruben Ohana, Liam Parker, Bruno Régaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: Large Language Models have not yet been broadly adapted for the analysis of scientific datasets due in part to the unique difficulties of tokenizing numbers. We propose xVal, a numerical encoding scheme that represents any real number using just a single token. xVal represents a given real number by scaling a dedicated embedding vector by the number value. Combined with a modified number-inference… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages 7 figures. Supplementary: 5 pages 2 figures

  7. arXiv:2305.12988  [pdf, ps, other

    physics.optics

    Linear Optical Random Projections Without Holography

    Authors: Ruben Ohana, Daniel Hesslow, Daniel Brunner, Sylvain Gigan, Kilian Müller

    Abstract: We introduce a novel method to perform linear optical random projections without the need for holography. Our method consists of a computationally trivial combination of multiple intensity measurements to mitigate the information loss usually associated with the absolute-square non-linearity imposed by optical intensity measurements. Both experimental and numerical findings demonstrate that the re… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 7 pages, 4 figures

    Journal ref: Opt. Express 31, 25881-25888 (2023)

  8. arXiv:2305.07583  [pdf, other

    cs.LG math.OC

    MoMo: Momentum Models for Adaptive Learning Rates

    Authors: Fabian Schaipp, Ruben Ohana, Michael Eickenberg, Aaron Defazio, Robert M. Gower

    Abstract: Training a modern machine learning architecture on a new task requires extensive learning-rate tuning, which comes at a high computational cost. Here we develop new Polyak-type adaptive learning rates that can be used on top of any momentum method, and require less tuning to perform well. We first develop MoMo, a Momentum Model based adaptive learning rate for SGD-M (stochastic gradient descent wi… ▽ More

    Submitted 5 June, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    MSC Class: 90C53; 74S60; 90C06; 62L20; 68W20; 15B52; 65Y20; 68W40 ACM Class: G.1.6

  9. arXiv:2206.03230  [pdf, other

    stat.ML cs.LG

    Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances

    Authors: Ruben Ohana, Kimia Nadjahi, Alain Rakotomamonjy, Liva Ralaivola

    Abstract: The Sliced-Wasserstein distance (SW) is a computationally efficient and theoretically grounded alternative to the Wasserstein distance. Yet, the literature on its statistical properties -- or, more accurately, its generalization properties -- with respect to the distribution of slices, beyond the uniform measure, is scarce. To bring new contributions to this line of research, we leverage the PAC-B… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  10. arXiv:2202.02031  [pdf, other

    stat.ML cs.LG stat.CO

    Complex-to-Real Sketches for Tensor Products with Applications to the Polynomial Kernel

    Authors: Jonas Wacker, Ruben Ohana, Maurizio Filippone

    Abstract: Randomized sketches of a tensor product of $p$ vectors follow a tradeoff between statistical efficiency and computational acceleration. Commonly used approaches avoid computing the high-dimensional tensor product explicitly, resulting in a suboptimal dependence of $\mathcal{O}(3^p)$ in the embedding dimension. We propose a simple Complex-to-Real (CtR) modification of well-known sketches that repla… ▽ More

    Submitted 30 April, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 32 pages

  11. arXiv:2108.04217  [pdf, other

    cs.CV cs.LG

    ROPUST: Improving Robustness through Fine-tuning with Photonic Processors and Synthetic Gradients

    Authors: Alessandro Cappelli, Julien Launay, Laurent Meunier, Ruben Ohana, Iacopo Poli

    Abstract: Robustness to adversarial attacks is typically obtained through expensive adversarial training with Projected Gradient Descent. Here we introduce ROPUST, a remarkably simple and efficient method to leverage robust pre-trained models and further increase their robustness, at no cost in natural accuracy. Our technique relies on the use of an Optical Processing Unit (OPU), a photonic co-processor, an… ▽ More

    Submitted 6 July, 2021; originally announced August 2021.

    Comments: 12 pages, 7 figures

  12. arXiv:2107.11814  [pdf, other

    cs.AR cs.ET

    LightOn Optical Processing Unit: Scaling-up AI and HPC with a Non von Neumann co-processor

    Authors: Charles Brossollet, Alessandro Cappelli, Igor Carron, Charidimos Chaintoutis, Amélie Chatelain, Laurent Daudet, Sylvain Gigan, Daniel Hesslow, Florent Krzakala, Julien Launay, Safa Mokaadi, Fabien Moreau, Kilian Müller, Ruben Ohana, Gustave Pariente, Iacopo Poli, Elena Tommasone

    Abstract: We introduce LightOn's Optical Processing Unit (OPU), the first photonic AI accelerator chip available on the market for at-scale Non von Neumann computations, reaching 1500 TeraOPS. It relies on a combination of free-space optics with off-the-shelf components, together with a software API allowing a seamless integration within Python-based processing pipelines. We discuss a variety of use cases… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

    Comments: Proceedings IEEE Hot Chips 33, 2021

  13. arXiv:2106.03645  [pdf, other

    cs.LG cs.CR

    Photonic Differential Privacy with Direct Feedback Alignment

    Authors: Ruben Ohana, Hamlet J. Medina Ruiz, Julien Launay, Alessandro Cappelli, Iacopo Poli, Liva Ralaivola, Alain Rakotomamonjy

    Abstract: Optical Processing Units (OPUs) -- low-power photonic chips dedicated to large scale random projections -- have been used in previous work to train deep neural networks using Direct Feedback Alignment (DFA), an effective alternative to backpropagation. Here, we demonstrate how to leverage the intrinsic noise of optical random projections to build a differentially private DFA mechanism, making OPUs… ▽ More

    Submitted 25 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Journal ref: NeurIPS 2021

  14. arXiv:2104.14429  [pdf, other

    stat.ML cs.LG

    Photonic co-processors in HPC: using LightOn OPUs for Randomized Numerical Linear Algebra

    Authors: Daniel Hesslow, Alessandro Cappelli, Igor Carron, Laurent Daudet, Raphaël Lafargue, Kilian Müller, Ruben Ohana, Gustave Pariente, Iacopo Poli

    Abstract: Randomized Numerical Linear Algebra (RandNLA) is a powerful class of methods, widely used in High Performance Computing (HPC). RandNLA provides approximate solutions to linear algebra functions applied to large signals, at reduced computational costs. However, the randomization step for dimensionality reduction may itself become the computational bottleneck on traditional hardware. Leveraging near… ▽ More

    Submitted 7 May, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Add "This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 860830"

  15. Adversarial Robustness by Design through Analog Computing and Synthetic Gradients

    Authors: Alessandro Cappelli, Ruben Ohana, Julien Launay, Laurent Meunier, Iacopo Poli, Florent Krzakala

    Abstract: We propose a new defense mechanism against adversarial attacks inspired by an optical co-processor, providing robustness without compromising natural accuracy in both white-box and black-box settings. This hardware co-processor performs a nonlinear fixed random transformation, where the parameters are unknown and impossible to retrieve with sufficient precision for large enough dimensions. In the… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Journal ref: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing,

  16. arXiv:2011.12428  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG cs.NE

    Align, then memorise: the dynamics of learning with feedback alignment

    Authors: Maria Refinetti, Stéphane d'Ascoli, Ruben Ohana, Sebastian Goldt

    Abstract: Direct Feedback Alignment (DFA) is emerging as an efficient and biologically plausible alternative to the ubiquitous backpropagation algorithm for training deep neural networks. Despite relying on random feedback weights for the backward pass, DFA successfully trains state-of-the-art models such as Transformers. On the other hand, it notoriously fails to train convolutional networks. An understand… ▽ More

    Submitted 10 June, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: The accompanying code for this paper is available at https://github.com/sdascoli/dfa-dynamics

    Journal ref: Proceedings of the 38th International Conference on Machine Learning (ICML), PMLR 139, 2021

  17. Experimental Approach to Demonstrating Contextuality for Qudits

    Authors: Adel Sohbi, Ruben Ohana, Isabelle Zaquine, Eleni Diamanti, Damian Markham

    Abstract: We propose a method to experimentally demonstrate contextuality with a family of tests for qudits. The experiment we propose uses a qudit encoded in the path of a single photon and its temporal degrees of freedom. We consider the impact of noise on the effectiveness of these tests, taking the approach of ontologically faithful non-contextuality. In this approach, imperfections in the experimental… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

    Journal ref: Phys. Rev. A 103, 062220 (2021)

  18. arXiv:2006.07310  [pdf, other

    stat.ML cs.LG eess.SP

    Reservoir Computing meets Recurrent Kernels and Structured Transforms

    Authors: Jonathan Dong, Ruben Ohana, Mushegh Rafayelyan, Florent Krzakala

    Abstract: Reservoir Computing is a class of simple yet efficient Recurrent Neural Networks where internal weights are fixed at random and only a linear output layer is trained. In the large size limit, such random neural networks have a deep connection with kernel methods. Our contributions are threefold: a) We rigorously establish the recurrent kernel limit of Reservoir Computing and prove its convergence.… ▽ More

    Submitted 21 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems, v33, pages 16785--16796, 2020

  19. arXiv:2002.12503  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Impact of epitaxial strain on the topological-nontopological phase diagram and semimetallic behavior of InAs/GaSb composite quantum wells

    Authors: H. Irie, T. Akiho, F. Couëdo, R. Ohana, K. Suzuki, K. Onomitsu, K. Muraki

    Abstract: We study the influence of epitaxial strain on the electronic properties of InAs/GaSb composite quantum wells (CQWs), host structures for quantum spin Hall insulators, by transport measurements and eight-band $\mathbf{k\cdot p}$ calculations. Using different substrates and buffer layer structures for crystal growth, we prepare two types of samples with vastly different strain conditions. CQWs with… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 13 pages, 9 figures

    Journal ref: Phys. Rev. B 101, 075433 (2020)

  20. Kernel computations from large-scale random features obtained by Optical Processing Units

    Authors: Ruben Ohana, Jonas Wacker, Jonathan Dong, Sébastien Marmin, Florent Krzakala, Maurizio Filippone, Laurent Daudet

    Abstract: Approximating kernel functions with random features (RFs)has been a successful application of random projections for nonparametric estimation. However, performing random projections presents computational challenges for large-scale problems. Recently, a new optical hardware called Optical Processing Unit (OPU) has been developed for fast and energy-efficient computation of large-scale RFs in the a… ▽ More

    Submitted 2 December, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 5 pages, 3 figures, submitted to ICASSP 2020

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  21. arXiv:1202.6612  [pdf, ps, other

    math.NT

    A Database of Elliptic Curves over Q(sqrt(5)) - First Report

    Authors: Jonathan Bober, Alyson Deines, Ariah Klages-Mundt, Benjamin LeVeque, R. Andrew Ohana, Ashwath Rabindranath, Paul Sharaba, William Stein

    Abstract: We describe a tabulation of (conjecturally) modular elliptic curves over the field Q(sqrt(5)) up to the first curve of rank 2. Using an efficient implementation of an algorithm of Lassina Dembele, we computed tables of Hilbert modular forms of weight (2,2) over Q(sqrt(5)), and via a variety of methods we constructed corresponding elliptic curves, including (again, conjecturally) all elliptic curve… ▽ More

    Submitted 9 July, 2012; v1 submitted 29 February, 2012; originally announced February 2012.

    Comments: 17 pages

  22. arXiv:1007.2667  [pdf, ps, other

    math.NT math.CO

    On well-rounded sublattices of the hexagonal lattice

    Authors: Lenny Fukshansky, Daniel Moore, R. Andrew Ohana, Whitney Zeldow

    Abstract: We produce an explicit parameterization of well-rounded sublattices of the hexagonal lattice in the plane, splitting them into similarity classes. We use this parameterization to study the number, the greatest minimal norm, and the highest signal-to-noise ratio of well-rounded sublattices of the hexagonal lattice of a fixed index. This investigation parallels earlier work by Bernstein, Sloane, and… ▽ More

    Submitted 29 July, 2010; v1 submitted 15 July, 2010; originally announced July 2010.

    Comments: 21 pages (minor correction to the proof of Lemma 2.1); to appear in Discrete Mathematics

    MSC Class: Primary: 11H31; 52C15; Secondary: 05B40; 11E45

    Journal ref: Discrete Mathematics, vol. 310 no. 23 (2010), pg. 3287--3302