Skip to main content

Showing 1–27 of 27 results for author: Michaud, J

.
  1. arXiv:2405.17420  [pdf, other

    cs.LG

    Survival of the Fittest Representation: A Case Study with Modular Addition

    Authors: Xiaoman Delores Ding, Zifan Carl Guo, Eric J. Michaud, Ziming Liu, Max Tegmark

    Abstract: When a neural network can learn multiple distinct algorithms to solve a task, how does it "choose" between them during training? To approach this question, we take inspiration from ecology: when multiple species coexist, they eventually reach an equilibrium where some survive while others die out. Analogously, we suggest that a neural network at initialization contains many solutions (representati… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2405.14860  [pdf, other

    cs.LG

    Not All Language Model Features Are Linear

    Authors: Joshua Engels, Isaac Liao, Eric J. Michaud, Wes Gurnee, Max Tegmark

    Abstract: Recent work has proposed the linear representation hypothesis: that language models perform computation by manipulating one-dimensional representations of concepts ("features") in activation space. In contrast, we explore whether some language model representations may be inherently multi-dimensional. We begin by develo** a rigorous definition of irreducible multi-dimensional features based on w… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Code and data at https://github.com/JoshEngels/MultiDimensionalFeatures

  3. arXiv:2403.19647  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

    Authors: Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller

    Abstract: We introduce methods for discovering and applying sparse feature circuits. These are causally implicated subnetworks of human-interpretable features for explaining language model behaviors. Circuits identified in prior work consist of polysemantic and difficult-to-interpret units like attention heads or neurons, rendering them unsuitable for many downstream applications. In contrast, sparse featur… ▽ More

    Submitted 31 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Code and data at https://github.com/saprmarks/feature-circuits. Demonstration at https://feature-circuits.xyz

  4. arXiv:2402.11681  [pdf, other

    cs.CL math.NA

    Opening the black box of language acquisition

    Authors: Jérôme Michaud, Anna Jon-and

    Abstract: Recent advances in large language models using deep learning techniques have renewed interest on how languages can be learned from data. However, it is unclear whether or how these models represent grammatical information from the learned languages. In addition, the models must be pre-trained on large corpora before they can be used. In this work, we propose an alternative, more transparent and co… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  5. arXiv:2402.05110  [pdf, other

    cs.LG

    Opening the AI black box: program synthesis via mechanistic interpretability

    Authors: Eric J. Michaud, Isaac Liao, Vedang Lad, Ziming Liu, Anish Mudide, Chloe Loughridge, Zifan Carl Guo, Tara Rezaei Kheirkhah, Mateja Vukelić, Max Tegmark

    Abstract: We present MIPS, a novel method for program synthesis based on automated mechanistic interpretability of neural networks trained to perform the desired task, auto-distilling the learned algorithm into Python code. We test MIPS on a benchmark of 62 algorithmic tasks that can be learned by an RNN and find it highly complementary to GPT-4: MIPS solves 32 of them, including 13 that are not solved by G… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 24 pages

  6. arXiv:2312.13742  [pdf, ps, other

    nucl-ex

    First measurement of the neutron-emission probability with a surrogate reaction in inverse kinematics at a heavy-ion storage ring

    Authors: M. Sguazzin, B. Jurado, J. Pibernat, J. A. Swartz, M. Grieser, J. Glorius, Yu. A. Litvinov, J. Adamczewski-Musch, P. Alfaurt, P. Ascher, L. Audouin, C. Berthelot, B. Blank, K. Blaum, B. Brückner, S. Dellmann, I. Dillmann, C. Domingo-Pardo, M. Dupuis, P. Erbacher, M. Flayol, O. Forstner, D. Freire-Fernández, M. Gerbaux, J. Giovinazzo , et al. (27 additional authors not shown)

    Abstract: Neutron-induced reaction cross sections of short-lived nuclei are imperative to understand the origin of heavy elements in stellar nucleosynthesis and for societal applications, but their measurement is extremely complicated due to the radioactivity of the targets involved. One way of overcoming this issue is to combine surrogate reactions with the unique possibilities offered by heavy-ion storage… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 8 pages and 5 figures

  7. arXiv:2307.15217  [pdf, other

    cs.AI cs.CL cs.LG

    Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

    Authors: Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen , et al. (7 additional authors not shown)

    Abstract: Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and rel… ▽ More

    Submitted 11 September, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  8. arXiv:2303.13506  [pdf, other

    cs.LG cond-mat.dis-nn

    The Quantization Model of Neural Scaling

    Authors: Eric J. Michaud, Ziming Liu, Uzay Girit, Max Tegmark

    Abstract: We propose the Quantization Model of neural scaling laws, explaining both the observed power law dropoff of loss with model and data size, and also the sudden emergence of new capabilities with scale. We derive this model from what we call the Quantization Hypothesis, where network knowledge and skills are "quantized" into discrete chunks ($\textbf{quanta}$). We show that when quanta are learned i… ▽ More

    Submitted 13 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 24 pages, 18 figures, NeurIPS 2023

  9. arXiv:2303.06969  [pdf

    physics.acc-ph

    Challenges in low losses and large acceptance ion beam transport

    Authors: F Osswald, E Traykov, T Durand, M Heine, J Michaud, J C Thomas

    Abstract: A prototype of ion beam transport module has been developed at the Institut Pluridisciplinaire Hubert Curien (IPHC) and used as a test bed to investigate key issues related to the efficient transport of ion beams. This includes the reduction of the beam losses, the increase of the acceptance, and the definition of the instrumentation necessary to evaluate the performances. An experiment was perfor… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.09611

  10. arXiv:2211.09611  [pdf

    physics.acc-ph

    Green beam lines, a challenging concept

    Authors: F. Osswald, E. Traykov, T. Durand, M. Heine, J. Michaud, J. C. Thomas

    Abstract: Due to increasing environmental and economic constraints, optimization of ion beam transport and equipment design becomes essential. The future should be equipped with planet-friendly facilities, that is, solutions that reduce environmental impact and improve economic competitiveness. The tendency to increase the intensity of the current and the power of the beams obliges us and brings us to new c… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  11. Transverse emittance measurement in 2D and 4D performed on a Low Energy Beam Transport line: benchmarking and data analysis

    Authors: F Osswald, T Durand, M Heine, J Michaud, F Poirier, J C Thomas, E Traykov

    Abstract: 2D and 4D transverse phase-space of a low-energy ion-beam is measured with two of the most common emittance scanners. The article covers the description of the installation, the setup, the settings, the experiment and the benchmark of the two emittance meters. We compare the results from three series of measurements and present the advantages and drawbacks of the two systems. Coupling between phas… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  12. arXiv:2210.13447  [pdf, other

    cs.LG physics.comp-ph

    Precision Machine Learning

    Authors: Eric J. Michaud, Ziming Liu, Max Tegmark

    Abstract: We explore unique considerations involved in fitting ML models to data with very high precision, as is often required for science applications. We empirically compare various function approximation methods and study how they scale with increasing parameters and data. We find that neural networks can often outperform classical approximation methods on high-dimensional examples, by auto-discovering… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  13. arXiv:2210.01117  [pdf, other

    cs.LG cs.AI physics.data-an stat.ME stat.ML

    Omnigrok: Grokking Beyond Algorithmic Data

    Authors: Ziming Liu, Eric J. Michaud, Max Tegmark

    Abstract: Grokking, the unusual phenomenon for algorithmic datasets where generalization happens long after overfitting the training data, has remained elusive. We aim to understand grokking by analyzing the loss landscapes of neural networks, identifying the mismatch between training and test losses as the cause for grokking. We refer to this as the "LU mechanism" because training and test losses (against… ▽ More

    Submitted 23 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  14. arXiv:2205.10343  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech cs.AI physics.class-ph

    Towards Understanding Grokking: An Effective Theory of Representation Learning

    Authors: Ziming Liu, Ouail Kitouni, Niklas Nolte, Eric J. Michaud, Max Tegmark, Mike Williams

    Abstract: We aim to understand grokking, a phenomenon where models generalize long after overfitting their training set. We present both a microscopic analysis anchored by an effective theory and a macroscopic analysis of phase diagrams describing learning performance across hyperparameters. We find that generalization originates from structured representations whose training dynamics and dependence on trai… ▽ More

    Submitted 14 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted by NeurIPS 2022

  15. arXiv:2203.11214  [pdf, ps, other

    physics.ins-det nucl-ex

    Status on the DESIR High Resolution Separator Commissioning

    Authors: J. Michaud, P. Alfaurt, A. Balana, B. Blank, L. Daudin, T. Kurtukian Nieto, B. Lachacinski, L. Serani, F. Varenne

    Abstract: Many nuclear reactions used to create radioactive isotopes for nuclear research produce, in addition to the isotope of interest, many contaminants, which are often produced in much larger amounts than the isotope of interest. Many installations using the ISOL approach are therefore equipped with high-resolution mass separators to remove at least isotopes with a different mass number. In the presen… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  16. arXiv:2012.05862  [pdf, other

    cs.LG

    Understanding Learned Reward Functions

    Authors: Eric J. Michaud, Adam Gleave, Stuart Russell

    Abstract: In many real-world tasks, it is not possible to procedurally specify an RL agent's reward function. In such cases, a reward function must instead be learned from interacting with and observing humans. However, current techniques for reward learning may fail to produce reward functions which accurately reflect user preferences. Absent significant advances in reward learning, it is thus important to… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: Presented at Deep RL Workshop, NeurIPS 2020

  17. arXiv:2010.13871  [pdf, other

    cs.LG cs.AI

    Examining the causal structures of deep neural networks using information theory

    Authors: Simon Mattsson, Eric J. Michaud, Erik Hoel

    Abstract: Deep Neural Networks (DNNs) are often examined at the level of their response to input, such as analyzing the mutual information between nodes and data sets. Yet DNNs can also be examined at the level of causation, exploring "what does what" within the layers of the network itself. Historically, analyzing the causal structure of DNNs has received less attention than understanding their responses t… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 14 pages, 8 figures

  18. arXiv:2009.12689  [pdf, other

    astro-ph.IM

    Lunar Opportunities for SETI

    Authors: Eric J. Michaud, Andrew P. V. Siemion, Jamie Drew, S. Pete Worden

    Abstract: A radio telescope placed in lunar orbit, or on the surface of the Moon's farside, could be of great value to the Search for Extraterrestrial Intelligence (SETI). The advantage of such a telescope is that it would be shielded by the body of the Moon from terrestrial sources of radio frequency interference (RFI). While RFI can be identified and ignored by other fields of radio astronomy, the possibl… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: 7 pages, submitted as a white paper for the National Academy of Sciences Planetary Science and Astrobiology Decadal Survey 2023-2032

  19. arXiv:1912.07881  [pdf, other

    hep-ex nucl-ex physics.acc-ph

    Storage Ring to Search for Electric Dipole Moments of Charged Particles -- Feasibility Study

    Authors: F. Abusaif, A. Aggarwal, A. Aksentev, B. Alberdi-Esuain, A. Andres, A. Atanasov, L. Barion, S. Basile, M. Berz, C. Böhme, J. Böker, J. Borburgh, N. Canale, C. Carli, I. Ciepał, G. Ciullo, M. Contalbrigo, J. -M. De Conto, S. Dymov, O. Felden, M. Gaisser, R. Gebel, N. Giese, J. Gooding, K. Grigoryev , et al. (76 additional authors not shown)

    Abstract: The proposed method exploits charged particles confined as a storage ring beam (proton, deuteron, possibly $^3$He) to search for an intrinsic electric dipole moment (EDM) aligned along the particle spin axis. Statistical sensitivities could approach 10$^{-29}$ e$\cdot$cm. The challenge will be to reduce systematic errors to similar levels. The ring will be adjusted to preserve the spin polarisatio… ▽ More

    Submitted 25 June, 2021; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: 243 pages

    Report number: CERN Yellow Reports: Monographs, CERN-2021-003

  20. arXiv:1812.08535  [pdf, other

    physics.acc-ph hep-ex

    Feasibility Study for an EDM Storage Ring

    Authors: F. Abusaif, A. Aggarwal, A. Aksentev, B. Alberdi-Esuain, L. Barion, S. Basile, M. Berz, M. Beyß, C. Böhme, J. Böker, J. Borburgh, C. Carli, I. Ciepał, G. Ciullo, M. Contalbrigo, J. -M. De Conto, S. Dymov, R. Engels, O. Felden, M. Gagoshidze, M. Gaisser, R. Gebel, N. Giese, K. Grigoryev, D. Grzonka , et al. (70 additional authors not shown)

    Abstract: This project exploits charged particles confined as a storage ring beam (proton, deuteron, possibly $^3$He) to search for an intrinsic electric dipole moment (EDM, $\vec d$) aligned along the particle spin axis. Statistical sensitivities can approach $10^{-29}$~e$\cdot$cm. The challenge will be to reduce systematic errors to similar levels. The ring will be adjusted to preserve the spin polarizati… ▽ More

    Submitted 18 January, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

  21. arXiv:1801.08819  [pdf, ps, other

    physics.soc-ph cs.SI

    Social Influence with Recurrent Mobility with multiple options

    Authors: Jérôme Michaud, Attila Szilva

    Abstract: In this paper, we discuss the possible generalizations of the Social Influence with Recurrent Mobility (SIRM) model developed in Phys. Rev. Lett. 112, 158701 (2014). Although the SIRM model worked approximately satisfying when US election was modelled, it has its limits: it has been developed only for two-party systems and can lead to unphysical behaviour when one of the parties has extreme vote s… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: 10 pages, 6 figures

    Journal ref: Phys. Rev. E 97, 062313 (2018)

  22. arXiv:1606.08433  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech

    Continuous time limits of the Utterance Selection Model

    Authors: Jérôme Michaud

    Abstract: In this paper, we derive new continuous time limits of the Utterance Selection Model (USM) for language change (Baxter et al., Phys. Rev. E {\bf 73}, 046118, 2006). This is motivated by the fact that the Fokker-Planck continuous time limit derived in the original version of the USM is only valid for a small range of parameters. We investigate the consequences of relaxing these constraints on param… ▽ More

    Submitted 5 January, 2017; v1 submitted 27 June, 2016; originally announced June 2016.

    Comments: 21 pages, 10 figures, accepted for publication Physical Review E

  23. arXiv:1606.04020  [pdf, other

    math.NA astro-ph.SR physics.comp-ph

    The IDSA and the homogeneous sphere: Issues and possible improvements

    Authors: Jérôme Michaud

    Abstract: In this paper, we are concerned with the study of the Isotropic Diffusion Source Approximation (IDSA) (Baxter et al., Phys. Rev. E 73, 046118, 2006) of radiative transfer. After having recalled well-known limits of the radiative transfer equation, we present the IDSA and adapt it to the case of the homogeneous sphere. We then show that for this example the IDSA suffers from severe numerical diffic… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

    Comments: 25 pages, 8 figures, accepted for publication in DCDS-S

    MSC Class: 65Z05; 35B40; 35Q85; 85A25; 41A25

  24. arXiv:1212.1623  [pdf, ps, other

    math.AP astro-ph.SR math-ph

    Derivation of the Isotropic Diffusion Source Approximation (IDSA) for Supernova Neutrino Transport by Asymptotic Expansions

    Authors: Heiko Berninger, Emmanuel Frenod, Martin Gander, Mathias Liebendorfer, Jerome Michaud

    Abstract: We present Chapman--Enskog and Hilbert expansions applied to the $\BigO(v/c)$ Boltzmann equation for the radiative transfer of neutrinos in core-collapse supernovae. Based on the Legendre expansion of the scattering kernel for the collision integral truncated after the second term, we derive the diffusion limit for the Boltzmann equation by truncation of Chapman--Enskog or Hilbert expansions with… ▽ More

    Submitted 6 August, 2013; v1 submitted 7 December, 2012; originally announced December 2012.

    Comments: SIAM Journal on Mathematical Analysis (2013) 0000-00000

  25. arXiv:1211.6901  [pdf, other

    astro-ph.SR math.AP

    A Mathematical Description of the IDSA for Supernova Neutrino transport, its discretization and a comparison with a finite volume scheme for Boltzmann's Equation

    Authors: Heiko Berninger, Emmanuel Frenod, Martin Gander, Mathias Liebendörfer, Jérôme Michaud, Nicolas Vasset

    Abstract: In this paper we give an introduction to the Boltzmann equation for neutrino transport used in core collapse supernova models as well as a detailed mathematical description of the \emph{Isotropic Diffusion Source Approximation} (IDSA). Furthermore, we present a numerical treatment of a reduced Boltzmann model problem based on time splitting and finite volumes and revise the discretization of the I… ▽ More

    Submitted 29 November, 2012; originally announced November 2012.

  26. The Kolmogorov-Smirnov test for the CMB

    Authors: Mona Frommert, Ruth Durrer, Jérôme Michaud

    Abstract: We investigate the statistics of the cosmic microwave background using the Kolmogorov-Smirnov test. We show that, when we correctly de-correlate the data, the partition function of the Kolmogorov stochasticity parameter is compatible with the Kolmogorov distribution and, contrary to previous claims, the CMB data are compatible with Gaussian fluctuations with the correlation function given by stand… ▽ More

    Submitted 1 December, 2011; v1 submitted 26 August, 2011; originally announced August 2011.

    Comments: Improved significance of the results (which remain unchanged) by using patches instead of ring segments in the analysis. Added sky maps of the Kolmogorov-parameter for original and de-correlated CMB map

    Journal ref: JCAP 1201 (2012) 009

  27. DUNE: The Dark Universe Explorer

    Authors: A. Refregier, O. Boulade, Y. Mellier, B. Milliard, R. Pain, J. Michaud, F. Safa, A. Amara, P. Astier, E. Barrelet, E. Bertin, S. Boulade, C. Cara, A. Claret, L. Georges, R. Grange, J. Guy, C. Koeck, L. Kroely, C. Magneville, N. Palanque-Delabrouille, N. Regnault, G. Smadja, C. Schimd, Z. Sun

    Abstract: Understanding the nature of Dark Matter and Dark Energy is one of the most pressing issues in cosmology and fundamental physics. The purpose of the DUNE (Dark UNiverse Explorer) mission is to study these two cosmological components with high precision, using a space-based weak lensing survey as its primary science driver. Weak lensing provides a measure of the distribution of dark matter in the… ▽ More

    Submitted 3 October, 2006; originally announced October 2006.

    Comments: 12 latex pages, including 7 figures and 2 tables. Procs. of SPIE symposium "Astronomical Telescopes and Instrumentation", Orlando, may 2006