Skip to main content

Showing 1–50 of 432 results for author: Woolf, T

.
  1. arXiv:2407.00424  [pdf, other

    nlin.SI math-ph nlin.PS

    Exact solitary wave solutions for a coupled gKdV-Schrodinger system by a new ODE reduction method

    Authors: Stephen C. Anco, James Hornick, Sicheng Zhao, Thomas Wolf

    Abstract: A new method is developed for finding exact solitary wave solutions of a generalized Korteweg-de Vries equation with p-power nonlinearity coupled to a linear Schrödinger equation arising in many different physical applications. This method yields 22 solution families, with p=1,2,3,4. No solutions for p>1 were known previously in the literature. For p=1, four of the solution families contain bright… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 45 pages

  2. arXiv:2406.17557  [pdf, other

    cs.CL

    The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

    Authors: Guilherme Penedo, Hynek Kydlíček, Loubna Ben allal, Anton Lozhkov, Margaret Mitchell, Colin Raffel, Leandro Von Werra, Thomas Wolf

    Abstract: The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for state-of-the-art open LLMs like Llama 3 and Mixtral are not publicly available and very little is known about how they were created. In this work, we introduce FineWeb, a 15-trillion token dataset derived from 96 Common Crawl snapshots that produ… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.13083  [pdf, other

    physics.ins-det

    Design and Performance of a Magnetic Bottle Electron Spectrometer for High-Energy Photoelectron Spectroscopy

    Authors: Kurtis Borne, Jordan T ONeal, Jun Wang, Erk Isele, Razib Obaid, Nora Berrah, Xinxin Cheng, Philip H Bucksbaum, Justin James, Andri Kamalov, Kirk A Larsen, Xiang Li, Ming-Fu Lin, Yusong Liu, Agostino Marinelli, Adam Summers, Emily Thierstein, Thomas Wolf, Daniel Rolles, Peter Walter, James P Cryan, Taran Driver

    Abstract: We describe the design and performance of a magnetic bottle electron spectrometer~(MBES) for high-energy electron spectroscopy. Our design features a ${\sim2}$~m long electron drift tube and electrostatic retardation lens, achieving sub-electronvolt (eV) electron kinetic energy resolution for high energy (several hundred eV) electrons with close to 4$π$ collection efficiency. A segmented anode… ▽ More

    Submitted 4 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.10709  [pdf, other

    cond-mat.str-el

    Intraband collective excitations in fractional Chern insulators are dark

    Authors: Tobias M. R. Wolf, Yung-Chun Chao, Allan H. MacDonald, Jung Jung Su

    Abstract: The low-energy collective excitations of semiconductors and insulators often couple strongly to light, allowing them to be probed optically. We argue here that in fractional Chern insulators intra-band collective excitations are dark in the sense that they couple anomalously weakly to light. This conclusion is based on a relationship between ideal quantum geometry and the structure factor of a Che… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Main text: 5 pages, 3 figures, Supmat: 5 pages, 1 figure; Comments are welcome

  5. arXiv:2406.09528  [pdf, other

    astro-ph.EP astro-ph.SR

    JWST/NIRCam 4-5 $μ$m Imaging of the Giant Planet AF Lep b

    Authors: Kyle Franson, William O. Balmer, Brendan P. Bowler, Laurent Pueyo, Yifan Zhou, Emily Rickman, Zhoujian Zhang, Sagnick Mukherjee, Tim D. Pearce, Daniella C. Bardalez Gagliuffi, Lauren I. Biddle, Timothy D. Brandt, Rachel Bowens-Rubin, Justin R. Crepp, James W. Davidson, Jr., Jacqueline Faherty, Christian Ginski, Elliott P. Horch, Marvin Morgan, Caroline V. Morley, Marshall D. Perrin, Aniket Sanghi, Maissa Salama, Christopher A. Theissen, Quang H. Tran , et al. (1 additional authors not shown)

    Abstract: With a dynamical mass of $3 \, M_\mathrm{Jup}$, the recently discovered giant planet AF Lep b is the lowest-mass imaged planet with a direct mass measurement. Its youth and spectral type near the L/T transition make it a promising target to study the impact of clouds and atmospheric chemistry at low surface gravities. In this work, we present JWST/NIRCam imaging of AF Lep b. Across two epochs, we… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 17 pages, 4 figures, submitted to ApJL

  6. arXiv:2405.14693  [pdf, ps, other

    astro-ph.EP astro-ph.IM physics.ao-ph physics.geo-ph

    Interpolation and synthesis of sparse samples in exoplanet atmospheric modeling

    Authors: Jacob Haqq-Misra, Eric T. Wolf, Thomas J. Fauchez, Ravi K. Kopparapu

    Abstract: This paper highlights methods from geostatistics that are relevant to the interpretation, intercomparison, and synthesis of atmospheric model data, with a specific application to exoplanet atmospheric modeling. Climate models are increasingly used to study theoretical and observational properties of exoplanets, which include a hierarchy of models ranging from fast and idealized models to those tha… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in the Planetary Science Journal

    Journal ref: PSJ (2024) 5: 140

  7. arXiv:2405.08074  [pdf

    cond-mat.mes-hall cond-mat.str-el physics.optics

    Optical Imaging of Flavor Order in Flat Band Graphene

    Authors: Tian Xie, Tobias M. Wolf, Siyuan Xu, Zhiyuan Cui, Richen Xiong, Yunbo Ou, Patrick Hays, Ludwig F Holleis, Yi Guo, Owen I Sheekey, Caitlin Patterson, Trevor Arp, Kenji Watanabe, Takashi Taniguchi, Seth Ariel Tongay, Andrea F Young, Allan H. MacDonald, Chenhao **

    Abstract: Spin and valley flavor polarization plays a central role in the many-body physics of flat band graphene, with fermi surface reconstructions often accompanied by quantized anomalous Hall and superconducting state observed in a variety of experimental systems. Here we describe an optical technique that sensitively and selectively detects flavor textures via the exciton response of a proximal transit… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 29 pages, 4 figures, with supplementary materials

  8. Impact of Planetary Parameters on Water Clouds Microphysics

    Authors: Huanzhou Yang, Thaddeus D. Komacek, Owen B. Toon, Eric T. Wolf, Tyler D. Robinson, Caroline Chael, Dorian S. Abbot

    Abstract: Potentially habitable exoplanets are targets of great interest for the James Webb Space Telescope and upcoming mission concepts such as the Habitable Worlds Observatory. Clouds strongly affect climate and habitability, but predicting their properties is difficult. In Global Climate Models (GCMs), especially those aiming at simulating Earth, cloud microphysics is often crudely approximated by assum… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 17 pages, 10 figures

    Journal ref: Huanzhou Yang et al 2024 ApJ 966 152

  9. arXiv:2405.03438  [pdf, other

    cond-mat.mtrl-sci

    Anomalous Nernst effect in the noncollinear antiferromagnet Mn$_5$Si$_3$

    Authors: Christoph Sürgers, Gerda Fischer, Warlley H. Campos, Anna Birk Hellenes, Libor Šmejkal, Jairo Sinova, Michael Merz, Thomas Wolf, Wolfgang Wernsdorfer

    Abstract: Investigating the off-diagonal components of the conductivity and thermoelectric tensor of materials hosting complex antiferromagnetic structures has become a viable method to reveal the effects of topology and chirality on the electronic transport in these systems. In this respect, Mn$_5$Si$_3$ is an interesting metallic compound that exhibits several antiferromagnetic phases below 100 K with dif… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 9 pages, 6 figures

  10. arXiv:2404.06253  [pdf, other

    cs.CV

    From Barlow Twins to Triplet Training: Differentiating Dementia with Limited Data

    Authors: Yitong Li, Tom Nuno Wolf, Sebastian Pölsterl, Igor Yakushev, Dennis M. Hedderich, Christian Wachinger

    Abstract: Differential diagnosis of dementia is challenging due to overlap** symptoms, with structural magnetic resonance imaging (MRI) being the primary method for diagnosis. Despite the clinical value of computer-aided differential diagnosis, research has been limited, mainly due to the absence of public datasets that contain diverse types of dementia. This leaves researchers with small in-house dataset… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at MIDL 2024

  11. arXiv:2403.14878  [pdf, other

    hep-ex physics.ins-det

    Offline tagging of radon-induced backgrounds in XENON1T and applicability to other liquid xenon detectors

    Authors: E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, G. Bruno, R. Budnik, T. K. Bui, J. M. R. Cardoso, A. P. Cimental Chavez, A. P. Colijn, J. Conrad , et al. (142 additional authors not shown)

    Abstract: This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity… ▽ More

    Submitted 19 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 17 pages, 19 figures

  12. arXiv:2403.01045  [pdf, other

    physics.chem-ph

    Unexpected hydrogen dissociation in thymine: predictions from a novel coupled cluster theory

    Authors: Eirik F. Kjønstad, O. Jonathan Fajen, Alexander C. Paul, Sara Angelico, Dennis Mayer, Markus Gühr, Thomas J. A. Wolf, Todd J. Martínez, Henrik Koch

    Abstract: The fate of thymine upon excitation by ultraviolet radiation has been the subject of intense debate over the past three decades. Today, it is widely believed that its ultrafast excited state decay stems from a radiationless transition from the bright $ππ^*$ state to a dark $nπ^*$ state. However, conflicting theoretical predictions have made the experimental data difficult to interpret. Here we sim… ▽ More

    Submitted 7 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 42 pages, 23 figures

  13. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  14. arXiv:2402.17685  [pdf, other

    physics.chem-ph

    Attosecond X-ray Chronoscopy of Core-level Photoemission

    Authors: Jia-Bao Ji, Zhaoheng Guo, Taran Driver, Cynthia S. Trevisan, David Cesar, Xinxin Cheng, Joseph Duris, Paris L. Franz, James Glownia, Xiaochun Gong, Daniel Hammerland, Meng Han, Saijoscha Heck, Matthias Hoffmann, Andrei Kamalov, Kirk A. Larsen, Xiang Li, Ming-Fu Lin, Yuchen Liu, C. William McCurdy, Razib Obaid, Jordan T. ONeal, Thomas N. Rescigno, River R. Robles, Nicholas Sudar , et al. (10 additional authors not shown)

    Abstract: Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrat… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  15. arXiv:2402.12764  [pdf, other

    physics.atom-ph

    Attosecond Delays in X-ray Molecular Ionization

    Authors: Taran Driver, Miles Mountney, Jun Wang, Lisa Ortmann, Andre Al-Haddad, Nora Berrah, Christoph Bostedt, Elio G. Champenois, Louis F. DiMauro, Joseph Duris, Douglas Garratt, James M. Glownia, Zhaoheng Guo, Daniel Haxton, Erik Isele, Igor Ivanov, Jiabao Ji, Andrei Kamalov, Siqi Li, Ming-Fu Lin, Jon P. Marangos, Razib Obaid, Jordan T. O'Neal, Philipp Rosenberger, Niranjan H. Shivaram , et al. (12 additional authors not shown)

    Abstract: The photoelectric effect is not truly instantaneous, but exhibits attosecond delays that can reveal complex molecular dynamics. Sub-femtosecond duration light pulses provide the requisite tools to resolve the dynamics of photoionization. Accordingly, the past decade has produced a large volume of work on photoionization delays following single photon absorption of an extreme ultraviolet (XUV) phot… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  16. arXiv:2402.10446  [pdf, other

    physics.ins-det astro-ph.IM hep-ex

    The XENONnT Dark Matter Experiment

    Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, M. Balata, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui , et al. (170 additional authors not shown)

    Abstract: The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 32 pages, 19 figures

  17. arXiv:2401.15250  [pdf, other

    physics.acc-ph physics.atom-ph

    Experimental Demonstration of Attosecond Pump-Probe Spectroscopy with an X-ray Free-Electron Laser

    Authors: Zhaoheng Guo, Taran Driver, Sandra Beauvarlet, David Cesar, Joseph Duris, Paris L. Franz, Oliver Alexander, Dorian Bohler, Christoph Bostedt, Vitali Averbukh, Xinxin Cheng, Louis F. DiMauro, Gilles Doumy, Ruaridh Forbes, Oliver Gessner, James M. Glownia, Erik Isele, Andrei Kamalov, Kirk A. Larsen, Siqi Li, Xiang Li, Ming-Fu Lin, Gregory A. McCracken, Razib Obaid, Jordan T. ONeal , et al. (25 additional authors not shown)

    Abstract: Pump-probe experiments with sub-femtosecond resolution are the key to understanding electronic dynamics in quantum systems. Here we demonstrate the generation and control of sub-femtosecond pulse pairs from a two-colour X-ray free-electron laser (XFEL). By measuring the delay between the two pulses with an angular streaking diagnostic, we characterise the group velocity of the XFEL and demonstrate… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 55 pages, main manuscript (5 figures) + supplementary materials (25 figures), 30 figures total. Submitted to Nature Photonics

  18. Gate-tunable topological phases in superlattice modulated bilayer graphene

    Authors: Yongxin Zeng, Tobias M. R. Wolf, Chunli Huang, Nemin Wei, Sayed Ali Akbar Ghorashi, Allan H. MacDonald, Jennifer Cano

    Abstract: Superlattice potential modulation can produce flat minibands in Bernal-stacked bilayer graphene. In this work we study how band topology and interaction-induced symmetry-broken phases in this system are controlled by tuning the displacement field and the shape and strength of the superlattice potential. We use an analytic perturbative analysis to demonstrate that topological flat bands are favored… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: Phys. Rev. B 109, 195406 (2024)

  19. arXiv:2312.09783  [pdf, other

    cs.LG cs.CV

    Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning

    Authors: Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann, Sebastian Pölsterl, Christian Wachinger

    Abstract: Explaining predictions of black-box neural networks is crucial when applied to decision-critical tasks. Thus, attribution maps are commonly used to identify important image regions, despite prior work showing that humans prefer explanations based on similar examples. To this end, ProtoPNet learns a set of class-representative feature vectors (prototypes) for case-based reasoning. During inference,… ▽ More

    Submitted 19 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: To be published in proceedings of AAAI Conference on Artificial Intelligence

  20. arXiv:2312.06527  [pdf, other

    cs.AI cs.CE

    Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models

    Authors: Theodore Wolf, Nantas Nardelli, John Shawe-Taylor, Maria Perez-Ortiz

    Abstract: Governments around the world aspire to ground decision-making on evidence. Many of the foundations of policy making - e.g. sensing patterns that relate to societal needs, develo** evidence-based programs, forecasting potential outcomes of policy changes, and monitoring effectiveness of policy programs - have the potential to benefit from the use of large-scale datasets or simulations together wi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Published at NeurIPS'23 Workshop on Tackling Climate Change with Machine Learning

  21. arXiv:2312.06511  [pdf, other

    cond-mat.str-el

    Paradigm for finding d-electron heavy fermions: the case of Cr-doped CsFe$_2$As$_2$

    Authors: Matteo Crispino, Pablo Villar Arribi, Anmol Shukla, Frédéric Hardy, Amir-Abbas Haghighirad, Thomas Wolf, Rolf Heid, Christoph Meingast, Tommaso Gorni, Adolfo Avella, Luca de' Medici

    Abstract: We define a general strategy for finding new heavy-fermionic materials without rare-earth elements: do** a Hund metal with pronounced orbital-selective correlations towards half-filling. We argue that in general band structures a possible orbital-selective Mott transition is frustrated by inter-orbital hop** into heavy-fermion behaviour - where d-orbitals provide both the heavy and the light e… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, 8 figures

  22. arXiv:2312.03671  [pdf, other

    astro-ph.IM astro-ph.EP cs.LG eess.IV

    Direct Exoplanet Detection Using Deep Convolutional Image Reconstruction (ConStruct): A New Algorithm for Post-Processing High-Contrast Images

    Authors: Trevor N. Wolf, Brandon A. Jones, Brendan P. Bowler

    Abstract: We present a novel machine-learning approach for detecting faint point sources in high-contrast adaptive optics imaging datasets. The most widely used algorithms for primary subtraction aim to decouple bright stellar speckle noise from planetary signatures by subtracting an approximation of the temporally evolving stellar noise from each frame in an imaging sequence. Our approach aims to improve t… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  23. arXiv:2311.12983  [pdf, other

    cs.CL cs.AI

    GAIA: a benchmark for General AI Assistants

    Authors: Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann LeCun, Thomas Scialom

    Abstract: We introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency. GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human r… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  24. arXiv:2311.12482  [pdf

    physics.chem-ph

    Monitoring the evolution of relative product populations at early times during a photochemical reaction

    Authors: Joao Pedro Figueira Nunes, Lea Maria Ibele, Shashank Pathak, Andrew R. Attar, Surjendu Bhattacharyya, Rebecca Boll, Kurtis Borne, Martin Centurion, Benjamin Erk, Ming-Fu Lin, Ruaridh J. G. Forbes, Nate Goff, Christopher S. Hansen, Matthias Hoffmann, David M. P. Holland, Rebecca A. Ingle, Duan Luo, Sri Bhavya Muvva, Alex Reid, Arnaud Rouzée, Artem Rudenko, Sajib Kumar Saha, Xiaozhe Shen, Anbu Selvam Venkatachalam, Xijie Wang , et al. (9 additional authors not shown)

    Abstract: Identifying multiple rival reaction products and transient species formed during ultrafast photochemical reactions and determining their time-evolving relative populations are key steps towards understanding and predicting photochemical outcomes. Yet, most contemporary ultrafast studies struggle with clearly identifying and quantifying competing molecular structures/species amongst the emerging re… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  25. arXiv:2311.11449  [pdf, other

    cond-mat.str-el

    Quasi-boson approximation yields accurate correlation energy in the 2D electron gas

    Authors: Tobias M. R. Wolf, Chunli Huang

    Abstract: We report the successful adaptation of the quasi-boson approximation, a technique traditionally employed in nuclear physics, to the analysis of the two-dimensional electron gas. We show that the correlation energy estimated from this approximation agrees closely with the results obtained from quantum Monte Carlo simulations. Our methodology comprehensively incorporates the exchange self-energy, di… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 7 pages, 4 figures

  26. arXiv:2311.05640  [pdf, other

    cs.CL

    FinGPT: Large Generative Models for a Small Language

    Authors: Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Le Scao, Thomas Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo

    Abstract: Large language models (LLMs) excel in many tasks in NLP and beyond, but most open models have very limited coverage of smaller languages and LLM work tends to focus on languages where nearly unlimited data is available for pretraining. In this work, we study the challenges of creating LLMs for Finnish, a language spoken by less than 0.1% of the world population. We compile an extensive dataset of… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 17 pages (10 main), 7 figures, 5 tables

  27. arXiv:2310.16944  [pdf, other

    cs.LG cs.CL

    Zephyr: Direct Distillation of LM Alignment

    Authors: Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf

    Abstract: We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on larger models significantly improves task accuracy; however, these models are unaligned, i.e. they do not respond well to natural prompts. To distill this property, we experiment with the use of preference data from AI Feedback (AIF). Start… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  28. arXiv:2309.11996  [pdf, other

    hep-ex physics.ins-det

    Design and performance of the field cage for the XENONnT experiment

    Authors: E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso, D. Cichon , et al. (139 additional authors not shown)

    Abstract: The precision in reconstructing events detected in a dual-phase time projection chamber depends on an homogeneous and well understood electric field within the liquid target. In the XENONnT TPC the field homogeneity is achieved through a double-array field cage, consisting of two nested arrays of field sha** rings connected by an easily accessible resistor chain. Rather than being connected to t… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Journal ref: Eur. Phys. J. C 84, 138 (2024)

  29. arXiv:2308.03996  [pdf, other

    physics.chem-ph

    Investigating dissociation pathways of nitrobenzene via mega-electron-volt ultrafast electron diffraction

    Authors: Kareem Hegazy, James Cryan, Renkai Li, Ming-Fu Lin, Brian Moore, Pedro Nunes, Xiaozhe Shen, Stephen Weathersby, Jie Yang, Xijie Wang, Thomas Wolf

    Abstract: As the simplest nitroaromatic compound, nitrobenzene is an interesting model system to explore the rich photochemistry of nitroaromatic compounds. Previous measurements of nitrobenzene's photochemical dynamics have probed structural and electronic properties, which, at times, paint a convoluted and sometimes contradictory description of the photochemical landscape. A sub-picosecond structural prob… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 5 pages, 3 figures, and 1 table

  30. arXiv:2306.16340  [pdf, other

    physics.ins-det

    Cosmogenic background simulations for the DARWIN observatory at different underground locations

    Authors: M. Adrover, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, B. Antunovic, E. Aprile, M. Babicz, D. Bajpai, E. Barberio, L. Baudis, M. Bazyk, N. Bell, L. Bellagamba, R. Biondi, Y. Biondi, A. Bismark, C. Boehm, A. Breskin, E. J. Brookes, A. Brown, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso , et al. (158 additional authors not shown)

    Abstract: Xenon dual-phase time projections chambers (TPCs) have proven to be a successful technology in studying physical phenomena that require low-background conditions. With 40t of liquid xenon (LXe) in the TPC baseline design, DARWIN will have a high sensitivity for the detection of particle dark matter, neutrinoless double beta decay ($0νββ$), and axion-like particles (ALPs). Although cosmic muons are… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  31. arXiv:2306.11871  [pdf, other

    hep-ex physics.ins-det

    Search for events in XENON1T associated with Gravitational Waves

    Authors: XENON Collaboration, E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antoń Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso , et al. (138 additional authors not shown)

    Abstract: We perform a blind search for particle signals in the XENON1T dark matter detector that occur close in time to gravitational wave signals in the LIGO and Virgo observatories. No particle signal is observed in the nuclear recoil, electronic recoil, CE$ν$NS, and S2-only channels within $\pm$ 500 seconds of observations of the gravitational wave signals GW170104, GW170729, GW170817, GW170818, and GW1… ▽ More

    Submitted 27 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  32. arXiv:2306.03890  [pdf, other

    cond-mat.supr-con

    Bosonic excitation spectra of superconducting $\mathrm{Bi_2Sr_2CaCu_2O_{8+δ}}$ and $\mathrm{YBa_2Cu_3O_{6+x}}$ extracted from scanning tunneling spectra

    Authors: Thomas Gozlinski, Mirjam Henn, Thomas Wolf, Matthieu Le Tacon, Jörg Schmalian, Wulf Wulfhekel

    Abstract: A detailed interpretation of scanning tunneling spectra obtained on unconventional superconductors enables one to gain information on the pairing boson. Decisive for this approach are inelastic tunneling events. Due to the lack of momentum conservation in tunneling from or to the sharp tip, those are enhanced in the geometry of a scanning tunneling microscope compared to planar tunnel junctions. T… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Journal ref: Journal of Physics: Condensed Matter 36, 175601 (2024)

  33. arXiv:2305.16264  [pdf, other

    cs.CL cs.AI cs.LG

    Scaling Data-Constrained Language Models

    Authors: Niklas Muennighoff, Alexander M. Rush, Boaz Barak, Teven Le Scao, Aleksandra Piktus, Nouamane Tazi, Sampo Pyysalo, Thomas Wolf, Colin Raffel

    Abstract: The current trend of scaling language models involves increasing both parameter count and training dataset size. Extrapolating this trend suggests that training dataset size may soon be limited by the amount of text data available on the internet. Motivated by this limit, we investigate scaling language models in data-constrained regimes. Specifically, we run a large set of experiments varying the… ▽ More

    Submitted 25 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 50 pages (9 main), 39 figures, 15 tables

  34. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  35. arXiv:2305.05169  [pdf

    physics.ins-det physics.optics

    A compact single-shot soft X-ray photon spectrometer for free electron laser diagnostics

    Authors: Kirk A. Larsen, Kurtis Borne, Razib Obaid, Andrei Kamalov, Yusong Liu, Xinxin Cheng, Justin James, Taran Driver, Kenan Li, Yanwei Liu, Anne Sakdinawat, Christian David, Thomas J. A. Wolf, James Cryan, Peter Walter, Ming-Fu Lin

    Abstract: The photon spectrum from free-electron laser (FEL) light sources offers valuable information in time-resolved experiments and machine optimization in the spectral and temporal domains. We have developed a compact single-shot photon spectrometer to diagnose soft X-ray spectra. The spectrometer consists of an array of off-axis Fresnel zone plates (FZP) that act as transmission-imaging gratings, a Ce… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 14 pages, 5 figures, 1 table

  36. arXiv:2304.10931  [pdf, other

    hep-ex physics.ins-det

    Searching for Heavy Dark Matter near the Planck Mass with XENON1T

    Authors: E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso, D. Cichon , et al. (142 additional authors not shown)

    Abstract: Multiple viable theoretical models predict heavy dark matter particles with a mass close to the Planck mass, a range relatively unexplored by current experimental measurements. We use 219.4 days of data collected with the XENON1T experiment to conduct a blind search for signals from Multiply-Interacting Massive Particles (MIMPs). Their unique track signature allows a targeted analysis with only 0.… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 7 pages, 6 figures

    Journal ref: Phys. Rev. Lett. 130, 261002 (2023)

  37. arXiv:2304.08332  [pdf, other

    math.NA

    Multiscale hierarchical decomposition methods for ill-posed problems

    Authors: Stefan Kindermann, Elena Resmerita, Tobias Wolf

    Abstract: The Multiscale Hierarchical Decomposition Method (MHDM) was introduced as an iterative method for total variation regularization, with the aim of recovering details at various scales from images corrupted by additive or multiplicative noise. Given its success beyond image restoration, we extend the MHDM iterates in order to solve larger classes of linear ill-posed problems in Banach spaces. Thus,… ▽ More

    Submitted 27 September, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  38. Detector signal characterization with a Bayesian network in XENONnT

    Authors: XENON Collaboration, E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso , et al. (142 additional authors not shown)

    Abstract: We developed a detector signal characterization model based on a Bayesian network trained on the waveform attributes generated by a dual-phase xenon time projection chamber. By performing inference on the model, we produced a quantitative metric of signal characterization and demonstrate that this metric can be used to determine whether a detector signal is sourced from a scintillation or an ioniz… ▽ More

    Submitted 26 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 11 pages, 8 figures

    Journal ref: Phys. Rev. D 108, 012016 (2023)

  39. arXiv:2303.17350  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Partial condensation of mobile excitons in graphene multilayers

    Authors: Igor V. Blinov, Chunli Huang, Nemin Wei, Qin Wei, Tobias Wolf, Allan H. MacDonald

    Abstract: At a large displacement field, in rhomboedral and Bernal-stacked graphene a normal paramagnetic state transitions to a correlated state. Recent experiments showed that such systems have several phase transitions as a function of the carrier density. The phase adjacent to a paramagnetic state has anomalously high resistance and reduced degeneracy of the Fermi sea. We show that both phenomena can be… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  40. arXiv:2303.14729  [pdf, other

    hep-ex astro-ph.CO astro-ph.IM hep-ph physics.ins-det

    First Dark Matter Search with Nuclear Recoils from the XENONnT Experiment

    Authors: XENON Collaboration, E. Aprile, K. Abe, F. Agostini, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai , et al. (141 additional authors not shown)

    Abstract: We report on the first search for nuclear recoils from dark matter in the form of weakly interacting massive particles (WIMPs) with the XENONnT experiment which is based on a two-phase time projection chamber with a sensitive liquid xenon mass of $5.9$ t. During the approximately 1.1 tonne-year exposure used for this search, the intrinsic $^{85}$Kr and $^{222}$Rn concentrations in the liquid targe… ▽ More

    Submitted 5 August, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Limit points are included in the submission file

    Journal ref: Phys. Rev. Lett. 131, 041003 (2023)

  41. arXiv:2303.07717  [pdf, other

    cs.CV

    HALOS: Hallucination-free Organ Segmentation after Organ Resection Surgery

    Authors: Anne-Marie Rickmann, Murong Xu, Tom Nuno Wolf, Oksana Kovalenko, Christian Wachinger

    Abstract: The wide range of research in deep learning-based medical image segmentation pushed the boundaries in a multitude of applications. A clinically relevant problem that received less attention is the handling of scans with irregular anatomy, e.g., after organ resection. State-of-the-art segmentation models often lead to organ hallucinations, i.e., false-positive predictions of organs, which cannot be… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: To be published in proceedings of Information Processing In Medical Imaging (IPMI) 2023

  42. arXiv:2303.07125  [pdf, other

    cs.LG cs.CV

    Don't PANIC: Prototypical Additive Neural Network for Interpretable Classification of Alzheimer's Disease

    Authors: Tom Nuno Wolf, Sebastian Pölsterl, Christian Wachinger

    Abstract: Alzheimer's disease (AD) has a complex and multifactorial etiology, which requires integrating information about neuroanatomy, genetics, and cerebrospinal fluid biomarkers for accurate diagnosis. Hence, recent deep learning approaches combined image and tabular information to improve diagnostic performance. However, the black-box nature of such neural networks is still a barrier for clinical appli… ▽ More

    Submitted 14 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: To be published in proceedings of Information Processing In Medical Imaging 2023

  43. arXiv:2303.03586  [pdf, other

    physics.chem-ph

    Femtosecond electronic and hydrogen structural dynamics in ammonia imaged with ultrafast electron diffraction

    Authors: Elio G. Champenois, Nanna H. List, Matthew Ware, Mathew Britton, Philip H. Bucksbaum, Xinxin Cheng, Martin Centurion, James P. Cryan, Ruaridh Forbes, Ian Gabalski, Kareem Hegazy, Matthias C. Hoffmann, Andrew J. Howard, Fuhao Ji, Ming-Fu Lin, J. Pedro Nunes, Xiaozhe Shen, Jie Yang, Xijie Wang, Todd J. Martinez, Thomas J. A. Wolf

    Abstract: Directly imaging structural dynamics involving hydrogen atoms by ultrafast diffraction methods is complicated by their low scattering cross-sections. Here we demonstrate that megaelectronvolt ultrafast electron diffraction is sufficiently sensitive to follow hydrogen dynamics in isolated molecules. In a study of the photodissociation of gas phase ammonia, we simultaneously observe signatures of th… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  44. arXiv:2302.12518  [pdf, other

    astro-ph.EP physics.ao-ph

    3D climate simulations of the Archean find that methane has a strong cooling effect at high concentrations

    Authors: Jake K. Eager-Nash, Nathan J. Mayne, Arwen E. Nicholson, Janke E. Prins, Oakley C. F. Young, Stuart J. Daines, Denis E. Sergeev, F. Hugo Lambert, James Manners, Ian A. Boutle, Eric T. Wolf, Inga E. E. Kamp, Krisztian Kohary, Tim M. Lenton

    Abstract: Methane is thought to have been an important greenhouse gas during the Archean, although its potential warming has been found to be limited at high concentrations due to its high shortwave absorption. We use the Met Office Unified Model, a general circulation model, to further explore the climatic effect of different Archean methane concentrations. Surface warming peaks at a pressure ratio CH$_4$:… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 36 pages, 18 figures

  45. arXiv:2302.05420  [pdf, other

    astro-ph.EP astro-ph.SR

    Astrometric Accelerations as Dynamical Beacons: A Giant Planet Imaged Inside the Debris Disk of the Young Star AF Lep

    Authors: Kyle Franson, Brendan P. Bowler, Yifan Zhou, Tim D. Pearce, Daniella C. Bardalez Gagliuffi, Lauren Biddle, Timothy D. Brandt, Justin R. Crepp, Trent J. Dupuy, Jacqueline Faherty, Rebecca Jensen-Clem, Marvin Morgan, Aniket Sanghi, Christopher A. Theissen, Quang H. Tran, Trevor A. Wolf

    Abstract: We present the direct imaging discovery of a giant planet orbiting the young star AF Lep, a 1.2 $M_{\odot}$ member of the 24 $\pm$ 3 Myr $β$ Pic moving group. AF Lep was observed as part of our ongoing high-contrast imaging program targeting stars with astrometric accelerations between Hipparcos and Gaia that indicate the presence of substellar companions. Keck/NIRC2 observations in $L'$ with the… ▽ More

    Submitted 25 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 14 pages, 3 figures, accepted to ApJL

  46. arXiv:2302.02662  [pdf, other

    cs.LG

    Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

    Authors: Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer

    Abstract: Recent works successfully leveraged Large Language Models' (LLM) abilities to capture abstract knowledge about world's physics to solve decision-making problems. Yet, the alignment between LLMs' knowledge and the environment can be wrong and limit functional competence due to lack of grounding. In this paper, we study an approach (named GLAM) to achieve this alignment through functional grounding:… ▽ More

    Submitted 6 September, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Journal ref: PMLR 202 (2023):3676-3713

  47. arXiv:2301.13112  [pdf, other

    stat.ML cs.LG

    Benchmarking optimality of time series classification methods in distinguishing diffusions

    Authors: Zehong Zhang, Fei Lu, Esther Xu Fei, Terry Lyons, Yannis Kevrekidis, Tom Woolf

    Abstract: Statistical optimality benchmarking is crucial for analyzing and designing time series classification (TSC) algorithms. This study proposes to benchmark the optimality of TSC algorithms in distinguishing diffusion processes by the likelihood ratio test (LRT). The LRT is an optimal classifier by the Neyman-Pearson lemma. The LRT benchmarks are computationally efficient because the LRT does not need… ▽ More

    Submitted 11 April, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 23 pages, 8 figures

    MSC Class: 62M02; 62M10; 62M20

  48. arXiv:2212.11032  [pdf, other

    physics.ins-det hep-ex

    The Triggerless Data Acquisition System of the XENONnT Experiment

    Authors: E. Aprile, J. Aalbers, K. Abe, F. Agostini, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso , et al. (140 additional authors not shown)

    Abstract: The XENONnT detector uses the latest and largest liquid xenon-based time projection chamber (TPC) operated by the XENON Collaboration, aimed at detecting Weakly Interacting Massive Particles and conducting other rare event searches. The XENONnT data acquisition (DAQ) system constitutes an upgraded and expanded version of the XENON1T DAQ system. For its operation, it relies predominantly on commerc… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  49. Ultraviolet Raman Spectroscopy for Remote Detection of Chlorine Gas

    Authors: Arne Walter, Frank Wilsenack, Thomas Wolf, Frank Duschek

    Abstract: As a primary material frequently used in industry, chlorine is relatively easy to obtain and available even in large quantities. Despite its high toxicity, molecular chlorine is readily available since it is an essential educt in the chemical industry. Over the past decades, numerous accidents involving injured and dead victims have occurred. Furthermore, it was already misused as a warfare agent… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 6th International Conference on Frontiers of Diagnostic Technologies, 6 pages

  50. arXiv:2212.04960  [pdf, other

    cs.CY

    BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

    Authors: Christopher Akiki, Giada Pistilli, Margot Mieskes, Matthias Gallé, Thomas Wolf, Suzana Ilić, Yacine Jernite

    Abstract: The BigScience Workshop was a value-driven initiative that spanned one and half years of interdisciplinary research and culminated in the creation of ROOTS, a 1.6TB multilingual dataset that was used to train BLOOM, one of the largest multilingual language models to date. In addition to the technical outcomes and artifacts, the workshop fostered multidisciplinary collaborations around large models… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: Presented at the 2022 NeurIPS Workshop on Broadening Research Collaborations in ML