-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Feature Expansion and enhanced Compression for Class Incremental Learning
Authors:
Quentin Ferdinand,
Gilles Le Chenadec,
Benoit Clement,
Panagiotis Papadakis,
Quentin Oliveau
Abstract:
Class incremental learning consists in training discriminative models to classify an increasing number of classes over time. However, doing so using only the newly added class data leads to the known problem of catastrophic forgetting of the previous classes. Recently, dynamic deep learning architectures have been shown to exhibit a better stability-plasticity trade-off by dynamically adding new f…
▽ More
Class incremental learning consists in training discriminative models to classify an increasing number of classes over time. However, doing so using only the newly added class data leads to the known problem of catastrophic forgetting of the previous classes. Recently, dynamic deep learning architectures have been shown to exhibit a better stability-plasticity trade-off by dynamically adding new feature extractors to the model in order to learn new classes followed by a compression step to scale the model back to its original size, thus avoiding a growing number of parameters. In this context, we propose a new algorithm that enhances the compression of previous class knowledge by cutting and mixing patches of previous class samples with the new images during compression using our Rehearsal-CutMix method. We show that this new data augmentation reduces catastrophic forgetting by specifically targeting past class information and improving its compression. Extensive experiments performed on the CIFAR and ImageNet datasets under diverse incremental learning evaluation protocols demonstrate that our approach consistently outperforms the state-of-the-art . The code will be made available upon publication of our work.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
PID Tuning using Cross-Entropy Deep Learning: a Lyapunov Stability Analysis
Authors:
Hector Kohler,
Benoit Clement,
Thomas Chaffre,
Gilles Le Chenadec
Abstract:
Underwater Unmanned Vehicles (UUVs) have to constantly compensate for the external disturbing forces acting on their body. Adaptive Control theory is commonly used there to grant the control law some flexibility in its response to process variation. Today, learning-based (LB) adaptive methods are leading the field where model-based control structures are combined with deep model-free learning algo…
▽ More
Underwater Unmanned Vehicles (UUVs) have to constantly compensate for the external disturbing forces acting on their body. Adaptive Control theory is commonly used there to grant the control law some flexibility in its response to process variation. Today, learning-based (LB) adaptive methods are leading the field where model-based control structures are combined with deep model-free learning algorithms. This work proposes experiments and metrics to empirically study the stability of such a controller. We perform this stability analysis on a LB adaptive control system whose adaptive parameters are determined using a Cross-Entropy Deep Learning method.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Hybrid Navigation Acceptability and Safety
Authors:
Benoit Clement,
Marie Dubromel,
Paulo E. Santos,
Karl Sammut,
Michelle Oppert,
Feras Dayoub
Abstract:
Autonomous vessels have emerged as a prominent and accepted solution, particularly in the naval defence sector. However, achieving full autonomy for marine vessels demands the development of robust and reliable control and guidance systems that can handle various encounters with manned and unmanned vessels while operating effectively under diverse weather and sea conditions. A significant challeng…
▽ More
Autonomous vessels have emerged as a prominent and accepted solution, particularly in the naval defence sector. However, achieving full autonomy for marine vessels demands the development of robust and reliable control and guidance systems that can handle various encounters with manned and unmanned vessels while operating effectively under diverse weather and sea conditions. A significant challenge in this pursuit is ensuring the autonomous vessels' compliance with the International Regulations for Preventing Collisions at Sea (COLREGs). These regulations present a formidable hurdle for the human-level understanding by autonomous systems as they were originally designed from common navigation practices created since the mid-19th century. Their ambiguous language assumes experienced sailors' interpretation and execution, and therefore demands a high-level (cognitive) understanding of language and agent intentions. These capabilities surpass the current state-of-the-art in intelligent systems. This position paper highlights the critical requirements for a trustworthy control and guidance system, exploring the complexity of adapting COLREGs for safe vessel-on-vessel encounters considering autonomous maritime technology competing and/or cooperating with manned vessels.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-minimum Phase Systems
Authors:
Fatemeh Tavakkoli,
Pouria Sarhadi,
Benoit Clement,
Wasif Naeem
Abstract:
Deep Reinforcement Learning (DRL) techniques have received significant attention in control and decision-making algorithms. Most applications involve complex decision-making systems, justified by the algorithms' computational power and cost. While model-based versions are emerging, model-free DRL approaches are intriguing for their independence from models, yet they remain relatively less explored…
▽ More
Deep Reinforcement Learning (DRL) techniques have received significant attention in control and decision-making algorithms. Most applications involve complex decision-making systems, justified by the algorithms' computational power and cost. While model-based versions are emerging, model-free DRL approaches are intriguing for their independence from models, yet they remain relatively less explored in terms of performance, particularly in applied control. This study conducts a thorough performance analysis comparing the data-driven DRL paradigm with a classical state feedback controller, both designed based on the same cost (reward) function of the linear quadratic regulator (LQR) problem. Twelve additional performance criteria are introduced to assess the controllers' performance, independent of the LQR problem for which they are designed. Two Deep Deterministic Policy Gradient (DDPG)-based controllers are developed, leveraging DDPG's widespread reputation. These controllers are aimed at addressing a challenging setpoint tracking problem in a Non-Minimum Phase (NMP) system. The performance and robustness of the controllers are assessed in the presence of operational challenges, including disturbance, noise, initial conditions, and model uncertainties. The findings suggest that the DDPG controller demonstrates promising behavior under rigorous test conditions. Nevertheless, further improvements are necessary for the DDPG controller to outperform classical methods in all criteria. While DRL algorithms may excel in complex environments owing to the flexibility in the reward function definition, this paper offers practical insights and a comparison framework specifically designed to evaluate these algorithms within the context of control engineering.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Improved Performances and Motivation in Intelligent Tutoring Systems: Combining Machine Learning and Learner Choice
Authors:
Benjamin Clément,
Hélène Sauzéon,
Didier Roy,
Pierre-Yves Oudeyer
Abstract:
Large class sizes pose challenges to personalized learning in schools, which educational technologies, especially intelligent tutoring systems (ITS), aim to address. In this context, the ZPDES algorithm, based on the Learning Progress Hypothesis (LPH) and multi-armed bandit machine learning techniques, sequences exercises that maximize learning progress (LP). This algorithm was previously shown in…
▽ More
Large class sizes pose challenges to personalized learning in schools, which educational technologies, especially intelligent tutoring systems (ITS), aim to address. In this context, the ZPDES algorithm, based on the Learning Progress Hypothesis (LPH) and multi-armed bandit machine learning techniques, sequences exercises that maximize learning progress (LP). This algorithm was previously shown in field studies to boost learning performances for a wider diversity of students compared to a hand-designed curriculum. However, its motivational impact was not assessed. Also, ZPDES did not allow students to express choices. This limitation in agency is at odds with the LPH theory concerned with modeling curiosity-driven learning. We here study how the introduction of such choice possibilities impact both learning efficiency and motivation. The given choice concerns dimensions that are orthogonal to exercise difficulty, acting as a playful feature.
In an extensive field study (265 7-8 years old children, RCT design), we compare systems based either on ZPDES or a hand-designed curriculum, both with and without self-choice. We first show that ZPDES improves learning performance and produces a positive and motivating learning experience. We then show that the addition of choice triggers intrinsic motivation and reinforces the learning effectiveness of the LP-based personalization. In doing so, it strengthens the links between intrinsic motivation and performance progress during the serious game. Conversely, deleterious effects of the playful feature are observed for hand-designed linear paths. Thus, the intrinsic motivation elicited by a playful feature is beneficial only if the curriculum personalization is effective for the learner. Such a result deserves great attention due to increased use of playful features in non adaptive educational technologies.
△ Less
Submitted 16 January, 2024;
originally announced February 2024.
-
Euclid preparation: TBD. The pre-launch Science Ground Segment simulation framework
Authors:
Euclid Collaboration,
S. Serrano,
P. Hudelot,
G. Seidel,
J. E. Pollack,
E. Jullo,
F. Torradeflot,
D. Benielli,
R. Fahed,
T. Auphan,
J. Carretero,
H. Aussel,
P. Casenove,
F. J. Castander,
J. E. Davies,
N. Fourmanoit,
S. Huot,
A. Kara,
E. Keihanen,
S. Kermiche,
K. Okumura,
J. Zoubian,
A. Ealet,
A. Boucaud,
H. Bretonniere
, et al. (251 additional authors not shown)
Abstract:
The European Space Agency's Euclid mission is one of the upcoming generation of large-scale cosmology surveys, which will map the large-scale structure in the Universe with unprecedented precision. The development and validation of the SGS pipeline requires state-of-the-art simulations with a high level of complexity and accuracy that include subtle instrumental features not accounted for previous…
▽ More
The European Space Agency's Euclid mission is one of the upcoming generation of large-scale cosmology surveys, which will map the large-scale structure in the Universe with unprecedented precision. The development and validation of the SGS pipeline requires state-of-the-art simulations with a high level of complexity and accuracy that include subtle instrumental features not accounted for previously as well as faster algorithms for the large-scale production of the expected Euclid data products. In this paper, we present the Euclid SGS simulation framework as applied in a large-scale end-to-end simulation exercise named Science Challenge 8. Our simulation pipeline enables the swift production of detailed image simulations for the construction and validation of the Euclid mission during its qualification phase and will serve as a reference throughout operations. Our end-to-end simulation framework starts with the production of a large cosmological N-body & mock galaxy catalogue simulation. We perform a selection of galaxies down to I_E=26 and 28 mag, respectively, for a Euclid Wide Survey spanning 165 deg^2 and a 1 deg^2 Euclid Deep Survey. We build realistic stellar density catalogues containing Milky Way-like stars down to H<26. Using the latest instrumental models for both the Euclid instruments and spacecraft as well as Euclid-like observing sequences, we emulate with high fidelity Euclid satellite imaging throughout the mission's lifetime. We present the SC8 data set consisting of overlap** visible and near-infrared Euclid Wide Survey and Euclid Deep Survey imaging and low-resolution spectroscopy along with ground-based. This extensive data set enables end-to-end testing of the entire ground segment data reduction and science analysis pipeline as well as the Euclid mission infrastructure, paving the way to future scientific and technical developments and enhancements.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
SUT: Active Defects Probing for Transcompiler Models
Authors:
Mengnan Qi,
Yufan Huang,
Maoquan Wang,
Yongqiang Yao,
Zihan Liu,
Bin Gu,
Colin Clement,
Neel Sundaresan
Abstract:
Automatic Program translation has enormous application value and hence has been attracting significant interest from AI researchers. However, we observe that current program translation models still make elementary syntax errors, particularly, when the target language does not have syntax elements in the source language. Metrics like BLUE, CodeBLUE and computation accuracy may not expose these iss…
▽ More
Automatic Program translation has enormous application value and hence has been attracting significant interest from AI researchers. However, we observe that current program translation models still make elementary syntax errors, particularly, when the target language does not have syntax elements in the source language. Metrics like BLUE, CodeBLUE and computation accuracy may not expose these issues. In this paper we introduce a new metrics for programming language translation and these metrics address these basic syntax errors. We develop a novel active defects probing suite called Syntactic Unit Tests (SUT) which includes a highly interpretable evaluation harness for accuracy and test scoring. Experiments have shown that even powerful models like ChatGPT still make mistakes on these basic unit tests. Specifically, compared to previous program translation task evaluation dataset, its pass rate on our unit tests has decreased by 26.15%. Further our evaluation harness reveal syntactic element errors in which these models exhibit deficiencies.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Program Translation via Code Distillation
Authors:
Yufan Huang,
Mengnan Qi,
Yongqiang Yao,
Maoquan Wang,
Bin Gu,
Colin Clement,
Neel Sundaresan
Abstract:
Software version migration and program translation are an important and costly part of the lifecycle of large codebases. Traditional machine translation relies on parallel corpora for supervised translation, which is not feasible for program translation due to a dearth of aligned data. Recent unsupervised neural machine translation techniques have overcome data limitations by included techniques s…
▽ More
Software version migration and program translation are an important and costly part of the lifecycle of large codebases. Traditional machine translation relies on parallel corpora for supervised translation, which is not feasible for program translation due to a dearth of aligned data. Recent unsupervised neural machine translation techniques have overcome data limitations by included techniques such as back translation and low level compiler intermediate representations (IR). These methods face significant challenges due to the noise in code snippet alignment and the diversity of IRs respectively. In this paper we propose a novel model called Code Distillation (CoDist) whereby we capture the semantic and structural equivalence of code in a language agnostic intermediate representation. Distilled code serves as a translation pivot for any programming language, leading by construction to parallel corpora which scale to all available source code by simply applying the distillation compiler. We demonstrate that our approach achieves state-of-the-art performance on CodeXGLUE and TransCoder GeeksForGeeks translation benchmarks, with an average absolute increase of 12.7% on the TransCoder GeeksforGeeks translation benchmark compare to TransCoder-ST.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance
Authors:
Thomas Chaffre,
Jonathan Wheare,
Andrew Lammas,
Paulo Santos,
Gilles Le Chenadec,
Karl Sammut,
Benoit Clement
Abstract:
Learning-based adaptive control methods hold the premise of enabling autonomous agents to reduce the effect of process variations with minimal human intervention. However, its application to autonomous underwater vehicles (AUVs) has so far been restricted due to 1) unknown dynamics under the form of sea current disturbance that we can not model properly nor measure due to limited sensor capability…
▽ More
Learning-based adaptive control methods hold the premise of enabling autonomous agents to reduce the effect of process variations with minimal human intervention. However, its application to autonomous underwater vehicles (AUVs) has so far been restricted due to 1) unknown dynamics under the form of sea current disturbance that we can not model properly nor measure due to limited sensor capability and 2) the nonlinearity of AUVs tasks where the controller response at some operating points must be overly conservative in order to satisfy the specification at other operating points. Deep Reinforcement Learning (DRL) can alleviates these limitations by training general-purpose neural network policies, but applications of DRL algorithms to AUVs have been restricted to simulated environments, due to their inherent high sample complexity and distribution shift problem. This paper presents a novel approach, merging the Maximum Entropy Deep Reinforcement Learning framework with a classic model-based control architecture, to formulate an adaptive controller. Within this framework, we introduce a Sim-to-Real transfer strategy comprising the following components: a bio-inspired experience replay mechanism, an enhanced domain randomisation technique, and an evaluation protocol executed on a physical platform. Our experimental assessments demonstrate that this method effectively learns proficient policies from suboptimal simulated models of the AUV, resulting in control performance 3 times higher when transferred to a real-world vehicle, compared to its model-based nonadaptive but optimal counterpart.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Probing the faint end Luminosity Function of Lyman Alpha Emitters at 3<z<7 behind 17 MUSE lensing clusters
Authors:
Tran Thi Thai,
Pham Tuan-Anh,
Roser Pello,
Ilias Goovaerts,
Johan Richard,
Adélaïde Claeyssens,
Guillaume Mahler,
David J. Lagattuta,
Geoffroy de la Vieuville,
Eduard Salvador-Solé,
Thibault Garel,
Franz E. Bauer,
Alexandre Jeanneau,
Benjamin Clément,
Jorryt Matthee
Abstract:
We present a study of the galaxy Lyman-alpha luminosity function (LF) using a sample of 17 lensing clusters observed by the MUSE/VLT. Magnification from strong gravitational lensing by clusters of galaxies and MUSE apabilities allow us to blindly detect LAEs without any photometric pre-selection, reaching the faint luminosity regime. 600 lensed LAEs were selected behind these clusters in the redsh…
▽ More
We present a study of the galaxy Lyman-alpha luminosity function (LF) using a sample of 17 lensing clusters observed by the MUSE/VLT. Magnification from strong gravitational lensing by clusters of galaxies and MUSE apabilities allow us to blindly detect LAEs without any photometric pre-selection, reaching the faint luminosity regime. 600 lensed LAEs were selected behind these clusters in the redshift range 2.9<$z$< 6.7, covering four orders of magnitude in magnification-corrected Lyman-alpha luminosity (39.0<log$L$< 43.0). The method used in this work ($V_{\text{max}}$) follows the recipes originally developed by arXiv:1905.13696(N) (DLV19) with some improvements to better account for the effects of lensing when computing the effective volume. The total co-moving volume at 2.9<$z$<6.7 is $\sim$50 $10^{3}Mpc^{3}$. Our LF points in the bright end (log L)>42 are consistent with those obtained from blank field observations. In the faint luminosity regime, the density of sources is well described by a steep slope, $α\sim-2$ for the global redshift range. Up to log(L)$\sim$41, the steepening of the faint end slope with redshift, suggested by the earlier work of DLV19 is observed, but the uncertainties remain large. A significant flattening is observed towards the faintest end, for the highest redshift bins (log$L$<41). Using face values, the steep slope at the faint-end causes the SFRD to dramatically increase with redshift, implying that LAEs could play a major role in the process of cosmic reionization. The flattening observed towards the faint end for the highest redshift bins still needs further investigation. This turnover is similar to the one observed for the UV LF at $z\geq6$ in lensing clusters, with the same conclusions regarding the reliability of current results (e.g.arXiv:1803.09747(N); arXiv:2205.11526(N)).
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Euclid Preparation XXXIII. Characterization of convolutional neural networks for the identification of galaxy-galaxy strong lensing events
Authors:
Euclid Collaboration,
L. Leuzzi,
M. Meneghetti,
G. Angora,
R. B. Metcalf,
L. Moscardini,
P. Rosati,
P. Bergamini,
F. Calura,
B. Clément,
R. Gavazzi,
F. Gentile,
M. Lochner,
C. Grillo,
G. Vernardos,
N. Aghanim,
A. Amara,
L. Amendola,
S. Andreon,
N. Auricchio,
S. Bardelli,
C. Bodendorf,
D. Bonino,
E. Branchini,
M. Brescia
, et al. (194 additional authors not shown)
Abstract:
Forthcoming imaging surveys will potentially increase the number of known galaxy-scale strong lenses by several orders of magnitude. For this to happen, images of tens of millions of galaxies will have to be inspected to identify potential candidates. In this context, deep learning techniques are particularly suitable for the finding patterns in large data sets, and convolutional neural networks (…
▽ More
Forthcoming imaging surveys will potentially increase the number of known galaxy-scale strong lenses by several orders of magnitude. For this to happen, images of tens of millions of galaxies will have to be inspected to identify potential candidates. In this context, deep learning techniques are particularly suitable for the finding patterns in large data sets, and convolutional neural networks (CNNs) in particular can efficiently process large volumes of images. We assess and compare the performance of three network architectures in the classification of strong lensing systems on the basis of their morphological characteristics. We train and test our models on different subsamples of a data set of forty thousand mock images, having characteristics similar to those expected in the wide survey planned with the ESA mission \Euclid, gradually including larger fractions of faint lenses. We also evaluate the importance of adding information about the colour difference between the lens and source galaxies by repeating the same training on single-band and multi-band images. Our models find samples of clear lenses with $\gtrsim 90\%$ precision and completeness, without significant differences in the performance of the three architectures. Nevertheless, when including lenses with fainter arcs in the training set, the three models' performance deteriorates with accuracy values of $\sim 0.87$ to $\sim 0.75$ depending on the model. Our analysis confirms the potential of the application of CNNs to the identification of galaxy-scale strong lenses. We suggest that specific training with separate classes of lenses might be needed for detecting the faint lenses since the addition of the colour information does not yield a significant improvement in the current analysis, with the accuracy ranging from $\sim 0.89$ to $\sim 0.78$ for the different models.
△ Less
Submitted 26 January, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Search for an interaction mediated by axion-like particles with ultracold neutrons at the PSI
Authors:
N. J. Ayres,
G. Bison,
K. Bodek,
V. Bondar,
T. Bouillaud,
E. Chanel,
P. -J. Chiu,
B. Clement,
C. B. Crawford,
M. Daum,
C. B. Doorenbos,
S. Emmenegger,
M. Fertl,
P. Flaux,
W. C. Griffith,
P. G. Harris,
N. Hild,
M. Kasprzak,
K. Kirch,
V. Kletzl,
P. A. Koss,
J. Krempel,
B. Lauss,
T. Lefort,
P. Mohanmurthy
, et al. (22 additional authors not shown)
Abstract:
We report on a search for a new, short-range, spin-dependent interaction using a modified version of the experimental apparatus used to measure the permanent neutron electric dipole moment at the Paul Scherrer Institute. This interaction, which could be mediated by axion-like particles, concerned the unpolarized nucleons (protons and neutrons) near the material surfaces of the apparatus and polari…
▽ More
We report on a search for a new, short-range, spin-dependent interaction using a modified version of the experimental apparatus used to measure the permanent neutron electric dipole moment at the Paul Scherrer Institute. This interaction, which could be mediated by axion-like particles, concerned the unpolarized nucleons (protons and neutrons) near the material surfaces of the apparatus and polarized ultracold neutrons stored in vacuum. The dominant systematic uncertainty resulting from magnetic-field gradients was controlled to an unprecedented level of approximately 4 pT/cm using an array of optically-pumped cesium vapor magnetometers and magnetic-field maps independently recorded using a dedicated measurement device. No signature of a theoretically predicted new interaction was found, and we set a new limit on the product of the scalar and the pseudoscalar couplings $g_sg_pλ^2 < 8.3 \times 10^{-28}\,\text{m}^2$ (95% C.L.) in a range of $5\,μ\text{m} < λ< 25\,\text{mm}$ for the monopole-dipole interaction. This new result confirms and improves our previous limit by a factor of 2.7 and provides the current tightest limit obtained with free neutrons.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
A search for neutron-to-hidden-neutron oscillations in a ultra-cold neutron beam
Authors:
G. Ban,
J. Chen,
P. -J. Chiu,
B. Clément,
M. Guigue,
T. Jenke,
P. Larue,
T. Lefort,
O. Naviliat-Cuncic,
B. Perriolat,
G. Pignol,
S. Roccia,
W. Saenz-Arevalo,
P. Schmidt-Wellenburg
Abstract:
Models that postulate the existence of hidden sectors address contemporary questions, such as the source of baryogenesis and the nature of dark matter. Among the possible mixing processes, neutron-to-hidden-neutron oscillations have been repeatedly tested with ultra-cold neutron storage and passing-through-wall experiments in the range of small ($δm<2$ peV) and large mass splitting ($δm>10$ neV),…
▽ More
Models that postulate the existence of hidden sectors address contemporary questions, such as the source of baryogenesis and the nature of dark matter. Among the possible mixing processes, neutron-to-hidden-neutron oscillations have been repeatedly tested with ultra-cold neutron storage and passing-through-wall experiments in the range of small ($δm<2$ peV) and large mass splitting ($δm>10$ neV), respectively. In this work, we present a new constraint in the oscillation parameter space derived from neutron disappearance in ultra-cold neutron beam experiments. The overall limit, which covers the intermediate mass-splitting range, is given by $τ_{nn'}> 1$ s for $|δm| \in [2,69]$ peV (95\% C.L.).
△ Less
Submitted 18 March, 2023;
originally announced March 2023.
-
A new step forward in realistic cluster lens mass modelling: Analysis of Hubble Frontier Field Cluster Abell S1063 from joint lensing, X-ray and galaxy kinematics data
Authors:
Benjamin Beauchesne,
Benjamin Clément,
Pascale Hibon,
Marceau Limousin,
Dominique Eckert,
Jean-Paul Kneib,
Johan Richard,
Priyamvada Natarajan,
Mathilde Jauzac,
Mireia Montes,
Guillaume Mahler,
Adélaïde Claeyssens,
Alexandre Jeanneau,
Anton M. Koekemoer,
David Lagattuta,
Amanda Pagul,
Javier Sánchez
Abstract:
We present a new method to simultaneously/self-consistently model the mass distribution of galaxy clusters that combines constraints from strong lensing features, X-ray emission and galaxy kinematics measurements. We are able to successfully decompose clusters into their collisionless and collisional mass components thanks to the X-ray surface brightness, as well as using the dynamics of cluster m…
▽ More
We present a new method to simultaneously/self-consistently model the mass distribution of galaxy clusters that combines constraints from strong lensing features, X-ray emission and galaxy kinematics measurements. We are able to successfully decompose clusters into their collisionless and collisional mass components thanks to the X-ray surface brightness, as well as using the dynamics of cluster members to obtain more accurate masses with the fundamental plane of elliptical galaxies. Knowledge from all observables is included through a consistent Bayesian approach in the likelihood or in physically motivated priors. We apply this method to the galaxy cluster Abell S1063 and produce a mass model that we publicly release with this paper. The resulting mass distribution presents a different ellipticities for the intra-cluster gas and the other large-scale mass components; and deviation from elliptical symmetry in the main halo. We assess the ability of our method to recover the masses of the different elements of the cluster using a mock cluster based on a simplified version of our Abell S1063 model. Thanks to the wealth of information provided by the mass model and the X-ray emission, we also found evidence for an on-going merger event with gas sloshing from a smaller infalling structure into the main cluster. In agreement with previous findings, the total mass, gas profile and gas mass fraction are consistent with small deviations from the hydrostatic equilibrium. This new mass model for Abell S1063 is publicly available as is the software used to construct it through the \textsc{Lenstool} package.
△ Less
Submitted 26 October, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
The impact of human expert visual inspection on the discovery of strong gravitational lenses
Authors:
Karina Rojas,
Thomas E. Collett,
Daniel Ballard,
Mark R. Magee,
Simon Birrer,
Elizabeth Buckley-Geer.,
James H. H. Chan,
Benjamin Clément,
José M. Diego,
Fabrizio Gentile,
Jimena González,
Rémy Joseph,
Jorge Mastache,
Stefan Schuldt,
Crescenzo Tortora,
Tomás Verdugo,
Aprajita Verma,
Tansu Daylan,
Martin Millon,
Neal Jackson,
Simon Dye,
Alejandra Melo,
Guillaume Mahler,
Ricardo L. C. Ogando,
Frédéric Courbin
, et al. (31 additional authors not shown)
Abstract:
We investigate the ability of human 'expert' classifiers to identify strong gravitational lens candidates in Dark Energy Survey like imaging. We recruited a total of 55 people that completed more than 25$\%$ of the project. During the classification task, we present to the participants 1489 images. The sample contains a variety of data including lens simulations, real lenses, non-lens examples, an…
▽ More
We investigate the ability of human 'expert' classifiers to identify strong gravitational lens candidates in Dark Energy Survey like imaging. We recruited a total of 55 people that completed more than 25$\%$ of the project. During the classification task, we present to the participants 1489 images. The sample contains a variety of data including lens simulations, real lenses, non-lens examples, and unlabeled data. We find that experts are extremely good at finding bright, well-resolved Einstein rings, whilst arcs with $g$-band signal-to-noise less than $\sim$25 or Einstein radii less than $\sim$1.2 times the seeing are rarely recovered. Very few non-lenses are scored highly. There is substantial variation in the performance of individual classifiers, but they do not appear to depend on the classifier's experience, confidence or academic position. These variations can be mitigated with a team of 6 or more independent classifiers. Our results give confidence that humans are a reliable pruning step for lens candidates, providing pure and quantifiably complete samples for follow-up studies.
△ Less
Submitted 25 April, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Determination of diffusion coefficients of mercury atoms in various gases from longitudinal spin relaxation in magnetic gradients
Authors:
B. Clément,
M. Guigue,
A. Leredde,
G. Pignol,
D. Rebreyend,
S. Roccia,
S. Touati
Abstract:
We present a method to measure the binary diffusion coefficient of mercury atoms in a gas at room temperature and low pressure. It is based on the measurement of the longitudinal spin relaxation of optically pumped mercury-199 atoms in a magnetic field gradient. We provide a consistent set of diffusion coefficients for helium-3, helium-4, argon, krypton, xenon, nitrogen, carbon dioxide, oxygen, an…
▽ More
We present a method to measure the binary diffusion coefficient of mercury atoms in a gas at room temperature and low pressure. It is based on the measurement of the longitudinal spin relaxation of optically pumped mercury-199 atoms in a magnetic field gradient. We provide a consistent set of diffusion coefficients for helium-3, helium-4, argon, krypton, xenon, nitrogen, carbon dioxide, oxygen, and air.
△ Less
Submitted 19 December, 2022; v1 submitted 14 September, 2022;
originally announced September 2022.
-
Precision modeling of JWST's first cluster lens SMACSJ0723.3-7327
Authors:
Guillaume Mahler,
Mathilde Jauzac,
Johan Richard,
Benjamin Beauchesne,
Harald Ebeling,
David Lagattuta,
Priyamvada Natarajan,
Keren Sharon,
Hakim Atek,
Adélaïde Claeyssens,
Benjamin Clément,
Dominique Eckert,
Alastair Edge,
Jean-Paul Kneib,
Anna Niemiec
Abstract:
Exploiting the fundamentally achromatic nature of gravitational lensing, we present a lens model for the massive galaxy cluster SMACSJ0723.3-7323 (SMACSJ0723, z=0.388) that significantly improves upon earlier work. Building on strong-lensing constraints identified in prior Hubble Space Telescope (HST) observations, the mass model utilizes 21 multiple-image systems, 17 of which were newly discovere…
▽ More
Exploiting the fundamentally achromatic nature of gravitational lensing, we present a lens model for the massive galaxy cluster SMACSJ0723.3-7323 (SMACSJ0723, z=0.388) that significantly improves upon earlier work. Building on strong-lensing constraints identified in prior Hubble Space Telescope (HST) observations, the mass model utilizes 21 multiple-image systems, 17 of which were newly discovered in Early Release Observation (ERO) data from the James Webb Space Telescope (JWST). The resulting lens model maps the cluster mass distribution to an RMS spatial precision of 0.32'' and is publicly available. Consistent with previous analyses, our study shows SMACSJ0723.3-7323 to be well described by a single large-scale component centered on the location of the brightest cluster galaxy. However, satisfying all lensing constraints provided by the JWST data, the model point to the need for the inclusion of an additional, diffuse component west of the cluster. A comparison of the galaxy, mass, and gas distributions in the core of SMACSJ0723 based on HST, JWST, and Chandra data reveals a concentrated regular elliptical profile along with tell-tale signs of a recent merger, possibly proceeding almost along our line of sight. The exquisite sensitivity of JWST's NIRCAM reveals in spectacular fashion both the extended intra-cluster-light distribution and numerous star-forming clumps in magnified background galaxies. The high-precision lens model derived here for SMACSJ0723-7323 demonstrates the unprecedented power of combining HST and JWST data for studies of structure formation and evolution in the distant Universe.
△ Less
Submitted 27 April, 2023; v1 submitted 14 July, 2022;
originally announced July 2022.
-
DeepPERF: A Deep Learning-Based Approach For Improving Software Performance
Authors:
Spandan Garg,
Roshanak Zilouchian Moghaddam,
Colin B. Clement,
Neel Sundaresan,
Chen Wu
Abstract:
Improving software performance is an important yet challenging part of the software development cycle. Today, the majority of performance inefficiencies are identified and patched by performance experts. Recent advancements in deep learning approaches and the wide-spread availability of open source data creates a great opportunity to automate the identification and patching of performance problems…
▽ More
Improving software performance is an important yet challenging part of the software development cycle. Today, the majority of performance inefficiencies are identified and patched by performance experts. Recent advancements in deep learning approaches and the wide-spread availability of open source data creates a great opportunity to automate the identification and patching of performance problems. In this paper, we present DeepPERF, a transformer-based approach to suggest performance improvements for C# applications. We pretrain DeepPERF on English and Source code corpora and followed by finetuning for the task of generating performance improvement patches for C# applications. Our evaluation shows that our model can generate the same performance improvement suggestion as the developer fix in ~53% of the cases, getting ~34% of them verbatim in our expert-verified dataset of performance changes made by C# developers. Additionally, we evaluate DeepPERF on 50 open source C# repositories on GitHub using both benchmark and unit tests and find that our model is able to suggest valid performance improvements that can improve both CPU usage and Memory allocations. So far we've submitted 19 pull-requests with 28 different performance optimizations and 11 of these PRs have been approved by the project owners.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
The `n2EDM MSR' -- a very large magnetically shielded room with an exceptional performance for fundamental physics measurements
Authors:
N. J. Ayres,
G. Ban,
G. Bison,
K. Bodek,
V. Bondar,
T. Bouillaud,
B. Clement,
E. Chanel,
P. -J. Chiu,
C. B. Crawford,
M. Daum,
C. B. Doorenbos,
S. Emmenegger,
A. Fratangelo,
M. Fertl,
W. C. Griffith,
Z. D. Grujic,
P. G. Harris,
K. Kirch,
J. Krempel,
B. Lauss,
T. Lefort,
O. Naviliat-Cuncic,
D. Pais,
F. M. Piegsa
, et al. (19 additional authors not shown)
Abstract:
We present the magnetically shielded room (MSR) for the n2EDM experiment at the Paul Scherrer Institute which features an interior cubic volume with each side of length 2.92m, thus providing an accessible space of 25m3. The MSR has 87 openings up to 220mm diameter to operate the experimental apparatus inside, and an intermediate space between the layers for sensitive signal processing electronics.…
▽ More
We present the magnetically shielded room (MSR) for the n2EDM experiment at the Paul Scherrer Institute which features an interior cubic volume with each side of length 2.92m, thus providing an accessible space of 25m3. The MSR has 87 openings up to 220mm diameter to operate the experimental apparatus inside, and an intermediate space between the layers for sensitive signal processing electronics. The characterization measurements show a remanent magnetic field in the central 1m3 below 100pT, and a field below 600pT in the entire inner volume, up to 4\,cm to the walls. The quasi-static shielding factor at 0.01\,Hz measured with a sinusoidal 2muT peak-to-peak signal is about 100,000 in all three spatial directions and rises fast with frequency to reach 10^8 above 1Hz.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
$n-n'$ Oscillations: Sensitivity of a first UCN beam experiment
Authors:
G. Ban,
J. Chen,
P. -J. Chiu,
B. Clément,
M. Guigue,
T. Jenke,
P. Larue,
T. Lefort,
O. Naviliat-Cuncic,
B. Perriolat,
G. Pignol,
S. Roccia,
W. Saenz-Arevalo,
P. Schmidt-Wellenburg
Abstract:
Oscillations of the neutron into a hidden sector particle are processes predicted in various Standard Model extensions. This extra channel for neutron disappearance has not been tested experimentally in large portions of the oscillation parameter space. Several efforts have been recently made on revising the oscillation time limits at low mass-splitting in ultra-cold neutron (UCN) storage experime…
▽ More
Oscillations of the neutron into a hidden sector particle are processes predicted in various Standard Model extensions. This extra channel for neutron disappearance has not been tested experimentally in large portions of the oscillation parameter space. Several efforts have been recently made on revising the oscillation time limits at low mass-splitting in ultra-cold neutron (UCN) storage experiments, and at larger mass-splitting in passing-through-wall experiments. In this work, we present the expected sensitivity of an experiment searching for neutron hidden neutron oscillations at intermediate mass-splitting via the application of magnetic fields in the range $B_0=30-1100$ $μ$T. This experiment was performed at the Institut-Laue-Langevin using a novel UCN counter to monitor the beam flux. The measured UCN rate and the data collection technique predict a sensitivity on the oscillation time at the level of a couple of seconds.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Manipulation of gravitational quantum states of a bouncing neutron with the GRANIT spectrometer
Authors:
Benoit Clément,
Stefan Baeßler,
Valery V. Nesvizhevsky,
Emily Perry,
Guillaume Pignol,
Jason A. Pioquinto,
Konstantin V. Protasov,
Dominique Rebreyend,
Damien Roulier,
Lingnan Shen,
Alexander V. Strelkov,
Francis Vezzu
Abstract:
The bouncing neutron is one of the rare system where gravity can be studied in a quantum framework. To this end it is crucial to be able to select some specific gravitational quantum state (GQS). The GRANIT apparatus is the first physics experiment connected to a superthermal helium UCN source. We report on the methods developed for this instrument showing how specific GQS can be favored using a s…
▽ More
The bouncing neutron is one of the rare system where gravity can be studied in a quantum framework. To this end it is crucial to be able to select some specific gravitational quantum state (GQS). The GRANIT apparatus is the first physics experiment connected to a superthermal helium UCN source. We report on the methods developed for this instrument showing how specific GQS can be favored using a step between mirrors and an absorbing slit. We explore the increase of GQS separation efficiency by increasing the absorber roughness amplitude, and find it is feasible but requires a high adjustment precision. We also quantify the transmission of the absorbing slit leading to a measurement of the spatial extension of the neutron vertical wave function $z_0 = \hbar^{2/3}\left(2m^2g\right)^{-1/3} = 5.9\pm0.3\,μ$m.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Generating Examples From CLI Usage: Can Transformers Help?
Authors:
Roshanak Zilouchian Moghaddam,
Spandan Garg,
Colin B. Clement,
Yevhen Mohylevskyy,
Neel Sundaresan
Abstract:
Continuous evolution in modern software often causes documentation, tutorials, and examples to be out of sync with changing interfaces and frameworks. Relying on outdated documentation and examples can lead programs to fail or be less efficient or even less secure. In response, programmers need to regularly turn to other resources on the web such as StackOverflow for examples to guide them in writ…
▽ More
Continuous evolution in modern software often causes documentation, tutorials, and examples to be out of sync with changing interfaces and frameworks. Relying on outdated documentation and examples can lead programs to fail or be less efficient or even less secure. In response, programmers need to regularly turn to other resources on the web such as StackOverflow for examples to guide them in writing software. We recognize that this inconvenient, error-prone, and expensive process can be improved by using machine learning applied to software usage data. In this paper, we present our practical system which uses machine learning on large-scale telemetry data and documentation corpora, generating appropriate and complex examples that can be used to improve documentation. We discuss both feature-based and transformer-based machine learning approaches and demonstrate that our system achieves 100% coverage for the used functionalities in the product, providing up-to-date examples upon every release and reduces the numbers of PRs submitted by software owners writing and editing documentation by >68%. We also share valuable lessons learnt during the 3 years that our production quality system has been deployed for Azure Cloud Command Line Interface (Azure CLI).
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Spatial resolution determination of a position sensitive ultra-cold neutron detector
Authors:
B. Clément,
L. Gesson,
T. Jenke,
V. V. Nesvizhevsky,
G. Pignol,
S. Roccia,
J. -P. Scordillis
Abstract:
The study of the properties of the quantum states of bouncing neutrons requires position sensitive detection with micro-metric spatial resolution. The UCNBoX detector relies on Charge Coupled Devices (CCD) coated with a thin boron-10 conversion layer to detect neutron hits. In this paper, we present an original experimental method to determine the spatial resolution of this device using micrometri…
▽ More
The study of the properties of the quantum states of bouncing neutrons requires position sensitive detection with micro-metric spatial resolution. The UCNBoX detector relies on Charge Coupled Devices (CCD) coated with a thin boron-10 conversion layer to detect neutron hits. In this paper, we present an original experimental method to determine the spatial resolution of this device using micrometric masks. The observed resolution is $2.0\pm0.3~μ$m.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
Training and Evaluating a Jupyter Notebook Data Science Assistant
Authors:
Shubham Chandel,
Colin B. Clement,
Guillermo Serrato,
Neel Sundaresan
Abstract:
We study the feasibility of a Data Science assistant powered by a sequence-to-sequence transformer by training a new model JuPyT5 on all publicly available Jupyter Notebook GitHub repositories and develo** a new metric: Data Science Problems (DSP). DSP is a collection of 1119 problems curated from 306 pedagogical notebooks with 92 dataset dependencies, natural language and Markdown problem descr…
▽ More
We study the feasibility of a Data Science assistant powered by a sequence-to-sequence transformer by training a new model JuPyT5 on all publicly available Jupyter Notebook GitHub repositories and develo** a new metric: Data Science Problems (DSP). DSP is a collection of 1119 problems curated from 306 pedagogical notebooks with 92 dataset dependencies, natural language and Markdown problem descriptions, and assert-based unit tests. These notebooks were designed to test university students' mastery of various Python implementations of Math and Data Science, and we now leverage them to study the ability of JuPyT5 to understand and pass the tests. We analyze the content of DSP, validate its quality, and we find that given 100 sampling attempts JuPyT5 is able to solve 77.5\% of the DSP problems. We further present various ablation and statistical analyses and compare DSP to other recent natural language to code benchmarks.
△ Less
Submitted 30 January, 2022;
originally announced January 2022.
-
Strong lensing in UNIONS: Toward a pipeline from discovery to modeling
Authors:
E. Savary,
K. Rojas,
M. Maus,
B. Clément,
F. Courbin,
R. Gavazzi,
J. H. H. Chan,
C. Lemon,
G. Vernardos,
R. Cañameras,
S. Schuldt,
S. H. Suyu,
J. -C. Cuillandre,
S. Fabbro,
S. Gwyn,
M. J. Hudson,
M. Kilbinger,
D. Scott,
C. Stone
Abstract:
We present a search for galaxy-scale strong gravitational lenses in the initial 2 500 square degrees of the Canada-France Imaging Survey (CFIS). We designed a convolutional neural network (CNN) committee that we applied to a selection of 2 344 002 exquisite-seeing $r$-band images of color-selected luminous red galaxies (LRGs). Our classification uses a realistic training set where the lensing gala…
▽ More
We present a search for galaxy-scale strong gravitational lenses in the initial 2 500 square degrees of the Canada-France Imaging Survey (CFIS). We designed a convolutional neural network (CNN) committee that we applied to a selection of 2 344 002 exquisite-seeing $r$-band images of color-selected luminous red galaxies (LRGs). Our classification uses a realistic training set where the lensing galaxies and the lensed sources are both taken from real data, namely the CFIS $r$-band images themselves and the Hubble Space Telescope (HST). A total of 9 460 candidates obtain a score above 0.5 with the CNN committee. After a visual inspection of the candidates, we find a total of 133 lens candidates, of which 104 are completely new. The set of false positives mainly contains ring, spiral, and merger galaxies, and to a lesser extent galaxies with nearby companions. We classify 32 of the lens candidates as secure lenses and 101 as maybe lenses. For the 32 highest quality lenses, we also fit a singular isothermal ellipsoid mass profile with external shear along with an elliptical Sersic profile for the lens and source light. This automated modeling step provides distributions of properties for both sources and lenses that have Einstein radii in the range $0.5\arcsec<θ_E<2.5\arcsec$. Finally, we introduce a new lens and/or source single-band deblending algorithm based on auto-encoder representation of our candidates. This is the first time an end-to-end lens-finding and modeling pipeline is assembled together, in view of future lens searches in a single band, as will be possible with Euclid.
△ Less
Submitted 24 September, 2022; v1 submitted 22 October, 2021;
originally announced October 2021.
-
Discovery of Strongly Lensed Quasars in the Ultraviolet Near Infrared Optical Northern Survey (UNIONS)
Authors:
J. H. H. Chan,
C. Lemon,
F. Courbin,
R. Gavazzi,
B. Clément,
M. Millon,
E. Paic,
K. Rojas,
E. Savary,
G. Vernardos,
J. -C. Cuillandre,
S. Fabbro,
S. Gwyn,
M. J. Hudson,
M. Kilbinger,
A. McConnachie
Abstract:
We report the discovery of five new doubly-imaged lensed quasars from the first 2500 square degrees of the ongoing Canada-France Imaging Survey (CFIS), which is a component of the Ultraviolet Near Infrared Optical Northern Survey (UNIONS), selected from initial catalogues of either Gaia pairs or MILLIQUAS quasars. We take advantage of the deep, 0.6'' median-seeing $r$-band imaging of CFIS to confi…
▽ More
We report the discovery of five new doubly-imaged lensed quasars from the first 2500 square degrees of the ongoing Canada-France Imaging Survey (CFIS), which is a component of the Ultraviolet Near Infrared Optical Northern Survey (UNIONS), selected from initial catalogues of either Gaia pairs or MILLIQUAS quasars. We take advantage of the deep, 0.6'' median-seeing $r$-band imaging of CFIS to confirm the presence of multiple point sources with similar colour of $u-r$, via convolution of the Laplacian of the point spread function. Requiring similar-colour point sources with flux ratios less than 2.5 mag in $r$-band, reduces the number of candidates from 256314 to 7815. After visual inspection we obtain 30 high-grade candidates, and prioritise spectroscopic follow-up for those showing signs of a lensing galaxy upon subtraction of the point sources. We obtain long-slit spectra for 18 candidates with ALFOSC on the 2.56-m Nordic Optical Telescope (NOT), confirming five new doubly lensed quasars with $1.21<z<3.36$ and angular separations from 0.8'' to 2.5''. One additional system is a probable lensed quasar based on the CFIS imaging and existing SDSS spectrum. We further classify six objects as nearly identical quasars -- still possible lenses but without the detection of a lensing galaxy. Given our recovery rate ($83\%$) of existing optically bright lenses within the CFIS footprint, we expect that a similar strategy, coupled with $u-r$ colour-selection from CFIS alone, will provide an efficient and complete discovery of small-separation lensed quasars of source redshifts below $z=2.7$ within the CFIS $r$-band magnitude limit of 24.1 mag.
△ Less
Submitted 1 December, 2021; v1 submitted 18 October, 2021;
originally announced October 2021.
-
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
Authors:
Colin B. Clement,
Shuai Lu,
Xiaoyu Liu,
Michele Tufano,
Dawn Drain,
Nan Duan,
Neel Sundaresan,
Alexey Svyatkovskiy
Abstract:
Statistical language modeling and translation with transformers have found many successful applications in program understanding and generation tasks, setting high benchmarks for tools in modern software development environments. The finite context window of these neural models means, however, that they will be unable to leverage the entire relevant context of large files and packages for any give…
▽ More
Statistical language modeling and translation with transformers have found many successful applications in program understanding and generation tasks, setting high benchmarks for tools in modern software development environments. The finite context window of these neural models means, however, that they will be unable to leverage the entire relevant context of large files and packages for any given task. While there are many efforts to extend the context window, we introduce an architecture-independent approach for leveraging the syntactic hierarchies of source code for incorporating entire file-level context into a fixed-length window. Using concrete syntax trees of each source file we extract syntactic hierarchies and integrate them into context window by selectively removing from view more specific, less relevant scopes for a given task. We evaluate this approach on code generation tasks and joint translation of natural language and source code in Python programming language, achieving a new state-of-the-art in code completion and summarization for Python in the CodeXGLUE benchmark. We also introduce new CodeXGLUE benchmarks for user-experience-motivated tasks: code completion with normalized literals, method body completion/code summarization conditioned on file-level context.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Strong lens systems search in the Dark Energy Survey using Convolutional Neural Networks
Authors:
K. Rojas,
E. Savary,
B. Clément,
M. Maus,
F. Courbin,
C. Lemon,
J. H. H. Chan,
G. Vernardos,
R. Joseph,
R. Cañameras,
A. Galan
Abstract:
We performed a search for strong lens galaxy-scale systems in the first data release of the Dark Energy Survey (DES), from a color-selected parent sample of 18~745~029 Luminous Red Galaxies (LRGs). Our search was based on a Convolutional Neural Network (CNN) to grade our LRG selection with values between 0 (non-lens) and 1 (lens). Our training set was data-driven, i.e. using lensed sources taken f…
▽ More
We performed a search for strong lens galaxy-scale systems in the first data release of the Dark Energy Survey (DES), from a color-selected parent sample of 18~745~029 Luminous Red Galaxies (LRGs). Our search was based on a Convolutional Neural Network (CNN) to grade our LRG selection with values between 0 (non-lens) and 1 (lens). Our training set was data-driven, i.e. using lensed sources taken from HST COSMOS images and where the light distribution of the lens plane was taken directly from DES images of our LRGs. A total of 76~582 cutouts obtained a score above 0.9. These were visually inspected and resulted in two catalogs. The first one contains 405 lens candidates, where 90 present clear lensing features and counterparts, while the others 315 require more evidence, such as higher resolution images or spectra to be conclusive. A total of 186 candidates were totally new identified in this search. The second catalog includes 539 ring galaxy candidates that will be useful to train CNNs against this type of false positives. For the 90 best lens candidates we carried out color-based deblending of the lens and source light without fitting any analytical profile to the data. The method turned out to be very efficient in the deblending, even for very compact objects and for objects with very complex morphology. Finally, from the 90 best lens candidates we selected 52 systems having one single deflector, to test an automated modeling pipeline which successfully modeled 79\% of the sample within an acceptable amount of computing time.
△ Less
Submitted 31 August, 2021;
originally announced September 2021.
-
Distilling Transformers for Neural Cross-Domain Search
Authors:
Colin B. Clement,
Chen Wu,
Dawn Drain,
Neel Sundaresan
Abstract:
Pre-trained transformers have recently clinched top spots in the gamut of natural language tasks and pioneered solutions to software engineering tasks. Even information retrieval has not been immune to the charm of the transformer, though their large size and cost is generally a barrier to deployment. While there has been much work in streamlining, caching, and modifying transformer architectures…
▽ More
Pre-trained transformers have recently clinched top spots in the gamut of natural language tasks and pioneered solutions to software engineering tasks. Even information retrieval has not been immune to the charm of the transformer, though their large size and cost is generally a barrier to deployment. While there has been much work in streamlining, caching, and modifying transformer architectures for production, here we explore a new direction: distilling a large pre-trained translation model into a lightweight bi-encoder which can be efficiently cached and queried. We argue from a probabilistic perspective that sequence-to-sequence models are a conceptually ideal---albeit highly impractical---retriever. We derive a new distillation objective, implementing it as a data augmentation scheme. Using natural language source code search as a case study for cross-domain search, we demonstrate the validity of this idea by significantly improving upon the current leader of the CodeSearchNet challenge, a recent natural language code search benchmark.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Improving parametric mass modelling of lensing clusters through a perturbative approach
Authors:
Benjamin Beauchesne,
Benjamin Clément,
Johan Richard,
Jean-Paul Kneib,
-----
Abstract:
We present a new method to model the mass distribution of galaxy clusters that combines a parametric and a free-form approach to reconstruct cluster cores with strong lensing constraints. It aims at combining the advantages of both approaches, by kee** the robustness of the parametric component with an increased flexibility thanks to a free-form surface of B-spline functions. We demonstrate the…
▽ More
We present a new method to model the mass distribution of galaxy clusters that combines a parametric and a free-form approach to reconstruct cluster cores with strong lensing constraints. It aims at combining the advantages of both approaches, by kee** the robustness of the parametric component with an increased flexibility thanks to a free-form surface of B-spline functions. We demonstrate the capabilities of this new approach on the simulated cluster Hera, which has been used to evaluate lensing codes for the analysis of the Frontier Fields clusters. The method leads to better reproduction of the constraints, with an improvement by a factor $\sim3-4$ on the root-mean-square error on multiple-image positions, when compared to parametric-only approaches. The resulting models show a better accuracy in the reconstruction of the amplitude of the convergence field while conserving a high fidelity on other lensing observables already well reproduced. We make this method publicly available through its implementation in the Lenstool software.
△ Less
Submitted 7 August, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
DeepDebug: Fixing Python Bugs Using Stack Traces, Backtranslation, and Code Skeletons
Authors:
Dawn Drain,
Colin B. Clement,
Guillermo Serrato,
Neel Sundaresan
Abstract:
The joint task of bug localization and program repair is an integral part of the software development process. In this work we present DeepDebug, an approach to automated debugging using large, pretrained transformers. We begin by training a bug-creation model on reversed commit data for the purpose of generating synthetic bugs. We apply these synthetic bugs toward two ends. First, we directly tra…
▽ More
The joint task of bug localization and program repair is an integral part of the software development process. In this work we present DeepDebug, an approach to automated debugging using large, pretrained transformers. We begin by training a bug-creation model on reversed commit data for the purpose of generating synthetic bugs. We apply these synthetic bugs toward two ends. First, we directly train a backtranslation model on all functions from 200K repositories. Next, we focus on 10K repositories for which we can execute tests, and create buggy versions of all functions in those repositories that are covered by passing tests. This provides us with rich debugging information such as stack traces and print statements, which we use to finetune our model which was pretrained on raw source code. Finally, we strengthen all our models by expanding the context window beyond the buggy function itself, and adding a skeleton consisting of that function's parent class, imports, signatures, docstrings, and method bodies, in order of priority. On the QuixBugs benchmark, we increase the total number of fixes found by over 50%, while also decreasing the false positive rate from 35% to 5% and decreasing the timeout from six hours to one minute. On our own benchmark of executable tests, our model fixes 68% of all bugs on its first attempt without using traces, and after adding traces it fixes 75% on first attempt. We will open-source our framework and validation set for evaluating on executable tests.
△ Less
Submitted 19 May, 2021;
originally announced May 2021.
-
Map** of the magnetic field to correct systematic effects in a neutron electric dipole moment experiment
Authors:
C. Abel,
N. J. Ayres,
G. Ban,
G. Bison,
K. Bodek,
V. Bondar,
E. Chanel,
P. -J. Chiu,
B. Clément,
C. B. Crawford,
M. Daum,
S. Emmenegger,
L. Ferraris-Bouchez,
M. Fertl,
P. Flaux,
A. Fratangelo,
W. C. Griffith,
Z. D. Grujić,
P. G. Harris,
L. Hayen,
N. Hild,
M. Kasprzak,
K. Kirch,
P. Knowles,
H. -C. Koch
, et al. (28 additional authors not shown)
Abstract:
Experiments dedicated to the measurement of the electric dipole moment of the neutron require outstanding control of the magnetic field uniformity. The neutron electric dipole moment (nEDM) experiment at the Paul Scherrer Institute uses a 199Hg co-magnetometer to precisely monitor magnetic field variations. This co-magnetometer, in the presence of field non-uniformity, is responsible for the large…
▽ More
Experiments dedicated to the measurement of the electric dipole moment of the neutron require outstanding control of the magnetic field uniformity. The neutron electric dipole moment (nEDM) experiment at the Paul Scherrer Institute uses a 199Hg co-magnetometer to precisely monitor magnetic field variations. This co-magnetometer, in the presence of field non-uniformity, is responsible for the largest systematic effect of this measurement. To evaluate and correct that effect, offline measurements of the field non-uniformity were performed during map** campaigns in 2013, 2014 and 2017. We present the results of these campaigns, and the improvement the correction of this effect brings to the neutron electric dipole moment measurement.
△ Less
Submitted 3 May, 2022; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Tail Modulo Cons
Authors:
Frédéric Bour,
Basile Clément,
Gabriel Scherer
Abstract:
OCaml function calls consume space on the system stack. Operating systems set default limits on the stack space which are much lower than the available memory. If a program runs out of stack space, they get the dreaded "Stack Overflow" exception -- they crash. As a result, OCaml programmers have to be careful, when they write recursive functions, to remain in the so-called _tail-recursive_ fragmen…
▽ More
OCaml function calls consume space on the system stack. Operating systems set default limits on the stack space which are much lower than the available memory. If a program runs out of stack space, they get the dreaded "Stack Overflow" exception -- they crash. As a result, OCaml programmers have to be careful, when they write recursive functions, to remain in the so-called _tail-recursive_ fragment, using _tail_ calls that do not consume stack space.
This discipline is a source of difficulties for both beginners and experts. Beginners have to be taught recursion, and then tail-recursion. Experts disagree on the "right" way to write `List.map`. The direct version is beautiful but not tail-recursive, so it crashes on larger inputs. The naive tail-recursive transformation is (slightly) slower than the direct version, and experts may want to avoid that cost. Some libraries propose horrible implementations, unrolling code by hand, to compensate for this performance loss. In general, tail-recursion requires the programmer to manually perform sophisticated program transformations.
In this work we propose an implementation of "Tail Modulo Cons" (TMC) for OCaml. TMC is a program transformation for a fragment of non-tail-recursive functions, that rewrites them in _destination-passing style_. The supported fragment is smaller than other approaches such as continuation-passing-style, but the performance of the transformed code is on par with the direct, non-tail-recursive version. Many useful functions that traverse a recursive datastructure and rebuild another recursive structure are in the TMC fragment, in particular `List.map` (and `List.filter`, `List.append`, etc.). Finally those functions can be written in a way that is beautiful, correct on all inputs, and efficient.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Johnson-Nyquist Noise Effects in Neutron Electric-Dipole-Moment Experiments
Authors:
N. J. Ayres,
G. Ban,
G. Bison,
K. Bodek,
V. Bondar,
P. -J. Chiu,
B. Clement,
C. B. Crawford,
M. Daum,
S. Emmenegger,
M. Fertl,
A. Fratangelo,
W. C. Griffith,
Z. D. Grujić,
P. G. Harris,
K. Kirch,
P. A. Koss,
B. Lauss,
T. Lefort,
P. Mohanmurthy,
O. Naviliat-Cuncic,
D. Pais,
F. M. Piegsa,
G. Pignol,
D. Rebreyend
, et al. (15 additional authors not shown)
Abstract:
Magnetic Johnson-Nyquist noise (JNN) originating from metal electrodes, used to create a static electric field in neutron electric-dipole-moment (nEDM) experiments, may limit the sensitivity of measurements. We present here the first dedicated study on JNN applied to a large-scale long-measurement-time experiment with the implementation of a co-magnetometry. In this study, we derive surface- and v…
▽ More
Magnetic Johnson-Nyquist noise (JNN) originating from metal electrodes, used to create a static electric field in neutron electric-dipole-moment (nEDM) experiments, may limit the sensitivity of measurements. We present here the first dedicated study on JNN applied to a large-scale long-measurement-time experiment with the implementation of a co-magnetometry. In this study, we derive surface- and volume-averaged root-mean-square normal noise amplitudes at a certain frequency bandwidth for a cylindrical geometry. In addition, we model the source of noise as a finite number of current dipoles and demonstrate a method to simulate temporal and three-dimensional spatial dependencies of JNN. The calculations are applied to estimate the impact of JNN on measurements with the new apparatus, n2EDM, at the Paul Scherrer Institute. We demonstrate that the performances of the optically pumped $^{133}$Cs magnetometers and $^{199}$Hg co-magnetometers, which will be used in the apparatus, are not limited by JNN. Further, we find that in measurements deploying a co-magnetometer system, the impact of JNN is negligible for nEDM searches down to a sensitivity of $4\,\times\,10^{-28}\,e\cdot{\rm cm}$ in a single measurement; therefore, the use of economically and mechanically favored solid aluminum electrodes is possible.
△ Less
Submitted 9 July, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Learning-based vs Model-free Adaptive Control of a MAV under Wind Gust
Authors:
Thomas Chaffre,
Julien Moras,
Adrien Chan-Hon-Tong,
Julien Marzat,
Karl Sammut,
Gilles Le Chenadec,
Benoit Clement
Abstract:
Navigation problems under unknown varying conditions are among the most important and well-studied problems in the control field. Classic model-based adaptive control methods can be applied only when a convenient model of the plant or environment is provided. Recent model-free adaptive control methods aim at removing this dependency by learning the physical characteristics of the plant and/or proc…
▽ More
Navigation problems under unknown varying conditions are among the most important and well-studied problems in the control field. Classic model-based adaptive control methods can be applied only when a convenient model of the plant or environment is provided. Recent model-free adaptive control methods aim at removing this dependency by learning the physical characteristics of the plant and/or process directly from sensor feedback. Although there have been prior attempts at improving these techniques, it remains an open question as to whether it is possible to cope with real-world uncertainties in a control system that is fully based on either paradigm. We propose a conceptually simple learning-based approach composed of a full state feedback controller, tuned robustly by a deep reinforcement learning framework based on the Soft Actor-Critic algorithm. We compare it, in realistic simulations, to a model-free controller that uses the same deep reinforcement learning framework for the control of a micro aerial vehicle under wind gust. The results indicate the great potential of learning-based adaptive control methods in modern dynamical systems.
△ Less
Submitted 5 July, 2021; v1 submitted 29 January, 2021;
originally announced January 2021.
-
The design of the n2EDM experiment
Authors:
N. J. Ayres,
G. Ban,
L. Bienstman,
G. Bison,
K. Bodek,
V. Bondar,
T. Bouillaud,
E. Chanel,
J. Chen,
P. -J. Chiu,
B. Clément,
C. Crawford,
M. Daum,
B. Dechenaux,
C. B. Doorenbos,
S. Emmenegger,
L. Ferraris-Bouchez,
M. Fertl,
A. Fratangelo,
P. Flaux,
D. Goupillière,
W. C. Griffith,
Z. D. Grujic,
P. G. Harris,
K. Kirch
, et al. (36 additional authors not shown)
Abstract:
We present the design of a next-generation experiment, n2EDM, currently under construction at the ultracold neutron source at the Paul Scherrer Institute (PSI) with the aim of carrying out a high-precision search for an electric dipole moment of the neutron. The project builds on experience gained with the previous apparatus operated at PSI until 2017, and is expected to deliver an order of magnit…
▽ More
We present the design of a next-generation experiment, n2EDM, currently under construction at the ultracold neutron source at the Paul Scherrer Institute (PSI) with the aim of carrying out a high-precision search for an electric dipole moment of the neutron. The project builds on experience gained with the previous apparatus operated at PSI until 2017, and is expected to deliver an order of magnitude better sensitivity with provision for further substantial improvements. An overview is given of the experimental method and setup, the sensitivity requirements for the apparatus are derived, and its technical design is described.
△ Less
Submitted 22 January, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
ALMA 1.3 mm Survey of Lensed Submillimeter Galaxies (SMGs) Selected by Herschel: Discovery of Spatially Extended SMGs and Implications
Authors:
Fengwu Sun,
Eiichi Egami,
Timothy D. Rawle,
Gregory L. Walth,
Ian Smail,
Miroslava Dessauges-Zavadsky,
Pablo G. Perez-Gonzalez,
Johan Richard,
Francoise Combes,
H. Ebeling,
Roser Pello,
Paul P. van der Werf,
B. Altieri,
Frederic Boone,
Antonio Cava,
Scott C. Chapman,
Benjamin Clement,
Alexis Finoguenov,
Kimihiko Nakajima,
Wiphu Rujopakarn,
Daniel Schaerer,
Ivan Valtchanov
Abstract:
We present an ALMA 1.3 mm (Band 6) continuum survey of lensed submillimeter galaxies (SMGs) at $z=1.0\sim3.2$ with an angular resolution of $\sim0.2$". These galaxies were uncovered by the Herschel Lensing Survey (HLS), and feature exceptionally bright far-infrared continuum emission ($S_\mathrm{peak} \gtrsim 90$ mJy) owing to their lensing magnification. We detect 29 sources in 20 fields of massi…
▽ More
We present an ALMA 1.3 mm (Band 6) continuum survey of lensed submillimeter galaxies (SMGs) at $z=1.0\sim3.2$ with an angular resolution of $\sim0.2$". These galaxies were uncovered by the Herschel Lensing Survey (HLS), and feature exceptionally bright far-infrared continuum emission ($S_\mathrm{peak} \gtrsim 90$ mJy) owing to their lensing magnification. We detect 29 sources in 20 fields of massive galaxy clusters with ALMA. Using both the Spitzer/IRAC (3.6/4.5 $\mathrm{μm}$) and ALMA data, we have successfully modeled the surface brightness profiles of 26 sources in the rest-frame near- and far-infrared. Similar to previous studies, we find the median dust-to-stellar continuum size ratio to be small ($R_\mathrm{e,dust}/R_\mathrm{e,star} = 0.38\pm0.14$) for the observed SMGs, indicating that star formation is centrally concentrated. This is, however, not the case for two spatially extended main-sequence SMGs with a low surface brightness at 1.3 mm ($\lesssim 0.1$ mJy arcsec$^{-2}$), in which the star formation is distributed over the entire galaxy ($R_\mathrm{e,dust}/R_\mathrm{e,star}>1$). As a whole, our SMG sample shows a tight anti-correlation between ($R_\mathrm{e,dust}/R_\mathrm{e,star}$) and far-infrared surface brightness ($Σ_\mathrm{IR}$) over a factor of $\simeq$ 1000 in $Σ_\mathrm{IR}$. This indicates that SMGs with less vigorous star formation (i.e., lower $Σ_\mathrm{IR}$) lack central starburst and are likely to retain a broader spatial distribution of star formation over the whole galaxies (i.e., larger $R_\mathrm{e,dust}/R_\mathrm{e,star}$). The same trend can be reproduced with cosmological simulations as a result of central starburst and potentially subsequent "inside-out" quenching, which likely accounts for the emergence of compact quiescent galaxies at $z\sim2$.
△ Less
Submitted 10 January, 2021;
originally announced January 2021.
-
PyMT5: multi-mode translation of natural language and Python code with transformers
Authors:
Colin B. Clement,
Dawn Drain,
Jonathan Timcheck,
Alexey Svyatkovskiy,
Neel Sundaresan
Abstract:
Simultaneously modeling source code and natural language has many exciting applications in automated software development and understanding. Pursuant to achieving such technology, we introduce PyMT5, the Python method text-to-text transfer transformer, which is trained to translate between all pairs of Python method feature combinations: a single model that can both predict whole methods from natu…
▽ More
Simultaneously modeling source code and natural language has many exciting applications in automated software development and understanding. Pursuant to achieving such technology, we introduce PyMT5, the Python method text-to-text transfer transformer, which is trained to translate between all pairs of Python method feature combinations: a single model that can both predict whole methods from natural language documentation strings (docstrings) and summarize code into docstrings of any common style. We present an analysis and modeling effort of a large-scale parallel corpus of 26 million Python methods and 7.7 million method-docstring pairs, demonstrating that for docstring and method generation, PyMT5 outperforms similarly-sized auto-regressive language models (GPT2) which were English pre-trained or randomly initialized. On the CodeSearchNet test set, our best model predicts 92.1% syntactically correct method bodies, achieved a BLEU score of 8.59 for method generation and 16.3 for docstring generation (summarization), and achieved a ROUGE-L F-score of 24.8 for method generation and 36.7 for docstring generation.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
An Atlas of MUSE Observations towards Twelve Massive Lensing Clusters
Authors:
Johan Richard,
Adélaïde Claeyssens,
David J. Lagattuta,
Lucia Guaita,
Franz E. Bauer,
Roser Pello,
David Carton,
Roland Bacon,
Geneviève Soucail,
Gonzalo Prieto Lyon,
Jean-Paul Kneib,
Guillaume Mahler,
Benjamin Clément,
Wilfried Mercier,
Andrei Variu,
Amélie Tamone,
Harald Ebeling,
Kasper B. Schmidt,
Themiya Nanayakkara,
Michael Maseda,
Peter M. Weilbacher,
Nicolas Bouché,
Rychard J. Bouwens,
Lutz Wisotzki,
Geoffroy de la Vieuville
, et al. (3 additional authors not shown)
Abstract:
Spectroscopic surveys of massive galaxy clusters reveal the properties of faint background galaxies, thanks to the magnification provided by strong gravitational lensing. We present a systematic analysis of integral-field-spectroscopy observations of 12 massive clusters, conducted with the Multi Unit Spectroscopic Explorer (MUSE). All data were taken under very good seeing conditions (0.6") in eff…
▽ More
Spectroscopic surveys of massive galaxy clusters reveal the properties of faint background galaxies, thanks to the magnification provided by strong gravitational lensing. We present a systematic analysis of integral-field-spectroscopy observations of 12 massive clusters, conducted with the Multi Unit Spectroscopic Explorer (MUSE). All data were taken under very good seeing conditions (0.6") in effective exposure times between two and 15 hrs per pointing, for a total of 125 hrs. Our observations cover a total solid angle of ~23 arcmin$^2$ in the direction of clusters, many of which were previously studied by the MACS, Frontier Fields, GLASS and CLASH programs. The achieved emission line detection limit at 5$σ$ for a point source varies between (0.77--1.5)$\times$10$^{-18}$ erg\,s$^{-1}$\,cm$^{-2}$ at 7000Å. We present our developed strategy to reduce these observational data, detect sources and determine their redshifts. We construct robust mass models for each cluster to further confirm our redshift measurements using strong-lensing constraints, and identify a total of 312 strongly lensed sources producing 939 multiple images. The final redshift catalogs contain more than 3300 robust redshifts, of which 40\% are for cluster members and $\sim$30\% for lensed Lyman-$α$ emitters. 14\% of all sources are line emitters not seen in the available HST images, even at the depth of the FFs ($\sim29$ AB). We find that the magnification distribution of the lensed sources in the high-magnification regime ($μ{=}$ 2--25) follows the theoretical expectation of $N(z)\proptoμ^{-2}$. The quality of this dataset, number of lensed sources, and number of strong-lensing constraints enables detailed studies of the physical properties of both the lensing cluster and the background galaxies. The full data products from this work are made available to the community. [abridged]
△ Less
Submitted 1 January, 2021; v1 submitted 21 September, 2020;
originally announced September 2020.
-
Measurement of the permanent electric dipole moment of the neutron
Authors:
C. Abel,
S. Afach,
N. J. Ayres,
C. A. Baker,
G. Ban,
G. Bison,
K. Bodek,
V. Bondar,
M. Burghoff,
E. Chanel,
Z. Chowdhuri,
P. -J. Chiu,
B. Clement,
C. B. Crawford,
M. Daum,
S. Emmenegger,
L. Ferraris-Bouchez,
M. Fertl,
P. Flaux,
B. Franke,
A. Fratangelo,
P. Geltenbort,
K. Green,
W. C. Griffith,
M. van der Grinten
, et al. (59 additional authors not shown)
Abstract:
We present the result of an experiment to measure the electric dipole moment (EDM) of the neutron at the Paul Scherrer Institute using Ramsey's method of separated oscillating magnetic fields with ultracold neutrons (UCN). Our measurement stands in the long history of EDM experiments probing physics violating time reversal invariance. The salient features of this experiment were the use of a Hg-19…
▽ More
We present the result of an experiment to measure the electric dipole moment (EDM) of the neutron at the Paul Scherrer Institute using Ramsey's method of separated oscillating magnetic fields with ultracold neutrons (UCN). Our measurement stands in the long history of EDM experiments probing physics violating time reversal invariance. The salient features of this experiment were the use of a Hg-199 co-magnetometer and an array of optically pumped cesium vapor magnetometers to cancel and correct for magnetic field changes. The statistical analysis was performed on blinded datasets by two separate groups while the estimation of systematic effects profited from an unprecedented knowledge of the magnetic field. The measured value of the neutron EDM is $d_{\rm n} = (0.0\pm1.1_{\rm stat}\pm0.2_{\rm sys})\times10^{-26}e\,{\rm cm}$.
△ Less
Submitted 31 January, 2020;
originally announced January 2020.
-
The BUFFALO HST Survey
Authors:
Charles L. Steinhardt,
Mathilde Jauzac,
Ana Acebron,
Hakim Atek,
Peter Capak,
Iary Davidzon,
Dominique Eckert,
David Harvey,
Anton M. Koekemoer,
Claudia D. P. Lagos,
Guillaume Mahler,
Mireia Montes,
Anna Niemiec,
Mario Nonino,
P. A. Oesch,
Johan Richard,
Steven A. Rodney,
Matthieu Schaller,
Keren Sharon,
Louis-Gregory Strolger,
Joseph Allingham,
Adam Amara,
Yannick Bah'e,
Celine Boehm,
Sownak Bose
, et al. (70 additional authors not shown)
Abstract:
The Beyond Ultra-deep Frontier Fields and Legacy Observations (BUFFALO) is a 101 orbit + 101 parallel Cycle 25 Hubble Space Telescope Treasury program taking data from 2018-2020. BUFFALO will expand existing coverage of the Hubble Frontier Fields (HFF) in WFC3/IR F105W, F125W, and F160W and ACS/WFC F606W and F814W around each of the six HFF clusters and flanking fields. This additional area has no…
▽ More
The Beyond Ultra-deep Frontier Fields and Legacy Observations (BUFFALO) is a 101 orbit + 101 parallel Cycle 25 Hubble Space Telescope Treasury program taking data from 2018-2020. BUFFALO will expand existing coverage of the Hubble Frontier Fields (HFF) in WFC3/IR F105W, F125W, and F160W and ACS/WFC F606W and F814W around each of the six HFF clusters and flanking fields. This additional area has not been observed by HST but is already covered by deep multi-wavelength datasets, including Spitzer and Chandra. As with the original HFF program, BUFFALO is designed to take advantage of gravitational lensing from massive clusters to simultaneously find high-redshift galaxies which would otherwise lie below HST detection limits and model foreground clusters to study properties of dark matter and galaxy assembly. The expanded area will provide a first opportunity to study both cosmic variance at high redshift and galaxy assembly in the outskirts of the large HFF clusters. Five additional orbits are reserved for transient followup. BUFFALO data including mosaics, value-added catalogs and cluster mass distribution models will be released via MAST on a regular basis, as the observations and analysis are completed for the six individual clusters.
△ Less
Submitted 13 February, 2020; v1 submitted 27 January, 2020;
originally announced January 2020.
-
Visualizing probabilistic models in Minkowski space with intensive symmetrized Kullback-Leibler embedding
Authors:
Han Kheng Teoh,
Katherine N. Quinn,
Jaron Kent-Dobias,
Colin B. Clement,
Qingyang Xu,
James P. Sethna
Abstract:
We show that the predicted probability distributions for any $N$-parameter statistical model taking the form of an exponential family can be explicitly and analytically embedded isometrically in a $N{+}N$-dimensional Minkowski space. That is, the model predictions can be visualized as control parameters are varied, preserving the natural distance between probability distributions. All pairwise dis…
▽ More
We show that the predicted probability distributions for any $N$-parameter statistical model taking the form of an exponential family can be explicitly and analytically embedded isometrically in a $N{+}N$-dimensional Minkowski space. That is, the model predictions can be visualized as control parameters are varied, preserving the natural distance between probability distributions. All pairwise distances between model instances are given by the symmetrized Kullback-Leibler divergence. We give formulas for these intensive symmetrized Kullback Leibler (isKL) coordinate embeddings, and illustrate the resulting visualizations with the Bernoulli (coin toss) problem, the ideal gas, $n$ sided die, the nonlinear least squares fit, and the Gaussian fit. We highlight how isKL can be used to determine the minimum number of parameters needed to describe probabilistic data, and conclude by visualizing the prediction space of the two-dimensional Ising model, where we examine the manifold behavior near its critical point.
△ Less
Submitted 2 July, 2020; v1 submitted 12 December, 2019;
originally announced December 2019.
-
Reconstruction of Current Densities from Magnetic Images by Bayesian Inference
Authors:
Colin B. Clement,
James P. Sethna,
Katja C. Nowack
Abstract:
Electronic transport is at the heart of many phenomena in condensed matter physics and material science. Magnetic imaging is a non-invasive tool for detecting electric current in materials and devices. A two-dimensional current density can be reconstructed from an image of a single component of the magnetic field produced by the current. In this work, we approach the reconstruction problem in the…
▽ More
Electronic transport is at the heart of many phenomena in condensed matter physics and material science. Magnetic imaging is a non-invasive tool for detecting electric current in materials and devices. A two-dimensional current density can be reconstructed from an image of a single component of the magnetic field produced by the current. In this work, we approach the reconstruction problem in the framework of Bayesian inference, i.e. we solve for the most likely current density given an image obtained by a magnetic probe. To enforce a sensible current density priors are used to associate a cost with unphysical features such as pixel-to-pixel oscillations or current outside the device boundary. Beyond previous work, our approach does not require analytically tractable priors and therefore creates flexibility to use priors that have not been explored in the context of current reconstruction. Here, we implement several such priors that have desirable properties. A challenging aspect of imposing a prior is choosing the optimal strength. We describe an empirical way to determine the appropriate strength of the prior. We test our approach on numerically generated examples. Our code is released in an open-source \texttt{python} package called \texttt{pysquid}.
△ Less
Submitted 6 July, 2021; v1 submitted 28 October, 2019;
originally announced October 2019.
-
Spectral variations of Lyman-alpha emission within strongly lensed sources observed with MUSE
Authors:
Adélaïde Claeyssens,
Johan Richard,
Jéméry Blaizot,
Thibault Garel,
Floriane Leclercq,
Vera Patricio,
Anne Verhamme,
Lutz Wisotzki,
Roland Bacon,
David Carton,
Benjamin Clément,
Edmund Christian Herenz,
Raffaella Anna Marino,
Sowgat Muzahid,
Rike Saust,
Joop Schaye
Abstract:
We present an analysis of HI Lyman-alpha emission in deep VLT/MUSE observations of two highly magnified and extended galaxies at z=3.5 and 4.03, including a newly discovered, almost complete Einstein ring. While these Lyman-alpha haloes are intrinsically similar to the ones typically seen in other MUSE deep fields, the benefits of gravitational lensing allows us to construct exceptionally detailed…
▽ More
We present an analysis of HI Lyman-alpha emission in deep VLT/MUSE observations of two highly magnified and extended galaxies at z=3.5 and 4.03, including a newly discovered, almost complete Einstein ring. While these Lyman-alpha haloes are intrinsically similar to the ones typically seen in other MUSE deep fields, the benefits of gravitational lensing allows us to construct exceptionally detailed maps of Lyman-alpha line properties at sub-kpc scales. By combining all multiple images, we are able to observe complex structures in the Lyman-alpha emission and uncover small (~ 120 km/s in Lyman-alpha peak shift), but significant at > 4 sigma, systematic variations in the shape of the Lyman-alpha line profile within each halo. Indeed, we observe a global trend for the line peak shift to become redder at large radii, together with a strong correlation between the peak wavelength and line width. This systematic intrahalo variation is markedly similar to the object-to-object variations obtained from the integrated properties of recent large samples. Regions of high surface brightness correspond to relatively small line shifts, which could indicate that Lyman-alpha emission escapes preferentially from regions where the line profile has been less severely affected by scattering of Lyman-alpha photons.
△ Less
Submitted 4 September, 2019;
originally announced September 2019.
-
Faint end of the $z \sim 3-7$ luminosity function of Lyman-alpha emitters behind lensing clusters observed with MUSE
Authors:
G. de La Vieuville,
D. Bina,
R. Pello,
G. Mahler,
J. Richard,
A. B. Drake,
E. C. Herenz,
F. E. Bauer,
B. Clément,
D. Lagattuta,
N. Laporte,
J. Martinez,
V. Patriìcio,
L. Wisotzki,
J. Zabl,
R. J. Bouwens,
T. Contini,
T. Garel,
B. Guiderdoni,
R. A. Marino,
M. V. Maseda,
J. Matthee,
J. Schaye,
G. Soucail
Abstract:
We present the results obtained with VLT/MUSE on the faint-end of the Lyman-alpha luminosity function (LF) based on deep observations of four lensing clusters. The precise aim of the present study is to further constrain the abundance of Lyman-alpha emitters (LAEs) by taking advantage of the magnification provided by lensing clusters. We blindly selected a sample of 156 LAEs, with redshifts betwee…
▽ More
We present the results obtained with VLT/MUSE on the faint-end of the Lyman-alpha luminosity function (LF) based on deep observations of four lensing clusters. The precise aim of the present study is to further constrain the abundance of Lyman-alpha emitters (LAEs) by taking advantage of the magnification provided by lensing clusters. We blindly selected a sample of 156 LAEs, with redshifts between $2.9 \le z \le 6.7$ and magnification-corrected luminosities in the range $ 39 \lesssim \log L_{Ly_α}$ [erg s$^{-1}$] $\lesssim 43$. The price to pay to benefit from magnification is a reduction of the effective volume of the survey, together with a more complex analysis procedure. To properly take into account the individual differences in detection conditions (including lensing configurations, spatial and spectral morphologies) when computing the LF, a new method based on the 1/Vmax approach was implemented. The LAE LF has been obtained in four different redshift bins with constraints down to $\log L_{Ly_α} = 40.5$. From our data only, no significant evolution of LF mean slope can be found. When performing a Schechter analysis including data from the literature to complete the present sample a steep faint-end slope was measured varying from $α= -1.69^{+0.08}_{-0.08}$ to $α= -1.87^{+0.12}_{-0.12}$ between the lowest and the highest redshift bins. The contribution of the LAE population to the star formation rate density at $z \sim 6$ is $\lesssim 50$% depending on the luminosity limit considered, which is of the same order as the Lyman-break galaxy (LBG) contribution. The evolution of the LAE contribution with redshift depends on the assumed escape fraction of Lyman-alpha photons, and appears to slightly increase with increasing redshift when this fraction is conservatively set to one. (abridged)
△ Less
Submitted 29 July, 2019; v1 submitted 31 May, 2019;
originally announced May 2019.
-
On the Use of ArXiv as a Dataset
Authors:
Colin B. Clement,
Matthew Bierbaum,
Kevin P. O'Keeffe,
Alexander A. Alemi
Abstract:
The arXiv has collected 1.5 million pre-print articles over 28 years, hosting literature from scientific fields including Physics, Mathematics, and Computer Science. Each pre-print features text, figures, authors, citations, categories, and other metadata. These rich, multi-modal features, combined with the natural graph structure---created by citation, affiliation, and co-authorship---makes the a…
▽ More
The arXiv has collected 1.5 million pre-print articles over 28 years, hosting literature from scientific fields including Physics, Mathematics, and Computer Science. Each pre-print features text, figures, authors, citations, categories, and other metadata. These rich, multi-modal features, combined with the natural graph structure---created by citation, affiliation, and co-authorship---makes the arXiv an exciting candidate for benchmarking next-generation models. Here we take the first necessary steps toward this goal, by providing a pipeline which standardizes and simplifies access to the arXiv's publicly available data. We use this pipeline to extract and analyze a 6.7 million edge citation graph, with an 11 billion word corpus of full-text research articles. We present some baseline classification results, and motivate application of more exciting generative graph models.
△ Less
Submitted 30 April, 2019;
originally announced May 2019.
-
Infrared Galaxies in the Field of the Massive Cluster Abell S1063: Discovery of a Luminous Kiloparsec-Sized HII Region in a Gravitationally Lensed IR-Luminous Galaxy at $z=0.6$
Authors:
Gregory L. Walth,
Eiichi Egami,
Benjamin Clément,
Timothy D. Rawle,
Marie Rex,
Johan Richard,
Pablo Pérez-González,
Frédéric Boone,
Miroslava Dessauges-Zavadsky,
Jeff Portouw,
Benjamin Weiner,
Ian McGreer,
Evan Schneider
Abstract:
Using the Spitzer Space Telescope and Herschel Space Observatory, we have conducted a survey of infrared galaxies in the field of the galaxy cluster Abell S1063 (AS1063) at $z=0.347$, which is one of the most massive clusters known and a target of the HST CLASH and Frontier-Field surveys. The Spitzer/MIPS 24 $μ$m and Herschel/PACS & SPIRE images revealed that the core of AS1063 is surprisingly dev…
▽ More
Using the Spitzer Space Telescope and Herschel Space Observatory, we have conducted a survey of infrared galaxies in the field of the galaxy cluster Abell S1063 (AS1063) at $z=0.347$, which is one of the most massive clusters known and a target of the HST CLASH and Frontier-Field surveys. The Spitzer/MIPS 24 $μ$m and Herschel/PACS & SPIRE images revealed that the core of AS1063 is surprisingly devoid of infrared sources, showing only a few detectable sources within the central r$\sim1^{\prime}$. There is, however, one particularly bright source (2.3 mJy at 24 $μ$m; 106 mJy at 160 $μ$m), which corresponds to a background galaxy at $z=0.61$. The modest magnification factor (4.0$\times$) implies that this galaxy is intrinsically IR-luminous (L$_{\rm IR}=3.1\times10^{11}\ \rm L_{\odot}$). What is particularly interesting about this galaxy is that HST optical/near-infrared images show a remarkably bright and large (1 kpc) clump at one edge of the disk. Our follow-up optical/near-infrared spectroscopy shows Balmer (H$α$-H8) and forbidden emission from this clump ([OII] $λ$3727, [OIII] $λλ$4959,5007, [NII] $λλ$6548,6583), indicating that it is a HII region. The HII region appears to have formed in-situ, as kinematically it is part of a rotating disk, and there is no evidence of nearby interacting galaxies. With an extinction correction of A$_{\rm V}=1.5$ mag, the star formation rate of this giant HII region is $\sim$10 M$_{\odot}$ yr$^{-1}$, which is exceptionally large, even for high redshift HII regions. Such a large and luminous HII region is often seen at $z\sim2$ but quite rare in the nearby Universe.
△ Less
Submitted 15 April, 2019;
originally announced April 2019.
-
On the Representation of Partially Specified Implementations and its Application to the Optimization of Linear Algebra Kernels on GPU
Authors:
Ulysse Beaugnon,
Basile Clément,
Nicolas Tollenaere,
Albert Cohen
Abstract:
Traditional optimizing compilers rely on rewrite rules to iteratively apply program transformations. This iterative approach hides optimization opportunities behind intermediate transformation steps. For instance, vectorization can only be applied to the innermost loop in a nest: one must first perform a loop interchange before even considering vectorization of an outer loop. In contrast, we propo…
▽ More
Traditional optimizing compilers rely on rewrite rules to iteratively apply program transformations. This iterative approach hides optimization opportunities behind intermediate transformation steps. For instance, vectorization can only be applied to the innermost loop in a nest: one must first perform a loop interchange before even considering vectorization of an outer loop. In contrast, we propose an implementation framework representing programs as sets of possible implementation decisions. Specifying one decision can have an impact on others in a bidirectional manner: specifying that a loop must be vectorized prevents other loops from being nested inside it; conversely, specifying a loop as an outer loop will prevent it from being vectorized. These optimization decisions commute, obviating the pass ordering problem. We present a constraint programming system to formally define, represent and explore such implementation spaces. We also propose an exploration strategy combining tree search and branch-and-bound; the strength and novelty of this strategy reside in an analytical model of the lower bound on the execution time of a set of possible implementations. We showcase our approach on the construction and exploration of an implementation space for linear algebra kernels running on GPUs. We show this search space is expressive enough to represent complex decisions that fundamentally change the structure of the generated code. We also present preliminary results competitive with the performance of native GPU libraries.
△ Less
Submitted 6 April, 2019;
originally announced April 2019.
-
Probing 3D Structure with a Large MUSE Mosaic: Extending the Mass Model of Frontier Field Abell 370
Authors:
David J. Lagattuta,
Johan Richard,
Franz E. Bauer,
Benjamin Clément,
Guillaume Mahler,
Geneviève Soucail,
David Carton,
Jean-Paul Kneib,
Nicolas Laporte,
Johany Martinez,
Vera Patrício,
Anna V. Payne,
Roser Pelló,
Kasper B. Schmidt,
Geoffroy de la Vieuville
Abstract:
We present an updated strong-lensing analysis of the massive cluster Abell 370 (A370), continuing the work first presented in Lagattuta et al. (2017). In this new analysis, we take advantage of the deeper imaging data from the Hubble Space Telescope (HST) Frontier Fields program, as well as a large spectroscopic mosaic obtained with the Multi-Unit Spectroscopic Explorer (MUSE). Thanks to the exten…
▽ More
We present an updated strong-lensing analysis of the massive cluster Abell 370 (A370), continuing the work first presented in Lagattuta et al. (2017). In this new analysis, we take advantage of the deeper imaging data from the Hubble Space Telescope (HST) Frontier Fields program, as well as a large spectroscopic mosaic obtained with the Multi-Unit Spectroscopic Explorer (MUSE). Thanks to the extended coverage of this mosaic, we probe the full 3D distribution of galaxies in the field, giving us a unique picture of the extended structure of the cluster and its surroundings. Our final catalog contains 584 redshifts, representing the largest spectroscopic catalog of A370 to date. Constructing the model, we measure a total mass distribution that is quantitatively similar to our previous work -- though to ensure a low rms error in the model fit, we invoke a significantly large external shear term. Using the redshift catalog, we search for other bound groups of galaxies, which may give rise to a more physical interpretation of this shear. We identify three structures in narrow redshift ranges along the line of sight, highlighting possible infalling substructures into the main cluster halo. We also discover additional substructure candidates in low-resolution imaging at larger projected radii. More spectroscopic coverage of these regions (pushing close to the A370 virial radius) and more extended, high-resolution imaging will be required to investigate this possibility, further advancing the analysis of these interesting developments.
△ Less
Submitted 3 April, 2019;
originally announced April 2019.