Search | arXiv e-print repository

Robust Errant Beam Prognostics with Conditional Modeling for Particle Accelerators

Authors: Kishansingh Rajput, Malachi Schram, Willem Blokland, Yasir Alanazi, Pradeep Ramuhalli, Alexander Zhukov, Charles Peters, Ricardo Vilalta

Abstract: Particle accelerators are complex and comprise thousands of components, with many pieces of equipment running at their peak power. Consequently, particle accelerators can fault and abort operations for numerous reasons. These faults impact the availability of particle accelerators during scheduled run-time and hamper the efficiency and the overall science output. To avoid these faults, we apply an… ▽ More Particle accelerators are complex and comprise thousands of components, with many pieces of equipment running at their peak power. Consequently, particle accelerators can fault and abort operations for numerous reasons. These faults impact the availability of particle accelerators during scheduled run-time and hamper the efficiency and the overall science output. To avoid these faults, we apply anomaly detection techniques to predict any unusual behavior and perform preemptive actions to improve the total availability of particle accelerators. Semi-supervised Machine Learning (ML) based anomaly detection approaches such as autoencoders and variational autoencoders are often used for such tasks. However, supervised ML techniques such as Siamese Neural Network (SNN) models can outperform unsupervised or semi-supervised approaches for anomaly detection by leveraging the label information. One of the challenges specific to anomaly detection for particle accelerators is the data's variability due to system configuration changes. To address this challenge, we employ Conditional Siamese Neural Network (CSNN) models and Conditional Variational Auto Encoder (CVAE) models to predict errant beam pulses at the Spallation Neutron Source (SNS) under different system configuration conditions and compare their performance. We demonstrate that CSNN outperforms CVAE in our application. △ Less

Submitted 19 February, 2024; v1 submitted 22 November, 2023; originally announced December 2023.

Comments: Under review at Machine Learning: Science and Technology Journal

arXiv:2303.14090 [pdf, ps, other]

Physics-informed neural networks in the recreation of hydrodynamic simulations from dark matter

Authors: Zhenyu Dai, Ben Moews, Ricardo Vilalta, Romeel Dave

Abstract: Physics-informed neural networks have emerged as a coherent framework for building predictive models that combine statistical patterns with domain knowledge. The underlying notion is to enrich the optimization loss function with known relationships to constrain the space of possible solutions. Hydrodynamic simulations are a core constituent of modern cosmology, while the required computations are… ▽ More Physics-informed neural networks have emerged as a coherent framework for building predictive models that combine statistical patterns with domain knowledge. The underlying notion is to enrich the optimization loss function with known relationships to constrain the space of possible solutions. Hydrodynamic simulations are a core constituent of modern cosmology, while the required computations are both expensive and time-consuming. At the same time, the comparatively fast simulation of dark matter requires fewer resources, which has led to the emergence of machine learning algorithms for baryon inpainting as an active area of research; here, recreating the scatter found in hydrodynamic simulations is an ongoing challenge. This paper presents the first application of physics-informed neural networks to baryon inpainting by combining advances in neural network architectures with physical constraints, injecting theory on baryon conversion efficiency into the model loss function. We also introduce a punitive prediction comparison based on the Kullback-Leibler divergence, which enforces scatter reproduction. By simultaneously extracting the complete set of baryonic properties for the Simba suite of cosmological simulations, our results demonstrate improved accuracy of baryonic predictions based on dark matter halo properties, successful recovery of the fundamental metallicity relation, and retrieve scatter that traces the target simulation's distribution. △ Less

Submitted 19 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

arXiv:2203.13876 [pdf, other]

doi 10.1103/PhysRevC.107.054911

Map** out the thermodynamic stability of a QCD equation of state with a critical point using active learning

Authors: D. Mroczek, M. Hjorth-Jensen, J. Noronha-Hostler, P. Parotto, C. Ratti, R. Vilalta

Abstract: The Beam Energy Scan Theory (BEST) collaboration's equation of state (EoS) incorporates a 3D Ising model critical point into the Quantum Chromodynamics (QCD) equation of state from lattice simulations. However, it contains 4 free parameters related to the size and location of the critical region in the QCD phase diagram. Certain combinations of the free parameters lead to acausal or unstable reali… ▽ More The Beam Energy Scan Theory (BEST) collaboration's equation of state (EoS) incorporates a 3D Ising model critical point into the Quantum Chromodynamics (QCD) equation of state from lattice simulations. However, it contains 4 free parameters related to the size and location of the critical region in the QCD phase diagram. Certain combinations of the free parameters lead to acausal or unstable realizations of the EoS that should not be considered. In this work, we use an active learning framework to rule out pathological EoS efficiently. We find that checking stability and causality for a small portion of the parameters' range is sufficient to construct algorithms that perform with $>$96% accuracy across the entire parameter space. Though in this work we focus on a specific case, our approach can be generalized to any EoS containing a parameter space-class correspondence. △ Less

Submitted 29 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

Comments: 14 pages, 9 figures

arXiv:2111.01815 [pdf, other]

doi 10.3847/2041-8213/ac7054

A New Constraint on the Nuclear Equation of State from Statistical Distributions of Compact Remnants of Supernovae

Authors: Mikhail M. Meskhi, Noah E. Wolfe, Zhenyu Dai, Carla Frohlich, Jonah M. Miller, Raymond K. W. Wong, Ricardo Vilalta

Abstract: Understanding how matter behaves at the highest densities and temperatures is a major open problem in both nuclear physics and relativistic astrophysics. This physics is often encapsulated in the so-called high-temperature nuclear equation of state, which influences compact binary mergers, core-collapse supernovae, and many more phenomena. One such case is the type (either black hole or neutron st… ▽ More Understanding how matter behaves at the highest densities and temperatures is a major open problem in both nuclear physics and relativistic astrophysics. This physics is often encapsulated in the so-called high-temperature nuclear equation of state, which influences compact binary mergers, core-collapse supernovae, and many more phenomena. One such case is the type (either black hole or neutron star) and mass of the remnant of the core collapse of a massive star. For each of six candidate equations of state, we use a very large suite of spherically symmetric supernova models to generate a suite of synthetic populations of such remnants. We then compare these synthetic populations to the observed remnant population. We thus provide a novel constraint on the high-temperature nuclear equation of state and describe which EOS candidates are more or less favored by this metric. △ Less

Submitted 27 March, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

Comments: Accepted by ApJ Letters; revised version; 3 figures, 2 tables

Report number: LA-UR-21-30803

arXiv:2110.13041 [pdf, other]

doi 10.3389/fdata.2022.787421

Applications and Techniques for Fast Machine Learning in Science

Authors: Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci-Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bahr, Jurgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomas E. Muller Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Kyle J Hazelwood , et al. (62 additional authors not shown)

Abstract: In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML ac… ▽ More In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML across a number of scientific domains; techniques for training and implementing performant and resource-efficient ML algorithms; and computing architectures, platforms, and technologies for deploying these algorithms. We also present overlap** challenges across the multiple scientific domains where common solutions can be found. This community report is intended to give plenty of examples and inspiration for scientific discovery through integrated and accelerated ML solutions. This is followed by a high-level overview and organization of technical advances, including an abundance of pointers to source material, which can enable these breakthroughs. △ Less

Submitted 25 October, 2021; originally announced October 2021.

Comments: 66 pages, 13 figures, 5 tables

Report number: FERMILAB-PUB-21-502-AD-E-SCD

Journal ref: Front. Big Data 5, 787421 (2022)

arXiv:2105.14027 [pdf, other]

doi 10.21468/SciPostPhys.12.1.043

The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider

Authors: T. Aarrestad, M. van Beekveld, M. Bona, A. Boveia, S. Caron, J. Davies, A. De Simone, C. Doglioni, J. M. Duarte, A. Farbin, H. Gupta, L. Hendriks, L. Heinrich, J. Howarth, P. Jawahar, A. Jueid, J. Lastow, A. Leinweber, J. Mamuzic, E. Merényi, A. Morandini, P. Moskvitina, C. Nellist, J. Ngadiuba, B. Ostdiek , et al. (14 additional authors not shown)

Abstract: We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We defin… ▽ More We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to $10~\rm{fb}^{-1}$ of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge. △ Less

Submitted 9 December, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

Comments: v1: 54 pages, 24 figures. v2: 56 pages, citations added, extend discussion of look-elsewhere-effect, results unchanged; v3. minor typos and updated references

Journal ref: SciPost Phys. 12, 043 (2022)

arXiv:2101.07852 [pdf, other]

Learning Abstract Task Representations

Authors: Mikhail M. Meskhi, Adriano Rivolli, Rafael G. Mantovani, Ricardo Vilalta

Abstract: A proper form of data characterization can guide the process of learning-algorithm selection and model-performance estimation. The field of meta-learning has provided a rich body of work describing effective forms of data characterization using different families of meta-features (statistical, model-based, information-theoretic, topological, etc.). In this paper, we start with the abundant set of… ▽ More A proper form of data characterization can guide the process of learning-algorithm selection and model-performance estimation. The field of meta-learning has provided a rich body of work describing effective forms of data characterization using different families of meta-features (statistical, model-based, information-theoretic, topological, etc.). In this paper, we start with the abundant set of existing meta-features and propose a method to induce new abstract meta-features as latent variables in a deep neural network. We discuss the pitfalls of using traditional meta-features directly and argue for the importance of learning high-level task properties. We demonstrate our methodology using a deep neural network as a feature extractor. We demonstrate that 1) induced meta-models map** abstract meta-features to generalization performance outperform other methods by ~18% on average, and 2) abstract meta-features attain high feature-relevance scores. △ Less

Submitted 28 January, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

Journal ref: AAAI Workshop on Meta-Learning 2021

arXiv:2010.05941 [pdf, other]

Active learning with RESSPECT: Resource allocation for extragalactic astronomical transients

Authors: Noble Kennamer, Emille E. O. Ishida, Santiago Gonzalez-Gaitan, Rafael S. de Souza, Alexander Ihler, Kara Ponder, Ricardo Vilalta, Anais Moller, David O. Jones, Mi Dai, Alberto Krone-Martins, Bruno Quint, Sreevarsha Sreejith, Alex I. Malz, Lluis Galbany

Abstract: The recent increase in volume and complexity of available astronomical data has led to a wide use of supervised machine learning techniques. Active learning strategies have been proposed as an alternative to optimize the distribution of scarce labeling resources. However, due to the specific conditions in which labels can be acquired, fundamental assumptions, such as sample representativeness and… ▽ More The recent increase in volume and complexity of available astronomical data has led to a wide use of supervised machine learning techniques. Active learning strategies have been proposed as an alternative to optimize the distribution of scarce labeling resources. However, due to the specific conditions in which labels can be acquired, fundamental assumptions, such as sample representativeness and labeling cost stability cannot be fulfilled. The Recommendation System for Spectroscopic follow-up (RESSPECT) project aims to enable the construction of optimized training samples for the Rubin Observatory Legacy Survey of Space and Time (LSST), taking into account a realistic description of the astronomical data environment. In this work, we test the robustness of active learning techniques in a realistic simulated astronomical data scenario. Our experiment takes into account the evolution of training and pool samples, different costs per object, and two different sources of budget. Results show that traditional active learning strategies significantly outperform random sampling. Nevertheless, more complex batch strategies are not able to significantly overcome simple uncertainty sampling techniques. Our findings illustrate three important points: 1) active learning strategies are a powerful tool to optimize the label-acquisition task in astronomy, 2) for upcoming large surveys like LSST, such techniques allow us to tailor the construction of the training sample for the first day of the survey, and 3) the peculiar data environment related to the detection of astronomical transients is a fertile ground that calls for the development of tailored machine learning algorithms. △ Less

Submitted 26 October, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

Comments: Accepted to the 2020 IEEE Symposium Series on Computational Intelligence

arXiv:2005.08583 [pdf, ps, other]

doi 10.1093/mnras/staa3204

Ridges in the Dark Energy Survey for cosmic trough identification

Authors: Ben Moews, Morgan A. Schmitz, Andrew J. Lawler, Joe Zuntz, Alex I. Malz, Rafael S. de Souza, Ricardo Vilalta, Alberto Krone-Martins, Emille E. O. Ishida

Abstract: Cosmic voids and their corresponding redshift-projected mass densities, known as troughs, play an important role in our attempt to model the large-scale structure of the Universe. Understanding these structures enables us to compare the standard model with alternative cosmologies, constrain the dark energy equation of state, and distinguish between different gravitational theories. In this paper,… ▽ More Cosmic voids and their corresponding redshift-projected mass densities, known as troughs, play an important role in our attempt to model the large-scale structure of the Universe. Understanding these structures enables us to compare the standard model with alternative cosmologies, constrain the dark energy equation of state, and distinguish between different gravitational theories. In this paper, we extend the subspace-constrained mean shift algorithm, a recently introduced method to estimate density ridges, and apply it to 2D weak lensing mass density maps from the Dark Energy Survey Y1 data release to identify curvilinear filamentary structures. We compare the obtained ridges with previous approaches to extract trough structure in the same data, and apply curvelets as an alternative wavelet-based method to constrain densities. We then invoke the Wasserstein distance between noisy and noiseless simulations to validate the denoising capabilities of our method. Our results demonstrate the viability of ridge estimation as a precursor for denoising weak lensing observables to recover the large-scale structure, paving the way for a more versatile and effective search for troughs. △ Less

Submitted 14 November, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

Comments: 12 pages, 5 figures, accepted for publication in MNRAS

MSC Class: 85A40; 62G07; 62P35; 85A35

arXiv:2002.12220 [pdf, other]

Les Houches 2019 Physics at TeV Colliders: New Physics Working Group Report

Authors: G. Brooijmans, A. Buckley, S. Caron, A. Falkowski, B. Fuks, A. Gilbert, W. J. Murray, M. Nardecchia, J. M. No, R. Torre, T. You, G. Zevi Della Porta, G. Alguero, J. Y. Araz, S. Banerjee, G. Bélanger, T. Berger-Hryn'ova, J. Bernigaud, A. Bharucha, D. Buttazzo, J. M. Butterworth, G. Cacciapaglia, A. Coccaro, L. Corpe, N. Desai , et al. (65 additional authors not shown)

Abstract: This report presents the activities of the `New Physics' working group for the `Physics at TeV Colliders' workshop (Les Houches, France, 10--28 June, 2019). These activities include studies of direct searches for new physics, approaches to exploit published data to constrain new physics, as well as the development of tools to further facilitate these investigations. Benefits of machine learning fo… ▽ More This report presents the activities of the `New Physics' working group for the `Physics at TeV Colliders' workshop (Les Houches, France, 10--28 June, 2019). These activities include studies of direct searches for new physics, approaches to exploit published data to constrain new physics, as well as the development of tools to further facilitate these investigations. Benefits of machine learning for both the search for new physics and the interpretation of these searches are also presented. △ Less

Submitted 27 February, 2020; originally announced February 2020.

Comments: Proceedings of the BSM Session of the Les Houches 2019 workshop, 227 pages

arXiv:1911.02479 [pdf, ps, other]

Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era

Authors: Brian Nord, Andrew J. Connolly, Jamie Kinney, Jeremy Kubica, Gautaum Narayan, Joshua E. G. Peek, Chad Schafer, Erik J. Tollerud, Camille Avestruz, G. Jogesh Babu, Simon Birrer, Douglas Burke, João Caldeira, Douglas A. Caldwell, Joleen K. Carlberg, Yen-Chi Chen, Chuanfei Dong, Eric D. Feigelson, V. Zach Golkhou, Vinay Kashyap, T. S. Li, Thomas Loredo, Luisa Lucie-Smith, Kaisey S. Mandel, J. R. Martínez-Galarza , et al. (13 additional authors not shown)

Abstract: The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our… ▽ More The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/). △ Less

Submitted 4 November, 2019; originally announced November 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1905.05116

Report number: FERMILAB-FN-1093-A-AE-SCD

arXiv:1903.04425 [pdf, other]

Dark Matter Science in the Era of LSST

Authors: Keith Bechtol, Alex Drlica-Wagner, Kevork N. Abazajian, Muntazir Abidi, Susmita Adhikari, Yacine Ali-Haïmoud, James Annis, Behzad Ansarinejad, Robert Armstrong, Jacobo Asorey, Carlo Baccigalupi, Arka Banerjee, Nilanjan Banik, Charles Bennett, Florian Beutler, Simeon Bird, Simon Birrer, Rahul Biswas, Andrea Biviano, Jonathan Blazek, Kimberly K. Boddy, Ana Bonaca, Julian Borrill, Sownak Bose, Jo Bovy , et al. (155 additional authors not shown)

Abstract: Astrophysical observations currently provide the only robust, empirical measurements of dark matter. In the coming decade, astrophysical observations will guide other experimental efforts, while simultaneously probing unique regions of dark matter parameter space. This white paper summarizes astrophysical observations that can constrain the fundamental physics of dark matter in the era of LSST. We… ▽ More Astrophysical observations currently provide the only robust, empirical measurements of dark matter. In the coming decade, astrophysical observations will guide other experimental efforts, while simultaneously probing unique regions of dark matter parameter space. This white paper summarizes astrophysical observations that can constrain the fundamental physics of dark matter in the era of LSST. We describe how astrophysical observations will inform our understanding of the fundamental properties of dark matter, such as particle mass, self-interaction strength, non-gravitational interactions with the Standard Model, and compact object abundances. Additionally, we highlight theoretical work and experimental/observational facilities that will complement LSST to strengthen our understanding of the fundamental characteristics of dark matter. △ Less

Submitted 11 March, 2019; originally announced March 2019.

Comments: 11 pages, 2 figures, Science Whitepaper for Astro 2020, more information at https://lsstdarkmatter.github.io

arXiv:1902.01055 [pdf, other]

Probing the Fundamental Nature of Dark Matter with the Large Synoptic Survey Telescope

Authors: Alex Drlica-Wagner, Yao-Yuan Mao, Susmita Adhikari, Robert Armstrong, Arka Banerjee, Nilanjan Banik, Keith Bechtol, Simeon Bird, Kimberly K. Boddy, Ana Bonaca, Jo Bovy, Matthew R. Buckley, Esra Bulbul, Chihway Chang, George Chapline, Johann Cohen-Tanugi, Alessandro Cuoco, Francis-Yan Cyr-Racine, William A. Dawson, Ana Díaz Rivero, Cora Dvorkin, Denis Erkal, Christopher D. Fassnacht, Juan García-Bellido, Maurizio Giannotti , et al. (75 additional authors not shown)

Abstract: Astrophysical and cosmological observations currently provide the only robust, empirical measurements of dark matter. Future observations with Large Synoptic Survey Telescope (LSST) will provide necessary guidance for the experimental dark matter program. This white paper represents a community effort to summarize the science case for studying the fundamental physics of dark matter with LSST. We d… ▽ More Astrophysical and cosmological observations currently provide the only robust, empirical measurements of dark matter. Future observations with Large Synoptic Survey Telescope (LSST) will provide necessary guidance for the experimental dark matter program. This white paper represents a community effort to summarize the science case for studying the fundamental physics of dark matter with LSST. We discuss how LSST will inform our understanding of the fundamental properties of dark matter, such as particle mass, self-interaction strength, non-gravitational couplings to the Standard Model, and compact object abundances. Additionally, we discuss the ways that LSST will complement other experiments to strengthen our understanding of the fundamental characteristics of dark matter. More information on the LSST dark matter effort can be found at https://lsstdarkmatter.github.io/ . △ Less

Submitted 24 April, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

Comments: 96 pages, 22 figures, 1 table

Report number: FERMILAB-PUB-19-048-A-AE

arXiv:1812.10403 [pdf, other]

Transfer Learning in Astronomy: A New Machine-Learning Paradigm

Authors: Ricardo Vilalta

Abstract: The widespread dissemination of machine learning tools in science, particularly in astronomy, has revealed the limitation of working with simple single-task scenarios in which any task in need of a predictive model is looked in isolation, and ignores the existence of other similar tasks. In contrast, a new generation of techniques is emerging where predictive models can take advantage of previous… ▽ More The widespread dissemination of machine learning tools in science, particularly in astronomy, has revealed the limitation of working with simple single-task scenarios in which any task in need of a predictive model is looked in isolation, and ignores the existence of other similar tasks. In contrast, a new generation of techniques is emerging where predictive models can take advantage of previous experience to leverage information from similar tasks. The new emerging area is referred to as transfer learning. In this paper, I briefly describe the motivation behind the use of transfer learning techniques, and explain how such techniques can be used to solve popular problems in astronomy. As an example, a prevalent problem in astronomy is to estimate the class of an object (e.g., Supernova Ia) using a generation of photometric light-curve datasets where data abounds, but class labels are scarce; such analysis can benefit from spectroscopic data where class labels are known with high confidence, but the data sample is small. Transfer learning provides a robust and practical solution to leverage information from one domain to improve the accuracy of a model built on a different domain. In the example above, transfer learning would look to overcome the difficulty in the compatibility of models between spectroscopic data and photometric data, since data properties such as size, class priors, and underlying distributions, are all expected to be significantly different. △ Less

Submitted 20 December, 2018; originally announced December 2018.

arXiv:1812.09786 [pdf, ps, other]

doi 10.1103/PhysRevD.99.123529

Stress testing the dark energy equation of state imprint on supernova data

Authors: Ben Moews, Rafael S. de Souza, Emille E. O. Ishida, Alex I. Malz, Caroline Heneka, Ricardo Vilalta, Joe Zuntz

Abstract: This work determines the degree to which a standard Lambda-CDM analysis based on type Ia supernovae can identify deviations from a cosmological constant in the form of a redshift-dependent dark energy equation of state w(z). We introduce and apply a novel random curve generator to simulate instances of w(z) from constraint families with increasing distinction from a cosmological constant. After pr… ▽ More This work determines the degree to which a standard Lambda-CDM analysis based on type Ia supernovae can identify deviations from a cosmological constant in the form of a redshift-dependent dark energy equation of state w(z). We introduce and apply a novel random curve generator to simulate instances of w(z) from constraint families with increasing distinction from a cosmological constant. After producing a series of mock catalogs of binned type Ia supernovae corresponding to each w(z) curve, we perform a standard Lambda-CDM analysis to estimate the corresponding posterior densities of the absolute magnitude of type Ia supernovae, the present-day matter density, and the equation of state parameter. Using the Kullback-Leibler divergence between posterior densities as a difference measure, we demonstrate that a standard type Ia supernova cosmology analysis has limited sensitivity to extensive redshift dependencies of the dark energy equation of state. In addition, we report that larger redshift-dependent departures from a cosmological constant do not necessarily manifest easier-detectable incompatibilities with the Lambda-CDM model. Our results suggest that physics beyond the standard model may simply be hidden in plain sight. △ Less

Submitted 5 July, 2019; v1 submitted 23 December, 2018; originally announced December 2018.

Comments: 14 pages, 9 figures

MSC Class: 85A40; 62P35; 68W20

Journal ref: Phys. Rev. D 99, 123529 (2019)

arXiv:1812.08839 [pdf, other]

doi 10.1088/1538-3873/aaf1fc

A General Approach to Domain Adaptation with Applications in Astronomy

Authors: Ricardo Vilalta, Kinjal Dhar Gupta, Dainis Boumber, Mikhail M. Meskhi

Abstract: The ability to build a model on a source task and subsequently adapt such model on a new target task is a pervasive need in many astronomical applications. The problem is generally known as transfer learning in machine learning, where domain adaptation is a popular scenario. An example is to build a predictive model on spectroscopic data to identify Supernovae IA, while subsequently trying to adap… ▽ More The ability to build a model on a source task and subsequently adapt such model on a new target task is a pervasive need in many astronomical applications. The problem is generally known as transfer learning in machine learning, where domain adaptation is a popular scenario. An example is to build a predictive model on spectroscopic data to identify Supernovae IA, while subsequently trying to adapt such model on photometric data. In this paper we propose a new general approach to domain adaptation that does not rely on the proximity of source and target distributions. Instead we simply assume a strong similarity in model complexity across domains, and use active learning to mitigate the dependency on source examples. Our work leads to a new formulation for the likelihood as a function of empirical error using a theoretical learning bound; the result is a novel map** from generalization error to a likelihood estimation. Results using two real astronomical problems, Supernova Ia classification and identification of Mars landforms, show two main advantages with our approach: increased accuracy performance and substantial savings in computational cost. △ Less

Submitted 20 December, 2018; originally announced December 2018.

arXiv:1808.05355 [pdf, other]

Conceptual Domain Adaptation Using Deep Learning

Authors: Behrang Mehrparvar, Ricardo Vilalta

Abstract: Deep learning has recently been shown to be instrumental in the problem of domain adaptation, where the goal is to learn a model on a target domain using a similar --but not identical-- source domain. The rationale for coupling both techniques is the possibility of extracting common concepts across domains. Considering (strictly) local representations, traditional deep learning assumes common conc… ▽ More Deep learning has recently been shown to be instrumental in the problem of domain adaptation, where the goal is to learn a model on a target domain using a similar --but not identical-- source domain. The rationale for coupling both techniques is the possibility of extracting common concepts across domains. Considering (strictly) local representations, traditional deep learning assumes common concepts must be captured in the same hidden units. We contend that jointly training a model with source and target data using a single deep network is prone to failure when there is inherently lower-level representational discrepancy between the two domains; such discrepancy leads to a misalignment of corresponding concepts in separate hidden units. We introduce a search framework to correctly align high-level representations when training deep networks; such framework leads to the notion of conceptual --as opposed to representational-- domain adaptation. △ Less

Submitted 16 August, 2018; originally announced August 2018.

arXiv:1804.03765 [pdf, other]

doi 10.1093/mnras/sty3015

Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning

Authors: E. E. O. Ishida, R. Beck, S. Gonzalez-Gaitan, R. S. de Souza, A. Krone-Martins, J. W. Barrett, N. Kennamer, R. Vilalta, J. M. Burgess, B. Quint, A. Z. Vitorelli, A. Mahabal, E. Gangler

Abstract: We report a framework for spectroscopic follow-up design for optimizing supernova photometric classification. The strategy accounts for the unavoidable mismatch between spectroscopic and photometric samples, and can be used even in the beginning of a new survey -- without any initial training set. The framework falls under the umbrella of active learning (AL), a class of algorithms that aims to mi… ▽ More We report a framework for spectroscopic follow-up design for optimizing supernova photometric classification. The strategy accounts for the unavoidable mismatch between spectroscopic and photometric samples, and can be used even in the beginning of a new survey -- without any initial training set. The framework falls under the umbrella of active learning (AL), a class of algorithms that aims to minimize labelling costs by identifying a few, carefully chosen, objects which have high potential in improving the classifier predictions. As a proof of concept, we use the simulated data released after the Supernova Photometric Classification Challenge (SNPCC) and a random forest classifier. Our results show that, using only 12\% the number of training objects in the SNPCC spectroscopic sample, this approach is able to double purity results. Moreover, in order to take into account multiple spectroscopic observations in the same night, we propose a semi-supervised batch-mode AL algorithm which selects a set of $N=5$ most informative objects at each night. In comparison with the initial state using the traditional approach, our method achieves 2.3 times higher purity and comparable figure of merit results after only 180 days of observation, or 800 queries (73% of the SNPCC spectroscopic sample size). Such results were obtained using the same amount of spectroscopic time necessary to observe the original SNPCC spectroscopic sample, showing that this type of strategy is feasible with current available spectroscopic resources. The code used in this work is available in the COINtoolbox: https://github.com/COINtoolbox/ActSNClass . △ Less

Submitted 3 January, 2019; v1 submitted 10 April, 2018; originally announced April 2018.

Comments: 18 pages, 15 figures - replace to match journal version

Journal ref: MNRAS, Volume 483, Issue 1, 11 February 2019, Pages 2-18

arXiv:1803.07328 [pdf]

doi 10.5281/zenodo.398834

End-to-end 5G services via an SDN/NFV-based multi-tenant network and cloud testbed

Authors: Raul Muñoz, Josep Mangues-Bafalluy, Nikolaos Bartzoudis, Ricard Vilalta, Ricardo Martínez, Ramon Casellas, Nicola Baldo, José Núñez-Martínez, Manuel Requena-Esteso, Oriol Font-Bach, Marco Miozzo, Pol Henarejos, Ana Pérez-Neira, Miquel Payaró

Abstract: 5G has a main requirement of highly flexible, ultralow latency and ultra-high bandwidth virtualized infrastructure in order to deliver end-to-end services. This requirement can be met by efficiently integrating all network segments (radio access, aggregation and core) with heterogeneous wireless and optical technologies (5G, mmWave, LTE/LTE-A, Wi-Fi, Ethernet, MPLS, WDM, software-defined optical t… ▽ More 5G has a main requirement of highly flexible, ultralow latency and ultra-high bandwidth virtualized infrastructure in order to deliver end-to-end services. This requirement can be met by efficiently integrating all network segments (radio access, aggregation and core) with heterogeneous wireless and optical technologies (5G, mmWave, LTE/LTE-A, Wi-Fi, Ethernet, MPLS, WDM, software-defined optical transmission, etc.), and massive computing and storage cloud services (offered in edge/core data centers). This paper introduces the preliminary architecture aiming at integrating three consolidated and standalone experimental infrastructures at CTTC, in order to deploy the required end-to-end top-to-bottom converged infrastructure pointed out above for testing and develo** advanced 5G services. △ Less

Submitted 20 March, 2018; originally announced March 2018.

arXiv:1803.07310 [pdf, other]

doi 10.1109/MVT.2015.2508320

The CTTC 5G end-to-end experimental platform: Integrating heterogeneous wireless/optical networks, distributed cloud, and IoT devices

Authors: Raul Muñóz, Josep Mangues, Ricard Vilalta, Christos Verikoukis, Jesús Alonso-Zarate, Nikolaos Bartzoudis, Apostolos Georgiadis, Miquel Payaró, Ana Pérez-Neira, Ramon Casellas, Ricardo Martínez, José Núñez-Martínez, Manuel Requena-Esteso, David Pubill, Oriol Font-Bach, Pol Henarejos, Jordi Serra, Francisco Vazquez-Gallego

Abstract: The Internet of Things (IoT) will facilitate a wide variety of applications in different domains, such as smart cities, smart grids, industrial automation (Industry 4.0), smart driving, assistance of the elderly, and home automation. Billions of heterogeneous smart devices with different application requirements will be connected to the networks and will generate huge aggregated volumes of data th… ▽ More The Internet of Things (IoT) will facilitate a wide variety of applications in different domains, such as smart cities, smart grids, industrial automation (Industry 4.0), smart driving, assistance of the elderly, and home automation. Billions of heterogeneous smart devices with different application requirements will be connected to the networks and will generate huge aggregated volumes of data that will be processed in distributed cloud infrastructures. On the other hand, there is also a general trend to deploy functions as software (SW) instances in cloud infrastructures [e.g., network function virtualization (NFV) or mobile edge computing (MEC)]. Thus, the next generation of mobile networks, the fifth-generation (5G), will need not only to develop new radio interfaces or waveforms to cope with the expected traffic growth but also to integrate heterogeneous networks from end to end (E2E) with distributed cloud resources to deliver E2E IoT and mobile services. This article presents the E2E 5G platform that is being developed by the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), the first known platform capable of reproducing such an ambitious scenario. △ Less

Submitted 20 March, 2018; originally announced March 2018.

arXiv:1703.07607 [pdf, other]

doi 10.1093/mnras/stx2156

A probabilistic approach to emission-line galaxy classification

Authors: R. S. de Souza, M. L. L. Dantas, M. V. Costa-Duarte, E. D. Feigelson, M. Killedar, P. -Y. Lablanche, R. Vilalta, A. Krone-Martins, R. Beck, F. Gieseke

Abstract: We invoke a Gaussian mixture model (GMM) to jointly analyse two traditional emission-line classification schemes of galaxy ionization sources: the Baldwin-Phillips-Terlevich (BPT) and $\rm W_{Hα}$ vs. [NII]/H$α$ (WHAN) diagrams, using spectroscopic data from the Sloan Digital Sky Survey Data Release 7 and SEAGal/STARLIGHT datasets. We apply a GMM to empirically define classes of galaxies in a thre… ▽ More We invoke a Gaussian mixture model (GMM) to jointly analyse two traditional emission-line classification schemes of galaxy ionization sources: the Baldwin-Phillips-Terlevich (BPT) and $\rm W_{Hα}$ vs. [NII]/H$α$ (WHAN) diagrams, using spectroscopic data from the Sloan Digital Sky Survey Data Release 7 and SEAGal/STARLIGHT datasets. We apply a GMM to empirically define classes of galaxies in a three-dimensional space spanned by the $\log$ [OIII]/H$β$, $\log$ [NII]/H$α$, and $\log$ EW(H$α$), optical parameters. The best-fit GMM based on several statistical criteria suggests a solution around four Gaussian components (GCs), which are capable to explain up to 97 per cent of the data variance. Using elements of information theory, we compare each GC to their respective astronomical counterpart. GC1 and GC4 are associated with star-forming galaxies, suggesting the need to define a new starburst subgroup. GC2 is associated with BPT's Active Galaxy Nuclei (AGN) class and WHAN's weak AGN class. GC3 is associated with BPT's composite class and WHAN's strong AGN class. Conversely, there is no statistical evidence -- based on four GCs -- for the existence of a Seyfert/LINER dichotomy in our sample. Notwithstanding, the inclusion of an additional GC5 unravels it. The GC5 appears associated to the LINER and Passive galaxies on the BPT and WHAN diagrams respectively. Subtleties aside, we demonstrate the potential of our methodology to recover/unravel different objects inside the wilderness of astronomical datasets, without lacking the ability to convey physically interpretable results. The probabilistic classifications from the GMM analysis are publicly available within the COINtoolbox (https://cointoolbox.github.io/GMM\_Catalogue/). △ Less

Submitted 18 August, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

Comments: Accepted for publication in MNRAS

arXiv:1512.06810 [pdf, other]

doi 10.1093/mnras/stw1228

Exploring the spectroscopic diversity of type Ia supernovae with DRACULA: a machine learning approach

Authors: Michele Sasdelli, E. E. O. Ishida, R. Vilalta, M. Aguena, V. C. Busti, H. Camacho, A. M. M. Trindade, F. Gieseke, R. S. de Souza, Y. T. Fantaye, P. A. Mazzali

Abstract: The existence of multiple subclasses of type Ia supernovae (SNeIa) has been the subject of great debate in the last decade. One major challenge inevitably met when trying to infer the existence of one or more subclasses is the time consuming, and subjective, process of subclass definition. In this work, we show how machine learning tools facilitate identification of subtypes of SNeIa through the e… ▽ More The existence of multiple subclasses of type Ia supernovae (SNeIa) has been the subject of great debate in the last decade. One major challenge inevitably met when trying to infer the existence of one or more subclasses is the time consuming, and subjective, process of subclass definition. In this work, we show how machine learning tools facilitate identification of subtypes of SNeIa through the establishment of a hierarchical group structure in the continuous space of spectral diversity formed by these objects. Using Deep Learning, we were capable of performing such identification in a 4 dimensional feature space (+1 for time evolution), while the standard Principal Component Analysis barely achieves similar results using 15 principal components. This is evidence that the progenitor system and the explosion mechanism can be described by a small number of initial physical parameters. As a proof of concept, we show that our results are in close agreement with a previously suggested classification scheme and that our proposed method can grasp the main spectral features behind the definition of such subtypes. This allows the confirmation of the velocity of lines as a first order effect in the determination of SNIa subtypes, followed by 91bg-like events. Given the expected data deluge in the forthcoming years, our proposed approach is essential to allow a quick and statistically coherent identification of SNeIa subtypes (and outliers). All tools used in this work were made publicly available in the Python package Dimensionality Reduction And Clustering for Unsupervised Learning in Astronomy (DRACULA) and can be found within COINtoolbox (https://github.com/COINtoolbox/DRACULA). △ Less

Submitted 30 June, 2016; v1 submitted 21 December, 2015; originally announced December 2015.

Comments: 16 pages, 12 figures, accepted for publication in MNRAS

arXiv:1409.7696 [pdf, other]

doi 10.1016/j.ascom.2015.04.002

The Overlooked Potential of Generalized Linear Models in Astronomy - I: Binomial Regression

Authors: R. S. de Souza, E. Cameron, M. Killedar, J. Hilbe, R. Vilalta, U. Maio, V. Biffi, B. Ciardi, J. D. Riggs

Abstract: Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlo… ▽ More Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlooked a whole family of statistical techniques for exploratory data analysis and robust regression, the so-called Generalized Linear Models (GLMs). In this paper -- the first in a series aimed at illustrating the power of these methods in astronomical applications -- we elucidate the potential of a particular class of GLMs for handling binary/binomial data, the so-called logit and probit regression techniques, from both a maximum likelihood and a Bayesian perspective. As a case in point, we present the use of these GLMs to explore the conditions of star formation activity and metal enrichment in primordial minihaloes from cosmological hydro-simulations including detailed chemistry, gas physics, and stellar feedback. We predict that for a dark mini-halo with metallicity $\approx 1.3 \times 10^{-4} Z_{\bigodot}$, an increase of $1.2 \times 10^{-2}$ in the gas molecular fraction, increases the probability of star formation occurrence by a factor of 75%. Finally, we highlight the use of receiver operating characteristic curves as a diagnostic for binary classifiers, and ultimately we use these to demonstrate the competitive predictive performance of GLMs against the popular technique of artificial neural networks. △ Less

Submitted 4 April, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

Comments: 20 pages, 10 figures, 3 tables, accepted for publication in Astronomy and Computing

Showing 1–23 of 23 results for author: Vilalta, R