-
Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Authors:
Everlyn Asiko Chimoto,
Jay Gala,
Orevaoghene Ahia,
Julia Kreutzer,
Bruce A. Bassett,
Sara Hooker
Abstract:
Neural Machine Translation models are extremely data and compute-hungry. However, not all data points contribute equally to model training and generalization. Data pruning to remove the low-value data points has the benefit of drastically reducing the compute budget without significant drop in model performance. In this paper, we propose a new data pruning technique: Checkpoints Across Time (CAT),…
▽ More
Neural Machine Translation models are extremely data and compute-hungry. However, not all data points contribute equally to model training and generalization. Data pruning to remove the low-value data points has the benefit of drastically reducing the compute budget without significant drop in model performance. In this paper, we propose a new data pruning technique: Checkpoints Across Time (CAT), that leverages early model training dynamics to identify the most relevant data points for model performance. We benchmark CAT against several data pruning techniques including COMET-QE, LASER and LaBSE. We find that CAT outperforms the benchmarks on Indo-European languages on multiple test sets. When applied to English-German, English-French and English-Swahili translation tasks, CAT achieves comparable performance to using the full dataset, while pruning up to 50% of training data. We inspect the data points that CAT selects and find that it tends to favour longer sentences and sentences with unique or rare words.
△ Less
Submitted 21 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
The Dark Energy Survey Supernova Program: Cosmological Analysis and Systematic Uncertainties
Authors:
M. Vincenzi,
D. Brout,
P. Armstrong,
B. Popovic,
G. Taylor,
M. Acevedo,
R. Camilleri,
R. Chen,
T. M. Davis,
S. R. Hinton,
L. Kelsey,
R. Kessler,
J. Lee,
C. Lidman,
A. Möller,
H. Qu,
M. Sako,
B. Sanchez,
D. Scolnic,
M. Smith,
M. Sullivan,
P. Wiseman,
J. Asorey,
B. A. Bassett,
D. Carollo
, et al. (71 additional authors not shown)
Abstract:
We present the full Hubble diagram of photometrically-classified Type Ia supernovae (SNe Ia) from the Dark Energy Survey supernova program (DES-SN). DES-SN discovered more than 20,000 SN candidates and obtained spectroscopic redshifts of 7,000 host galaxies. Based on the light-curve quality, we select 1635 photometrically-identified SNe Ia with spectroscopic redshift 0.10$< z <$1.13, which is the…
▽ More
We present the full Hubble diagram of photometrically-classified Type Ia supernovae (SNe Ia) from the Dark Energy Survey supernova program (DES-SN). DES-SN discovered more than 20,000 SN candidates and obtained spectroscopic redshifts of 7,000 host galaxies. Based on the light-curve quality, we select 1635 photometrically-identified SNe Ia with spectroscopic redshift 0.10$< z <$1.13, which is the largest sample of supernovae from any single survey and increases the number of known $z>0.5$ supernovae by a factor of five. In a companion paper, we present cosmological results of the DES-SN sample combined with 194 spectroscopically-classified SNe Ia at low redshift as an anchor for cosmological fits. Here we present extensive modeling of this combined sample and validate the entire analysis pipeline used to derive distances. We show that the statistical and systematic uncertainties on cosmological parameters are $σ_{Ω_M,{\rm stat+sys}}^{Λ{\rm CDM}}=$0.017 in a flat $Λ$CDM model, and $(σ_{Ω_M},σ_w)_{\rm stat+sys}^{w{\rm CDM}}=$(0.082, 0.152) in a flat $w$CDM model. Combining the DES SN data with the highly complementary CMB measurements by Planck Collaboration (2020) reduces uncertainties on cosmological parameters by a factor of 4. In all cases, statistical uncertainties dominate over systematics. We show that uncertainties due to photometric classification make up less than 10% of the total systematic uncertainty budget. This result sets the stage for the next generation of SN cosmology surveys such as the Vera C. Rubin Observatory's Legacy Survey of Space and Time.
△ Less
Submitted 22 January, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
The Dark Energy Survey: Cosmology Results With ~1500 New High-redshift Type Ia Supernovae Using The Full 5-year Dataset
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Acevedo,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
J. Annis,
P. Armstrong,
J. Asorey,
S. Avila,
D. Bacon,
B. A. Bassett,
K. Bechtol,
P. H. Bernardinelli,
G. M. Bernstein,
E. Bertin,
J. Blazek,
S. Bocquet,
D. Brooks,
D. Brout,
E. Buckley-Geer,
D. L. Burke
, et al. (134 additional authors not shown)
Abstract:
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscop…
▽ More
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscopic redshifts are acquired from a dedicated follow-up survey of the host galaxies. After accounting for the likelihood of each SN being a SN Ia, we find 1635 DES SNe in the redshift range $0.10<z<1.13$ that pass quality selection criteria sufficient to constrain cosmological parameters. This quintuples the number of high-quality $z>0.5$ SNe compared to the previous leading compilation of Pantheon+, and results in the tightest cosmological constraints achieved by any SN data set to date. To derive cosmological constraints we combine the DES supernova data with a high-quality external low-redshift sample consisting of 194 SNe Ia spanning $0.025<z<0.10$. Using SN data alone and including systematic uncertainties we find $Ω_{\rm M}=0.352\pm 0.017$ in flat $Λ$CDM. Supernova data alone now require acceleration ($q_0<0$ in $Λ$CDM) with over $5σ$ confidence. We find $(Ω_{\rm M},w)=(0.264^{+0.074}_{-0.096},-0.80^{+0.14}_{-0.16})$ in flat $w$CDM. For flat $w_0w_a$CDM, we find $(Ω_{\rm M},w_0,w_a)=(0.495^{+0.033}_{-0.043},-0.36^{+0.36}_{-0.30},-8.8^{+3.7}_{-4.5})$. Including Planck CMB data, SDSS BAO data, and DES $3\times2$-point data gives $(Ω_{\rm M},w)=(0.321\pm0.007,-0.941\pm0.026)$. In all cases dark energy is consistent with a cosmological constant to within $\sim2σ$. In our analysis, systematic errors on cosmological parameters are subdominant compared to statistical errors; paving the way for future photometrically classified supernova analyses.
△ Less
Submitted 6 June, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
Planning to Learn: A Novel Algorithm for Active Learning during Model-Based Planning
Authors:
Rowan Hodson,
Bruce Bassett,
Charel van Hoof,
Benjamin Rosman,
Mark Solms,
Jonathan P. Shock,
Ryan Smith
Abstract:
Active Inference is a recent framework for modeling planning under uncertainty. Empirical and theoretical work have now begun to evaluate the strengths and weaknesses of this approach and how it might be improved. A recent extension - the sophisticated inference (SI) algorithm - improves performance on multi-step planning problems through recursive decision tree search. However, little work to dat…
▽ More
Active Inference is a recent framework for modeling planning under uncertainty. Empirical and theoretical work have now begun to evaluate the strengths and weaknesses of this approach and how it might be improved. A recent extension - the sophisticated inference (SI) algorithm - improves performance on multi-step planning problems through recursive decision tree search. However, little work to date has been done to compare SI to other established planning algorithms. SI was also developed with a focus on inference as opposed to learning. The present paper has two aims. First, we compare performance of SI to Bayesian reinforcement learning (RL) schemes designed to solve similar problems. Second, we present an extension of SI - sophisticated learning (SL) - that more fully incorporates active learning during planning. SL maintains beliefs about how model parameters would change under the future observations expected under each policy. This allows a form of counterfactual retrospective inference in which the agent considers what could be learned from current or past observations given different future observations. To accomplish these aims, we make use of a novel, biologically inspired environment designed to highlight the problem structure for which SL offers a unique solution. Here, an agent must continually search for available (but changing) resources in the presence of competing affordances for information gain. Our simulations show that SL outperforms all other algorithms in this context - most notably, Bayes-adaptive RL and upper confidence bound algorithms, which aim to solve multi-step planning problems using similar principles (i.e., directed exploration and counterfactual reasoning). These results provide added support for the utility of Active Inference in solving this class of biologically-relevant problems and offer added tools for testing hypotheses about human cognition.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Authors:
Christiaan Jacobs,
Nathanaël Carraz Rakotonirina,
Everlyn Asiko Chimoto,
Bruce A. Bassett,
Herman Kamper
Abstract:
We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data f…
▽ More
We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data from well-resourced languages to spot keywords in data in the unseen target language. In contrast to ASR, the AWE approach only requires a few keyword exemplars. In controlled experiments on Wolof and Swahili where training and test data are from the same domain, an ASR model trained on just five minutes of data outperforms the AWE approach. But in an in-the-wild test on Swahili radio broadcasts with actual hate speech keywords, the AWE model (using one minute of template data) is more robust, giving similar performance to an ASR system trained on 30 hours of labelled data.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Trajectory Based RFI Subtraction and Calibration for Radio Interferometry
Authors:
Chris Finlay,
Bruce A. Bassett,
Martin Kunz,
Nadeem Oozeer
Abstract:
Radio interferometry calibration and Radio Frequency Interference (RFI) removal are usually done separately. Here we show that jointly modelling the antenna gains and RFI has significant benefits when the RFI follows precise trajectories, such as for satellites. One surprising benefit is improved calibration solutions, by leveraging the RFI signal itself. We present tabascal (TrAjectory BAsed RFI…
▽ More
Radio interferometry calibration and Radio Frequency Interference (RFI) removal are usually done separately. Here we show that jointly modelling the antenna gains and RFI has significant benefits when the RFI follows precise trajectories, such as for satellites. One surprising benefit is improved calibration solutions, by leveraging the RFI signal itself. We present tabascal (TrAjectory BAsed RFI Subtraction and CALibration), a new algorithm that jointly models the RFI and calibration parameters in visibilities. We test tabascal on simulated MeerKAT calibration observations contaminated by satellite-based RFI. We obtain gain estimates that are both unbiased and up to an order of magnitude better constrained compared to uncontaminated data. When combined with an ad hoc RFI subtraction scheme, tabascal solutions can be further applied to an adjacent target observation: 5 minutes of calibration data results in an image with about a third the noise achieved when using flagging alone. The recovered flux distribution of RFI subtracted data was on par with uncontaminated data. In contrast, RFI flagging alone resulted in a higher detection threshold and consistent underestimation of source fluxes. For a mean RFI amplitude of 17 Jy, using RFI subtraction leads to less than 1% loss of data compared to 75% data loss from an ideal $3σ$ flagging algorithm, a very significant increase in data available for science analysis. Although we have examined the case of satellite RFI, tabascal should work for any RFI moving on parameterizable trajectories, relative to the phase centre, such as planes and/or objects fixed to the ground.
△ Less
Submitted 28 June, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Nature and Evolution of UHF and L-band Radio Frequency Interference at the MeerKAT Radio Telescope
Authors:
Isaac Sihlangu,
Nadeem Oozeer,
Bruce Bassett
Abstract:
Radio Frequency Interference (RFI) is unwanted noise that swamps the desired astronomical signal. Radio astronomers have always had to deal with RFI detection and excision around telescope sites, but little has been done to understand the full scope, nature and evolution of RFI in a unified way. We undertake this for the MeerKAT array using a probabilistic multidimensional framework approach focus…
▽ More
Radio Frequency Interference (RFI) is unwanted noise that swamps the desired astronomical signal. Radio astronomers have always had to deal with RFI detection and excision around telescope sites, but little has been done to understand the full scope, nature and evolution of RFI in a unified way. We undertake this for the MeerKAT array using a probabilistic multidimensional framework approach focussing on UHF-band and L-band data. In the UHF- band, RFI is dominated by the allocated Global System for Mobile (GSM) Communications, flight Distance Measuring Equipment (DME), and UHF-TV bands. The L-band suffers from known RFI sources such as DMEs, GSM, and the Global Positioning System (GPS) satellites. In the "clean" MeerKAT band, we noticed the RFI occupancy changing with time and direction for both the L-band and UHF band. For example, we saw a significant increase (300% increase) in the fraction of L-band flagged data in November 2018 compared to June 2018. This increase seems to correlate with construction activity on site. In the UHF-band, we found that the early morning is least impacted by RFI and other outliers. We also found a dramatic decrease in DME RFI during the hard lockdown due to the COVID-19 pandemic. The work presented here allows us to characterise the evolution of RFI at the MeerKAT site. Any observatory can adopt it to understand the behaviour of RFI within its surroundings.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Very Low Resource Sentence Alignment: Luhya and Swahili
Authors:
Everlyn Asiko Chimoto,
Bruce A. Bassett
Abstract:
Language-agnostic sentence embeddings generated by pre-trained models such as LASER and LaBSE are attractive options for mining large datasets to produce parallel corpora for low-resource machine translation. We test LASER and LaBSE in extracting bitext for two related low-resource African languages: Luhya and Swahili. For this work, we created a new parallel set of nearly 8000 Luhya-English sente…
▽ More
Language-agnostic sentence embeddings generated by pre-trained models such as LASER and LaBSE are attractive options for mining large datasets to produce parallel corpora for low-resource machine translation. We test LASER and LaBSE in extracting bitext for two related low-resource African languages: Luhya and Swahili. For this work, we created a new parallel set of nearly 8000 Luhya-English sentences which allows a new zero-shot test of LASER and LaBSE. We find that LaBSE significantly outperforms LASER on both languages. Both LASER and LaBSE however perform poorly at zero-shot alignment on Luhya, achieving just 1.5% and 22.0% successful alignments respectively (P@1 score). We fine-tune the embeddings on a small set of parallel Luhya sentences and show significant gains, improving the LaBSE alignment accuracy to 53.3%. Further, restricting the dataset to sentence embedding pairs with cosine similarity above 0.7 yielded alignments with over 85% accuracy.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Learning to Detect Interesting Anomalies
Authors:
Alireza Vafaei Sadr,
Bruce A. Bassett,
Emmanuel Sekyi
Abstract:
Anomaly detection algorithms are typically applied to static, unchanging, data features hand-crafted by the user. But how does a user systematically craft good features for anomalies that have never been seen? Here we couple deep learning with active learning -- in which an Oracle iteratively labels small amounts of data selected algorithmically over a series of rounds -- to automatically and dyna…
▽ More
Anomaly detection algorithms are typically applied to static, unchanging, data features hand-crafted by the user. But how does a user systematically craft good features for anomalies that have never been seen? Here we couple deep learning with active learning -- in which an Oracle iteratively labels small amounts of data selected algorithmically over a series of rounds -- to automatically and dynamically improve the data features for efficient outlier detection. This approach, AHUNT, shows excellent performance on MNIST, CIFAR10, and Galaxy-DESI data, significantly outperforming both standard anomaly detection and active learning algorithms with static feature spaces. Beyond improved performance, AHUNT also allows the number of anomaly classes to grow organically in response to Oracle's evaluations. Extensive ablation studies explore the impact of Oracle question selection strategy and loss function on performance. We illustrate how the dynamic anomaly class taxonomy represents another step towards fully personalized rankings of different anomaly classes that reflect a user's interests, allowing the algorithm to learn to ignore statistically significant but uninteresting outliers (e.g., noise). This should prove useful in the era of massive astronomical datasets serving diverse sets of users who can only review a tiny subset of the incoming data.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
COMET-QE and Active Learning for Low-Resource Machine Translation
Authors:
Everlyn Asiko Chimoto,
Bruce A. Bassett
Abstract:
Active learning aims to deliver maximum benefit when resources are scarce. We use COMET-QE, a reference-free evaluation metric, to select sentences for low-resource neural machine translation. Using Swahili, Kinyarwanda and Spanish for our experiments, we show that COMET-QE significantly outperforms two variants of Round Trip Translation Likelihood (RTTL) and random sentence selection by up to 5 B…
▽ More
Active learning aims to deliver maximum benefit when resources are scarce. We use COMET-QE, a reference-free evaluation metric, to select sentences for low-resource neural machine translation. Using Swahili, Kinyarwanda and Spanish for our experiments, we show that COMET-QE significantly outperforms two variants of Round Trip Translation Likelihood (RTTL) and random sentence selection by up to 5 BLEU points for 20k sentences selected by Active Learning on a 30k baseline. This suggests that COMET-QE is a powerful tool for sentence selection in the very low-resource limit.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
A Hitchhiker's Guide to Anomaly Detection with Astronomaly
Authors:
Michelle Lochner,
Bruce A. Bassett
Abstract:
The next generation of telescopes such as the SKA and the Rubin Observatory will produce enormous data sets, requiring automated anomaly detection to enable scientific discovery. Here, we present an overview and friendly user guide to the Astronomaly framework for active anomaly detection in astronomical data. Astronomaly uses active learning to combine the raw processing power of machine learning…
▽ More
The next generation of telescopes such as the SKA and the Rubin Observatory will produce enormous data sets, requiring automated anomaly detection to enable scientific discovery. Here, we present an overview and friendly user guide to the Astronomaly framework for active anomaly detection in astronomical data. Astronomaly uses active learning to combine the raw processing power of machine learning with the intuition and experience of a human user, enabling personalised recommendations of interesting anomalies. It makes use of a Python backend to perform data processing, feature extraction and machine learning to detect anomalous objects; and a JavaScript frontend to allow interaction with the data, labelling of interesting anomalous and active learning. Astronomaly is designed to be modular, extendable and run on almost any type of astronomical data. In this paper, we detail the structure of the Astronomaly code and provide guidelines for basic usage.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
The Dark Energy Survey Supernova Program: Cosmological biases from supernova photometric classification
Authors:
M. Vincenzi,
M. Sullivan,
A. Möller,
P. Armstrong,
B. A. Bassett,
D. Brout,
D. Carollo,
A. Carr,
T. M. Davis,
C. Frohmaier,
L. Galbany,
K. Glazebrook,
O. Graur,
L. Kelsey,
R. Kessler,
E. Kovacs,
G. F. Lewis,
C. Lidman,
U. Malik,
R. C. Nichol,
B. Popovic,
M. Sako,
D. Scolnic,
M. Smith,
G. Taylor
, et al. (59 additional authors not shown)
Abstract:
Cosmological analyses of samples of photometrically-identified Type Ia supernovae (SNe Ia) depend on understanding the effects of 'contamination' from core-collapse and peculiar SN Ia events. We employ a rigorous analysis on state-of-the-art simulations of photometrically identified SN Ia samples and determine cosmological biases due to such 'non-Ia' contamination in the Dark Energy Survey (DES) 5…
▽ More
Cosmological analyses of samples of photometrically-identified Type Ia supernovae (SNe Ia) depend on understanding the effects of 'contamination' from core-collapse and peculiar SN Ia events. We employ a rigorous analysis on state-of-the-art simulations of photometrically identified SN Ia samples and determine cosmological biases due to such 'non-Ia' contamination in the Dark Energy Survey (DES) 5-year SN sample. As part of the analysis, we test on our DES simulations the performance of SuperNNova, a photometric SN classifier based on recurrent neural networks. Depending on the choice of non-Ia SN models in both the simulated data sample and training sample, contamination ranges from 0.8-3.5 %, with the efficiency of the classification from 97.7-99.5 %. Using the Bayesian Estimation Applied to Multiple Species (BEAMS) framework and its extension 'BEAMS with Bias Correction' (BBC), we produce a redshift-binned Hubble diagram marginalised over contamination and corrected for selection effects and we use it to constrain the dark energy equation-of-state, $w$. Assuming a flat universe with Gaussian $Ω_M$ prior of $0.311\pm0.010$, we show that biases on $w$ are $<0.008$ when using SuperNNova and accounting for a wide range of non-Ia SN models in the simulations. Systematic uncertainties associated with contamination are estimated to be at most $σ_{w, \mathrm{syst}}=0.004$. This compares to an expected statistical uncertainty of $σ_{w,\mathrm{stat}}=0.039$ for the DES-SN sample, thus showing that contamination is not a limiting uncertainty in our analysis. We also measure biases due to contamination on $w_0$ and $w_a$ (assuming a flat universe), and find these to be $<$0.009 in $w_0$ and $<$0.108 in $w_a$, hence 5 to 10 times smaller than the statistical uncertainties expected from the DES-SN sample.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
The Hydrogen Intensity and Real-time Analysis eXperiment: 256-Element Array Status and Overview
Authors:
Devin Crichton,
Moumita Aich,
Adam Amara,
Kevin Bandura,
Bruce A. Bassett,
Carlos Bengaly,
Pascale Berner,
Shruti Bhatporia,
Martin Bucher,
Tzu-Ching Chang,
H. Cynthia Chiang,
Jean-Francois Cliche,
Carolyn Crichton,
Romeel Dave,
Dirk I. L. de Villiers,
Matt A. Dobbs,
Aaron M. Ewall-Wice,
Scott Eyono,
Christopher Finlay,
Sindhu Gaddam,
Ken Ganga,
Kevin G. Gayley,
Kit Gerodias,
Tim Gibbon,
Austin Gumba
, et al. (75 additional authors not shown)
Abstract:
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) is a radio interferometer array currently in development, with an initial 256-element array to be deployed at the South African Radio Astronomy Observatory (SARAO) Square Kilometer Array (SKA) site in South Africa. Each of the 6m, $f/0.23$ dishes will be instrumented with dual-polarisation feeds operating over a frequency range of 40…
▽ More
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) is a radio interferometer array currently in development, with an initial 256-element array to be deployed at the South African Radio Astronomy Observatory (SARAO) Square Kilometer Array (SKA) site in South Africa. Each of the 6m, $f/0.23$ dishes will be instrumented with dual-polarisation feeds operating over a frequency range of 400-800 MHz. Through intensity map** of the 21 cm emission line of neutral hydrogen, HIRAX will provide a cosmological survey of the distribution of large-scale structure over the redshift range of $0.775 < z < 2.55$ over $\sim$15,000 square degrees of the southern sky. The statistical power of such a survey is sufficient to produce $\sim$7 percent constraints on the dark energy equation of state parameter when combined with measurements from the Planck satellite. Additionally, HIRAX will provide a highly competitive platform for radio transient and HI absorber science while enabling a multitude of cross-correlation studies. In this paper, we describe the science goals of the experiment, overview of the design and status of the sub-components of the telescope system, and describe the expected performance of the initial 256-element array as well as the planned future expansion to the final, 1024-element array.
△ Less
Submitted 17 January, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
CRKSPH-compatible discretization of the SUPG and SAAF transport equations
Authors:
Brody R. Bassett,
J. Michael Owen
Abstract:
The self-adjoint angular flux and streamline-upwind Petrov-Galerkin transport equations are discretized using reproducing kernels with the collocation method to produce a discretization that is compatible with conservative reproducing kernel smoothed particle hydrodynamics. A novel second derivative is derived for the diffusion-like term in the self-adjoint angular flux equation. The resulting equ…
▽ More
The self-adjoint angular flux and streamline-upwind Petrov-Galerkin transport equations are discretized using reproducing kernels with the collocation method to produce a discretization that is compatible with conservative reproducing kernel smoothed particle hydrodynamics. A novel second derivative is derived for the diffusion-like term in the self-adjoint angular flux equation. The resulting equations involve only evaluations of kernels and physical data at the nodal centers.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Low-Resource Neural Machine Translation for Southern African Languages
Authors:
Evander Nyoni,
Bruce A. Bassett
Abstract:
Low-resource African languages have not fully benefited from the progress in neural machine translation because of a lack of data. Motivated by this challenge we compare zero-shot learning, transfer learning and multilingual learning on three Bantu languages (Shona, isiXhosa and isiZulu) and English. Our main target is English-to-isiZulu translation for which we have just 30,000 sentence pairs, 28…
▽ More
Low-resource African languages have not fully benefited from the progress in neural machine translation because of a lack of data. Motivated by this challenge we compare zero-shot learning, transfer learning and multilingual learning on three Bantu languages (Shona, isiXhosa and isiZulu) and English. Our main target is English-to-isiZulu translation for which we have just 30,000 sentence pairs, 28% of the average size of our other corpora. We show the importance of language similarity on the performance of English-to-isiZulu transfer learning based on English-to-isiXhosa and English-to-Shona parent models whose BLEU scores differ by 5.2. We then demonstrate that multilingual learning surpasses both transfer learning and zero-shot learning on our dataset, with BLEU score improvements relative to the baseline English-to-isiZulu model of 9.9, 6.1 and 2.0 respectively. Our best model also improves the previous SOTA BLEU score by more than 10.
△ Less
Submitted 3 April, 2021; v1 submitted 1 April, 2021;
originally announced April 2021.
-
Astronomaly: Personalised Active Anomaly Detection in Astronomical Data
Authors:
Michelle Lochner,
Bruce A. Bassett
Abstract:
Survey telescopes such as the Vera C. Rubin Observatory and the Square Kilometre Array will discover billions of static and dynamic astronomical sources. Properly mined, these enormous datasets will likely be wellsprings of rare or unknown astrophysical phenomena. The challenge is that the datasets are so large that most data will never be seen by human eyes; currently the most robust instrument w…
▽ More
Survey telescopes such as the Vera C. Rubin Observatory and the Square Kilometre Array will discover billions of static and dynamic astronomical sources. Properly mined, these enormous datasets will likely be wellsprings of rare or unknown astrophysical phenomena. The challenge is that the datasets are so large that most data will never be seen by human eyes; currently the most robust instrument we have to detect relevant anomalies. Machine learning is a useful tool for anomaly detection in this regime. However, it struggles to distinguish between interesting anomalies and irrelevant data such as instrumental artefacts or rare astronomical sources that are simply not of interest to a particular scientist. Active learning combines the flexibility and intuition of the human brain with the raw processing power of machine learning. By strategically choosing specific objects for expert labelling, it minimises the amount of data that scientists have to look through while maximising potential scientific return. Here we introduce Astronomaly: a general anomaly detection framework with a novel active learning approach designed to provide personalised recommendations. Astronomaly can operate on most types of astronomical data, including images, light curves and spectra. We use the Galaxy Zoo dataset to demonstrate the effectiveness of Astronomaly, as well as simulated data to thoroughly test our new active learning approach. We find that for both datasets, Astronomaly roughly doubles the number of interesting anomalies found in the first 100 objects viewed by the user. Astronomaly is easily extendable to include new feature extraction techniques, anomaly detection algorithms and even different active learning approaches. The code is publicly available at https://github.com/MichelleLochner/astronomaly.
△ Less
Submitted 6 October, 2021; v1 submitted 21 October, 2020;
originally announced October 2020.
-
Deep Evolution for Facial Emotion Recognition
Authors:
Emmanuel Dufourq,
Bruce A. Bassett
Abstract:
Deep facial expression recognition faces two challenges that both stem from the large number of trainable parameters: long training times and a lack of interpretability. We propose a novel method based on evolutionary algorithms, that deals with both challenges by massively reducing the number of trainable parameters, whilst simultaneously retaining classification performance, and in some cases ac…
▽ More
Deep facial expression recognition faces two challenges that both stem from the large number of trainable parameters: long training times and a lack of interpretability. We propose a novel method based on evolutionary algorithms, that deals with both challenges by massively reducing the number of trainable parameters, whilst simultaneously retaining classification performance, and in some cases achieving superior performance. We are robustly able to reduce the number of parameters on average by 95% (e.g. from 2M to 100k parameters) with no loss in classification accuracy. The algorithm learns to choose small patches from the image, relative to the nose, which carry the most important information about emotion, and which coincide with typical human choices of important features. Our work implements a novel form attention and shows that evolutionary algorithms are a valuable addition to machine learning in the deep learning era, both for reducing the number of parameters for facial expression recognition and for providing interpretable features that can help reduce bias.
△ Less
Submitted 13 October, 2020; v1 submitted 29 September, 2020;
originally announced September 2020.
-
Multidimensional RFI Framework for Characterising Radio Astronomy Observatories
Authors:
Isaac Sihlangu,
Nadeem Oozeer,
Bruce A. Bassett
Abstract:
Radio Frequency Interference (RFI) has historically plagued radio astronomy, worsening with the rapid spread of electronics and increasing telescope sensitivity. We present a multi-dimensional probabilistic framework for characterising the RFI environment around a radio astronomy site that uses automatically flagged data from the array itself. We illustrate the framework using about 1500 hours of…
▽ More
Radio Frequency Interference (RFI) has historically plagued radio astronomy, worsening with the rapid spread of electronics and increasing telescope sensitivity. We present a multi-dimensional probabilistic framework for characterising the RFI environment around a radio astronomy site that uses automatically flagged data from the array itself. We illustrate the framework using about 1500 hours of commissioning data from the MeerKAT radio telescope; producing a 6-dimensional array that yields both average RFI occupancy as well as confidence intervals around the mean as a function of key variables (frequency, direction, baseline, time). Our results provide the first detailed view of the MeerKAT RFI environment at high sensitivity as a function of direction, frequency, time of day and baseline. They allow us to track the historical evolution of the RFI and to quantify fluctuations which can be used for alerting on new RFI. As expected we find the major RFI contributors for MeerKAT site are from Global Positioning System (GPS) satellites, flight Distance Measurement Equipment (DME) and the Global System for Mobile (GSM) Communications. Beyond characterising RFI environments our approach allows observers access to the prior probability of RFI in any combination of tracked variables, allowing for more efficient observation planning and data excision.
△ Less
Submitted 20 August, 2020;
originally announced August 2020.
-
Meshless discretization of the discrete-ordinates transport equation with integration based on Voronoi cells
Authors:
Brody R. Bassett,
J. Michael Owen
Abstract:
The time-dependent radiation transport equation is discretized using the meshless-local Petrov-Galerkin method with reproducing kernels. The integration is performed using a Voronoi tessellation, which creates a partition of unity that only depends on the position and extent of the kernels. The resolution of the integration automatically follows the particles and requires no manual adjustment. The…
▽ More
The time-dependent radiation transport equation is discretized using the meshless-local Petrov-Galerkin method with reproducing kernels. The integration is performed using a Voronoi tessellation, which creates a partition of unity that only depends on the position and extent of the kernels. The resolution of the integration automatically follows the particles and requires no manual adjustment. The discretization includes streamline-upwind Petrov-Galerkin stabilization to prevent oscillations and improve numerical conditioning. The angular quadrature is selectively refineable to increase angular resolution in chosen directions. The time discretization is done using backward Euler. The transport solve for each direction and the solve for the scattering source are both done using Krylov iterative methods. Results indicate first-order convergence in time and second-order convergence in space for linear reproducing kernels.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
Climate & BCG: Effects on COVID-19 Death Growth Rates
Authors:
Chris Finlay,
Bruce A. Bassett
Abstract:
Multiple studies have suggested the spread of COVID-19 is affected by factors such as climate, BCG vaccinations, pollution and blood type. We perform a joint study of these factors using the death growth rates of 40 regions worldwide with both machine learning and Bayesian methods. We find weak, non-significant (< 3$σ$) evidence for temperature and relative humidity as factors in the spread of COV…
▽ More
Multiple studies have suggested the spread of COVID-19 is affected by factors such as climate, BCG vaccinations, pollution and blood type. We perform a joint study of these factors using the death growth rates of 40 regions worldwide with both machine learning and Bayesian methods. We find weak, non-significant (< 3$σ$) evidence for temperature and relative humidity as factors in the spread of COVID-19 but little or no evidence for BCG vaccination prevalence or $\text{PM}_{2.5}$ pollution. The only variable detected at a statistically significant level (>3$σ$) is the rate of positive COVID-19 tests, with higher positive rates correlating with higher daily growth of deaths.
△ Less
Submitted 10 July, 2020;
originally announced July 2020.
-
Deep Learning improves identification of Radio Frequency Interference
Authors:
Alireza Vafaei Sadr,
Bruce A. Bassett,
Nadeem Oozeer,
Yabebal Fantaye,
Chris Finlay
Abstract:
Flagging of Radio Frequency Interference (RFI) is an increasingly important challenge in radio astronomy. We present R-Net, a deep convolutional ResNet architecture that significantly outperforms existing algorithms -- including the default MeerKAT RFI flagger, and deep U-Net architectures -- across all metrics including AUC, F1-score and MCC. We demonstrate the robustness of this improvement on b…
▽ More
Flagging of Radio Frequency Interference (RFI) is an increasingly important challenge in radio astronomy. We present R-Net, a deep convolutional ResNet architecture that significantly outperforms existing algorithms -- including the default MeerKAT RFI flagger, and deep U-Net architectures -- across all metrics including AUC, F1-score and MCC. We demonstrate the robustness of this improvement on both single dish and interferometric simulations and, using transfer learning, on real data. Our R-Net model's precision is approximately $90\%$ better than the current MeerKAT flagger at $80\%$ recall and has a 35\% higher F1-score with no additional performance cost. We further highlight the effectiveness of transfer learning from a model initially trained on simulated MeerKAT data and fine-tuned on real, human-flagged, KAT-7 data. Despite the wide differences in the nature of the two telescope arrays, the model achieves an AUC of 0.91, while the best model without transfer learning only reaches an AUC of 0.67. We consider the use of phase information in our models but find that without calibration the phase adds almost no extra information relative to amplitude data only. Our results strongly suggest that deep learning on simulations, boosted by transfer learning on real data, will likely play a key role in the future of RFI flagging of radio astronomy data.
△ Less
Submitted 12 October, 2020; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Efficient smoothed particle radiation hydrodynamics II: Radiation hydrodynamics
Authors:
Brody R. Bassett,
J. Michael Owen,
Thomas A. Brunner
Abstract:
The radiation hydrodynamics equations for smoothed particle hydrodynamics are derived by operator splitting the radiation and hydrodynamics terms, including necessary terms for material motion, and discretizing each of the sets of equations separately in time and space. The implicit radiative transfer discussed in the first paper of this series is coupled to explicit smoothed particle hydrodynamic…
▽ More
The radiation hydrodynamics equations for smoothed particle hydrodynamics are derived by operator splitting the radiation and hydrodynamics terms, including necessary terms for material motion, and discretizing each of the sets of equations separately in time and space. The implicit radiative transfer discussed in the first paper of this series is coupled to explicit smoothed particle hydrodynamics. The result is a multi-material meshless radiation hydrodynamics code with arbitrary opacities and equations of state that performs well for problems with significant material motion. The code converges with second-order accuracy in space and first-order accuracy in time to the semianalytic solution for the Lowrie radiative shock problem and has competitive performance compared to a mesh-based radiation hydrodynamics code for a multi-material problem in two dimensions and an ablation problem inspired by inertial confinement fusion in two and three dimensions.
△ Less
Submitted 16 November, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
Efficient smoothed particle radiation hydrodynamics I: Thermal radiative transfer
Authors:
Brody R. Bassett,
J. Michael Owen,
Thomas A. Brunner
Abstract:
This work presents efficient solution techniques for radiative transfer in the smoothed particle hydrodynamics discretization. Two choices that impact efficiency are how the material and radiation energy are coupled, which determines the number of iterations needed to converge the emission source, and how the radiation diffusion equation is solved, which must be done in each iteration. The coupled…
▽ More
This work presents efficient solution techniques for radiative transfer in the smoothed particle hydrodynamics discretization. Two choices that impact efficiency are how the material and radiation energy are coupled, which determines the number of iterations needed to converge the emission source, and how the radiation diffusion equation is solved, which must be done in each iteration. The coupled material and radiation energy equations are solved using an inexact Newton iteration scheme based on nonlinear elimination, which reduces the number of Newton iterations needed to converge within each time step. During each Newton iteration, the radiation diffusion equation is solved using Krylov iterative methods with a multigrid preconditioner, which abstracts and optimizes much of the communication when running in parallel. The code is verified for an infinite medium problem, a one-dimensional Marshak wave, and a two and three-dimensional manufactured problem, and exhibits first-order convergence in time and second-order convergence in space. For these problems, the number of iterations needed to converge the inexact Newton scheme and the diffusion equation are independent of the number of spatial points and the number of processors.
△ Less
Submitted 16 November, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
The Mystery of Photometric Twins DES17X1boj and DES16E2bjy
Authors:
M. Pursiainen,
C. Gutierrez,
P. Wiseman,
M. Childress,
M. Smith,
C. Frohmaier,
C. Angus,
N. Castro Segura,
L. Kelsey,
M. Sullivan,
L. Galbany,
P. Nugent,
B. A. Bassett,
D. Brout,
D. Carollo,
C. B. D'Andrea,
T. M. Davis,
R. J. Foley,
M. Grayling,
S. R. Hinton,
C. Inserra,
R. Kessler,
C. Lidman,
E. Macaulay,
M. March
, et al. (58 additional authors not shown)
Abstract:
We present an analysis of DES17X1boj and DES16E2bjy, two peculiar transients discovered by the Dark Energy Survey (DES). They exhibit nearly identical double-peaked light curves which reach very different maximum luminosities (M$_\mathrm{r}$ = -15.4 and M$_\mathrm{r}$ = -17.9, respectively). The light curve evolution of these events is highly atypical and has not been reported before. The transien…
▽ More
We present an analysis of DES17X1boj and DES16E2bjy, two peculiar transients discovered by the Dark Energy Survey (DES). They exhibit nearly identical double-peaked light curves which reach very different maximum luminosities (M$_\mathrm{r}$ = -15.4 and M$_\mathrm{r}$ = -17.9, respectively). The light curve evolution of these events is highly atypical and has not been reported before. The transients are found in different host environments: DES17X1boj was found near the nucleus of a spiral galaxy, while DES16E2bjy is located in the outskirts of a passive red galaxy. Early photometric data is well fitted with a blackbody and the resulting moderate photospheric expansion velocities (1800 km/s for DES17X1boj and 4800 km/s for DES16E2bjy) suggest an explosive or eruptive origin. Additionally, a feature identified as high-velocity CaII absorption (v $\approx$ 9400km/s) in the near-peak spectrum of DES17X1boj may imply that it is a supernova. While similar light curve evolution suggests a similar physical origin for these two transients, we are not able to identify or characterise the progenitors.
△ Less
Submitted 7 April, 2020; v1 submitted 27 November, 2019;
originally announced November 2019.
-
A Flexible Framework for Anomaly Detection via Dimensionality Reduction
Authors:
Alireza Vafaei Sadr,
Bruce A. Bassett,
Martin Kunz
Abstract:
Anomaly detection is challenging, especially for large datasets in high dimensions. Here we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. We release DRAMA, a general python package that implements the general framework with a wide range of built-in options. We test DRAMA on a wide variety of simulated and real datasets, in up to 3000 d…
▽ More
Anomaly detection is challenging, especially for large datasets in high dimensions. Here we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. We release DRAMA, a general python package that implements the general framework with a wide range of built-in options. We test DRAMA on a wide variety of simulated and real datasets, in up to 3000 dimensions, and find it robust and highly competitive with commonly-used anomaly detection algorithms, especially in high dimensions. The flexibility of the DRAMA framework allows for significant optimization once some examples of anomalies are available, making it ideal for online anomaly detection, active learning and highly unbalanced datasets.
△ Less
Submitted 9 September, 2019;
originally announced September 2019.
-
Bayesian Anomaly Detection and Classification
Authors:
Ethan Roberts,
Bruce A. Bassett,
Michelle Lochner
Abstract:
Statistical uncertainties are rarely incorporated in machine learning algorithms, especially for anomaly detection. Here we present the Bayesian Anomaly Detection And Classification (BADAC) formalism, which provides a unified statistical approach to classification and anomaly detection within a hierarchical Bayesian framework. BADAC deals with uncertainties by marginalising over the unknown, true,…
▽ More
Statistical uncertainties are rarely incorporated in machine learning algorithms, especially for anomaly detection. Here we present the Bayesian Anomaly Detection And Classification (BADAC) formalism, which provides a unified statistical approach to classification and anomaly detection within a hierarchical Bayesian framework. BADAC deals with uncertainties by marginalising over the unknown, true, value of the data. Using simulated data with Gaussian noise, BADAC is shown to be superior to standard algorithms in both classification and anomaly detection performance in the presence of uncertainties, though with significantly increased computational cost. Additionally, BADAC provides well-calibrated classification probabilities, valuable for use in scientific pipelines. We show that BADAC can work in online mode and is fairly robust to model errors, which can be diagnosed through model-selection methods. In addition it can perform unsupervised new class detection and can naturally be extended to search for anomalous subsets of data. BADAC is therefore ideal where computational cost is not a limiting factor and statistical rigour is important. We discuss approximations to speed up BADAC, such as the use of Gaussian processes, and finally introduce a new metric, the Rank-Weighted Score (RWS), that is particularly suited to evaluating the ability of algorithms to detect anomalies.
△ Less
Submitted 22 February, 2019;
originally announced February 2019.
-
First Cosmology Results Using Type Ia Supernovae From the Dark Energy Survey: Survey Overview and Supernova Spectroscopy
Authors:
C. B. D'Andrea,
M. Smith,
M. Sullivan,
R. C. Nichol,
R. C. Thomas,
A. G. Kim,
A. Möller,
M. Sako,
F. J. Castander,
A. V. Filippenko,
R. J. Foley,
L. Galbany,
S. González-Gaitán,
E. Kasai,
R. P. Kirshner,
C. Lidman,
D. Scolnic,
D. Brout,
T. M. Davis,
R. R. Gupta,
S. R. Hinton,
R. Kessler,
J. Lasker,
E. Macaulay,
R. C. Wolf
, et al. (86 additional authors not shown)
Abstract:
We present spectroscopy from the first three seasons of the Dark Energy Survey Supernova Program (DES-SN). We describe the supernova spectroscopic program in full: strategy, observations, data reduction, and classification. We have spectroscopically confirmed 307 supernovae, including 251 type Ia supernovae (SNe Ia) over a redshift range of $0.017 < z < 0.85$. We determine the effective spectrosco…
▽ More
We present spectroscopy from the first three seasons of the Dark Energy Survey Supernova Program (DES-SN). We describe the supernova spectroscopic program in full: strategy, observations, data reduction, and classification. We have spectroscopically confirmed 307 supernovae, including 251 type Ia supernovae (SNe Ia) over a redshift range of $0.017 < z < 0.85$. We determine the effective spectroscopic selection function for our sample, and use it to investigate the redshift-dependent bias on the distance moduli of SNe Ia we have classified. We also provide a full overview of the strategy, observations, and data products of DES-SN, which has discovered 12,015 likely supernovae during these first three seasons. The data presented here are used for the first cosmology analysis by DES-SN ('DES-SN3YR'), the results of which are given in DES Collaboration (2018a).
△ Less
Submitted 23 November, 2018;
originally announced November 2018.
-
Classification of Multiwavelength Transients with Machine Learning
Authors:
K. Sooknunan,
M. Lochner,
Bruce A. Bassett,
H. V. Peiris,
R. Fender,
A. J. Stewart,
M. Pietka,
P. A. Woudt,
J. D. McEwen,
O. Lahav
Abstract:
With the advent of powerful telescopes such as the Square Kilometer Array and the Vera C. Rubin Observatory, we are entering an era of multiwavelength transient astronomy that will lead to a dramatic increase in data volume. Machine learning techniques are well suited to address this data challenge and rapidly classify newly detected transients. We present a multiwavelength classification algorith…
▽ More
With the advent of powerful telescopes such as the Square Kilometer Array and the Vera C. Rubin Observatory, we are entering an era of multiwavelength transient astronomy that will lead to a dramatic increase in data volume. Machine learning techniques are well suited to address this data challenge and rapidly classify newly detected transients. We present a multiwavelength classification algorithm consisting of three steps: (1) interpolation and augmentation of the data using Gaussian processes; (2) feature extraction using wavelets; and (3) classification with random forests. Augmentation provides improved performance at test time by balancing the classes and adding diversity into the training set. In the first application of machine learning to the classification of real radio transient data, we apply our technique to the Green Bank Interferometer and other radio light curves. We find we are able to accurately classify most of the 11 classes of radio variables and transients after just eight hours of observations, achieving an overall test accuracy of 78 percent. We fully investigate the impact of the small sample size of 82 publicly available light curves and use data augmentation techniques to mitigate the effect. We also show that on a significantly larger simulated representative training set that the algorithm achieves an overall accuracy of 97 percent, illustrating that the method is likely to provide excellent performance on future surveys. Finally, we demonstrate the effectiveness of simultaneous multiwavelength observations by showing how incorporating just one optical data point into the analysis improves the accuracy of the worst performing class by 19 percent.
△ Less
Submitted 8 March, 2021; v1 submitted 20 November, 2018;
originally announced November 2018.
-
First Cosmology Results Using Type Ia Supernovae from the Dark Energy Survey: Effects of Chromatic Corrections to Supernova Photometry on Measurements of Cosmological Parameters
Authors:
J. Lasker,
R. Kessler,
D. Scolnic,
D. Brout,
C. B. D'Andrea,
T. M. Davis,
S. R. Hinton,
A. G. Kim,
C. Lidman,
E. Macaulay,
A. Möller,
M. Sako,
M. Smith,
M. Sullivan,
J. Asorey,
B. A. Bassett,
D. L. Burke,
J. Calcino,
D. Carollo,
M. Childress,
J. Frieman,
J. K. Hoormann,
E. Kasai,
T. S. Li,
M. March
, et al. (56 additional authors not shown)
Abstract:
Calibration uncertainties have been the leading systematic uncertainty in recent analyses using type Ia Supernovae (SNe Ia) to measure cosmological parameters. To improve the calibration, we present the application of Spectral Energy Distribution (SED)-dependent "chromatic corrections" to the supernova light-curve photometry from the Dark Energy Survey (DES). These corrections depend on the combin…
▽ More
Calibration uncertainties have been the leading systematic uncertainty in recent analyses using type Ia Supernovae (SNe Ia) to measure cosmological parameters. To improve the calibration, we present the application of Spectral Energy Distribution (SED)-dependent "chromatic corrections" to the supernova light-curve photometry from the Dark Energy Survey (DES). These corrections depend on the combined atmospheric and instrumental transmission function for each exposure, and they affect photometry at the 0.01 mag (1%) level, comparable to systematic uncertainties in calibration and photometry. Fitting our combined DES and low-z SN Ia sample with Baryon Acoustic Oscillation (BAO) and Cosmic Microwave Background (CMB) priors for the cosmological parameters $Ω_{\rm m}$ (the fraction of the critical density of the universe comprised of matter) and w (the dark energy equation of state parameter), we compare those parameters before and after applying the corrections. We find the change in w and $Ω_{\rm m}$ due to not including chromatic corrections are -0.002 and 0.000, respectively, for the DES-SN3YR sample with BAO and CMB priors, consistent with a larger DES-SN3YR-like simulation, which has a w-change of 0.0005 with an uncertainty of 0.008 and an $Ω_{\rm m}$ change of 0.000 with an uncertainty of 0.002 . However, when considering samples on individual CCDs we find large redshift-dependent biases (approximately 0.02 in distance modulus) for supernova distances.
△ Less
Submitted 7 November, 2018; v1 submitted 6 November, 2018;
originally announced November 2018.
-
First Cosmology Results Using Type Ia Supernovae From the Dark Energy Survey: Photometric Pipeline and Light Curve Data Release
Authors:
D. Brout,
M. Sako,
D. Scolnic,
R. Kessler,
C. B. D'Andrea,
T. M. Davis,
S. R. Hinton,
A. G. Kim,
J. Lasker,
E. Macaulay,
A. Möller,
R. C. Nichol,
M. Smith,
M. Sullivan,
R. C. Wolf,
S. Allam,
B. A. Bassett,
P. Brown,
F. J. Castander,
M. Childress,
R. J. Foley,
L. Galbany,
K. Herner,
E. Kasai,
M. March
, et al. (67 additional authors not shown)
Abstract:
We present griz light curves of 251 Type Ia Supernovae (SNe Ia) from the first 3 years of the Dark Energy Survey Supernova Program's (DES-SN) spectroscopically classified sample. The photometric pipeline described in this paper produces the calibrated fluxes and associated uncertainties used in the cosmological parameter analysis (Brout et al. 2018-SYS, DES Collaboration et al. 2018) by employing…
▽ More
We present griz light curves of 251 Type Ia Supernovae (SNe Ia) from the first 3 years of the Dark Energy Survey Supernova Program's (DES-SN) spectroscopically classified sample. The photometric pipeline described in this paper produces the calibrated fluxes and associated uncertainties used in the cosmological parameter analysis (Brout et al. 2018-SYS, DES Collaboration et al. 2018) by employing a scene modeling approach that simultaneously forward models a variable transient flux and temporally constant host galaxy. We inject artificial point sources onto DECam images to test the accuracy of our photometric method. Upon comparison of input and measured artificial supernova fluxes, we find flux biases peak at 3 mmag. We require corrections to our photometric uncertainties as a function of host galaxy surface brightness at the transient location, similar to that seen by the DES Difference Imaging Pipeline used to discover transients. The public release of the light curves can be found at https://des.ncsa.illinois.edu/releases/sn.
△ Less
Submitted 1 June, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
First Cosmology Results Using Type Ia Supernovae From the Dark Energy Survey: Analysis, Systematic Uncertainties, and Validation
Authors:
D. Brout,
D. Scolnic,
R. Kessler,
C. B. D'Andrea,
T. M. Davis,
R. R. Gupta,
S. R. Hinton,
A. G. Kim,
J. Lasker,
C. Lidman,
E. Macaulay,
A. Möller,
R. C. Nichol,
M. Sako,
M. Smith,
M. Sullivan,
B. Zhang,
P. Andersen,
J. Asorey,
A. Avelino,
B. A. Bassett,
P. Brown,
J. Calcino,
D. Carollo,
P. Challis
, et al. (100 additional authors not shown)
Abstract:
We present the analysis underpinning the measurement of cosmological parameters from 207 spectroscopically classified type Ia supernovae (SNe Ia) from the first three years of the Dark Energy Survey Supernova Program (DES-SN), spanning a redshift range of 0.017<$z$<0.849. We combine the DES-SN sample with an external sample of 122 low-redshift ($z$<0.1) SNe Ia, resulting in a "DES-SN3YR" sample of…
▽ More
We present the analysis underpinning the measurement of cosmological parameters from 207 spectroscopically classified type Ia supernovae (SNe Ia) from the first three years of the Dark Energy Survey Supernova Program (DES-SN), spanning a redshift range of 0.017<$z$<0.849. We combine the DES-SN sample with an external sample of 122 low-redshift ($z$<0.1) SNe Ia, resulting in a "DES-SN3YR" sample of 329 SNe Ia. Our cosmological analyses are blinded: after combining our DES-SN3YR distances with constraints from the Cosmic Microwave Background (CMB; Planck Collaboration 2016), our uncertainties in the measurement of the dark energy equation-of-state parameter, $w$, are .042 (stat) and .059 (stat+syst) at 68% confidence. We provide a detailed systematic uncertainty budget, which has nearly equal contributions from photometric calibration, astrophysical bias corrections, and instrumental bias corrections. We also include several new sources of systematic uncertainty. While our sample is <1/3 the size of the Pantheon sample, our constraints on $w$ are only larger by 1.4$\times$, showing the impact of the DES SN Ia light curve quality. We find that the traditional stretch and color standardization parameters of the DES SNe Ia are in agreement with earlier SN Ia samples such as Pan-STARRS1 and the Supernova Legacy Survey. However, we find smaller intrinsic scatter about the Hubble diagram (0.077 mag). Interestingly, we find no evidence for a Hubble residual step ( 0.007 $\pm$ 0.018 mag) as a function of host galaxy mass for the DES subset, in 2.4$σ$ tension with previous measurements. We also present novel validation methods of our sample using simulated SNe Ia inserted in DECam images and using large catalog-level simulations to test for biases in our analysis pipelines.
△ Less
Submitted 1 June, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
First Cosmological Results using Type Ia Supernovae from the Dark Energy Survey: Measurement of the Hubble Constant
Authors:
E. Macaulay,
R. C. Nichol,
D. Bacon,
D. Brout,
T. M. Davis,
B. Zhang,
B. A. Bassett,
D. Scolnic,
A. Möller,
C. B. D'Andrea,
S. R. Hinton,
R. Kessler,
A. G. Kim,
J. Lasker,
C. Lidman,
M. Sako,
M. Smith,
M. Sullivan,
T. M. C. Abbott,
S. Allam,
J. Annis,
J. Asorey,
S. Avila,
K. Bechtol,
D. Brooks
, et al. (84 additional authors not shown)
Abstract:
We present an improved measurement of the Hubble constant (H_0) using the 'inverse distance ladder' method, which adds the information from 207 Type Ia supernovae (SNe Ia) from the Dark Energy Survey (DES) at redshift 0.018 < z < 0.85 to existing distance measurements of 122 low redshift (z < 0.07) SNe Ia (Low-z) and measurements of Baryon Acoustic Oscillations (BAOs). Whereas traditional measurem…
▽ More
We present an improved measurement of the Hubble constant (H_0) using the 'inverse distance ladder' method, which adds the information from 207 Type Ia supernovae (SNe Ia) from the Dark Energy Survey (DES) at redshift 0.018 < z < 0.85 to existing distance measurements of 122 low redshift (z < 0.07) SNe Ia (Low-z) and measurements of Baryon Acoustic Oscillations (BAOs). Whereas traditional measurements of H_0 with SNe Ia use a distance ladder of parallax and Cepheid variable stars, the inverse distance ladder relies on absolute distance measurements from the BAOs to calibrate the intrinsic magnitude of the SNe Ia. We find H_0 = 67.8 +/- 1.3 km s-1 Mpc-1 (statistical and systematic uncertainties, 68% confidence). Our measurement makes minimal assumptions about the underlying cosmological model, and our analysis was blinded to reduce confirmation bias. We examine possible systematic uncertainties and all are below the statistical uncertainties. Our H_0 value is consistent with estimates derived from the Cosmic Microwave Background assuming a LCDM universe (Planck Collaboration et al. 2018).
△ Less
Submitted 27 May, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
Cosmological Constraints from Multiple Probes in the Dark Energy Survey
Authors:
DES Collaboration,
T. M. C. Abbott,
A. Alarcon,
S. Allam,
P. Andersen,
F. Andrade-Oliveira,
J. Annis,
J. Asorey,
A. Avelino,
S. Avila,
D. Bacon,
N. Banik,
B. A. Bassett,
E. Baxter,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
E. Bertin,
J. Blazek,
S. L. Bridle,
D. Brooks,
D. Brout,
D. L. Burke,
J. Calcino,
H. Camacho
, et al. (144 additional authors not shown)
Abstract:
The combination of multiple observational probes has long been advocated as a powerful technique to constrain cosmological parameters, in particular dark energy. The Dark Energy Survey has measured 207 spectroscopically--confirmed Type Ia supernova lightcurves; the baryon acoustic oscillation feature; weak gravitational lensing; and galaxy clustering. Here we present combined results from these pr…
▽ More
The combination of multiple observational probes has long been advocated as a powerful technique to constrain cosmological parameters, in particular dark energy. The Dark Energy Survey has measured 207 spectroscopically--confirmed Type Ia supernova lightcurves; the baryon acoustic oscillation feature; weak gravitational lensing; and galaxy clustering. Here we present combined results from these probes, deriving constraints on the equation of state, $w$, of dark energy and its energy density in the Universe. Independently of other experiments, such as those that measure the cosmic microwave background, the probes from this single photometric survey rule out a Universe with no dark energy, finding $w=-0.80^{+0.09}_{-0.11}$. The geometry is shown to be consistent with a spatially flat Universe, and we obtain a constraint on the baryon density of $Ω_b=0.069^{+0.009}_{-0.012}$ that is independent of early Universe measurements. These results demonstrate the potential power of large multi-probe photometric surveys and pave the way for order of magnitude advances in our constraints on properties of dark energy and cosmology over the next decade.
△ Less
Submitted 6 May, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
First Cosmology Results using Type Ia Supernovae from the Dark Energy Survey: Constraints on Cosmological Parameters
Authors:
T. M. C. Abbott,
S. Allam,
P. Andersen,
C. Angus,
J. Asorey,
A. Avelino,
S. Avila,
B. A. Bassett,
K. Bechtol,
G. M. Bernstein,
E. Bertin,
D. Brooks,
D. Brout,
P. Brown,
D. L. Burke,
J. Calcino,
A. Carnero Rosell,
D. Carollo,
M. Carrasco Kind,
J. Carretero,
R. Casas,
F. J. Castander,
R. Cawthon,
P. Challis,
M. Childress
, et al. (119 additional authors not shown)
Abstract:
We present the first cosmological parameter constraints using measurements of type Ia supernovae (SNe Ia) from the Dark Energy Survey Supernova Program (DES-SN). The analysis uses a subsample of 207 spectroscopically confirmed SNe Ia from the first three years of DES-SN, combined with a low-redshift sample of 122 SNe from the literature. Our "DES-SN3YR" result from these 329 SNe Ia is based on a s…
▽ More
We present the first cosmological parameter constraints using measurements of type Ia supernovae (SNe Ia) from the Dark Energy Survey Supernova Program (DES-SN). The analysis uses a subsample of 207 spectroscopically confirmed SNe Ia from the first three years of DES-SN, combined with a low-redshift sample of 122 SNe from the literature. Our "DES-SN3YR" result from these 329 SNe Ia is based on a series of companion analyses and improvements covering SN Ia discovery, spectroscopic selection, photometry, calibration, distance bias corrections, and evaluation of systematic uncertainties. For a flat LCDM model we find a matter density Omega_m = 0.331 +_ 0.038. For a flat wCDM model, and combining our SN Ia constraints with those from the cosmic microwave background (CMB), we find a dark energy equation of state w = -0.978 +_ 0.059, and Omega_m = 0.321 +_ 0.018. For a flat w0waCDM model, and combining probes from SN Ia, CMB and baryon acoustic oscillations, we find w0 = -0.885 +_ 0.114 and wa = -0.387 +_ 0.430. These results are in agreement with a cosmological constant and with previous constraints using SNe Ia (Pantheon, JLA).
△ Less
Submitted 10 May, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
DeepSource: Point Source Detection using Deep Learning
Authors:
A. Vafaei Sadr,
Etienne. E. Vos,
Bruce A. Bassett,
Zafiirah Hosenie,
N. Oozeer,
Michelle Lochner
Abstract:
Point source detection at low signal-to-noise is challenging for astronomical surveys, particularly in radio interferometry images where the noise is correlated. Machine learning is a promising solution, allowing the development of algorithms tailored to specific telescope arrays and science cases. We present DeepSource - a deep learning solution - that uses convolutional neural networks to achiev…
▽ More
Point source detection at low signal-to-noise is challenging for astronomical surveys, particularly in radio interferometry images where the noise is correlated. Machine learning is a promising solution, allowing the development of algorithms tailored to specific telescope arrays and science cases. We present DeepSource - a deep learning solution - that uses convolutional neural networks to achieve these goals. DeepSource enhances the Signal-to-Noise Ratio (SNR) of the original map and then uses dynamic blob detection to detect sources. Trained and tested on two sets of 500 simulated 1 deg x 1 deg MeerKAT images with a total of 300,000 sources, DeepSource is essentially perfect in both purity and completeness down to SNR = 4 and outperforms PyBDSF in all metrics. For uniformly-weighted images it achieves a Purity x Completeness (PC) score at SNR = 3 of 0.73, compared to 0.31 for the best PyBDSF model. For natural-weighting we find a smaller improvement of ~40% in the PC score at SNR = 3. If instead we ask where either of the purity or completeness first drop to 90%, we find that DeepSource reaches this value at SNR = 3.6 compared to the 4.3 of PyBDSF (natural-weighting). A key advantage of DeepSource is that it can learn to optimally trade off purity and completeness for any science case under consideration. Our results show that deep learning is a promising approach to point source detection in astronomical images.
△ Less
Submitted 7 July, 2018;
originally announced July 2018.
-
Automated Classification of Text Sentiment
Authors:
Emmanuel Dufourq,
Bruce A. Bassett
Abstract:
The ability to identify sentiment in text, referred to as sentiment analysis, is one which is natural to adult humans. This task is, however, not one which a computer can perform by default. Identifying sentiments in an automated, algorithmic manner will be a useful capability for business and research in their search to understand what consumers think about their products or services and to under…
▽ More
The ability to identify sentiment in text, referred to as sentiment analysis, is one which is natural to adult humans. This task is, however, not one which a computer can perform by default. Identifying sentiments in an automated, algorithmic manner will be a useful capability for business and research in their search to understand what consumers think about their products or services and to understand human sociology. Here we propose two new Genetic Algorithms (GAs) for the task of automated text sentiment analysis. The GAs learn whether words occurring in a text corpus are either sentiment or amplifier words, and their corresponding magnitude. Sentiment words, such as 'horrible', add linearly to the final sentiment. Amplifier words in contrast, which are typically adjectives/adverbs like 'very', multiply the sentiment of the following word. This increases, decreases or negates the sentiment of the following word. The sentiment of the full text is then the sum of these terms. This approach grows both a sentiment and amplifier dictionary which can be reused for other purposes and fed into other machine learning algorithms. We report the results of multiple experiments conducted on large Amazon data sets. The results reveal that our proposed approach was able to outperform several public and/or commercial sentiment analysis algorithms.
△ Less
Submitted 5 April, 2018;
originally announced April 2018.
-
Cosmic String Detection with Tree-Based Machine Learning
Authors:
A. Vafaei Sadr,
M. Farhang,
S. M. S. Movahed,
B. Bassett,
M. Kunz
Abstract:
We explore the use of random forest and gradient boosting, two powerful tree-based machine learning algorithms, for the detection of cosmic strings in maps of the cosmic microwave background (CMB), through their unique Gott-Kaiser-Stebbins effect on the temperature anisotropies.The information in the maps is compressed into feature vectors before being passed to the learning units. The feature vec…
▽ More
We explore the use of random forest and gradient boosting, two powerful tree-based machine learning algorithms, for the detection of cosmic strings in maps of the cosmic microwave background (CMB), through their unique Gott-Kaiser-Stebbins effect on the temperature anisotropies.The information in the maps is compressed into feature vectors before being passed to the learning units. The feature vectors contain various statistical measures of processed CMB maps that boost the cosmic string detectability. Our proposed classifiers, after training, give results improved over or similar to the claimed detectability levels of the existing methods for string tension, $Gμ$. They can make $3σ$ detection of strings with $Gμ\gtrsim 2.1\times 10^{-10}$ for noise-free, $0.9'$-resolution CMB observations. The minimum detectable tension increases to $Gμ\gtrsim 3.0\times 10^{-8}$ for a more realistic, CMB S4-like (II) strategy, still a significant improvement over the previous results.
△ Less
Submitted 12 January, 2018;
originally announced January 2018.
-
Painting galaxies into dark matter halos using machine learning
Authors:
Shankar Agarwal,
Romeel Davé,
Bruce A. Bassett
Abstract:
We develop a machine learning (ML) framework to populate large dark matter-only simulations with baryonic galaxies. Our ML framework takes input halo properties including halo mass, environment, spin, and recent growth history, and outputs central galaxy and halo baryonic properties including stellar mass ($M_*$), star formation rate (SFR), metallicity ($Z$), neutral ($\rm HI$) and molecular (…
▽ More
We develop a machine learning (ML) framework to populate large dark matter-only simulations with baryonic galaxies. Our ML framework takes input halo properties including halo mass, environment, spin, and recent growth history, and outputs central galaxy and halo baryonic properties including stellar mass ($M_*$), star formation rate (SFR), metallicity ($Z$), neutral ($\rm HI$) and molecular ($\rm H_2$) hydrogen mass. We apply this to the MUFASA cosmological hydrodynamic simulation, and show that it recovers the mean trends of output quantities with halo mass highly accurately, including following the sharp drop in SFR and gas in quenched massive galaxies. However, the scatter around the mean relations is under-predicted. Examining galaxies individually, at $z=0$ the stellar mass and metallicity are accurately recovered ($σ\lesssim 0.2$~dex), but SFR and $\rm HI$ show larger scatter ($σ\gtrsim 0.3$~dex); these values improve somewhat at $z=1,2$. Remarkably, ML quantitatively recovers second parameter trends in galaxy properties, e.g. that galaxies with higher gas content and lower metallicity have higher SFR at a given $M_*$. Testing various ML algorithms, we find that none perform significantly better than the others, nor does ensembling improve performance, likely because none of the algorithms reproduce the large observed scatter around the mean properties. For the random forest algorithm, we find that halo mass and nearby ($\sim 200$~kpc) environment are the most important predictive variables followed by growth history, while halo spin and $\sim$Mpc scale environment are not important. Finally we study the impact of additionally inputting key baryonic properties $M_*$, SFR and $Z$, as would be available e.g. from an equilibrium model, and show that particularly providing the SFR enables $\rm HI$ to be recovered substantially more accurately.
△ Less
Submitted 2 May, 2018; v1 submitted 8 December, 2017;
originally announced December 2017.
-
EDEN: Evolutionary Deep Networks for Efficient Machine Learning
Authors:
Emmanuel Dufourq,
Bruce A. Bassett
Abstract:
Deep neural networks continue to show improved performance with increasing depth, an encouraging trend that implies an explosion in the possible permutations of network architectures and hyperparameters for which there is little intuitive guidance. To address this increasing complexity, we propose Evolutionary DEep Networks (EDEN), a computationally efficient neuro-evolutionary algorithm which int…
▽ More
Deep neural networks continue to show improved performance with increasing depth, an encouraging trend that implies an explosion in the possible permutations of network architectures and hyperparameters for which there is little intuitive guidance. To address this increasing complexity, we propose Evolutionary DEep Networks (EDEN), a computationally efficient neuro-evolutionary algorithm which interfaces to any deep neural network platform, such as TensorFlow. We show that EDEN evolves simple yet successful architectures built from embedding, 1D and 2D convolutional, max pooling and fully connected layers along with their hyperparameters. Evaluation of EDEN across seven image and sentiment classification datasets shows that it reliably finds good networks -- and in three cases achieves state-of-the-art results -- even on a single GPU, in just 6-24 hours. Our study provides a first attempt at applying neuro-evolution to the creation of 1D convolutional networks for sentiment analysis including the optimisation of the embedding layer.
△ Less
Submitted 26 September, 2017;
originally announced September 2017.
-
Text Compression for Sentiment Analysis via Evolutionary Algorithms
Authors:
Emmanuel Dufourq,
Bruce A. Bassett
Abstract:
Can textual data be compressed intelligently without losing accuracy in evaluating sentiment? In this study, we propose a novel evolutionary compression algorithm, PARSEC (PARts-of-Speech for sEntiment Compression), which makes use of Parts-of-Speech tags to compress text in a way that sacrifices minimal classification accuracy when used in conjunction with sentiment analysis algorithms. An analys…
▽ More
Can textual data be compressed intelligently without losing accuracy in evaluating sentiment? In this study, we propose a novel evolutionary compression algorithm, PARSEC (PARts-of-Speech for sEntiment Compression), which makes use of Parts-of-Speech tags to compress text in a way that sacrifices minimal classification accuracy when used in conjunction with sentiment analysis algorithms. An analysis of PARSEC with eight commercial and non-commercial sentiment analysis algorithms on twelve English sentiment data sets reveals that accurate compression is possible with (0%, 1.3%, 3.3%) loss in sentiment classification accuracy for (20%, 50%, 75%) data compression with PARSEC using LingPipe, the most accurate of the sentiment algorithms. Other sentiment analysis algorithms are more severely affected by compression. We conclude that significant compression of text data is possible for sentiment analysis depending on the accuracy demands of the specific application and the specific sentiment analysis algorithm used.
△ Less
Submitted 20 September, 2017;
originally announced September 2017.
-
MeerKLASS: MeerKAT Large Area Synoptic Survey
Authors:
Mario G. Santos,
Michelle Cluver,
Matt Hilton,
Matt Jarvis,
Gyula I. G. Jozsa,
Lerothodi Leeuw,
Oleg Smirnov,
Russ Taylor,
Filipe Abdalla,
Jose Afonso,
David Alonso,
David Bacon,
Bruce A. Bassett,
Gianni Bernardi,
Philip Bull,
Stefano Camera,
H. Cynthia Chiang,
Sergio Colafrancesco,
Pedro G. Ferreira,
Jose Fonseca,
Kurt van der Heyden,
Ian Heywood,
Kenda Knowles,
Michelle Lochner,
Yin-Zhe Ma
, et al. (13 additional authors not shown)
Abstract:
We discuss the ground-breaking science that will be possible with a wide area survey, using the MeerKAT telescope, known as MeerKLASS (MeerKAT Large Area Synoptic Survey). The current specifications of MeerKAT make it a great fit for science applications that require large survey speeds but not necessarily high angular resolutions. In particular, for cosmology, a large survey over…
▽ More
We discuss the ground-breaking science that will be possible with a wide area survey, using the MeerKAT telescope, known as MeerKLASS (MeerKAT Large Area Synoptic Survey). The current specifications of MeerKAT make it a great fit for science applications that require large survey speeds but not necessarily high angular resolutions. In particular, for cosmology, a large survey over $\sim 4,000 \, {\rm deg}^2$ for $\sim 4,000$ hours will potentially provide the first ever measurements of the baryon acoustic oscillations using the 21cm intensity map** technique, with enough accuracy to impose constraints on the nature of dark energy. The combination with multi-wavelength data will give unique additional information, such as exquisite constraints on primordial non-Gaussianity using the multi-tracer technique, as well as a better handle on foregrounds and systematics. Such a wide survey with MeerKAT is also a great match for HI galaxy studies, providing unrivalled statistics in the pre-SKA era for galaxies resolved in the HI emission line beyond local structures at z > 0.01. It will also produce a large continuum galaxy sample down to a depth of about 5\,$μ$Jy in L-band, which is quite unique over such large areas and will allow studies of the large-scale structure of the Universe out to high redshifts, complementing the galaxy HI survey to form a transformational multi-wavelength approach to study galaxy dynamics and evolution. Finally, the same survey will supply unique information for a range of other science applications, including a large statistical investigation of galaxy clusters as well as produce a rotation measure map across a huge swathe of the sky. The MeerKLASS survey will be a crucial step on the road to using SKA1-MID for cosmological applications and other commensal surveys, as described in the top priority SKA key science projects (abridged).
△ Less
Submitted 18 September, 2017;
originally announced September 2017.
-
The MeerKAT International GHz Tiered Extragalactic Exploration (MIGHTEE) Survey
Authors:
Matt J. Jarvis,
A. R. Taylor,
I. Agudo,
James R. Allison,
R. P. Deane,
B. Frank,
N. Gupta,
I. Heywood,
N. Maddox,
K. McAlpine,
Mario G. Santos,
A. M. M. Scaife,
M. Vaccari,
J. T. L. Zwart,
E. Adams,
D. J. Bacon,
A. J. Baker,
Bruce. A. Bassett,
P. N. Best,
R. Beswick,
S. Blyth,
Michael L. Brown,
M. Bruggen,
M. Cluver,
S. Colafranceso
, et al. (32 additional authors not shown)
Abstract:
The MIGHTEE large survey project will survey four of the most well-studied extragalactic deep fields, totalling 20 square degrees to $μ$Jy sensitivity at Giga-Hertz frequencies, as well as an ultra-deep image of a single ~1 square degree MeerKAT pointing. The observations will provide radio continuum, spectral line and polarisation information. As such, MIGHTEE, along with the excellent multi-wave…
▽ More
The MIGHTEE large survey project will survey four of the most well-studied extragalactic deep fields, totalling 20 square degrees to $μ$Jy sensitivity at Giga-Hertz frequencies, as well as an ultra-deep image of a single ~1 square degree MeerKAT pointing. The observations will provide radio continuum, spectral line and polarisation information. As such, MIGHTEE, along with the excellent multi-wavelength data already available in these deep fields, will allow a range of science to be achieved. Specifically, MIGHTEE is designed to significantly enhance our understanding of, (i) the evolution of AGN and star-formation activity over cosmic time, as a function of stellar mass and environment, free of dust obscuration; (ii) the evolution of neutral hydrogen in the Universe and how this neutral gas eventually turns into stars after moving through the molecular phase, and how efficiently this can fuel AGN activity; (iii) the properties of cosmic magnetic fields and how they evolve in clusters, filaments and galaxies. MIGHTEE will reach similar depth to the planned SKA all-sky survey, and thus will provide a pilot to the cosmology experiments that will be carried out by the SKA over a much larger survey volume.
△ Less
Submitted 6 September, 2017;
originally announced September 2017.
-
Automated Problem Identification: Regression vs Classification via Evolutionary Deep Networks
Authors:
Emmanuel Dufourq,
Bruce A. Bassett
Abstract:
Regression or classification? This is perhaps the most basic question faced when tackling a new supervised learning problem. We present an Evolutionary Deep Learning (EDL) algorithm that automatically solves this by identifying the question type with high accuracy, along with a proposed deep architecture. Typically, a significant amount of human insight and preparation is required prior to executi…
▽ More
Regression or classification? This is perhaps the most basic question faced when tackling a new supervised learning problem. We present an Evolutionary Deep Learning (EDL) algorithm that automatically solves this by identifying the question type with high accuracy, along with a proposed deep architecture. Typically, a significant amount of human insight and preparation is required prior to executing machine learning algorithms. For example, when creating deep neural networks, the number of parameters must be selected in advance and furthermore, a lot of these choices are made based upon pre-existing knowledge of the data such as the use of a categorical cross entropy loss function. Humans are able to study a dataset and decide whether it represents a classification or a regression problem, and consequently make decisions which will be applied to the execution of the neural network. We propose the Automated Problem Identification (API) algorithm, which uses an evolutionary algorithm interface to TensorFlow to manipulate a deep neural network to decide if a dataset represents a classification or a regression problem. We test API on 16 different classification, regression and sentiment analysis datasets with up to 10,000 features and up to 17,000 unique target values. API achieves an average accuracy of $96.3\%$ in identifying the problem type without hardcoding any insights about the general characteristics of regression or classification problems. For example, API successfully identifies classification problems even with 1000 target values. Furthermore, the algorithm recommends which loss function to use and also recommends a neural network architecture. Our work is therefore a step towards fully automated machine learning.
△ Less
Submitted 3 July, 2017;
originally announced July 2017.
-
zBEAMS: A unified solution for supernova cosmology with redshift uncertainties
Authors:
Ethan Roberts,
Michelle Lochner,
José Fonseca,
Bruce A. Bassett,
Pierre-Yves Lablanche,
Shankar Agarwal
Abstract:
Supernova cosmology without spectra will be an important component of future surveys such as LSST. This lack of supernova spectra results in uncertainty in the redshifts which, if ignored, leads to significantly biased estimates of cosmological parameters. Here we present a hierarchical Bayesian formalism -- zBEAMS -- that addresses this problem by marginalising over the unknown or uncertain super…
▽ More
Supernova cosmology without spectra will be an important component of future surveys such as LSST. This lack of supernova spectra results in uncertainty in the redshifts which, if ignored, leads to significantly biased estimates of cosmological parameters. Here we present a hierarchical Bayesian formalism -- zBEAMS -- that addresses this problem by marginalising over the unknown or uncertain supernova redshifts to produce unbiased cosmological estimates that are competitive with supernova data with spectroscopically confirmed redshifts. zBEAMS provides a unified treatment of both photometric redshifts and host galaxy misidentification (occurring due to chance galaxy alignments or faint hosts), effectively correcting the inevitable contamination in the Hubble diagram. Like its predecessor BEAMS, our formalism also takes care of non-Ia supernova contamination by marginalising over the unknown supernova type. We illustrate this technique with simulations of supernovae with photometric redshifts and host galaxy misidentification. A novel feature of the photometric redshift case is the important role played by the redshift distribution of the supernovae.
△ Less
Submitted 26 October, 2017; v1 submitted 25 April, 2017;
originally announced April 2017.
-
Age-dating Luminous Red Galaxies observed with the Southern African Large Telescope
Authors:
A. L. Ratsimbazafy,
S. I. Loubser,
S. M. Crawford,
C. M. Cress,
B. A. Bassett,
R. C. Nichol,
P. Väisänen
Abstract:
We measure a value for the cosmic expansion of $H(z) = 89 \pm 23$(stat) $\pm$ 44(syst) km s$^{-1}$ Mpc$^{-1}$ at a redshift of $z \simeq 0.47$ based on the differential age technique. This technique, also known as cosmic chronometers, uses the age difference between two redshifts for a passively evolving population of galaxies to calculate the expansion rate of the Universe. Our measurement is bas…
▽ More
We measure a value for the cosmic expansion of $H(z) = 89 \pm 23$(stat) $\pm$ 44(syst) km s$^{-1}$ Mpc$^{-1}$ at a redshift of $z \simeq 0.47$ based on the differential age technique. This technique, also known as cosmic chronometers, uses the age difference between two redshifts for a passively evolving population of galaxies to calculate the expansion rate of the Universe. Our measurement is based on analysis of high quality spectra of Luminous Red Galaxies (LRGs) obtained with the Southern African Large Telescope (SALT) in two narrow redshift ranges of $z \simeq 0.40$ and $z \simeq 0.55$ as part of an initial pilot study. Ages were estimated by fitting single stellar population models to the observed spectra. This measurement presents one of the best estimates of $H(z)$ via this method at $z\sim0.5$ to date.
△ Less
Submitted 21 November, 2017; v1 submitted 1 February, 2017;
originally announced February 2017.
-
Early observations of the nearby type Ia supernova SN 2015F
Authors:
R. Cartier,
M. Sullivan,
R. Firth,
G. Pignata,
P. Mazzali,
K. Maguire,
M. J. Childress,
I. Arcavi,
C. Ashall,
B. Bassett,
S. M. Crawford,
C. Frohmaier,
L. Galbany,
A. Gal-Yam,
G. Hosseinzadeh,
D. A. Howell,
C. Inserra,
J. Johansson,
E. K. Kasai,
C. McCully,
S. Prajs,
S. Prentice,
S. Schulze,
S. J. Smartt,
K. W. Smith
, et al. (3 additional authors not shown)
Abstract:
We present photometry and time-series spectroscopy of the nearby type Ia supernova (SN Ia) SN 2015F over $-16$ days to $+80$ days relative to maximum light, obtained as part of the Public ESO Spectroscopic Survey of Transient Objects (PESSTO). SN 2015F is a slightly sub-luminous SN Ia with a decline rate of $Δm15(B)=1.35 \pm 0.03$ mag, placing it in the region between normal and SN 1991bg-like eve…
▽ More
We present photometry and time-series spectroscopy of the nearby type Ia supernova (SN Ia) SN 2015F over $-16$ days to $+80$ days relative to maximum light, obtained as part of the Public ESO Spectroscopic Survey of Transient Objects (PESSTO). SN 2015F is a slightly sub-luminous SN Ia with a decline rate of $Δm15(B)=1.35 \pm 0.03$ mag, placing it in the region between normal and SN 1991bg-like events. Our densely-sampled photometric data place tight constraints on the epoch of first light and form of the early-time light curve. The spectra exhibit photospheric C II $λ6580$ absorption until $-4$ days, and high-velocity Ca II is particularly strong at $<-10$ days at expansion velocities of $\simeq$23000\kms. At early times, our spectral modelling with syn++ shows strong evidence for iron-peak elements (Fe II, Cr II, Ti II, and V II) expanding at velocities $>14000$ km s$^{-1}$, suggesting mixing in the outermost layers of the SN ejecta. Although unusual in SN Ia spectra, including V II in the modelling significantly improves the spectral fits. Intriguingly, we detect an absorption feature at $\sim$6800 Å that persists until maximum light. Our favoured explanation for this line is photospheric Al II, which has never been claimed before in SNe Ia, although detached high-velocity C II material could also be responsible. In both cases the absorbing material seems to be confined to a relatively narrow region in velocity space. The nucleosynthesis of detectable amounts of Al II would argue against a low-metallicity white dwarf progenitor. We also show that this 6800 Å feature is weakly present in other normal SN Ia events, and common in the SN 1991bg-like sub-class.
△ Less
Submitted 14 October, 2016; v1 submitted 14 September, 2016;
originally announced September 2016.
-
Bayes Factors via Savage-Dickey Supermodels
Authors:
A. Mootoovaloo,
Bruce A. Bassett,
M. Kunz
Abstract:
We outline a new method to compute the Bayes Factor for model selection which bypasses the Bayesian Evidence. Our method combines multiple models into a single, nested, Supermodel using one or more hyperparameters. Since the models are now nested the Bayes Factors between the models can be efficiently computed using the Savage-Dickey Density Ratio (SDDR). In this way model selection becomes a prob…
▽ More
We outline a new method to compute the Bayes Factor for model selection which bypasses the Bayesian Evidence. Our method combines multiple models into a single, nested, Supermodel using one or more hyperparameters. Since the models are now nested the Bayes Factors between the models can be efficiently computed using the Savage-Dickey Density Ratio (SDDR). In this way model selection becomes a problem of parameter estimation. We consider two ways of constructing the supermodel in detail: one based on combined models, and a second based on combined likelihoods. We report on these two approaches for a Gaussian linear model for which the Bayesian evidence can be calculated analytically and a toy nonlinear problem. Unlike the combined model approach, where a standard Monte Carlo Markov Chain (MCMC) struggles, the combined-likelihood approach fares much better in providing a reliable estimate of the log-Bayes Factor. This scheme potentially opens the way to computationally efficient ways to compute Bayes Factors in high dimensions that exploit the good scaling properties of MCMC, as compared to methods such as nested sampling that fail for high dimensions.
△ Less
Submitted 7 September, 2016;
originally announced September 2016.
-
Application of Bayesian graphs to SN Ia data analysis and compression
Authors:
Cong Ma,
Pier-Stefano Corasaniti,
Bruce A. Bassett
Abstract:
Bayesian graphical models are an efficient tool for modelling complex data and derive self-consistent expressions of the posterior distribution of model parameters. We apply Bayesian graphs to perform statistical analyses of Type Ia supernova (SN Ia) luminosity distance measurements from the joint light-curve analysis (JLA) data set. In contrast to the $χ^2$ approach used in previous studies, the…
▽ More
Bayesian graphical models are an efficient tool for modelling complex data and derive self-consistent expressions of the posterior distribution of model parameters. We apply Bayesian graphs to perform statistical analyses of Type Ia supernova (SN Ia) luminosity distance measurements from the joint light-curve analysis (JLA) data set. In contrast to the $χ^2$ approach used in previous studies, the Bayesian inference allows us to fully account for the standard-candle parameter dependence of the data covariance matrix. Comparing with $χ^2$ analysis results, we find a systematic offset of the marginal model parameter bounds. We demonstrate that the bias is statistically significant in the case of the SN Ia standardization parameters with a maximal 6 $σ$ shift of the SN light-curve colour correction. In addition, we find that the evidence for a host galaxy correction is now only 2.4 $σ$. Systematic offsets on the cosmological parameters remain small, but may increase by combining constraints from complementary cosmological probes. The bias of the $χ^2$ analysis is due to neglecting the parameter-dependent log-determinant of the data covariance, which gives more statistical weight to larger values of the standardization parameters. We find a similar effect on compressed distance modulus data. To this end, we implement a fully consistent compression method of the JLA data set that uses a Gaussian approximation of the posterior distribution for fast generation of compressed data. Overall, the results of our analysis emphasize the need for a fully consistent Bayesian statistical approach in the analysis of future large SN Ia data sets.
△ Less
Submitted 8 September, 2016; v1 submitted 28 March, 2016;
originally announced March 2016.
-
Bayesian Inference for Radio Observations - Going beyond deconvolution
Authors:
Michelle Lochner,
Bruce A. Bassett,
Martin Kunz,
Iniyan Natarajan,
Nadeem Oozeer,
Oleg Smirnov,
Jon Zwart
Abstract:
Radio interferometers suffer from the problem of missing information in their data, due to the gaps between the antennas. This results in artifacts, such as bright rings around sources, in the images obtained. Multiple deconvolution algorithms have been proposed to solve this problem and produce cleaner radio images. However, these algorithms are unable to correctly estimate uncertainties in deriv…
▽ More
Radio interferometers suffer from the problem of missing information in their data, due to the gaps between the antennas. This results in artifacts, such as bright rings around sources, in the images obtained. Multiple deconvolution algorithms have been proposed to solve this problem and produce cleaner radio images. However, these algorithms are unable to correctly estimate uncertainties in derived scientific parameters or to always include the effects of instrumental errors. We propose an alternative technique called Bayesian Inference for Radio Observations (BIRO) which uses a Bayesian statistical framework to determine the scientific parameters and instrumental errors simultaneously directly from the raw data, without making an image. We use a simple simulation of Westerbork Synthesis Radio Telescope data including pointing errors and beam parameters as instrumental effects, to demonstrate the use of BIRO.
△ Less
Submitted 14 September, 2015;
originally announced September 2015.
-
Type Ia supernova Hubble diagram with near-infrared and optical observations
Authors:
V. Stanishev,
A. Goobar,
R. Amanullah,
B. Bassett,
Y. T. Fantaye,
P. Garnavich,
R. Hlozek,
J. Nordin,
P. M. Okouma,
L. Ostman,
M. Sako,
R. Scalzo,
M. Smith
Abstract:
We main goal of this paper is to test whether the NIR peak magnitudes of SNe Ia could be accurately estimated with only a single observation obtained close to maximum light, provided the time of B band maximum and the optical stretch parameter are known. We obtained multi-epoch UBVRI and single-epoch J and H photometric observations of 16 SNe Ia in the redshift range z=0.037-0.183, doubling the le…
▽ More
We main goal of this paper is to test whether the NIR peak magnitudes of SNe Ia could be accurately estimated with only a single observation obtained close to maximum light, provided the time of B band maximum and the optical stretch parameter are known. We obtained multi-epoch UBVRI and single-epoch J and H photometric observations of 16 SNe Ia in the redshift range z=0.037-0.183, doubling the leverage of the current SN Ia NIR Hubble diagram and the number of SNe beyond redshift 0.04. This sample was analyzed together with 102 NIR and 458 optical light curves (LCs) of normal SNe Ia from the literature. The analysis of 45 well-sampled NIR LCs shows that a single template accurately describes them if its time axis is stretched with the optical stretch parameter. This allows us to estimate the NIR peak magnitudes even with one observation obtained within 10 days from B-band maximum. We find that the NIR Hubble residuals show weak correlation with DM_15 and E(B-V), and for the first time we report a possible dependence on the J_max-H_max color. The intrinsic NIR luminosity scatter of SNe Ia is estimated to be around 0.10 mag, which is smaller than what can be derived for a similarly heterogeneous sample at optical wavelengths. In conclusion, we find that SNe Ia are at least as good standard candles in the NIR as in the optical. We showed that it is feasible to extended the NIR SN Ia Hubble diagram to z=0.2 with very modest sampling of the NIR LCs, if complemented by well-sampled optical LCs. Our results suggest that the most efficient way to extend the NIR Hubble diagram to high redshift would be to obtain a single observation close to the NIR maximum. (abridged)
△ Less
Submitted 23 March, 2018; v1 submitted 28 May, 2015;
originally announced May 2015.