-
A Bayesian Approach for Prioritising Driving Behaviour Investigations in Telematic Auto Insurance Policies
Authors:
Mark McLeod,
Bernardo Perez-Orozco,
Nika Lee,
Davide Zilli
Abstract:
Automotive insurers increasingly have access to telematic information via black-box recorders installed in the insured vehicle, and wish to identify undesirable behaviour which may signify increased risk or uninsured activities. However, identification of such behaviour with machine learning is non-trivial, and results are far from perfect, requiring human investigation to verify suspected cases.…
▽ More
Automotive insurers increasingly have access to telematic information via black-box recorders installed in the insured vehicle, and wish to identify undesirable behaviour which may signify increased risk or uninsured activities. However, identification of such behaviour with machine learning is non-trivial, and results are far from perfect, requiring human investigation to verify suspected cases. An appropriately formed priority score, generated by automated analysis of GPS data, allows underwriters to make more efficient use of their time, improving detection of the behaviour under investigation.
An example of such behaviour is the use of a privately insured vehicle for commercial purposes, such as delivering meals and parcels. We first make use of trip GPS and accelerometer data, augmented by geospatial information, to train an imperfect classifier for delivery driving on a per-trip basis. We make use of a mixture of Beta-Binomial distributions to model the propensity of a policyholder to undertake trips which result in a positive classification as being drawn from either a rare high-scoring or common low-scoring group, and learn the parameters of this model using MCMC. This model provides us with a posterior probability that any policyholder will be a regular generator of automated alerts given any number of trips and alerts. This posterior probability is converted to a priority score, which was used to select the most valuable candidates for manual investigation.
Testing over a 1-year period ranked policyholders by likelihood of commercial driving activity on a weekly basis. The top 0.9% have been reviewed at least once by the underwriters at the time of writing, and of those 99.4% have been confirmed as correctly identified, showing the approach has achieved a significant improvement in efficiency of human resource allocation compared to manual searching.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Behavioral event detection and rate estimation for autonomous vehicle evaluation
Authors:
Maria A. Terres,
Aiyou Chen,
Ruixuan Rachel Zhou,
Claire M. McLeod
Abstract:
Autonomous vehicles are continually increasing their presence on public roads. However, before any new autonomous driving software can be approved, it must first undergo a rigorous assessment of driving quality. These quality evaluations typically focus on estimating the frequency of (undesirable) behavioral events. While rate estimation would be straight-forward with complete data, in the autonom…
▽ More
Autonomous vehicles are continually increasing their presence on public roads. However, before any new autonomous driving software can be approved, it must first undergo a rigorous assessment of driving quality. These quality evaluations typically focus on estimating the frequency of (undesirable) behavioral events. While rate estimation would be straight-forward with complete data, in the autonomous driving setting this estimation is greatly complicated by the fact that \textit{detecting} these events within large driving logs is a non-trivial task that often involves human reviewers. In this paper we outline a \textit{streaming partial tiered event review} configuration that ensures both high recall and high precision on the events of interest. In addition, the framework allows for valid streaming estimates at any phase of the data collection process, even when labels are incomplete, for which we develop the maximum likelihood estimate and show it is unbiased. Constructing honest and effective confidence intervals (CI) for these rate estimates, particularly for rare safety-critical events, is a novel and challenging statistical problem due to the complexity of the data likelihood. We develop and compare several CI approximations, including a novel Gamma CI method that approximates the exact but intractable distribution with a weighted sum of independent Poisson random variables. There is a clear trade-off between statistical coverage and interval width across the different CI methods, and the extent of this trade-off varies depending on the specific application settings (e.g., rare vs. common events). In particular, we argue that our proposed CI method is the best-suited when estimating the rate of safety-critical events where guaranteed coverage of the true parameter value is a prerequisite to safely launching a new ADS on public roads.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Enhanced diastereocontrol via strong light-matter interactions in an optical cavity
Authors:
Nam Vu,
Grace M. McLeod,
Kenneth Hanson,
A. Eugene DePrince III
Abstract:
The enantiopurification of racemic mixtures of chiral molecules is important for a range of applications. Recent work has shown that chiral group-directed photoisomerization is a promising approach to enantioenrich racemic mixtures of BINOL, but increased control of the diasteriomeric excess (de) is necessary for its broad utility. Here we develop a cavity quantum electrodynamics (QED) generalizat…
▽ More
The enantiopurification of racemic mixtures of chiral molecules is important for a range of applications. Recent work has shown that chiral group-directed photoisomerization is a promising approach to enantioenrich racemic mixtures of BINOL, but increased control of the diasteriomeric excess (de) is necessary for its broad utility. Here we develop a cavity quantum electrodynamics (QED) generalization of time-dependent density functional theory and demonstrate computationally that strong light-matter coupling can alter the de of chiral group-directed photoisomerization of BINOL. The relative orientation of the cavity mode polarization and the molecules in the cavity dictates the nature of the cavity interactions, which either enhance the de of the (R)-BINOL diasteriomer (from 17% to $\approx$ 40%) or invert the favorability to the (S)-BINOL derivative (to $\approx$ 34% de). The latter outcome is particularly remarkable because it indicates that the preference in diasteriomer can be influenced via orientational control, without changing the chirality of the directing group. We demonstrate that the observed effect stems from cavity-induced changes to the Kohn-Sham orbitals of the ground state.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Continual Auxiliary Task Learning
Authors:
Matthew McLeod,
Chunlok Lo,
Matthew Schlegel,
Andrew Jacobsen,
Raksha Kumaraswamy,
Martha White,
Adam White
Abstract:
Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather useful data for those off-policy predictions. In this work, we investigate a reinforcement learning syste…
▽ More
Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather useful data for those off-policy predictions. In this work, we investigate a reinforcement learning system designed to learn a collection of auxiliary tasks, with a behavior policy learning to take actions to improve those auxiliary predictions. We highlight the inherent non-stationarity in this continual auxiliary task learning problem, for both prediction learners and the behavior learner. We develop an algorithm based on successor features that facilitates tracking under non-stationary rewards, and prove the separation into learning successor features and rewards provides convergence rate improvements. We conduct an in-depth study into the resulting multi-prediction learning system.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
The Two Body Problem in the Presence of Dark Energy and Modified Gravity: Application to the Local Group
Authors:
Michael McLeod,
Ofer Lahav
Abstract:
We explore mass estimation of the Local Group via the use of the simple, dynamical `timing argument' in the context of a variety of theories of dark energy and modified gravity: a cosmological constant, a perfect fluid with constant equation of state $w$, quintessence (minimally coupled scalar field), MOND, and symmetrons (coupled scalar field). We explore generic coupled scalar field theories, wi…
▽ More
We explore mass estimation of the Local Group via the use of the simple, dynamical `timing argument' in the context of a variety of theories of dark energy and modified gravity: a cosmological constant, a perfect fluid with constant equation of state $w$, quintessence (minimally coupled scalar field), MOND, and symmetrons (coupled scalar field). We explore generic coupled scalar field theories, with the symmetron model as an explicit example. We find that theories which attempt to eliminate dark matter by fitting rotation curves produce mass estimates in the timing argument which are not compatible with the luminous mass of the galaxies alone. Assuming that the galaxies are approaching their first encounter, MOND gives of around $2.7\times 10^{10} M_\odot$, roughly 10\% of the luminous mass of the LG, although a higher mass can be obtained in the case of a previous fly-by event between the MW and M31. The symmetron model suggests a mass too high to be explained without additional dark matter ($\mathcal{O}(10^{12}) M_\odot$), suggesting that there is a missing mass problem in this model. We also demonstrate that tensions in measurements of $H_0$ can produce an uncertainty in the Local Group mass estimate comparable to observational uncertainties on the separation and relative velocity of the galaxies, with values for the mass ranging from $4.5 - 5.4 \times 10^{12} M_{\odot}$ varying $h$ between 0.67 and 0.76.
△ Less
Submitted 7 August, 2020; v1 submitted 26 March, 2019;
originally announced March 2019.
-
Upper Bound of Neutrino Masses from Combined Cosmological Observations and Particle Physics Experiments
Authors:
Arthur Loureiro,
Andrei Cuceu,
Filipe B. Abdalla,
Bruno Moraes,
Lorne Whiteway,
Michael McLeod,
Sreekumar T. Balan,
Ofer Lahav,
Aurélien Benoit-Lévy,
Marc Manera,
Richard P. Rollins,
Henrique S. Xavier
Abstract:
We investigate the impact of prior models on the upper bound of the sum of neutrino masses, $\sum m_ν$. We use data from Large Scale Structure of galaxies, Cosmic Microwave Background, Type Ia SuperNovae, and Big Bang Nucleosynthesis. We probe physically motivated neutrino mass models (respecting oscillation experiment constraints) and compare them to constraints using standard cosmological approx…
▽ More
We investigate the impact of prior models on the upper bound of the sum of neutrino masses, $\sum m_ν$. We use data from Large Scale Structure of galaxies, Cosmic Microwave Background, Type Ia SuperNovae, and Big Bang Nucleosynthesis. We probe physically motivated neutrino mass models (respecting oscillation experiment constraints) and compare them to constraints using standard cosmological approximations. The former give a consistent upper bound of $\sum m_ν \lesssim 0.26$ eV ($95\%$ CI) and yields a strong competitive upper bound for the lightest neutrino mass species, $m_0^ν < 0.086$ eV ($95\%$ CI). By contrast one of the approximations, which is somewhat inconsistent with oscillation experiments, yields an upper bound of $\sum m_ν \lesssim 0.15$ eV ($95\%$ CI), which differs substantially from the former upper bound. We, therefore, argue that cosmological neutrino mass and hierarchy determination should be pursued using physically motivated models since approximations might lead to incorrect and nonphysical upper bounds.
△ Less
Submitted 27 August, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
Cosmological Measurements from Angular Power Spectra Analysis of BOSS DR12 Tomography
Authors:
Arthur Loureiro,
Bruno Moraes,
Filipe B. Abdalla,
Andrei Cuceu,
Michael McLeod,
Lorne Whiteway,
Sreekumar T. Balan,
Aurélien Benoit-Lévy,
Ofer Lahav,
Marc Manera,
Richard Rollins,
Henrique S. Xavier
Abstract:
We constrain cosmological parameters by analysing the angular power spectra of the Baryon Oscillation Spectroscopic Survey DR12 galaxies, a spectroscopic follow-up of around 1.3 million SDSS galaxies over 9,376 deg$^2$ with an effective volume of $\sim 6.5$ (Gpc $h^{-1}$)$^3$ in the redshift range $0.15 \leq z < 0.80$. We split this sample into 13 tomographic bins ($Δz = 0.05$); angular power spec…
▽ More
We constrain cosmological parameters by analysing the angular power spectra of the Baryon Oscillation Spectroscopic Survey DR12 galaxies, a spectroscopic follow-up of around 1.3 million SDSS galaxies over 9,376 deg$^2$ with an effective volume of $\sim 6.5$ (Gpc $h^{-1}$)$^3$ in the redshift range $0.15 \leq z < 0.80$. We split this sample into 13 tomographic bins ($Δz = 0.05$); angular power spectra were calculated using a Pseudo-$C_{\ell}$ estimator, and covariance matrices were estimated using log-normal simulated maps. Cosmological constraints obtained from these data were combined with constraints from Planck CMB experiment as well as the JLA supernovae compilation. Considering a $w$CDM cosmological model measured on scales up to $k_{max} = 0.07h$ Mpc$^{-1}$, we constrain a constant dark energy equation-of-state with a $\sim 4\%$ error at the 1-$σ$ level: $w_0 = -0.993^{+0.046}_{-0.043}$, together with $Ω_m = 0.330\pm 0.012$, $Ω_b = 0.0505 \pm 0.002$, $S_8 \equiv σ_8 \sqrt{Ω_m/0.3} = 0.863 \pm 0.016$, and $h = 0.661 \pm 0.012$. For the same combination of datasets, but now considering a $Λ$CDM model with massive neutrinos and the same scale cut, we find: $Ω_m = 0.328 \pm 0.009$, $Ω_b = 0.05017^{+0.0009}_{-0.0008}$, $S_8 = 0.862 \pm 0.017$, and $h = 0.663^{+0.006}_{-0.007}$ and a 95\% credible interval (CI) upper limit of $\sum m_ν < 0.14$ eV for a normal hierarchy. These results are competitive if not better than standard analyses with the same dataset, and demonstrate this should be a method of choice for future surveys, opening the door for their full exploitation in cross-correlations probes.
△ Less
Submitted 27 August, 2019; v1 submitted 19 September, 2018;
originally announced September 2018.
-
Optimization, fast and slow: optimally switching between local and Bayesian optimization
Authors:
Mark McLeod,
Michael A. Osborne,
Stephen J. Roberts
Abstract:
We develop the first Bayesian Optimization algorithm, BLOSSOM, which selects between multiple alternative acquisition functions and traditional local optimization at each step. This is combined with a novel stop** condition based on expected regret. This pairing allows us to obtain the best characteristics of both local and Bayesian optimization, making efficient use of function evaluations whil…
▽ More
We develop the first Bayesian Optimization algorithm, BLOSSOM, which selects between multiple alternative acquisition functions and traditional local optimization at each step. This is combined with a novel stop** condition based on expected regret. This pairing allows us to obtain the best characteristics of both local and Bayesian optimization, making efficient use of function evaluations while yielding superior convergence to the global minimum on a selection of optimization problems, and also halting optimization once a principled and intuitive stop** condition has been fulfilled.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Fast Information-theoretic Bayesian Optimisation
Authors:
Binxin Ru,
Mark McLeod,
Diego Granziol,
Michael A. Osborne
Abstract:
Information-theoretic Bayesian optimisation techniques have demonstrated state-of-the-art performance in tackling important global optimisation problems. However, current information-theoretic approaches require many approximations in implementation, introduce often-prohibitive computational overhead and limit the choice of kernels available to model the objective. We develop a fast information-th…
▽ More
Information-theoretic Bayesian optimisation techniques have demonstrated state-of-the-art performance in tackling important global optimisation problems. However, current information-theoretic approaches require many approximations in implementation, introduce often-prohibitive computational overhead and limit the choice of kernels available to model the objective. We develop a fast information-theoretic Bayesian Optimisation method, FITBO, that avoids the need for sampling the global minimiser, thus significantly reducing computational overhead. Moreover, in comparison with existing approaches, our method faces fewer constraints on kernel choice and enjoys the merits of dealing with the output space. We demonstrate empirically that FITBO inherits the performance associated with information-theoretic Bayesian optimisation, while being even faster than simpler Bayesian optimisation approaches, such as Expected Improvement.
△ Less
Submitted 6 June, 2018; v1 submitted 2 November, 2017;
originally announced November 2017.
-
Practical Bayesian Optimization for Variable Cost Objectives
Authors:
Mark McLeod,
Michael A. Osborne,
Stephen J. Roberts
Abstract:
We propose a novel Bayesian Optimization approach for black-box functions with an environmental variable whose value determines the tradeoff between evaluation cost and the fidelity of the evaluations. Further, we use a novel approach to sampling support points, allowing faster construction of the acquisition function. This allows us to achieve optimization with lower overheads than previous appro…
▽ More
We propose a novel Bayesian Optimization approach for black-box functions with an environmental variable whose value determines the tradeoff between evaluation cost and the fidelity of the evaluations. Further, we use a novel approach to sampling support points, allowing faster construction of the acquisition function. This allows us to achieve optimization with lower overheads than previous approaches and is implemented for a more general class of problem. We show this approach to be effective on synthetic and real world benchmark problems.
△ Less
Submitted 15 May, 2018; v1 submitted 13 March, 2017;
originally announced March 2017.
-
A Joint Analysis for Cosmology and Photometric Redshift Calculation Using Cross Correlations
Authors:
Michael McLeod,
Sreekumar T. Balan,
Filipe B. Abdalla
Abstract:
We present a method of calibrating the properties of photometric redshift bins as part of a larger Markov Chain Monte Carlo (MCMC) analysis for the inference of cosmological parameters. The redshift bins are characterised by their mean and variance, which are varied as free parameters and marginalised over when obtaining the cosmological parameters. We demonstrate that the likelihood function for…
▽ More
We present a method of calibrating the properties of photometric redshift bins as part of a larger Markov Chain Monte Carlo (MCMC) analysis for the inference of cosmological parameters. The redshift bins are characterised by their mean and variance, which are varied as free parameters and marginalised over when obtaining the cosmological parameters. We demonstrate that the likelihood function for cross-correlations in an angular power spectrum framework tightly constrains the properties of bins such that they may be well determined, reducing their influence on cosmological parameters and avoiding the bias from poorly estimated redshift distributions. We demonstrate that even with only three photometric and three spectroscopic bins, we can recover accurate estimates of the mean redshift of a bin to within $Δμ\approx 3-4 \times10^{-3}$ and the width of the bin to $Δσ\approx 1\times10^{-3}$ for galaxies near $z = 1$. This indicates that we may be able to bring down the photometric redshift errors to a level which is in line with the requirements for the next generation of cosmological experiments.
△ Less
Submitted 1 December, 2016;
originally announced December 2016.
-
Estimating the Mass of the Local Group using Machine Learning Applied to Numerical Simulations
Authors:
Michael McLeod,
Noam Libeskind,
Ofer Lahav,
Yehuda Hoffman
Abstract:
We revisit the estimation of the combined mass of the Milky Way and Andromeda (M31), which dominate the mass of the Local Group. We make use of an ensemble of 30,190 halo pairs from the Small MultiDark simulation, assuming a $Λ$CDM (Cosmological Constant with Cold Dark Matter) cosmology, to investigate the relationship between the bound mass and parameters characterising the orbit of the binary an…
▽ More
We revisit the estimation of the combined mass of the Milky Way and Andromeda (M31), which dominate the mass of the Local Group. We make use of an ensemble of 30,190 halo pairs from the Small MultiDark simulation, assuming a $Λ$CDM (Cosmological Constant with Cold Dark Matter) cosmology, to investigate the relationship between the bound mass and parameters characterising the orbit of the binary and their local environment with the aid of machine learning methods (artificial neural networks, ANN). Results from the ANN are most successful when information about the velocity shear is provided, which demonstrates the flexibility of machine learning to model physical phenomena and readily incorporate new information as it becomes available. The resulting estimate for the Local Group mass, when shear information is included, is $4.9 \times 10^{12} M_\odot$, with an error of $\pm0.8 \times 10^{12} M_\odot$ from the 68% uncertainty in observables, and a 68% confidence interval of $^{+1.3}_{-1.4} \times 10^{12}M_\odot$ from the intrinsic scatter from the differences between the model and simulation masses. We also consider a recently reported large transverse velocity of M31 relative to the Milky Way, and produce an alternative mass estimate of $3.6\pm0.3\pm1.4 \times 10^{12}M_\odot$. Although different methods predict similar values for the most likely mass of the LG, application of ANN compared to the Timing Argument reduces the scatter in the log mass by over half when tested on samples from the simulation.
△ Less
Submitted 25 October, 2017; v1 submitted 8 June, 2016;
originally announced June 2016.
-
ALMA North American Integration Center Front-End Test System
Authors:
Geoffrey A. Ediss,
Joshua Crabtree,
Kirk Crady,
Erik Gaines,
Morgan McLeod,
Greg Morris,
Rick Williams,
Antonio Perfetto,
John Webber
Abstract:
The Atacama Large Millimeter/submillimeter (ALMA) Array Front End (FE) system is the first element in a complex chain of signal receiving, conversion, processing and recording. 70 Front Ends will be required for the project. The Front End is designed to receive signals in ten different frequency bands. In the initial phase of operations, the antennas will be fully equipped with six bands. These ar…
▽ More
The Atacama Large Millimeter/submillimeter (ALMA) Array Front End (FE) system is the first element in a complex chain of signal receiving, conversion, processing and recording. 70 Front Ends will be required for the project. The Front End is designed to receive signals in ten different frequency bands. In the initial phase of operations, the antennas will be fully equipped with six bands. These are Band 3 (84-116 GHz), Band 4 (125-163 GHz), Band 6 (211-275 GHz), Band 7 (275-373 GHz), Band 8 (385-500 GHz) and Band 9 (602-720 GHz). It is planned to equip the antennas with the missing bands at a later stage of ALMA operations, with a few Band 5 (163-211 GHz) and Band 10 (787-950 GHz) receivers in use before the end of the construction project.
The ALMA Front End is far superior to any existing receiver systems; spin-offs of the ALMA prototypes are leading to improved sensitivities in existing millimeter and submillimeter observatories. The Front End units are comprised of numerous elements, produced at different locations in Europe, North America and East Asia and are integrated at several Front End integration centers (FEIC) to insure timely delivery of all the units to Chile. The North American FEIC (NA FEIC) is at the National Radio Astronomy Observatory facility in Charlottesville, Virginia, USA.
This paper describes the design and performance of the test set used at the NA FEIC to check the performance of the Front Ends, following integration and prior to shipment to Chile.
△ Less
Submitted 1 September, 2010;
originally announced September 2010.