Search | arXiv e-print repository

arXiv:2008.00065 [pdf, other]

State Readout of a Trapped Ion Qubit Using a Trap-Integrated Superconducting Photon Detector

Authors: S. L. Todaro, V. B. Verma, K. C. McCormick, D. T. C. Allcock, R. P. Mirin, D. J. Wineland, S. W. Nam, A. C. Wilson, D. Leibfried, D. H. Slichter

Abstract: We report high-fidelity state readout of a trapped ion qubit using a trap-integrated photon detector. We determine the hyperfine qubit state of a single $^9$Be$^+$ ion held in a surface-electrode rf ion trap by counting state-dependent ion fluorescence photons with a superconducting nanowire single-photon detector (SNSPD) fabricated into the trap structure. The average readout fidelity is 0.9991(1… ▽ More We report high-fidelity state readout of a trapped ion qubit using a trap-integrated photon detector. We determine the hyperfine qubit state of a single $^9$Be$^+$ ion held in a surface-electrode rf ion trap by counting state-dependent ion fluorescence photons with a superconducting nanowire single-photon detector (SNSPD) fabricated into the trap structure. The average readout fidelity is 0.9991(1), with a mean readout duration of 46 $μ$s, and is limited by the polarization impurity of the readout laser beam and by off-resonant optical pum**. Because there are no intervening optical elements between the ion and the detector, we can use the ion fluorescence as a self-calibrated photon source to determine the detector quantum efficiency and its dependence on photon incidence angle and polarization. △ Less

Submitted 31 July, 2020; originally announced August 2020.

Comments: 15 pages, 11 figures, including supplemental material

Journal ref: Phys. Rev. Lett. 126, 010501 (2021)

arXiv:2007.09015 [pdf, other]

Capacity Value of Solar Power and Other Variable Generation

Authors: S. Awara, M. Lynch, S. Pfenninger, K. Schell, R. Sioshansi, I. Staffell, N. Samaan, S. H. Tindemans, A. L. Wilson, S. Zachary, H. Zareipour, C. J. Dent

Abstract: This paper reviews methods that are used for adequacy risk assessment considering solar power and for assessment of the capacity value of solar power. The properties of solar power are described as seen from the perspective of the power-system operator, comparing differences in energy availability and capacity factors with those of wind power. Methodologies for risk calculations considering variab… ▽ More This paper reviews methods that are used for adequacy risk assessment considering solar power and for assessment of the capacity value of solar power. The properties of solar power are described as seen from the perspective of the power-system operator, comparing differences in energy availability and capacity factors with those of wind power. Methodologies for risk calculations considering variable generation are surveyed, including the probability background, statistical-estimation approaches, and capacity-value metrics. Issues in incorporating variable generation in capacity markets are described, followed by a review of applied studies considering solar power. Finally, recommendations for further research are presented. △ Less

Submitted 17 July, 2020; originally announced July 2020.

Comments: 8 pages, 3 figures, submitted to IEEE TPS

arXiv:2007.06762 [pdf]

Dynamics of B-cell repertoires and emergence of cross-reactive responses in COVID-19 patients with different disease severity

Authors: Zachary Montague, Huibin Lv, Jakub Otwinowski, William S. DeWitt, Giulio Isacchini, Garrick K. Yip, Wilson W. Ng, Owen Tak-Yin Tsang, Meng Yuan, Hejun Liu, Ian A. Wilson, J. S. Malik Peiris, Nicholas C. Wu, Armita Nourmohammad, Chris Ka Pun Mok

Abstract: COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multipl… ▽ More COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19. △ Less

Submitted 5 April, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

arXiv:2007.01374 [pdf, other]

Smt-Switch: a solver-agnostic C++ API for SMT Solving

Authors: Makai Mann, Amalee Wilson, Cesare Tinelli, Clark Barrett

Abstract: This extended abstract describes work in progress on Smt-Switch, an open-source, solver-agnostic API for SMT solving. Smt-Switch provides an abstract interface, which can be implemented by different SMT solvers. Smt-Switch provides simple, uniform, and high-performance access to SMT solving for applications in areas such as automated reasoning, planning, and formal verification. The interface allo… ▽ More This extended abstract describes work in progress on Smt-Switch, an open-source, solver-agnostic API for SMT solving. Smt-Switch provides an abstract interface, which can be implemented by different SMT solvers. Smt-Switch provides simple, uniform, and high-performance access to SMT solving for applications in areas such as automated reasoning, planning, and formal verification. The interface allows the user to create, traverse, and manipulate terms, as well as to dynamically dispatch queries to different underlying SMT solvers. △ Less

Submitted 12 July, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

Comments: This version adds a reference to metaSMT. 11 pages, 1 figure, to be included in SMT Workshop 2020: http://smt-workshop.cs.uiowa.edu/2020/

arXiv:2006.09157 [pdf, other]

Selecting Diverse Models for Scientific Insight

Authors: Laura J. Wendelberger, Brian J. Reich, Alyson G. Wilson

Abstract: Model selection often aims to choose a single model, assuming that the form of the model is correct. However, there may be multiple possible underlying explanatory patterns in a set of predictors that could explain a response. Model selection without regard for model uncertainty can fail to bring these patterns to light. We explore multi-model penalized regression (MMPR) to acknowledge model uncer… ▽ More Model selection often aims to choose a single model, assuming that the form of the model is correct. However, there may be multiple possible underlying explanatory patterns in a set of predictors that could explain a response. Model selection without regard for model uncertainty can fail to bring these patterns to light. We explore multi-model penalized regression (MMPR) to acknowledge model uncertainty in the context of penalized regression. We examine how different penalty settings can promote either shrinkage or sparsity of coefficients in separate models. The method is tuned to explicitly limit model similarity. A choice of penalty form that enforces variable selection is applied to predict stacking fault energy (SFE) from steel alloy composition. The aim is to identify multiple models with different subsets of covariates that explain a single type of response. △ Less

Submitted 15 December, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

Comments: 37 Pages, 14 Figures. Presented at Conference on Data Analysis (CoDA) 2020 (Feb 25-27)

arXiv:2006.08545 [pdf, other]

Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Authors: Polina Kirichenko, Pavel Izmailov, Andrew Gordon Wilson

Abstract: Detecting out-of-distribution (OOD) data is crucial for robust machine learning systems. Normalizing flows are flexible deep generative models that often surprisingly fail to distinguish between in- and out-of-distribution data: a flow trained on pictures of clothing assigns higher likelihood to handwritten digits. We investigate why normalizing flows perform poorly for OOD detection. We demonstra… ▽ More Detecting out-of-distribution (OOD) data is crucial for robust machine learning systems. Normalizing flows are flexible deep generative models that often surprisingly fail to distinguish between in- and out-of-distribution data: a flow trained on pictures of clothing assigns higher likelihood to handwritten digits. We investigate why normalizing flows perform poorly for OOD detection. We demonstrate that flows learn local pixel correlations and generic image-to-latent-space transformations which are not specific to the target image dataset. We show that by modifying the architecture of flow coupling layers we can bias the flow towards learning the semantic structure of the target data, improving OOD detection. Our investigation reveals that properties that enable flows to generate high-fidelity images can have a detrimental effect on OOD detection. △ Less

Submitted 15 June, 2020; originally announced June 2020.

Comments: Code is available at https://github.com/PolinaKirichenko/flows_ood

arXiv:2006.06900 [pdf, other]

Improving GAN Training with Probability Ratio Clip** and Sample Reweighting

Authors: Yue Wu, Pan Zhou, Andrew Gordon Wilson, Eric P. Xing, Zhiting Hu

Abstract: Despite success on a wide range of problems related to vision, generative adversarial networks (GANs) often suffer from inferior performance due to unstable training, especially for text generation. To solve this issue, we propose a new variational GAN training framework which enjoys superior training stability. Our approach is inspired by a connection of GANs and reinforcement learning under a va… ▽ More Despite success on a wide range of problems related to vision, generative adversarial networks (GANs) often suffer from inferior performance due to unstable training, especially for text generation. To solve this issue, we propose a new variational GAN training framework which enjoys superior training stability. Our approach is inspired by a connection of GANs and reinforcement learning under a variational perspective. The connection leads to (1) probability ratio clip** that regularizes generator training to prevent excessively large updates, and (2) a sample re-weighting mechanism that improves discriminator training by downplaying bad-quality fake samples. Moreover, our variational GAN framework can provably overcome the training issue in many GANs that an optimal discriminator cannot provide any informative gradient to training generator. By plugging the training approach in diverse state-of-the-art GAN architectures, we obtain significantly improved performance over a range of tasks, including text generation, text style transfer, and image generation. △ Less

Submitted 30 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

Comments: NeurIPS 2020 camera ready version (citations updated)

arXiv:2006.00326 [pdf, ps, other]

doi 10.1002/env.2642

Bayesian Nonparametric Monotone Regression

Authors: Ander Wilson, Jessica Tryner, Christian L'Orange, John Volckens

Abstract: In many applications there is interest in estimating the relation between a predictor and an outcome when the relation is known to be monotone or otherwise constrained due to the physical processes involved. We consider one such application--inferring time-resolved aerosol concentration from a low-cost differential pressure sensor. The objective is to estimate a monotone function and make inferenc… ▽ More In many applications there is interest in estimating the relation between a predictor and an outcome when the relation is known to be monotone or otherwise constrained due to the physical processes involved. We consider one such application--inferring time-resolved aerosol concentration from a low-cost differential pressure sensor. The objective is to estimate a monotone function and make inference on the scaled first derivative of the function. We proposed Bayesian nonparametric monotone regression which uses a Bernstein polynomial basis to construct the regression function and puts a Dirichlet process prior on the regression coefficients. The base measure of the Dirichlet process is a finite mixture of a mass point at zero and a truncated normal. This construction imposes monotonicity while clustering the basis functions. Clustering the basis functions reduces the parameter space and allows the estimated regression function to be linear. With the proposed approach we can make closed-formed inference on the derivative of the estimated function including full quantification of uncertainty. In a simulation study the proposed method performs similar to other monotone regression approaches when the true function is wavy but performs better when the true function is linear. We apply the method to estimate time-resolved aerosol concentration with a newly-developed portable aerosol monitor. The R package bnmr is made available to implement the method. △ Less

Submitted 30 May, 2020; originally announced June 2020.

Journal ref: Environmetrics 2020

arXiv:2005.07673 [pdf]

Epidemic models with geography

Authors: Alan Wilson

Abstract: Most epidemic models are spatially aggregate and the index which is most used for planning and policy numbers, the r number, typically refers to a single system of interest. Even if r numbers are calculated for each of adjacent areas, regions or countries for example, there is no interaction between them. Here we aim to offer a fine-grained geography: models of epidemics in spatially disaggregated… ▽ More Most epidemic models are spatially aggregate and the index which is most used for planning and policy numbers, the r number, typically refers to a single system of interest. Even if r numbers are calculated for each of adjacent areas, regions or countries for example, there is no interaction between them. Here we aim to offer a fine-grained geography: models of epidemics in spatially disaggregated systems with interactions. This offers the possibility of new insights into the dynamics of epidemics and of policies aimed at mitigation and control. △ Less

Submitted 15 May, 2020; originally announced May 2020.

Comments: 5 pages

arXiv:2004.14645 [pdf, other]

doi 10.1051/0004-6361/202038296

Discovery and characterization of the exoplanets WASP-148b and c. A transiting system with two interacting giant planets

Authors: G. Hebrard, R. F. Diaz, A. C. M. Correia, A. Collier Cameron, J. Laskar, D. Pollacco, J. -M. Almenara, D. R. Anderson, S. C. C. Barros, I. Boisse, A. S. Bonomo, F. Bouchy, G. Boue, P. Boumis, D. J. A. Brown, S. Dalal, M. Deleuil, O. Demangeon, A. P. Doyle, C. A. Haswell, C. Hellier, H. Osborn, F. Kiefer, U. C. Kolb, K. Lam , et al. (17 additional authors not shown)

Abstract: We present the discovery and characterization of WASP-148, a new extrasolar system that includes at least two giant planets. The host star is a slowly rotating inactive late-G dwarf with a V=12 magnitude. The planet WASP-148b is a hot Jupiter of 0.72 R_Jup and 0.29 M_Jup that transits its host with an orbital period of 8.80 days. We found the planetary candidate with the SuperWASP photometric surv… ▽ More We present the discovery and characterization of WASP-148, a new extrasolar system that includes at least two giant planets. The host star is a slowly rotating inactive late-G dwarf with a V=12 magnitude. The planet WASP-148b is a hot Jupiter of 0.72 R_Jup and 0.29 M_Jup that transits its host with an orbital period of 8.80 days. We found the planetary candidate with the SuperWASP photometric survey, then characterized it with the SOPHIE spectrograph. Our radial velocity measurements subsequently revealed a second planet in the system, WASP-148c, with an orbital period of 34.5 days and a minimum mass of 0.40 M_Jup. No transits of this outer planet were detected. The orbits of both planets are eccentric and fall near the 4:1 mean-motion resonances. This configuration is stable on long timescales, but induces dynamical interactions so that the orbits differ slightly from purely Keplerian orbits. In particular, WASP-148b shows transit-timing variations of typically 15 minutes, making it the first interacting system with transit-timing variations that is detected on ground-based light curves. We establish that the mutual inclination of the orbital plane of the two planets cannot be higher than 35 degrees, and the true mass of WASP-148c is below 0.60 M_Jup. We present photometric and spectroscopic observations of this system that cover a time span of ten years. We also provide their Keplerian and Newtonian analyses; these analyses should be significantly improved through future TESS~observations. △ Less

Submitted 24 June, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

Comments: 16 pages, 10 figures, 7 tables, A&A in press

Journal ref: A&A 640, A32 (2020)

arXiv:2003.10314 [pdf, other]

doi 10.1038/s41586-020-2421-7

A remnant planetary core in the hot-Neptune desert

Authors: David J. Armstrong, Théo A. Lopez, Vardan Adibekyan, Richard A. Booth, Edward M. Bryant, Karen A. Collins, Alexandre Emsenhuber, Chelsea X. Huang, George W. King, Jorge Lillo-box, Jack J. Lissauer, Elisabeth C. Matthews, Olivier Mousis, Louise D. Nielsen, Hugh Osborn, Jon Otegi, Nuno C. Santos, Sérgio G. Sousa, Keivan G. Stassun, Dimitri Veras, Carl Ziegler, Jack S. Acton, Jose M. Almenara, David R. Anderson, David Barrado , et al. (69 additional authors not shown)

Abstract: The interiors of giant planets remain poorly understood. Even for the planets in the Solar System, difficulties in observation lead to large uncertainties in the properties of planetary cores. Exoplanets that have undergone rare evolutionary processes provide a route to understanding planetary interiors. Planets found in and near the typically barren hot-Neptune 'desert' (a region in mass-radius s… ▽ More The interiors of giant planets remain poorly understood. Even for the planets in the Solar System, difficulties in observation lead to large uncertainties in the properties of planetary cores. Exoplanets that have undergone rare evolutionary processes provide a route to understanding planetary interiors. Planets found in and near the typically barren hot-Neptune 'desert' (a region in mass-radius space that contains few planets) have proved to be particularly valuable in this regard. These planets include HD149026b, which is thought to have an unusually massive core, and recent discoveries such as LTT9779b and NGTS-4b, on which photoevaporation has removed a substantial part of their outer atmospheres. Here we report observations of the planet TOI-849b, which has a radius smaller than Neptune's but an anomalously large mass of $39.1^{+2.7}_{-2.6}$ Earth masses and a density of $5.2^{+0.7}_{-0.8}$ grams per cubic centimetre, similar to Earth's. Interior structure models suggest that any gaseous envelope of pure hydrogen and helium consists of no more than $3.9^{+0.8}_{-0.9}$ per cent of the total planetary mass. The planet could have been a gas giant before undergoing extreme mass loss via thermal self-disruption or giant planet collisions, or it could have avoided substantial gas accretion, perhaps through gap opening or late formation. Although photoevaporation rates cannot account for the mass loss required to reduce a Jupiter-like gas giant, they can remove a small (a few Earth masses) hydrogen and helium envelope on timescales of several billion years, implying that any remaining atmosphere on TOI-849b is likely to be enriched by water or other volatiles from the planetary interior. We conclude that TOI-849b is the remnant core of a giant planet. △ Less

Submitted 16 July, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

Comments: Published in Nature. This is a preprint of the article, before minor changes made during the refereeing and editing process. The published PDF is at https://www.nature.com/articles/s41586-020-2421-7 and can be accessed for free by following this link: https://rdcu.be/b5miB . Abstract updated to match published version

arXiv:2003.09060 [pdf, other]

doi 10.1364/JOSAB.475467

VECSEL systems for quantum information processing with trapped beryllium ions

Authors: S. C. Burd, J. -P. Penttinen, P. -Y. Hou, H. M. Knaack, S. Ranta, M. Mäki, E. Kantola, M. Guina, D. H. Slichter, D. Leibfried, A. C. Wilson

Abstract: Two vertical-external-cavity surface-emitting laser (VECSEL) systems producing ultraviolet (UV) radiation at 235 nm and 313 nm are demonstrated. The systems are suitable for quantum information processing applications with trapped beryllium ions. Each system consists of a compact, single-frequency, continuous-wave VECSEL producing high-power near-infrared light, tunable over tens of nanometers. On… ▽ More Two vertical-external-cavity surface-emitting laser (VECSEL) systems producing ultraviolet (UV) radiation at 235 nm and 313 nm are demonstrated. The systems are suitable for quantum information processing applications with trapped beryllium ions. Each system consists of a compact, single-frequency, continuous-wave VECSEL producing high-power near-infrared light, tunable over tens of nanometers. One system generates 2.4 W at 940 nm, using a gain mirror based on GaInAs/GaAs quantum wells, which is converted to 54 mW of 235 nm light for photoionization of neutral beryllium atoms. The other system uses a novel gain mirror based on GaInNAs/GaAs quantum-wells, enabling wavelength extension with manageable strain in the GaAs lattice. This system generates 1.6 W at 1252 nm, which is converted to 41 mW of 313 nm light that is used to laser cool trapped $^{9}$Be$^{+}$ ions and to implement quantum state preparation and detection. The 313 nm system is also suitable for implementing high-fidelity quantum gates, and more broadly, our results extend the capabilities of VECSEL systems for applications in atomic, molecular, and optical physics. △ Less

Submitted 19 March, 2020; originally announced March 2020.

Comments: 8 pages, 7 figures

Journal ref: JOSA B 40, 773 (2023)

arXiv:2003.03520 [pdf, other]

doi 10.1002/qute.202000028

Ion transport and reordering in a two-dimensional trap array

Authors: Y. Wan, R. Jördens, S. D. Erickson, J. J. Wu, R. Bowler, T. R. Tan, P. -Y. Hou, D. J. Wineland, A. C. Wilson, D. Leibfried

Abstract: Scaling quantum information processors is a challenging task, requiring manipulation of a large number of qubits with high fidelity and a high degree of connectivity. For trapped ions, this could be realized in a two-dimensional array of interconnected traps in which ions are separated, transported and recombined to carry out quantum operations on small subsets of ions. Here, we use a junction con… ▽ More Scaling quantum information processors is a challenging task, requiring manipulation of a large number of qubits with high fidelity and a high degree of connectivity. For trapped ions, this could be realized in a two-dimensional array of interconnected traps in which ions are separated, transported and recombined to carry out quantum operations on small subsets of ions. Here, we use a junction connecting orthogonal linear segments in a two-dimensional (2D) trap array to reorder a two-ion crystal. The secular motion of the ions experiences low energy gain and the internal qubit levels maintain coherence during the reordering process, therefore demonstrating a promising method for providing all-to-all connectivity in a large-scale, two- or three-dimensional trapped-ion quantum information processor. △ Less

Submitted 7 March, 2020; originally announced March 2020.

arXiv:2003.02139 [pdf, other]

Rethinking Parameter Counting in Deep Models: Effective Dimensionality Revisited

Authors: Wesley J. Maddox, Gregory Benton, Andrew Gordon Wilson

Abstract: Neural networks appear to have mysterious generalization properties when using parameter counting as a proxy for complexity. Indeed, neural networks often have many more parameters than there are data points, yet still provide good generalization performance. Moreover, when we measure generalization as a function of parameters, we see double descent behaviour, where the test error decreases, incre… ▽ More Neural networks appear to have mysterious generalization properties when using parameter counting as a proxy for complexity. Indeed, neural networks often have many more parameters than there are data points, yet still provide good generalization performance. Moreover, when we measure generalization as a function of parameters, we see double descent behaviour, where the test error decreases, increases, and then again decreases. We show that many of these properties become understandable when viewed through the lens of effective dimensionality, which measures the dimensionality of the parameter space determined by the data. We relate effective dimensionality to posterior contraction in Bayesian deep learning, model selection, width-depth tradeoffs, double descent, and functional diversity in loss surfaces, leading to a richer understanding of the interplay between parameters and functions in deep models. We also show that effective dimensionality compares favourably to alternative norm- and flatness- based generalization measures. △ Less

Submitted 25 May, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

arXiv:2003.00617 [pdf, other]

Approximate Cross-validation: Guarantees for Model Assessment and Selection

Authors: Ashia Wilson, Maximilian Kasy, Lester Mackey

Abstract: Cross-validation (CV) is a popular approach for assessing and selecting predictive models. However, when the number of folds is large, CV suffers from a need to repeatedly refit a learning procedure on a large number of training datasets. Recent work in empirical risk minimization (ERM) approximates the expensive refitting with a single Newton step warm-started from the full training set optimizer… ▽ More Cross-validation (CV) is a popular approach for assessing and selecting predictive models. However, when the number of folds is large, CV suffers from a need to repeatedly refit a learning procedure on a large number of training datasets. Recent work in empirical risk minimization (ERM) approximates the expensive refitting with a single Newton step warm-started from the full training set optimizer. While this can greatly reduce runtime, several open questions remain including whether these approximations lead to faithful model selection and whether they are suitable for non-smooth objectives. We address these questions with three main contributions: (i) we provide uniform non-asymptotic, deterministic model assessment guarantees for approximate CV; (ii) we show that (roughly) the same conditions also guarantee model selection performance comparable to CV; (iii) we provide a proximal Newton extension of the approximate CV framework for non-smooth prediction problems and develop improved assessment guarantees for problems such as l1-regularized ERM. △ Less

Submitted 10 June, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

arXiv:2002.12880 [pdf, other]

Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data

Authors: Marc Finzi, Samuel Stanton, Pavel Izmailov, Andrew Gordon Wilson

Abstract: The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equi… ▽ More The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group with a surjective exponential map. Incorporating equivariance to a new group requires implementing only the group exponential and logarithm maps, enabling rapid prototy**. Showcasing the simplicity and generality of our method, we apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems. For Hamiltonian systems, the equivariance of our models is especially impactful, leading to exact conservation of linear and angular momentum. △ Less

Submitted 24 September, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

Comments: ICML 2020. Code available at https://github.com/mfinzi/LieConv

arXiv:2002.08791 [pdf, other]

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

Authors: Andrew Gordon Wilson, Pavel Izmailov

Abstract: The key distinguishing property of a Bayesian approach is marginalization, rather than using a single setting of weights. Bayesian marginalization can particularly improve the accuracy and calibration of modern deep neural networks, which are typically underspecified by the data, and can represent many compelling but different solutions. We show that deep ensembles provide an effective mechanism f… ▽ More The key distinguishing property of a Bayesian approach is marginalization, rather than using a single setting of weights. Bayesian marginalization can particularly improve the accuracy and calibration of modern deep neural networks, which are typically underspecified by the data, and can represent many compelling but different solutions. We show that deep ensembles provide an effective mechanism for approximate Bayesian marginalization, and propose a related approach that further improves the predictive distribution by marginalizing within basins of attraction, without significant overhead. We also investigate the prior over functions implied by a vague distribution over neural network weights, explaining the generalization properties of such models from a probabilistic perspective. From this perspective, we explain results that have been presented as mysterious and distinct to neural network generalization, such as the ability to fit images with random labels, and show that these results can be reproduced with Gaussian processes. We also show that Bayesian model averaging alleviates double descent, resulting in monotonic performance improvements with increased flexibility. Finally, we provide a Bayesian perspective on tempering for calibrating predictive distributions. △ Less

Submitted 30 March, 2022; v1 submitted 20 February, 2020; originally announced February 2020.

Comments: 31 pages, 19 figures

arXiv:2002.08424 [pdf, other]

doi 10.1088/1748-0221/15/06/P06033

Construction of precision wire readout planes for the Short-Baseline Near Detector (SBND)

Authors: R. Acciarri, C. Adams, C. Andreopoulos, J. Asaadi, M. Babicz, C. Backhouse, W. Badgett, L. F. Bagby, D. Barker, C. Barnes, A. Basharina-Freshville, V. Basque, A. Baxter, M. C. Q. Bazetto, O. Beltramello, M. Betancourt, A. Bhanderi, A. Bhat, M. R. M. Bishai, A. Bitadze, A. S. T. Blake, J. Boissevain, C. Bonifazi, J. Y. Book, D. Brailsford , et al. (170 additional authors not shown)

Abstract: The Short-Baseline Near Detector time projection chamber is unique in the design of its charge readout planes. These anode plane assemblies (APAs) have been fabricated and assembled to meet strict accuracy and precision requirements: wire spacing of 3 mm +/- 0.5 mm and wire tension of 7 N +/- 1 N across 3,964 wires per APA, and flatness within 0.5 mm over the 4 m +/- 2.5 m extent of each APA. This… ▽ More The Short-Baseline Near Detector time projection chamber is unique in the design of its charge readout planes. These anode plane assemblies (APAs) have been fabricated and assembled to meet strict accuracy and precision requirements: wire spacing of 3 mm +/- 0.5 mm and wire tension of 7 N +/- 1 N across 3,964 wires per APA, and flatness within 0.5 mm over the 4 m +/- 2.5 m extent of each APA. This paper describes the design, manufacture and assembly of these key detector components, with a focus on the quality assurance at each stage. △ Less

Submitted 24 April, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: 42 pages, 45 figures. Prepared for submission to JINST

arXiv:2002.05323 [pdf, other]

Top of the Batch: Interviews and the Match

Authors: Federico Echenique, Ruy Gonzalez, Alistair Wilson, Leeat Yariv

Abstract: Most doctors in the NRMP are matched to one of their most-preferred internship programs. Since various surveys indicate similarities across doctors' preferences, this suggests a puzzle. How can nearly everyone get a position in a highly-desirable program when positions in each program are scarce? We provide one possible explanation for this puzzle. We show that the patterns observed in the NRMP da… ▽ More Most doctors in the NRMP are matched to one of their most-preferred internship programs. Since various surveys indicate similarities across doctors' preferences, this suggests a puzzle. How can nearly everyone get a position in a highly-desirable program when positions in each program are scarce? We provide one possible explanation for this puzzle. We show that the patterns observed in the NRMP data may be an artifact of the interview process that precedes the match. Our analysis highlights the importance of interactions occurring outside of a matching clearinghouse for resulting outcomes, and casts doubts on analysis of clearinghouses that take reported preferences at face value. △ Less

Submitted 8 December, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

arXiv:2002.02474 [pdf, other]

Design of a Fully Actuated Robotic Hand With Multiple Gelsight Tactile Sensors

Authors: Achu Wilson, Shaoxiong Wang, Branden Romero, Edward Adelson

Abstract: This work details the design of a novel two finger robot gripper with multiple Gelsight based optical-tactile sensors covering the inner surface of the hand. The multiple Gelsight sensors can gather the surface topology of the object from multiple views simultaneously as well as can track the shear and tensile stress. In addition, other sensing modalities enable the hand to gather the thermal, aco… ▽ More This work details the design of a novel two finger robot gripper with multiple Gelsight based optical-tactile sensors covering the inner surface of the hand. The multiple Gelsight sensors can gather the surface topology of the object from multiple views simultaneously as well as can track the shear and tensile stress. In addition, other sensing modalities enable the hand to gather the thermal, acoustic and vibration information from the object being grasped. The force controlled gripper is fully actuated so that it can be used for various grasp configurations and can also be used for in-hand manipulation tasks. Here we present the design of such a gripper. △ Less

Submitted 6 February, 2020; originally announced February 2020.

arXiv:2002.01755 [pdf, other]

doi 10.1051/0004-6361/201937080

Three planets transiting the evolved star EPIC 249893012: a hot 8.8-M$_\oplus$ super-Earth and two warm 14.7 and 10.2-M$_\oplus$ sub-Neptunes

Authors: D. Hidalgo, E. Pallé, R. Alonso, D. Gandolfi, M. Fridlund, G. Nowak, R. Luque, T. Hirano, A. B. Justesen, W. D. Cochran, O. Barragan, L. Spina, F. Rodler, S. Albrecht, D. Anderson, P. Amado, E. Bryant, J. A. Caballero, J. Cabrera, Sz. Csizmadia, F. Dai, J. De Leon, H. J. Deeg, Ph. Eigmuller, M. Endl , et al. (35 additional authors not shown)

Abstract: We report the discovery of a new planetary system with three transiting planets, one super-Earth and two sub-Neptunes, that orbit EPIC\,249893012, a G8\,IV-V evolved star ($M_\star$\,=\,1.05\,$\pm$\,0.05\,$M_\odot$, $R_\star$\,=\,1.71\,$\pm$\,0.04\,$R_\odot$, $T_\mathrm{eff}$\,=5430\,$\pm$\,85\,K). The star is just leaving the main sequence. We combined \ktwo \ photometry with IRCS adaptive-optics… ▽ More We report the discovery of a new planetary system with three transiting planets, one super-Earth and two sub-Neptunes, that orbit EPIC\,249893012, a G8\,IV-V evolved star ($M_\star$\,=\,1.05\,$\pm$\,0.05\,$M_\odot$, $R_\star$\,=\,1.71\,$\pm$\,0.04\,$R_\odot$, $T_\mathrm{eff}$\,=5430\,$\pm$\,85\,K). The star is just leaving the main sequence. We combined \ktwo \ photometry with IRCS adaptive-optics imaging and HARPS, HARPS-N, and CARMENES high-precision radial velocity measurements to confirm the planetary system, determine the stellar parameters, and measure radii, masses, and densities of the three planets. With an orbital period of $3.5949^{+0.0007}_{-0.0007}$ days, a mass of $8.75^{+1.09}_{-1.08}\ M_{\oplus}$ , and a radius of $1.95^{+0.09}_{-0.08}\ R_{\oplus}$, the inner planet b is compatible with nickel-iron core and a silicate mantle ($ρ_b= 6.39^{+1.19}_{-1.04}$ g cm$^{-3}$). Planets c and d with orbital periods of $15.624^{+0.001}_{-0.001}$ and $35.747^{+0.005}_{-0.005}$ days, respectively, have masses and radii of $14.67^{+1,84}_{-1.89}\ M_{\oplus}$ and $3.67^{+0.17}_{-0.14}\ R_{\oplus}$ and $10.18^{+2.46}_{-2.42}\ M_{\oplus}$ and $3.94^{+0.13}_{-0.12}\ R_{\oplus}$, respectively, yielding a mean density of $1.62^{+0.30}_{-0.29}$ and $0.91^{+0.25}_{-0.23}$ g cm$^{-3}$, respectively. The radius of planet b lies in the transition region between rocky and gaseous planets, but its density is consistent with a rocky composition. Its semimajor axis and the corresponding photoevaporation levels to which the planet has been exposed might explain its measured density today. In contrast, the densities and semimajor axes of planets c and d suggest a very thick atmosphere. The singularity of this system, which orbits a slightly evolved star that is just leaving the main sequence, makes it a good candidate for a deeper study from a dynamical point of view. △ Less

Submitted 5 February, 2020; originally announced February 2020.

Comments: Accepted for publication in A\& A

Journal ref: A&A 636, A89 (2020)

arXiv:2001.10995 [pdf, ps, other]

The Case for Bayesian Deep Learning

Authors: Andrew Gordon Wilson

Abstract: The key distinguishing property of a Bayesian approach is marginalization instead of optimization, not the prior, or Bayes rule. Bayesian inference is especially compelling for deep neural networks. (1) Neural networks are typically underspecified by the data, and can represent many different but high performing models corresponding to different settings of parameters, which is exactly when margin… ▽ More The key distinguishing property of a Bayesian approach is marginalization instead of optimization, not the prior, or Bayes rule. Bayesian inference is especially compelling for deep neural networks. (1) Neural networks are typically underspecified by the data, and can represent many different but high performing models corresponding to different settings of parameters, which is exactly when marginalization will make the biggest difference for both calibration and accuracy. (2) Deep ensembles have been mistaken as competing approaches to Bayesian methods, but can be seen as approximate Bayesian marginalization. (3) The structure of neural networks gives rise to a structured prior in function space, which reflects the inductive biases of neural networks that help them generalize. (4) The observed correlation between parameters in flat regions of the loss and a diversity of solutions that provide good generalization is further conducive to Bayesian marginalization, as flat regions occupy a large volume in a high dimensional space, and each different solution will make a good contribution to a Bayesian model average. (5) Recent practical advances for Bayesian deep learning provide improvements in accuracy and calibration compared to standard training, while retaining scalability. △ Less

Submitted 29 January, 2020; originally announced January 2020.

arXiv:2001.08988 [pdf]

doi 10.1016/j.jclinepi.2020.07.014

Towards a Framework for the Design, Implementation and Reporting of Methodology Sco** Reviews

Authors: Glen P. Martin, David Jenkins, Lucy Bull, Rose Sisk, Li**g Lin, William Hulme, Anthony Wilson, Wenjuan Wang, Michael Barrowman, Camilla Sammut-Powell, Alexander Pate, Matthew Sperrin, Niels Peek

Abstract: Background: In view of the growth of published papers, there is an increasing need for studies that summarise scientific research. An increasingly common review is a 'Methodology sco** review', which provides a summary of existing analytical methods, techniques and software, proposed or applied in research articles, which address an analytical problem or further an analytical approach. However,… ▽ More Background: In view of the growth of published papers, there is an increasing need for studies that summarise scientific research. An increasingly common review is a 'Methodology sco** review', which provides a summary of existing analytical methods, techniques and software, proposed or applied in research articles, which address an analytical problem or further an analytical approach. However, guidelines for their design, implementation and reporting are limited. Methods: Drawing on the experiences of the authors, which were consolidated through a series of face-to-face workshops, we summarise the challenges inherent in conducting a methodology sco** review and offer suggestions of best practice to promote future guideline development. Results: We identified three challenges of conducting a methodology sco** review. First, identification of search terms; one cannot usually define the search terms a priori and the language used for a particular method can vary across the literature. Second, the scope of the review requires careful consideration since new methodology is often not described (in full) within abstracts. Third, many new methods are motivated by a specific clinical question, where the methodology may only be documented in supplementary materials. We formulated several recommendations that build upon existing review guidelines. These recommendations ranged from an iterative approach to defining search terms through to screening and data extraction processes. Conclusion: Although methodology sco** reviews are an important aspect of research, there is currently a lack of guidelines to standardise their design, implementation and reporting. We recommend a wider discussion on this topic. △ Less

Submitted 16 January, 2020; originally announced January 2020.

Comments: 22 pages, 2 tables

Journal ref: Journal of Clinical Epidemiology. (2020)

arXiv:2001.08834 [pdf, other]

doi 10.1093/mnras/staa197

Mass determinations of the three mini-Neptunes transiting TOI-125

Authors: L. D. Nielsen, D. Gandolfi, D. J. Armstrong, J. S. Jenkins, M. Fridlund, N. C. Santos, F. Dai, V. Adibekyan, R. Luque, J. H. Steffen, M. Esposito, F. Meru, S. Sabotta, E. Bolmont, D. Kossakowski, J. F. Otegi, F. Murgas, M. Stalport, F. ~Rodler, M. R. Díaz, N. T. ~Kurtovic, G. Ricker, R. Vanderspek, D. W. Latham, S. Seager , et al. (55 additional authors not shown)

Abstract: The Transiting Exoplanet Survey Satellite, TESS, is currently carrying out an all-sky search for small planets transiting bright stars. In the first year of the TESS survey, steady progress was made in achieving the mission's primary science goal of establishing bulk densities for 50 planets smaller than Neptune. During that year, TESS's observations were focused on the southern ecliptic hemispher… ▽ More The Transiting Exoplanet Survey Satellite, TESS, is currently carrying out an all-sky search for small planets transiting bright stars. In the first year of the TESS survey, steady progress was made in achieving the mission's primary science goal of establishing bulk densities for 50 planets smaller than Neptune. During that year, TESS's observations were focused on the southern ecliptic hemisphere, resulting in the discovery of three mini-Neptunes orbiting the star TOI-125, a V=11.0 K0 dwarf. We present intensive HARPS radial velocity observations, yielding precise mass measurements for TOI-125b, TOI-125c and TOI-125d. TOI-125b has an orbital period of 4.65 days, a radius of $2.726 \pm 0.075 ~\mathrm{R_{\rm E}}$, a mass of $ 9.50 \pm 0.88 ~\mathrm{M_{\rm E}}$ and is near the 2:1 mean motion resonance with TOI-125c at 9.15 days. TOI-125c has a similar radius of $2.759 \pm 0.10 ~\mathrm{R_{\rm E}}$ and a mass of $ 6.63 \pm 0.99 ~\mathrm{M_{\rm E}}$, being the puffiest of the three planets. TOI-125d, has an orbital period of 19.98 days and a radius of $2.93 \pm 0.17~\mathrm{R_{\rm E}}$ and mass $13.6 \pm 1.2 ~\mathrm{M_{\rm E}}$. For TOI-125b and TOI-125d we find unusual high eccentricities of $0.19\pm 0.04$ and $0.17^{+0.08}_{-0.06}$, respectively. Our analysis also provides upper mass limits for the two low-SNR planet candidates in the system; for TOI-125.04 ($R_P=1.36 ~\mathrm{R_{\rm E}}$, $P=$0.53 days) we find a $2σ$ upper mass limit of $1.6~\mathrm{M_{\rm E}}$, whereas TOI-125.05 ( $R_P=4.2^{+2.4}_{-1.4} ~\mathrm{R_{\rm E}}$, $P=$ 13.28 days) is unlikely a viable planet candidate with upper mass limit $2.7~\mathrm{M_{\rm E}}$. We discuss the internal structure of the three confirmed planets, as well as dynamical stability and system architecture for this intriguing exoplanet system. △ Less

Submitted 23 January, 2020; originally announced January 2020.

Comments: Accepted for publication in MNRAS

arXiv:2001.08681 [pdf, other]

Bayesian estimates of transmission line outage rates that consider line dependencies

Authors: Kai Zhou, James R. Cruise, Chris J. Dent, Ian Dobson, Louis Wehenkel, Zhaoyu Wang, Amy L. Wilson

Abstract: Transmission line outage rates are fundamental to power system reliability analysis. Line outages are infrequent, occurring only about once a year, so outage data are limited. We propose a Bayesian hierarchical model that leverages line dependencies to better estimate outage rates of individual transmission lines from limited outage data. The Bayesian estimates have a lower standard deviation than… ▽ More Transmission line outage rates are fundamental to power system reliability analysis. Line outages are infrequent, occurring only about once a year, so outage data are limited. We propose a Bayesian hierarchical model that leverages line dependencies to better estimate outage rates of individual transmission lines from limited outage data. The Bayesian estimates have a lower standard deviation than estimating the outage rates simply by dividing the number of outages by the number of years of data, especially when the number of outages is small. The Bayesian model produces more accurate individual line outage rates, as well as estimates of the uncertainty of these rates. Better estimates of line outage rates can improve system risk assessment, outage prediction, and maintenance scheduling. △ Less

Submitted 23 January, 2020; originally announced January 2020.

arXiv:2001.07629 [pdf, other]

Efficient Computation of the Magnetic Polarizability Tensor Spectral Signature using POD

Authors: B. A. Wilson, P. D. Ledger

Abstract: Our interest lies in the identification of hidden conducting permeable objects from measurements of the perturbed magnetic field in metal detection taken over range of low frequencies. The magnetic polarizability tensor (MPT) provides a characterisation of a conducting permeable object using a small number of coefficients, has explicit formula for their calculation and a well understood frequency… ▽ More Our interest lies in the identification of hidden conducting permeable objects from measurements of the perturbed magnetic field in metal detection taken over range of low frequencies. The magnetic polarizability tensor (MPT) provides a characterisation of a conducting permeable object using a small number of coefficients, has explicit formula for their calculation and a well understood frequency behaviour, which we call its spectral signature. However, to compute such signatures, and build a library of them for object classification, requires repeated solution of a direct (full order) problem, which is typically accomplished using a finite element discretisation. To overcome this issue, we propose an efficient reduced order model (ROM) using a proper orthogonal decomposition (POD) for the rapid computation of MPT spectral signatures. Our ROM benefits from output certificates, which give bounds on the accuracy of the predicted outputs with respect to the full order model solutions. To further increase the efficiency of the computation of the MPT spectral signature, we provide scaling results, which enable an immediate calculation of the signature under changes in the object size or conductivity. We illustrate our approach by application to a range of homogenous and inhomogeneous conducting permeable objects. △ Less

Submitted 21 January, 2020; originally announced January 2020.

MSC Class: 65N30; 35R30; 35B30

arXiv:1912.13025 [pdf, other]

Semi-Supervised Learning with Normalizing Flows

Authors: Pavel Izmailov, Polina Kirichenko, Marc Finzi, Andrew Gordon Wilson

Abstract: Normalizing flows transform a latent distribution through an invertible neural network for a flexible and pleasingly simple approach to generative modelling, while preserving an exact likelihood. We propose FlowGMM, an end-to-end approach to generative semi supervised learning with normalizing flows, using a latent Gaussian mixture model. FlowGMM is distinct in its simplicity, unified treatment of… ▽ More Normalizing flows transform a latent distribution through an invertible neural network for a flexible and pleasingly simple approach to generative modelling, while preserving an exact likelihood. We propose FlowGMM, an end-to-end approach to generative semi supervised learning with normalizing flows, using a latent Gaussian mixture model. FlowGMM is distinct in its simplicity, unified treatment of labelled and unlabelled data with an exact likelihood, interpretability, and broad applicability beyond image data. We show promising results on a wide range of applications, including AG-News and Yahoo Answers text data, tabular data, and semi-supervised image classification. We also show that FlowGMM can discover interpretable structure, provide real-time optimization-free feature visualizations, and specify well calibrated predictive distributions. △ Less

Submitted 30 December, 2019; originally announced December 2019.

arXiv:1912.12834 [pdf, other]

Randomly Projected Additive Gaussian Processes for Regression

Authors: Ian A. Delbridge, David S. Bindel, Andrew Gordon Wilson

Abstract: Gaussian processes (GPs) provide flexible distributions over functions, with inductive biases controlled by a kernel. However, in many applications Gaussian processes can struggle with even moderate input dimensionality. Learning a low dimensional projection can help alleviate this curse of dimensionality, but introduces many trainable hyperparameters, which can be cumbersome, especially in the sm… ▽ More Gaussian processes (GPs) provide flexible distributions over functions, with inductive biases controlled by a kernel. However, in many applications Gaussian processes can struggle with even moderate input dimensionality. Learning a low dimensional projection can help alleviate this curse of dimensionality, but introduces many trainable hyperparameters, which can be cumbersome, especially in the small data regime. We use additive sums of kernels for GP regression, where each kernel operates on a different random projection of its inputs. Surprisingly, we find that as the number of random projections increases, the predictive performance of this approach quickly converges to the performance of a kernel operating on the original full dimensional inputs, over a wide range of data sets, even if we are projecting into a single dimension. As a consequence, many problems can remarkably be reduced to one dimensional input spaces, without learning a transformation. We prove this convergence and its rate, and additionally propose a deterministic approach that converges more quickly than purely random projections. Moreover, we demonstrate our approach can achieve faster inference and improved predictive accuracy for high-dimensional inputs compared to kernels in the original input space. △ Less

Submitted 30 December, 2019; originally announced December 2019.

arXiv:1911.13296 [pdf, other]

doi 10.1051/0004-6361/201937254

The SOPHIE search for northern extrasolar planets. XVI. HD 158259: A compact planetary system in a near-3:2 mean motion resonance chain

Authors: N. C. Hara, F. Bouchy, M. Stalport, I. Boisse, J. Rodrigues, J-. B. Delisle, A. Santerne, G. W. Henry, L. Arnold, N. Astudillo-Defru, S. Borgniet, X. Bonfils, V. Bourrier, B. Brugger, B. Courcol, S. Dalal, M. Deleuil, X. Delfosse, O. Demangeon, R. F. D'iaz, X. Dumusque, T. Forveille, G. Hébrard, M. Hobson, F. Kiefer , et al. (10 additional authors not shown)

Abstract: Since 2011, the SOPHIE spectrograph has been used to search for Neptunes and super-Earths in the Northern Hemisphere. As part of this observational program, 290 radial velocity measurements of the 6.4 V magnitude star HD 158259 were obtained. Additionally, TESS photometric measurements of this target are available. We present an analysis of the SOPHIE data and compare our results with the output o… ▽ More Since 2011, the SOPHIE spectrograph has been used to search for Neptunes and super-Earths in the Northern Hemisphere. As part of this observational program, 290 radial velocity measurements of the 6.4 V magnitude star HD 158259 were obtained. Additionally, TESS photometric measurements of this target are available. We present an analysis of the SOPHIE data and compare our results with the output of the TESS pipeline. The radial velocity data, ancillary spectroscopic indices, and ground-based photometric measurements were analyzed with classical and $\ell_1$ periodograms. The stellar activity was modeled as a correlated Gaussian noise and its impact on the planet detection was measured with a new technique. The SOPHIE data support the detection of five planets, each with $m \sin i \approx 6 M_\oplus$, orbiting HD 158259 in 3.4, 5.2, 7.9, 12, and 17.4 days. Though a planetary origin is strongly favored, the 17.4 d signal is classified as a planet candidate due to a slightly lower statistical significance and to its proximity to the expected stellar rotation period. The data also present low frequency variations, most likely originating from a magnetic cycle and instrument systematics. Furthermore, the TESS pipeline reports a significant signal at 2.17 days corresponding to a planet of radius $\approx 1.2 R_\oplus$. A compatible signal is seen in the radial velocities, which confirms the detection of an additional planet and yields a $\approx 2 M_\oplus$ mass estimate. We find a system of five planets and a strong candidate near a 3:2 mean motion resonance chain orbiting HD 158259. The planets are found to be outside of the two and three body resonances. △ Less

Submitted 26 March, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

Comments: Accepted for publication in Astronomy & Astrophysics

arXiv:1911.12628 [pdf, other]

doi 10.1093/mnras/staa1078

Detection of Na, K and H$_2$O in the hazy atmosphere of WASP-6b

Authors: Aarynn L. Carter, Nikolay Nikolov, David K. Sing, Munazza K. Alam, Jayesh M. Goyal, Thomas Mikal-Evans, Hannah R. Wakeford, Gregory W. Henry, Sam Morrell, Mercedes López-Morales, Barry Smalley, Panayotis Lavvas, Joanna K. Barstow, Antonio García Muñoz, Paul A. Wilson, Neale P. Gibson

Abstract: We present new observations of the transmission spectrum of the hot Jupiter WASP-6b both from the ground with the Very Large Telescope (VLT) FOcal Reducer and Spectrograph (FORS2) from 0.45-0.83 $μ$m, and space with the Transiting Exoplanet Survey Satellite (TESS) from 0.6-1.0 $μ$m and the Hubble Space Telescope (HST) Wide Field Camera 3 from 1.12-1.65 $μ$m. Archival data from the HST Space Telesc… ▽ More We present new observations of the transmission spectrum of the hot Jupiter WASP-6b both from the ground with the Very Large Telescope (VLT) FOcal Reducer and Spectrograph (FORS2) from 0.45-0.83 $μ$m, and space with the Transiting Exoplanet Survey Satellite (TESS) from 0.6-1.0 $μ$m and the Hubble Space Telescope (HST) Wide Field Camera 3 from 1.12-1.65 $μ$m. Archival data from the HST Space Telescope Imaging Spectrograph (STIS) and Spitzer is also reanalysed on a common Gaussian process framework, of which the STIS data show a good overall agreement with the overlap** FORS2 data. We also explore the effects of stellar heterogeneity on our observations and its resulting implications towards determining the atmospheric characteristics of WASP-6b. Independent of our assumptions for the level of stellar heterogeneity we detect Na I, K I and H$_2$O absorption features and constrain the elemental oxygen abundance to a value of [O/H] $\simeq -0.9\pm0.3$ relative to solar. In contrast, we find that the stellar heterogeneity correction can have significant effects on the retrieved distributions of the [Na/H] and [K/H] abundances, primarily through its degeneracy with the slo** optical opacity of scattering haze species within the atmosphere. Our results also show that despite this presence of haze, WASP-6b remains a favourable object for future atmospheric characterisation with upcoming missions such as the James Webb Space Telescope. △ Less

Submitted 6 May, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

Comments: Accepted to MNRAS

arXiv:1911.04783 [pdf, other]

Permutation group algorithms based on directed graphs (extended version)

Authors: Christopher Jefferson, Markus Pfeiffer, Rebecca Waldecker, Wilf A. Wilson

Abstract: We introduce a new framework for solving an important class of computational problems involving finite permutation groups, which includes calculating set stabilisers, intersections of subgroups, and isomorphisms of combinatorial structures. Our techniques generalise 'partition backtrack', which is the current state-of-the-art algorithm introduced by Jeffrey Leon in 1991, and which has inspired our… ▽ More We introduce a new framework for solving an important class of computational problems involving finite permutation groups, which includes calculating set stabilisers, intersections of subgroups, and isomorphisms of combinatorial structures. Our techniques generalise 'partition backtrack', which is the current state-of-the-art algorithm introduced by Jeffrey Leon in 1991, and which has inspired our work. Our backtrack search algorithms are organised around vertex- and arc-labelled directed graphs, which allow us to represent many problems more richly than do ordered partitions. We present the theory underpinning our framework, and we include the results of experiments showing that our techniques often result in smaller search spaces than does partition backtrack. An implementation of our algorithms is available as free software in the GraphBacktracking package for GAP. △ Less

Submitted 1 July, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

Comments: Extended version of arXiv:2106.13132; now with a note pointing to and recommending this shorter version; 55 pages, 13 figures, 4 tables

MSC Class: 20-08 (Primary) 20B05 (Secondary)

arXiv:1911.02012 [pdf, other]

doi 10.1093/mnras/staa277

TOI-132 b: A short-period planet in the Neptune desert transiting a $V=11.3$ G-type star

Authors: Matías R. Díaz, James S. Jenkins, Davide Gandolfi, Eric D. Lopez, Maritza G. Soto, Pía Cortés-Zuleta, Zaira M. Berdiñas, Keivan G. Stassun, Karen A. Collins, José I. Vines, Carl Ziegler, Malcolm Fridlund, Eric J. N. Jensen, Felipe Murgas, Alexandre Santerne, Paul A. Wilson, Massimiliano Esposito, Artie P. Hatzes, Marshall C. Johnson, Kristine W. F. Lam, John H. Livingston, Vincent Van Eylen, Norio Narita, César Briceño, Kevin I. Collins , et al. (23 additional authors not shown)

Abstract: The Neptune desert is a feature seen in the radius-mass-period plane, whereby a notable dearth of short period, Neptune-like planets is found. Here we report the {\it TESS} discovery of a new short-period planet in the Neptune desert, orbiting the G-type dwarf TYC\,8003-1117-1 (TOI-132). {\it TESS} photometry shows transit-like dips at the level of $\sim$1400 ppm occurring every $\sim$2.11 days. H… ▽ More The Neptune desert is a feature seen in the radius-mass-period plane, whereby a notable dearth of short period, Neptune-like planets is found. Here we report the {\it TESS} discovery of a new short-period planet in the Neptune desert, orbiting the G-type dwarf TYC\,8003-1117-1 (TOI-132). {\it TESS} photometry shows transit-like dips at the level of $\sim$1400 ppm occurring every $\sim$2.11 days. High-precision radial velocity follow-up with HARPS confirmed the planetary nature of the transit signal and provided a semi-amplitude radial velocity variation of $\sim$11.5 m s$^{-1}$, which, when combined with the stellar mass of $0.97\pm0.06$ $M_{\odot}$, provides a planetary mass of 22.83$^{+1.81}_{-1.80}$ $M_{\oplus}$. Modeling the {\it TESS} high-quality light curve returns a planet radius of 3.43$^{+0.13}_{-0.14}$ $R_{\oplus}$, and therefore the planet bulk density is found to be 3.11$^{+0.44}_{-0.450}$ g cm$^{-3}$. Planet structure models suggest that the bulk of the planet mass is in the form of a rocky core, with an atmospheric mass fraction of 4.3$^{+1.2}_{-2.3}$\%. TOI-132 b is a {\it TESS} Level 1 Science Requirement candidate, and therefore priority follow-up will allow the search for additional planets in the system, whilst hel** to constrain low-mass planet formation and evolution models, particularly valuable for better understanding the Neptune desert. △ Less

Submitted 18 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

Comments: 12 pages, 10 figures, 4 tables. Submitted to MNRAS. Comments welcome. Missing labels, Typos fixed

arXiv:1910.14178 [pdf, ps, other]

doi 10.1103/PhysRevA.101.042334

Laser-free trapped-ion entangling gates with simultaneous insensitivity to qubit and motional decoherence

Authors: R. T. Sutherland, R. Srinivas, S. C. Burd, H. M. Knaack, A. C. Wilson, D. J. Wineland, D. Leibfried, D. T. C. Allcock, D. H. Slichter, S. B. Libby

Abstract: The dominant error sources for state-of-the-art laser-free trapped-ion entangling gates are decoherence of the qubit state and the ion motion. The effect of these decoherence mechanisms can be suppressed with additional control fields, or with techniques that have the disadvantage of reducing gate speed. Here, we propose using a near-motional-frequency magnetic field gradient to implement a laser-… ▽ More The dominant error sources for state-of-the-art laser-free trapped-ion entangling gates are decoherence of the qubit state and the ion motion. The effect of these decoherence mechanisms can be suppressed with additional control fields, or with techniques that have the disadvantage of reducing gate speed. Here, we propose using a near-motional-frequency magnetic field gradient to implement a laser-free gate that is simultaneously resilient to both types of decoherence, does not require additional control fields, and has a relatively small cost in gate speed. △ Less

Submitted 30 March, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

Journal ref: Phys. Rev. A 101, 042334 (2020)

arXiv:1910.13565 [pdf, other]

Function-Space Distributions over Kernels

Authors: Gregory W. Benton, Wesley J. Maddox, Jayson P. Salkey, Julio Albinati, Andrew Gordon Wilson

Abstract: Gaussian processes are flexible function approximators, with inductive biases controlled by a covariance kernel. Learning the kernel is the key to representation learning and strong predictive performance. In this paper, we develop functional kernel learning (FKL) to directly infer functional posteriors over kernels. In particular, we place a transformed Gaussian process over a spectral density, t… ▽ More Gaussian processes are flexible function approximators, with inductive biases controlled by a covariance kernel. Learning the kernel is the key to representation learning and strong predictive performance. In this paper, we develop functional kernel learning (FKL) to directly infer functional posteriors over kernels. In particular, we place a transformed Gaussian process over a spectral density, to induce a non-parametric distribution over kernel functions. The resulting approach enables learning of rich representations, with support for any stationary kernel, uncertainty over the values of the kernel, and an interpretable specification of a prior directly over kernels, without requiring sophisticated initialization or manual intervention. We perform inference through elliptical slice sampling, which is especially well suited to marginalizing posteriors with the strongly correlated priors typical to function space modelling. We develop our approach for non-uniform, large-scale, multi-task, and multidimensional data, and show promising performance in a wide range of settings, including interpolation, extrapolation, and kernel recovery experiments. △ Less

Submitted 29 October, 2019; originally announced October 2019.

Comments: Published at NeurIPS 2019

arXiv:1910.06403 [pdf, other]

BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization

Authors: Maximilian Balandat, Brian Karrer, Daniel R. Jiang, Samuel Daulton, Benjamin Letham, Andrew Gordon Wilson, Eytan Bakshy

Abstract: Bayesian optimization provides sample-efficient global optimization for a broad range of applications, including automatic machine learning, engineering, physics, and experimental design. We introduce BoTorch, a modern programming framework for Bayesian optimization that combines Monte-Carlo (MC) acquisition functions, a novel sample average approximation optimization approach, auto-differentiatio… ▽ More Bayesian optimization provides sample-efficient global optimization for a broad range of applications, including automatic machine learning, engineering, physics, and experimental design. We introduce BoTorch, a modern programming framework for Bayesian optimization that combines Monte-Carlo (MC) acquisition functions, a novel sample average approximation optimization approach, auto-differentiation, and variance reduction techniques. BoTorch's modular design facilitates flexible specification and optimization of probabilistic models written in PyTorch, simplifying implementation of new acquisition functions. Our approach is backed by novel theoretical convergence results and made practical by a distinctive algorithmic foundation that leverages fast predictive distributions, hardware acceleration, and deterministic optimization. We also propose a novel "one-shot" formulation of the Knowledge Gradient, enabled by a combination of our theoretical and software contributions. In experiments, we demonstrate the improved sample efficiency of BoTorch relative to other popular libraries. △ Less

Submitted 8 December, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

Journal ref: Advances in Neural Information Processing Systems 33, 2020

arXiv:1910.05885 [pdf, other]

Parallelized Training of Restricted Boltzmann Machines using Markov-Chain Monte Carlo Methods

Authors: Pei Yang, Srinivas Varadharajan, Lucas A. Wilson, Don D. Smith II, John A Lockman III, Vineet Gundecha, Quy Ta

Abstract: Restricted Boltzmann Machine (RBM) is a generative stochastic neural network that can be applied to collaborative filtering technique used by recommendation systems. Prediction accuracy of the RBM model is usually better than that of other models for recommendation systems. However, training the RBM model involves Markov-Chain Monte Carlo (MCMC) method, which is computationally expensive. In this… ▽ More Restricted Boltzmann Machine (RBM) is a generative stochastic neural network that can be applied to collaborative filtering technique used by recommendation systems. Prediction accuracy of the RBM model is usually better than that of other models for recommendation systems. However, training the RBM model involves Markov-Chain Monte Carlo (MCMC) method, which is computationally expensive. In this paper, we have successfully applied distributed parallel training using Horovod framework to improve the training time of the RBM model. Our tests show that the distributed training approach of the RBM model has a good scaling efficiency. We also show that this approach effectively reduces the training time to little over 12 minutes on 64 CPU nodes compared to 5 hours on a single CPU node. This will make RBM models more practically applicable in recommendation systems. △ Less

Submitted 13 October, 2019; originally announced October 2019.

arXiv:1910.04123 [pdf, other]

The Disparate Equilibria of Algorithmic Decision Making when Individuals Invest Rationally

Authors: Lydia T. Liu, Ashia Wilson, Nika Haghtalab, Adam Tauman Kalai, Christian Borgs, Jennifer Chayes

Abstract: The long-term impact of algorithmic decision making is shaped by the dynamics between the deployed decision rule and individuals' response. Focusing on settings where each individual desires a positive classification---including many important applications such as hiring and school admissions, we study a dynamic learning setting where individuals invest in a positive outcome based on their group's… ▽ More The long-term impact of algorithmic decision making is shaped by the dynamics between the deployed decision rule and individuals' response. Focusing on settings where each individual desires a positive classification---including many important applications such as hiring and school admissions, we study a dynamic learning setting where individuals invest in a positive outcome based on their group's expected gain and the decision rule is updated to maximize institutional benefit. By characterizing the equilibria of these dynamics, we show that natural challenges to desirable long-term outcomes arise due to heterogeneity across groups and the lack of realizability. We consider two interventions, decoupling the decision rule by group and subsidizing the cost of investment. We show that decoupling achieves optimal outcomes in the realizable case but has discrepant effects that may depend on the initial conditions otherwise. In contrast, subsidizing the cost of investment is shown to create better equilibria for the disadvantaged group even in the absence of realizability. △ Less

Submitted 4 October, 2019; originally announced October 2019.

Comments: 30 pages, 7 figures

arXiv:1909.12457 [pdf, other]

Nonlinear Dynamic Models of Conflict via Multiplexed Interaction Networks

Authors: Gerardo Aquino, Weisi Guo, Alan Wilson

Abstract: The risk of conflict is exasperated by a multitude of internal and external factors. Current multivariate analysis paints diverse causal risk profiles that vary with time. However, these profiles evolve and a universal model to understand that evolution remains absent. Most of the current conflict analysis is data-driven and conducted at the individual country or region level, often in isolation.… ▽ More The risk of conflict is exasperated by a multitude of internal and external factors. Current multivariate analysis paints diverse causal risk profiles that vary with time. However, these profiles evolve and a universal model to understand that evolution remains absent. Most of the current conflict analysis is data-driven and conducted at the individual country or region level, often in isolation. Consistent consideration of multi-scale interactions and their non-linear dynamics is missing. Here, we develop a multiplexed network model, where each city is modelled as a non-linear bi-stable system with stable states in either war or peace. The causal factor categories which exasperate the risk of conflict are each modelled as a network layer. We consider 3 layers: (1) core geospatial network of interacting cities reflecting ground level interactions, (2) cultural network of interacting countries reflecting cultural grou**s, and (3) political network of interacting countries reflecting alliances. Together, they act as drivers to push cities towards or pull cities away from war. Using a variety of data sources relative to 2002-2016, we show, that our model correctly predicts the transitions from war to peace and peace to war with F1 score of 0.78 to 0.92 worldwide at the city scale resolution. As many conflicts during this period are auto-regressive (e.g. the War on Terror in Afghanistan and Iraq, the Narco War across the Americas), we can predict the emergence of new war or new peace. We demonstrate successful predictions across a wide range of conflict genres and we perform causal discovery by identifying which model component led to the correct prediction. In the cases of Somalia (2008-13), Myanmar (2013-15), Colombia (2011-14), Libya (2014-16), and Yemen (2011-13) we identify the set of most likely causal factors and how it may differ across a country and change over time. △ Less

Submitted 26 September, 2019; originally announced September 2019.

Comments: 16 pages,10 figures, 7 tables

arXiv:1909.08464 [pdf, other]

doi 10.1016/j.physletb.2020.135323

New data on $\vecγ \vec{p}\rightarrow ηp$ with polarized photons and protons and their implications for $N^* \to Nη$ decays

Authors: J. Müller, J. Hartmann, M. Grüner, F. Afzal, A. V. Anisovich, B. Bantes, D. Bayadilov, R. Beck, M. Becker, Y. Beloglazov, M. Berlin, M. Bichow, S. Böse, K. -T. Brinkmann, T. Challand, V. Crede, F. Dietz, M. Dieterle, P. Drexler, H. Dutz, H. Eberhardt, D. Elsner, R. Ewald, K. Fornet-Ponse, S. Friedrich , et al. (64 additional authors not shown)

Abstract: The polarization observables $T, E, P, H$, and $G$ in photoproduction of $η$ mesons off protons are measured for photon energies from threshold to $W=2400\,$MeV ($T$), 2280 MeV ($E$), 1620 MeV ($P, H$), or 1820 MeV ($G$), covering nearly the full solid angle. The data are compared to predictions from the SAID, MAID, JüBo, and BnGa partial-wave analyses. A refit within the BnGa approach including f… ▽ More The polarization observables $T, E, P, H$, and $G$ in photoproduction of $η$ mesons off protons are measured for photon energies from threshold to $W=2400\,$MeV ($T$), 2280 MeV ($E$), 1620 MeV ($P, H$), or 1820 MeV ($G$), covering nearly the full solid angle. The data are compared to predictions from the SAID, MAID, JüBo, and BnGa partial-wave analyses. A refit within the BnGa approach including further data yields precise branching ratios for the $Nη$ decay of nucleon resonances. A $Nη$-branching ratio of $0.33\pm 0.04$ for $N(1650)1/2^-$ is found, which reduces the large and controversially discussed $Nη$-branching ratio difference of the two lowest mass $J^P=1/2^-$-resonances significantly. △ Less

Submitted 18 September, 2019; originally announced September 2019.

Comments: 10 pages, 11 figures

arXiv:1909.00739 [pdf, other]

doi 10.1051/0004-6361/201935113

The detection and characterisation of 54 massive companions with the SOPHIE spectrograph -- 7 new brown dwarfs and constraints on the BD desert

Authors: F. Kiefer, G. Hébrard, J. Sahlmann, S. G. Sousa, T. Forveille, N. Santos, M. Mayor, M. Deleuil, P. A. Wilson, S. Dalal, R. F. Díaz, G. W. Henry, J. Hagelberg, M. J. Hobson, O. Demangeon, V. Bourrier, X. Delfosse, L. Arnold, N. Astudillo-Defru, J. -L. Beuzit, I. Boisse, X. Bonfils, S. Borgniet, F. Bouchy, B. Courcol , et al. (13 additional authors not shown)

Abstract: Brown-dwarfs are substellar objects with masses intermediate between planets and stars within about 13-80Mjup. While isolated BDs are most likely produced by gravitational collapse in molecular clouds down to masses of a few Mjup, a non-negligible fraction of low-mass companions might be formed through the planet formation channel in protoplanetary disks. The upper mass limit of objects formed wit… ▽ More Brown-dwarfs are substellar objects with masses intermediate between planets and stars within about 13-80Mjup. While isolated BDs are most likely produced by gravitational collapse in molecular clouds down to masses of a few Mjup, a non-negligible fraction of low-mass companions might be formed through the planet formation channel in protoplanetary disks. The upper mass limit of objects formed within disks is still observationally unknown, the main reason being the strong dearth of BD companions at orbital periods shorter than 10 years, a.k.a. the BD desert. To address this question, we aim at determining the best statistics of secondary companions within the 10-100Mjup range and within 10 au from the primary star, while minimising observational bias. We made an extensive use of the RV surveys of FGK stars below 60pc distance to the Sun and in the northern hemisphere performed with the SOPHIE spectrograph at the Observatoire de Haute-Provence. We derived the Keplerian solutions of the RV variations of 54 sources. Public astrometric data of the Hipparcos and Gaia missions allowed constraining the mass of the companion for most sources. We introduce GASTON, a new method to derive inclination combining RVs Keplerian and astrometric excess noise from Gaia DR1. We report the discovery of 12 new BD candidates. For 5 of them, additional astrometric data led to revise their mass in the M-dwarf regime. Among the 7 remaining objects, 4 are confirmed BD companions, and 3 others are likely in this mass regime. We also report the detection of 42 M-dwarfs within 90Mjup-0.52Msun. The resulting Msin(i)-P distribution of BD candidates shows a clear drop in the detection rate below 80-day orbital period. Above that limit, the BD desert reveals rather wet, with a uniform distribution of the Msin(i). We derive a minimum BD-detection frequency around Solar-like stars of 2.0+/-0.5%. △ Less

Submitted 2 September, 2019; originally announced September 2019.

Comments: 51 pages, 20 figures, 16 tables

Journal ref: A&A 631, A125 (2019)

arXiv:1908.06598 [pdf, other]

Chromatic nonsymmetric polynomials of Dyck graphs are slide-positive

Authors: Vasu Tewari, Andrew Timothy Wilson, Philip B. Zhang

Abstract: Motivated by the study of Macdonald polynomials, J. Haglund and A. Wilson introduced a nonsymmetric polynomial analogue of the chromatic quasisymmetric function called the \emph{chromatic nonsymmetric polynomial} of a Dyck graph. We give a positive expansion for this polynomial in the basis of fundamental slide polynomials using recent work of Assaf-Bergeron on flagged $(P,ρ)$-partitions. We then… ▽ More Motivated by the study of Macdonald polynomials, J. Haglund and A. Wilson introduced a nonsymmetric polynomial analogue of the chromatic quasisymmetric function called the \emph{chromatic nonsymmetric polynomial} of a Dyck graph. We give a positive expansion for this polynomial in the basis of fundamental slide polynomials using recent work of Assaf-Bergeron on flagged $(P,ρ)$-partitions. We then derive the known expansion for the chromatic quasisymmetric function of Dyck graphs in terms of Gessel's fundamental basis by taking a backstable limit of our expansion. △ Less

Submitted 19 August, 2019; originally announced August 2019.

arXiv:1907.13050 [pdf, other]

Using extreme value theory for the estimation of risk metrics for capacity adequacy assessment

Authors: Amy L Wilson, Stan Zachary

Abstract: This paper investigates the use of extreme value theory for modelling the distribution of demand-net-of-wind for capacity adequacy assessment. Extreme value theory approaches are well-established and mathematically justified methods for estimating the tails of a distribution and so are ideally suited for problems in capacity adequacy, where normally only the tails of the relevant distributions are… ▽ More This paper investigates the use of extreme value theory for modelling the distribution of demand-net-of-wind for capacity adequacy assessment. Extreme value theory approaches are well-established and mathematically justified methods for estimating the tails of a distribution and so are ideally suited for problems in capacity adequacy, where normally only the tails of the relevant distributions are significant. The extreme value theory peaks over threshold approach is applied directly to observations of demand-net-of-wind, meaning that no assumption is needed about the nature of any dependence between demand and wind. The methodology is tested on data from Great Britain and compared to two alternative approaches: use of the empirical distribution of demand-net-of-wind and use of a model which assumes independence between demand and wind. Extreme value theory is shown to produce broadly similar estimates of risk metrics to the use of the above empirical distribution but with smaller sampling uncertainty. Estimates of risk metrics differ when the approach assuming independence is used, especially when data across different historical years are pooled, suggesting that assuming independence might result in the over- or under-estimation of risk metrics. △ Less

Submitted 30 July, 2019; originally announced July 2019.

Comments: 8 pages, 4 figures

arXiv:1907.08562 [pdf, other]

First demonstration of ionization cooling by the Muon Ionization Cooling Experiment

Authors: M. Bogomilov, R. Tsenov, G. Vankova-Kirilova, Y. P. Song, J. Y. Tang, Z. H. Li, R. Bertoni, M. Bonesini, F. Chignoli, R. Mazza, V. Palladino, A. de Bari, D. Orestano, L. Tortora, Y. Kuno, H. Sakamoto, A. Sato, S. Ishimoto, M. Chung, C. K. Sung, F. Filthaut, D. Jokovic, D. Maletic, M. Savic, N. Jovancevic , et al. (110 additional authors not shown)

Abstract: High-brightness muon beams of energy comparable to those produced by state-of-the-art electron, proton and ion accelerators have yet to be realised. Such beams have the potential to carry the search for new phenomena in lepton-antilepton collisions to extremely high energy and also to provide uniquely well-characterised neutrino beams. A muon beam may be created through the decay of pions produced… ▽ More High-brightness muon beams of energy comparable to those produced by state-of-the-art electron, proton and ion accelerators have yet to be realised. Such beams have the potential to carry the search for new phenomena in lepton-antilepton collisions to extremely high energy and also to provide uniquely well-characterised neutrino beams. A muon beam may be created through the decay of pions produced in the interaction of a proton beam with a target. To produce a high-brightness beam from such a source requires that the phase space volume occupied by the muons be reduced (cooled). Ionization cooling is the novel technique by which it is proposed to cool the beam. The Muon Ionization Cooling Experiment collaboration has constructed a section of an ionization cooling cell and used it to provide the first demonstration of ionization cooling. We present these ground-breaking measurements. △ Less

Submitted 19 July, 2019; originally announced July 2019.

Comments: 19 pages and 6 figures

Report number: RAL-P-2019-003

arXiv:1907.07504 [pdf, other]

Subspace Inference for Bayesian Deep Learning

Authors: Pavel Izmailov, Wesley J. Maddox, Polina Kirichenko, Timur Garipov, Dmitry Vetrov, Andrew Gordon Wilson

Abstract: Bayesian inference was once a gold standard for learning with neural networks, providing accurate full predictive distributions and well calibrated uncertainty. However, scaling Bayesian inference techniques to deep neural networks is challenging due to the high dimensionality of the parameter space. In this paper, we construct low-dimensional subspaces of parameter space, such as the first princi… ▽ More Bayesian inference was once a gold standard for learning with neural networks, providing accurate full predictive distributions and well calibrated uncertainty. However, scaling Bayesian inference techniques to deep neural networks is challenging due to the high dimensionality of the parameter space. In this paper, we construct low-dimensional subspaces of parameter space, such as the first principal components of the stochastic gradient descent (SGD) trajectory, which contain diverse sets of high performing models. In these subspaces, we are able to apply elliptical slice sampling and variational inference, which struggle in the full parameter space. We show that Bayesian model averaging over the induced posterior in these subspaces produces accurate predictions and well calibrated predictive uncertainty for both regression and image classification. △ Less

Submitted 17 July, 2019; originally announced July 2019.

Comments: Published at UAI 2019

arXiv:1907.05973 [pdf, other]

doi 10.5547/01956574.43.4.szac

The integration of variable generation and storage into electricity capacity markets

Authors: Stan Zachary, Amy Wilson, Chris Dent

Abstract: We show how to value both variable generation and energy storage to enable them to be integrated fairly and optimally into electricity capacity markets. We develop theory based on balancing expected energy unserved against costs of capacity procurement, and in which the optimal procurement is that necessary to meet an appropriate reliability standard. For conventional generation the theory reduces… ▽ More We show how to value both variable generation and energy storage to enable them to be integrated fairly and optimally into electricity capacity markets. We develop theory based on balancing expected energy unserved against costs of capacity procurement, and in which the optimal procurement is that necessary to meet an appropriate reliability standard. For conventional generation the theory reduces to that already in common use. Further the valuation of both variable generation and storage coincides with the traditional risk-based approach based on equivalent firm capacity. The determination of the equivalent firm capacity of storage requires particular care; this is due both to the flexibility with which storage added to an existing system may be scheduled, and also because, when any resource is added to an existing system, storage already within that system may be flexibly rescheduled. We illustrate the theory with an example based on the GB system. △ Less

Submitted 7 August, 2021; v1 submitted 12 July, 2019; originally announced July 2019.

Comments: To appear in The Energy Journal

arXiv:1907.00268 [pdf, ps, other]

The valley version of the Extended Delta Conjecture

Authors: Dun Qiu, Andrew Timothy Wilson

Abstract: The Shuffle Theorem of Carlsson and Mellit gives a combinatorial expression for the bigraded Frobenius characteristic of the ring of diagonal harmonics, and the Delta Conjecture of Haglund, Remmel and the second author provides two generalizations of the Shuffle Theorem to the delta operator expression $Δ'_{e_k} e_n$. Haglund et al. also propose the Extended Delta Conjecture for the delta operator… ▽ More The Shuffle Theorem of Carlsson and Mellit gives a combinatorial expression for the bigraded Frobenius characteristic of the ring of diagonal harmonics, and the Delta Conjecture of Haglund, Remmel and the second author provides two generalizations of the Shuffle Theorem to the delta operator expression $Δ'_{e_k} e_n$. Haglund et al. also propose the Extended Delta Conjecture for the delta operator expression $Δ'_{e_k} Δ_{h_r}e_n$, which is analogous to the rise version of the Delta Conjecture. Recently, D'Adderio, Iraci and Wyngaerd proved the rise version of the Extended Delta Conjecture at the case when $t=0$. In this paper, we propose a new valley version of the Extended Delta Conjecture. Then, we work on the combinatorics of extended ordered multiset partitions to prove that the two conjectures for $Δ'_{e_k} Δ_{h_r}e_n$ are equivalent when $t$ or $q$ equals 0, thus proving the valley version of the Extended Delta Conjecture when $t$ or $q$ equals 0. △ Less

Submitted 9 July, 2019; v1 submitted 29 June, 2019; originally announced July 2019.

Comments: 28 pages, 9 figures

MSC Class: 05E05

arXiv:1906.10045 [pdf, other]

A closed-loop all-electronic pixel-wise adaptive imaging system for high dynamic range video

Authors: Jie, Zhang, Jonathan P. Newman, Xiao Wang, Chetan Singh Thakur, John Rattray, Ralph Etienne-Cummings, Matthew A. Wilson

Abstract: We demonstrated a CMOS imaging system that adapts each pixel's exposure and sampling rate to capture high dynamic range (HDR) videos. The system consist of a custom designed image sensor with pixel-wise exposure configurability and a real-time pixel exposure controller. These parts operate in a closed-loop to sample, detect and optimize each pixel's exposure and sampling rate to minimize local reg… ▽ More We demonstrated a CMOS imaging system that adapts each pixel's exposure and sampling rate to capture high dynamic range (HDR) videos. The system consist of a custom designed image sensor with pixel-wise exposure configurability and a real-time pixel exposure controller. These parts operate in a closed-loop to sample, detect and optimize each pixel's exposure and sampling rate to minimize local region's underexposure, overexposure and motion blurring. Exposure control is implemented using all-integrated electronics without external optical modulation. This reduces overall system size and power consumption. The image sensor is implemented using a standard 130nm CMOS process while the exposure controller is implemented on a computer. We performed experiments under complex lighting and motion condition to test performance of the system, and demonstrate the benefit of pixel-wise adaptive imaging on the performance of computer vision tasks such as segmentation, motion estimation and object recognition. △ Less

Submitted 24 June, 2019; originally announced June 2019.

Comments: 9 pages, 8 figures

arXiv:1906.08846 [pdf, ps, other]

doi 10.1080/00927872.2024.2321301

Octonions, Albert vectors and the group $\mathrm{E}_6(F)$

Authors: John N. Bray, Yegor Stepanov, Robert A. Wilson

Abstract: We present a uniform approach to the construction of the groups of type $\mathrm{E}_6$ over arbitrary fields without using Lie theory. This gives a simple description of the group generators and some of the subgroup structure. In the finite case our approach also permits relatively straightforward computation of the group order. We present a uniform approach to the construction of the groups of type $\mathrm{E}_6$ over arbitrary fields without using Lie theory. This gives a simple description of the group generators and some of the subgroup structure. In the finite case our approach also permits relatively straightforward computation of the group order. △ Less

Submitted 13 April, 2024; v1 submitted 20 June, 2019; originally announced June 2019.

Comments: 30 pages

MSC Class: 20H20

Journal ref: Communications in Algebra, 1-29, 2024

arXiv:1906.03315 [pdf, ps, other]

Vandermondes in superspace

Authors: Brendon Rhoades, Andrew Timothy Wilson

Abstract: Superspace of rank $n$ is a $\mathbb{Q}$-algebra with $n$ commuting generators $x_1, \dots, x_n$ and $n$ anticommuting generators $θ_1, \dots, θ_n$. We present an extension of the Vandermonde determinant to superspace which depends on a sequence $\mathbf{a} = (a_1, \dots, a_r)$ of nonnegative integers of length $r \leq n$. We use superspace Vandermondes to construct graded representations of the s… ▽ More Superspace of rank $n$ is a $\mathbb{Q}$-algebra with $n$ commuting generators $x_1, \dots, x_n$ and $n$ anticommuting generators $θ_1, \dots, θ_n$. We present an extension of the Vandermonde determinant to superspace which depends on a sequence $\mathbf{a} = (a_1, \dots, a_r)$ of nonnegative integers of length $r \leq n$. We use superspace Vandermondes to construct graded representations of the symmetric group. This construction recovers hook-shaped Tanisaki quotients, the coinvariant ring for the Delta Conjecture constructed by Haglund, Rhoades, and Shimozono, and a superspace quotient related to positroids and Chern plethysm constructed by Billey, Rhoades, and Tewari. We define a notion of partial differentiation with respect to anticommuting variables to construct doubly graded modules from superspace Vandermondes. These doubly graded modules carry a natural ring structure which satisfies a 2-dimensional version of Poincaré duality. The application of polarization operators gives rise to other bigraded modules which give a conjectural module for the symmetric function $Δ'_{e_{k-1}} e_n$ appearing in the Delta Conjecture of Haglund, Remmel, and Wilson. △ Less

Submitted 16 July, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

Comments: 32 pages

arXiv:1905.07121 [pdf, other]

Simple Black-box Adversarial Attacks

Authors: Chuan Guo, Jacob R. Gardner, Yurong You, Andrew Gordon Wilson, Kilian Q. Weinberger

Abstract: We propose an intriguingly simple method for the construction of adversarial images in the black-box setting. In constrast to the white-box scenario, constructing black-box adversarial images has the additional constraint on query budget, and efficient attacks remain an open problem to date. With only the mild assumption of continuous-valued confidence scores, our highly query-efficient algorithm… ▽ More We propose an intriguingly simple method for the construction of adversarial images in the black-box setting. In constrast to the white-box scenario, constructing black-box adversarial images has the additional constraint on query budget, and efficient attacks remain an open problem to date. With only the mild assumption of continuous-valued confidence scores, our highly query-efficient algorithm utilizes the following simple iterative principle: we randomly sample a vector from a predefined orthonormal basis and either add or subtract it to the target image. Despite its simplicity, the proposed method can be used for both untargeted and targeted attacks -- resulting in previously unprecedented query efficiency in both settings. We demonstrate the efficacy and efficiency of our algorithm on several real world settings including the Google Cloud Vision API. We argue that our proposed algorithm should serve as a strong baseline for future black-box attacks, in particular because it is extremely fast and its implementation requires less than 20 lines of PyTorch code. △ Less

Submitted 15 August, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

Comments: Published at ICML 2019

Showing 201–250 of 662 results for author: Wilson, A