Skip to main content

Showing 1–20 of 20 results for author: Feige, I

.
  1. arXiv:2306.05484  [pdf, other

    stat.ME cs.LG stat.ML

    Task-specific experimental design for treatment effect estimation

    Authors: Bethany Connolly, Kim Moore, Tobias Schwedes, Alexander Adam, Gary Willis, Ilya Feige, Christopher Frye

    Abstract: Understanding causality should be a core requirement of any attempt to build real impact through AI. Due to the inherent unobservability of counterfactuals, large randomised trials (RCTs) are the standard for causal inference. But large experiments are generically expensive, and randomisation carries its own costs, e.g. when suboptimal decisions are trialed. Recent work has proposed more sample-ef… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: To appear in ICML 2023; 8 pages, 7 figures, 4 appendices

  2. arXiv:2010.12464  [pdf, other

    cs.LG stat.ML

    Representation Learning for High-Dimensional Data Collection under Local Differential Privacy

    Authors: Alex Mansbridge, Gregory Barbour, Davide Piras, Michael Murray, Christopher Frye, Ilya Feige, David Barber

    Abstract: The collection of individuals' data has become commonplace in many industries. Local differential privacy (LDP) offers a rigorous approach to preserving privacy whereby the individual privatises their data locally, allowing only their perturbed datum to leave their possession. LDP thus provides a provable privacy guarantee to the individual against both adversaries and database administrators. Exi… ▽ More

    Submitted 14 May, 2022; v1 submitted 23 October, 2020; originally announced October 2020.

  3. arXiv:2010.07389  [pdf, other

    cs.LG cs.AI stat.ML

    Explainability for fair machine learning

    Authors: Tom Begley, Tobias Schwedes, Christopher Frye, Ilya Feige

    Abstract: As the decisions made or influenced by machine learning models increasingly impact our lives, it is crucial to detect, understand, and mitigate unfairness. But even simply determining what "unfairness" should mean in a given context is non-trivial: there are many competing definitions, and choosing between them often requires a deep understanding of the underlying task. It is thus tempting to use… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 8 pages, 3 figures, 2 tables, 1 appendix

  4. arXiv:2010.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Human-interpretable model explainability on high-dimensional data

    Authors: Damien de Mijolla, Christopher Frye, Markus Kunesch, John Mansir, Ilya Feige

    Abstract: The importance of explainability in machine learning continues to grow, as both neural-network architectures and the data they model become increasingly complex. Unique challenges arise when a model's input features become high dimensional: on one hand, principled model-agnostic approaches to explainability become too computationally expensive; on the other, more efficient explainability algorithm… ▽ More

    Submitted 20 December, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: 8 pages, 6 figures, 1 appendix

  5. arXiv:2010.03467  [pdf, other

    stat.ML cs.CV cs.LG

    Learning Deep-Latent Hierarchies by Stacking Wasserstein Autoencoders

    Authors: Benoit Gaujac, Ilya Feige, David Barber

    Abstract: Probabilistic models with hierarchical-latent-variable structures provide state-of-the-art results amongst non-autoregressive, unsupervised density-based models. However, the most common approach to training such models based on Variational Autoencoders (VAEs) often fails to leverage deep-latent hierarchies; successful approaches require complex inference and optimisation schemes. Optimal Transpor… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  6. arXiv:2010.03459  [pdf, other

    stat.ML cs.CV cs.LG

    Learning disentangled representations with the Wasserstein Autoencoder

    Authors: Benoit Gaujac, Ilya Feige, David Barber

    Abstract: Disentangled representation learning has undoubtedly benefited from objective function surgery. However, a delicate balancing act of tuning is still required in order to trade off reconstruction fidelity versus disentanglement. Building on previous successes of penalizing the total correlation in the latent variables, we propose TCWAE (Total Correlation Wasserstein Autoencoder). Working in the WAE… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  7. arXiv:2006.01272  [pdf, other

    cs.LG cs.AI stat.ML

    Shapley explainability on the data manifold

    Authors: Christopher Frye, Damien de Mijolla, Tom Begley, Laurence Cowton, Megan Stanley, Ilya Feige

    Abstract: Explainability in AI is crucial for model development, compliance with regulation, and providing operational nuance to predictions. The Shapley framework for explainability attributes a model's predictions to its input features in a mathematically principled and model-agnostic way. However, general implementations of Shapley explainability make an untenable assumption: that the model's features ar… ▽ More

    Submitted 20 December, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: To appear in ICLR 2021; 9 pages, 6 figures, 2 appendices

  8. arXiv:1910.06358  [pdf, other

    stat.ML cs.AI cs.LG

    Asymmetric Shapley values: incorporating causal knowledge into model-agnostic explainability

    Authors: Christopher Frye, Colin Rowat, Ilya Feige

    Abstract: Explaining AI systems is fundamental both to the development of high performing models and to the trust placed in them by their users. The Shapley framework for explainability has strength in its general applicability combined with its precise, rigorous foundation: it provides a common, model-agnostic language for AI explainability and uniquely satisfies a set of intuitive mathematical axioms. How… ▽ More

    Submitted 20 December, 2021; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: To appear in NeurIPS 2020; 9 pages, 2 figures, 2 appendices

  9. Binary JUNIPR: an interpretable probabilistic model for discrimination

    Authors: Anders Andreassen, Ilya Feige, Christopher Frye, Matthew D. Schwartz

    Abstract: JUNIPR is an approach to unsupervised learning in particle physics that scaffolds a probabilistic model for jets around their representation as binary trees. Separate JUNIPR models can be learned for different event or jet types, then compared and explored for physical insight. The relative probabilities can also be used for discrimination. In this paper, we show how the training of the separate m… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: 6 pages, 3 figures

    Journal ref: Phys. Rev. Lett. 123, 182001 (2019)

  10. arXiv:1902.06766  [pdf, other

    cs.AI cs.LG stat.ML

    Parenting: Safe Reinforcement Learning from Human Input

    Authors: Christopher Frye, Ilya Feige

    Abstract: Autonomous agents trained via reinforcement learning present numerous safety concerns: reward hacking, negative side effects, and unsafe exploration, among others. In the context of near-future autonomous agents, operating in environments where humans understand the existing dangers, human involvement in the learning process has proved a promising approach to AI Safety. Here we demonstrate that a… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: 9 pages, 4 figures, 1 table

  11. arXiv:1902.03251  [pdf, other

    stat.ML cs.AI cs.LG

    Invariant-equivariant representation learning for multi-class data

    Authors: Ilya Feige

    Abstract: Representations learnt through deep neural networks tend to be highly informative, but opaque in terms of what information they learn to encode. We introduce an approach to probabilistic modelling that learns to represent data with two separate deep representations: an invariant representation that encodes the information of the class from which the data belongs, and an equivariant representation… ▽ More

    Submitted 19 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 8 pages, 5 figures, 2 tables, 2 appendices

    Journal ref: ICML 2019

  12. arXiv:1806.04480  [pdf, other

    stat.ML cs.LG

    Improving latent variable descriptiveness with AutoGen

    Authors: Alex Mansbridge, Roberto Fierimonte, Ilya Feige, David Barber

    Abstract: Powerful generative models, particularly in Natural Language Modelling, are commonly trained by maximizing a variational lower bound on the data log likelihood. These models often suffer from poor use of their latent variable, with ad-hoc annealing factors used to encourage retention of information in the latent variable. We discuss an alternative and general approach to latent variable modelling,… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 8 pages, 2 figures, 5 tables

  13. arXiv:1806.04465  [pdf, other

    stat.ML cs.LG

    Gaussian mixture models with Wasserstein distance

    Authors: Benoit Gaujac, Ilya Feige, David Barber

    Abstract: Generative models with both discrete and continuous latent variables are highly motivated by the structure of many real-world data sets. They present, however, subtleties in training often manifesting in the discrete latent being under leveraged. In this paper, we show that such models are more amenable to training when using the Optimal Transport framework of Wasserstein Autoencoders. We find our… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 8 pages, 5 figures

  14. JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics

    Authors: Anders Andreassen, Ilya Feige, Christopher Frye, Matthew D. Schwartz

    Abstract: In applications of machine learning to particle physics, a persistent challenge is how to go beyond discrimination to learn about the underlying physics. To this end, a powerful tool would be a framework for unsupervised learning, where the machine learns the intricate high-dimensional contours of the data upon which it is trained, without reference to pre-established labels. In order to approach… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: 37 pages, 24 figures

  15. A Complete Basis of Helicity Operators for Subleading Factorization

    Authors: Ilya Feige, Daniel W. Kolodrubetz, Ian Moult, Iain W. Stewart

    Abstract: Factorization theorems underly our ability to make predictions for many processes involving the strong interaction. Although typically formulated at leading power, the study of factorization at subleading power is of interest both for improving the precision of calculations, as well as for understanding the all orders structure of QCD. We use the SCET helicity operator formalism to construct a com… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: 110 pages, many figures

    Report number: MIT-CTP 4597

  16. Streamlining resummed QCD calculations using Monte Carlo integration

    Authors: David Farhi, Ilya Feige, Marat Freytsis, Matthew D. Schwartz

    Abstract: Some of the most arduous and error-prone aspects of precision resummed calculations are related to the partonic hard process, having nothing to do with the resummation. In particular, interfacing to parton-distribution functions, combining various channels, and performing the phase space integration can be limiting factors in completing calculations. Conveniently, however, most of these tasks are… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

    Comments: 30 pages, 10 figures, code included with submission

  17. Removing phase-space restrictions in factorized cross sections

    Authors: Ilya Feige, Matthew D. Schwartz, Kai Yan

    Abstract: Factorization in gauge theories holds at the amplitude or amplitude-squared level for states of given soft or collinear momenta. When performing phase-space integrals over such states, one would generally like to avoid putting in explicit cuts to separate soft from collinear momenta. Removing these cuts induces an overcounting of the soft-collinear region and adds new infrared-ultraviolet divergen… ▽ More

    Submitted 18 February, 2015; originally announced February 2015.

    Comments: 32 pages, 1 figure

    Journal ref: Phys. Rev. D 91, 094027 (2015)

  18. Hard-Soft-Collinear Factorization to All Orders

    Authors: Ilya Feige, Matthew D. Schwartz

    Abstract: We provide a precise statement of hard-soft-collinear factorization of scattering amplitudes and prove it to all orders in perturbation theory. Factorization is formulated as the equality at leading power of scattering amplitudes in QCD with other amplitudes in QCD computed from a product of operator matrix elements. The equivalence is regulator independent and gauge independent. As the formulatio… ▽ More

    Submitted 6 March, 2015; v1 submitted 25 March, 2014; originally announced March 2014.

    Comments: 88 pages. Version 3 is updated to match the PRD article

    Journal ref: Phys. Rev. D 90, 105020 (2014)

  19. An on-shell approach to factorization

    Authors: Ilya Feige, Matthew D. Schwartz

    Abstract: Factorization is possible due to the universal behavior of Yang-Mills theories in soft and collinear limits. Here, we take a small step towards a more transparent understanding of these limits by proving a form of perturbative factorization at tree- level using on-shell spinor helicity methods. We present a concrete and self-contained expression of factorization in which matrix elements in QCD are… ▽ More

    Submitted 11 July, 2013; v1 submitted 26 June, 2013; originally announced June 2013.

    Comments: 48 pages

  20. Precision Jet Substructure from Boosted Event Shapes

    Authors: Ilya Feige, Matthew D. Schwartz, Iain W. Stewart, Jesse Thaler

    Abstract: Jet substructure has emerged as a critical tool for LHC searches, but studies so far have relied heavily on shower Monte Carlo simulations, which formally approximate QCD at leading-log level. We demonstrate that systematic higher-order QCD computations of jet substructure can be carried out by boosting global event shapes by a large momentum Q, and accounting for effects due to finite jet size, i… ▽ More

    Submitted 17 September, 2012; v1 submitted 17 April, 2012; originally announced April 2012.

    Comments: 5 pages, 7 figures

    Journal ref: Phys. Rev. Lett. 109, 092001 (2012)