Skip to main content

Showing 1–11 of 11 results for author: Benton, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.07837  [pdf, other

    cs.LG

    Measuring Feature Sparsity in Language Models

    Authors: Mingyang Deng, Lucas Tao, Joe Benton

    Abstract: Recent works have proposed that activations in language models can be modelled as sparse linear combinations of vectors corresponding to features of input text. Under this assumption, these works aimed to reconstruct feature directions using sparse coding. We develop metrics to assess the success of these sparse coding techniques and test the validity of the linearity and sparsity assumptions. We… ▽ More

    Submitted 13 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  2. arXiv:2308.03686  [pdf, other

    stat.ML cs.LG

    Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization

    Authors: Joe Benton, Valentin De Bortoli, Arnaud Doucet, George Deligiannidis

    Abstract: Denoising diffusions are a powerful method to generate approximate samples from high-dimensional data distributions. Recent results provide polynomial bounds on their convergence rate, assuming $L^2$-accurate scores. Until now, the tightest bounds were either superlinear in the data dimension or required strong smoothness assumptions. We provide the first convergence bounds which are linear in the… ▽ More

    Submitted 5 March, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  3. arXiv:2305.16860  [pdf, ps, other

    stat.ML cs.LG

    Error Bounds for Flow Matching Methods

    Authors: Joe Benton, George Deligiannidis, Arnaud Doucet

    Abstract: Score-based generative models are a popular class of generative modelling techniques relying on stochastic differential equations (SDE). From their inception, it was realized that it was also possible to perform generation using ordinary differential equations (ODE) rather than SDE. This led to the introduction of the probability flow ODE approach and denoising diffusion implicit models. Flow matc… ▽ More

    Submitted 11 February, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  4. arXiv:2211.03595  [pdf, other

    stat.ML cs.LG

    From Denoising Diffusions to Denoising Markov Models

    Authors: Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusions are state-of-the-art generative models exhibiting remarkable empirical performance. They work by diffusing the data distribution into a Gaussian distribution and then learning to reverse this noising process to obtain synthetic datapoints. The denoising diffusion relies on approximations of the logarithmic derivatives of the noised data densities using score matching. Such mod… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  5. arXiv:2210.06226  [pdf, other

    stat.ML cs.LG

    Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics

    Authors: Kamélia Daudel, Joe Benton, Yuyang Shi, Arnaud Doucet

    Abstract: Several algorithms involving the Variational Rényi (VR) bound have been proposed to minimize an alpha-divergence between a target posterior distribution and a variational distribution. Despite promising empirical results, those algorithms resort to biased stochastic gradient descent procedures and thus lack theoretical guarantees. In this paper, we formalize and study the VR-IWAE bound, a generali… ▽ More

    Submitted 19 July, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  6. arXiv:2210.01892  [pdf, other

    cs.NE cs.AI cs.LG

    Polysemanticity and Capacity in Neural Networks

    Authors: Adam Scherlis, Kshitij Sachan, Adam S. Jermyn, Joe Benton, Buck Shlegeris

    Abstract: Individual neurons in neural networks often represent a mixture of unrelated features. This phenomenon, called polysemanticity, can make interpreting neural networks more difficult and so we aim to understand its causes. We propose doing so through the lens of feature \emph{capacity}, which is the fractional dimension each feature consumes in the embedding space. We show that in a toy model the op… ▽ More

    Submitted 11 July, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 22 pages, 7 figures. Corrected typos in Figure 7, improved notation to distinguish column and row vectors, corrected proof in Appendix A, and other misc changes

  7. arXiv:2205.14987  [pdf, other

    stat.ML cs.LG

    A Continuous Time Framework for Discrete Denoising Models

    Authors: Andrew Campbell, Joe Benton, Valentin De Bortoli, Tom Rainforth, George Deligiannidis, Arnaud Doucet

    Abstract: We provide the first complete continuous time framework for denoising diffusion models of discrete data. This is achieved by formulating the forward noising process and corresponding reverse time generative process as Continuous Time Markov Chains (CTMCs). The model can be efficiently trained using a continuous time version of the ELBO. We simulate the high dimensional CTMC using techniques develo… ▽ More

    Submitted 14 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 44 pages, 15 figures; NeurIPS 2022

  8. arXiv:2203.15030  [pdf, other

    cs.AI cs.LG cs.MA cs.RO eess.SY

    Solving Disjunctive Temporal Networks with Uncertainty under Restricted Time-Based Controllability using Tree Search and Graph Neural Networks

    Authors: Kevin Osanlou, Jeremy Frank, Andrei Bursuc, Tristan Cazenave, Eric Jacopin, Christophe Guettier, J. Benton

    Abstract: Planning under uncertainty is an area of interest in artificial intelligence. We present a novel approach based on tree search and graph machine learning for the scheduling problem known as Disjunctive Temporal Networks with Uncertainty (DTNU). Dynamic Controllability (DC) of DTNUs seeks a reactive scheduling strategy to satisfy temporal constraints in response to uncontrollable action durations.… ▽ More

    Submitted 30 March, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Thirty-Sixth AAAI Conference on Artificial Intelligence. This version includes the technical appendix. arXiv admin note: substantial text overlap with arXiv:2108.01068

    Journal ref: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

  9. arXiv:2108.01068  [pdf, other

    cs.AI cs.LG cs.RO

    Time-based Dynamic Controllability of Disjunctive Temporal Networks with Uncertainty: A Tree Search Approach with Graph Neural Network Guidance

    Authors: Kevin Osanlou, Jeremy Frank, J. Benton, Andrei Bursuc, Christophe Guettier, Eric Jacopin, Tristan Cazenave

    Abstract: Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications. We study the problem of dynamic controllability (DC) of disjunctive temporal networks with uncertainty (DTNU), which seeks a strategy to satisfy all constraints in response to uncontrollable action durations. We introduce a more restricted, stronger form of controlla… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Journal ref: International Conference on Automated Planning and Scheduling 2020. Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning (PRL)

  10. arXiv:1411.0156  [pdf, other

    cs.AI

    Surrogate Search As a Way to Combat Harmful Effects of Ill-behaved Evaluation Functions

    Authors: William Cushing, J. Benton, Patrick Eyerich, Subbarao Kambhampati

    Abstract: Recently, several researchers have found that cost-based satisficing search with A* often runs into problems. Although some "work arounds" have been proposed to ameliorate the problem, there has been little concerted effort to pinpoint its origin. In this paper, we argue that the origins of this problem can be traced back to the fact that most planners that try to optimize cost also use cost-based… ▽ More

    Submitted 1 November, 2014; originally announced November 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1103.3687

  11. arXiv:1103.3687  [pdf, other

    cs.AI

    Cost Based Satisficing Search Considered Harmful

    Authors: William Cushing, J. Benton, Subbarao Kambhampati

    Abstract: Recently, several researchers have found that cost-based satisficing search with A* often runs into problems. Although some "work arounds" have been proposed to ameliorate the problem, there has not been any concerted effort to pinpoint its origin. In this paper, we argue that the origins can be traced back to the wide variance in action costs that is observed in most planning domains. We show tha… ▽ More

    Submitted 18 March, 2011; originally announced March 2011.

    Comments: Longer version of an extended abstract from SOCS 2010