Skip to main content

Showing 1–10 of 10 results for author: Tristan, J

.
  1. arXiv:2111.14630  [pdf, ps, other

    cs.LG cs.LO math.LO stat.ML

    On computable learning of continuous features

    Authors: Nathanael Ackerman, Julian Asilis, Jieqi Di, Cameron Freer, Jean-Baptiste Tristan

    Abstract: We introduce definitions of computable PAC learning for binary classification over computable metric spaces. We provide sufficient conditions for learners that are empirical risk minimizers (ERM) to be computable, and bound the strong Weihrauch degree of an ERM learner under more general conditions. We also give a presentation of a hypothesis class that does not admit any proper computable PAC lea… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 16 pages

  2. arXiv:2106.13798  [pdf, other

    cs.LG stat.ML

    Conjugate Energy-Based Models

    Authors: Hao Wu, Babak Esmaeili, Michael Wick, Jean-Baptiste Tristan, Jan-Willem van de Meent

    Abstract: In this paper, we propose conjugate energy-based models (CEBMs), a new class of energy-based models that define a joint density over data and latent variables. The joint density of a CEBM decomposes into an intractable distribution over data and a tractable posterior over latent variables. CEBMs have similar use cases as variational autoencoders, in the sense that they learn an unsupervised mappin… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  3. arXiv:2010.11855  [pdf, other

    cs.CL cs.AI cs.LG

    Detecting and Exorcising Statistical Demons from Language Models with Anti-Models of Negative Data

    Authors: Michael L. Wick, Kate Silverstein, Jean-Baptiste Tristan, Adam Pocock, Mark Johnson

    Abstract: It's been said that "Language Models are Unsupervised Multitask Learners." Indeed, self-supervised language models trained on "positive" examples of English text generalize in desirable ways to many natural language tasks. But if such models can stray so far from an initial self-supervision objective, a wayward model might generalize in undesirable ways too, say to nonsensical "negative" examples… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  4. arXiv:2007.06776  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Verification of ML Systems via Reparameterization

    Authors: Jean-Baptiste Tristan, Joseph Tassarotti, Koundinya Vajjha, Michael L. Wick, Anindya Banerjee

    Abstract: As machine learning is increasingly used in essential systems, it is important to reduce or eliminate the incidence of serious bugs. A growing body of research has developed machine learning algorithms with formal guarantees about performance, robustness, or fairness. Yet, the analysis of these algorithms is often complex, and implementing such systems in practice introduces room for error. Proof… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  5. arXiv:1911.04594  [pdf, other

    cs.LG stat.ML

    Rate-Regularization and Generalization in VAEs

    Authors: Alican Bozkurt, Babak Esmaeili, Jean-Baptiste Tristan, Dana H. Brooks, Jennifer G. Dy, Jan-Willem van de Meent

    Abstract: Variational autoencoders optimize an objective that combines a reconstruction loss (the distortion) and a KL term (the rate). The rate is an upper bound on the mutual information, which is often interpreted as a regularizer that controls the degree of compression. We here examine whether inclusion of the rate also acts as an inductive bias that improves generalization. We perform rate-distortion a… ▽ More

    Submitted 25 March, 2021; v1 submitted 11 November, 2019; originally announced November 2019.

  6. arXiv:1911.00385  [pdf, other

    cs.LG cs.LO stat.ML

    A Formal Proof of PAC Learnability for Decision Stumps

    Authors: Joseph Tassarotti, Koundinya Vajjha, Anindya Banerjee, Jean-Baptiste Tristan

    Abstract: We present a formal proof in Lean of probably approximately correct (PAC) learnability of the concept class of decision stumps. This classic result in machine learning theory derives a bound on error probabilities for a simple type of classifier. Though such a proof appears simple on paper, analytic and measure-theoretic subtleties arise when carrying it out fully formally. Our proof is structured… ▽ More

    Submitted 7 January, 2021; v1 submitted 1 November, 2019; originally announced November 2019.

    Comments: 13 pages, appeared in Certified Programs and Proofs (CPP) 2021

  7. arXiv:1810.01400  [pdf, other

    cs.LG stat.ML

    Sketching for Latent Dirichlet-Categorical Models

    Authors: Joseph Tassarotti, Jean-Baptiste Tristan, Michael Wick

    Abstract: Recent work has explored transforming data sets into smaller, approximate summaries in order to scale Bayesian inference. We examine a related problem in which the parameters of a Bayesian model are very large and expensive to store in memory, and propose more compact representations of parameter values that can be used during inference. We focus on a class of graphical models that we refer to as… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: 20 pages

  8. arXiv:1707.08608  [pdf, ps, other

    cs.CL

    Gradient-based Inference for Networks with Output Constraints

    Authors: Jay Yoon Lee, Sanket Vaibhav Mehta, Michael Wick, Jean-Baptiste Tristan, Jaime Carbonell

    Abstract: Practitioners apply neural networks to increasingly complex problems in natural language processing, such as syntactic parsing and semantic role labeling that have rich output structures. Many such structured-prediction problems require deterministic constraints on the output values; for example, in sequence-to-sequence syntactic parsing, we require that the sequential outputs encode valid trees.… ▽ More

    Submitted 22 April, 2019; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: AAAI 2019

  9. arXiv:1505.03851  [pdf, other

    cs.DC

    Using Butterfly-Patterned Partial Sums to Optimize GPU Memory Accesses for Drawing from Discrete Distributions

    Authors: Guy L. Steele Jr., Jean-Baptiste Tristan

    Abstract: We describe a technique for drawing values from discrete distributions, such as sampling from the random variables of a mixture model, that avoids computing a complete table of partial sums of the relative probabilities. A table of alternate ("butterfly-patterned") form is faster to compute, making better use of coalesced memory accesses. From this table, complete partial sums are computed on the… ▽ More

    Submitted 14 May, 2015; originally announced May 2015.

    Comments: 11 pages

  10. arXiv:1312.3613  [pdf, ps, other

    stat.ML cs.AI cs.DC cs.PL

    Augur: a Modeling Language for Data-Parallel Probabilistic Inference

    Authors: Jean-Baptiste Tristan, Daniel Huang, Joseph Tassarotti, Adam Pocock, Stephen J. Green, Guy L. Steele Jr

    Abstract: It is time-consuming and error-prone to implement inference procedures for each new probabilistic model. Probabilistic programming addresses this problem by allowing a user to specify the model and having a compiler automatically generate an inference procedure for it. For this approach to be practical, it is important to generate inference code that has reasonable performance. In this paper, we p… ▽ More

    Submitted 10 June, 2014; v1 submitted 12 December, 2013; originally announced December 2013.