Showing 1–2 of 2 results for author: Letarte, G

Search v0.5.6 released 2020-02-24

arXiv:1905.10259 [pdf, other]

cs.LG stat.ML

Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural Networks

Authors: Gaël Letarte, Pascal Germain, Benjamin Guedj, François Laviolette

Abstract: We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Our results are obtained by minimizing the expected l… ▽ More We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Our results are obtained by minimizing the expected loss of an architecture-dependent aggregation of binary activated deep neural networks. Our analysis inherently overcomes the fact that binary activation function is non-differentiable. The performance of our approach is assessed on a thorough numerical experiment protocol on real-life datasets. △ Less

Submitted 4 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

Journal ref: NeurIPS 2019
arXiv:1810.12683 [pdf, other]

stat.ML cs.LG

Pseudo-Bayesian Learning with Kernel Fourier Transform as Prior

Authors: Gaël Letarte, Emilie Morvant, Pascal Germain

Abstract: We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF) method through the lens of the PAC-Bayesian theory. While the primary goal of RFF is to approximate a kernel, we look at the Fourier transform as a prior distribution over trigonometric hypotheses. It naturally suggests learning a posterior on these hypotheses. We derive generalization bounds that are optimized by learning a… ▽ More We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF) method through the lens of the PAC-Bayesian theory. While the primary goal of RFF is to approximate a kernel, we look at the Fourier transform as a prior distribution over trigonometric hypotheses. It naturally suggests learning a posterior on these hypotheses. We derive generalization bounds that are optimized by learning a pseudo-posterior obtained from a closed-form expression. Based on this study, we consider two learning strategies: The first one finds a compact landmarks-based representation of the data where each landmark is given by a distribution-tailored similarity measure, while the second one provides a PAC-Bayesian justification to the kernel alignment method of Sinha and Duchi (2016). △ Less

Submitted 27 March, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

Comments: Published at AISTATS 2019

Search v0.5.6 released 2020-02-24