Skip to main content

Showing 1–17 of 17 results for author: Sercu, T

.
  1. arXiv:2005.11248  [pdf, other

    cs.LG q-bio.QM stat.ML

    Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics

    Authors: Payel Das, Tom Sercu, Kahini Wadhawan, Inkit Padhi, Sebastian Gehrmann, Flaviu Cipcigan, Vijil Chenthamarakshan, Hendrik Strobelt, Cicero dos Santos, Pin-Yu Chen, Yi Yan Yang, Jeremy Tan, James Hedrick, Jason Crain, Aleksandra Mojsilovic

    Abstract: De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. We propose CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled u… ▽ More

    Submitted 25 February, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Journal ref: Nature Biomedical Engineering (2021)

  2. arXiv:1910.14212  [pdf, other

    cs.LG stat.ML

    Sobolev Independence Criterion

    Authors: Youssef Mroueh, Tom Sercu, Mattia Rigotti, Inkit Padhi, Cicero Dos Santos

    Abstract: We propose the Sobolev Independence Criterion (SIC), an interpretable dependency measure between a high dimensional random variable X and a response variable Y . SIC decomposes to the sum of feature importance scores and hence can be used for nonlinear feature selection. SIC can be seen as a gradient regularized Integral Probability Metric (IPM) between the joint distribution of the two random var… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  3. arXiv:1907.13121  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition

    Authors: Tom Sercu, Neil Mallinar

    Abstract: We introduce Multi-Frame Cross-Entropy training (MFCE) for convolutional neural network acoustic models. Recognizing that similar to RNNs, CNNs are in nature sequence models that take variable length inputs, we propose to take as input to the CNN a part of an utterance long enough that multiple labels are predicted at once, therefore getting cross-entropy loss signal from multiple adjacent frames.… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

  4. arXiv:1902.04999  [pdf, other

    cs.LG stat.ML

    Wasserstein Barycenter Model Ensembling

    Authors: Pierre Dognin, Igor Melnyk, Youssef Mroueh, Jerret Ross, Cicero Dos Santos, Tom Sercu

    Abstract: In this paper we propose to perform model ensembling in a multiclass or a multilabel learning setting using Wasserstein (W.) barycenters. Optimal transport metrics, such as the Wasserstein distance, allow incorporating semantic side information such as word embeddings. Using W. barycenters to find the consensus between models allows us to balance confidence and semantics in finding the agreement b… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: ICLR 2019

  5. arXiv:1810.07743  [pdf, other

    q-bio.QM cs.LG stat.ML

    PepCVAE: Semi-Supervised Targeted Design of Antimicrobial Peptide Sequences

    Authors: Payel Das, Kahini Wadhawan, Oscar Chang, Tom Sercu, Cicero Dos Santos, Matthew Riemer, Vijil Chenthamarakshan, Inkit Padhi, Aleksandra Mojsilovic

    Abstract: Given the emerging global threat of antimicrobial resistance, new methods for next-generation antimicrobial design are urgently needed. We report a peptide generation framework PepCVAE, based on a semi-supervised variational autoencoder (VAE) model, for designing novel antimicrobial peptide (AMP) sequences. Our model learns a rich latent space of the biological peptide context by taking advantage… ▽ More

    Submitted 13 November, 2018; v1 submitted 17 October, 2018; originally announced October 2018.

  6. arXiv:1807.03848  [pdf, ps, other

    cs.CV

    Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

    Authors: Chun-Fu Chen, Quanfu Fan, Neil Mallinar, Tom Sercu, Rogerio Feris

    Abstract: In this paper, we propose a novel Convolutional Neural Network (CNN) architecture for learning multi-scale feature representations with good tradeoffs between speed and accuracy. This is achieved by using a multi-branch network, which has different computational complexity at different branches. Through frequent merging of features from branches at distinct scales, our model obtains multi-scale fe… ▽ More

    Submitted 30 July, 2019; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: git repo: https://github.com/IBM/BigLittleNet

  7. arXiv:1805.12062  [pdf, other

    cs.LG stat.ML

    Sobolev Descent

    Authors: Youssef Mroueh, Tom Sercu, Anant Raj

    Abstract: We study a simplification of GAN training: the problem of transporting particles from a source to a target distribution. Starting from the Sobolev GAN critic, part of the gradient regularized GAN family, we show a strong relation with Optimal Transport (OT). Specifically with the less popular dynamic formulation of OT that finds a path of distributions from source to target minimizing a ``kinetic… ▽ More

    Submitted 5 August, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: AISTATS 2019

  8. arXiv:1805.00063  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Adversarial Semantic Alignment for Improved Image Captions

    Authors: Pierre L. Dognin, Igor Melnyk, Youssef Mroueh, Jarret Ross, Tom Sercu

    Abstract: In this paper we study image captioning as a conditional GAN training, proposing both a context-aware LSTM captioner and co-attentive discriminator, which enforces semantic alignment between images and captions. We empirically focus on the viability of two training methods: Self-critical Sequence Training (SCST) and Gumbel Straight-Through (ST) and demonstrate that SCST shows more stable gradient… ▽ More

    Submitted 6 June, 2019; v1 submitted 30 April, 2018; originally announced May 2018.

    Comments: Authors Equal Contribution, CVPR 2019

  9. arXiv:1712.02505  [pdf, other

    cs.LG

    Semi-Supervised Learning with IPM-based GANs: an Empirical Study

    Authors: Tom Sercu, Youssef Mroueh

    Abstract: We present an empirical investigation of a recent class of Generative Adversarial Networks (GANs) using Integral Probability Metrics (IPM) and their performance for semi-supervised learning. IPM-based GANs like Wasserstein GAN, Fisher GAN and Sobolev GAN have desirable properties in terms of theoretical understanding, training stability, and a meaningful loss. In this work we investigate how the d… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: Appeared at NIPS 2017 Workshop: Deep Learning: Bridging Theory and Practice

  10. arXiv:1711.04894  [pdf, other

    cs.LG stat.ML

    Sobolev GAN

    Authors: Youssef Mroueh, Chun-Liang Li, Tom Sercu, Anant Raj, Yu Cheng

    Abstract: We propose a new Integral Probability Metric (IPM) between distributions: the Sobolev IPM. The Sobolev IPM compares the mean discrepancy of two distributions for functions (critic) restricted to a Sobolev ball defined with respect to a dominant measure $μ$. We show that the Sobolev IPM compares two distributions in high dimensions based on weighted conditional Cumulative Distribution Functions (CD… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  11. arXiv:1705.09675  [pdf, other

    cs.LG stat.ML

    Fisher GAN

    Authors: Youssef Mroueh, Tom Sercu

    Abstract: Generative Adversarial Networks (GANs) are powerful models for learning complex distributions. Stable training of GANs has been addressed in many recent works which explore different metrics between distributions. In this paper we introduce Fisher GAN which fits within the Integral Probability Metrics (IPM) framework for training GANs. Fisher GAN defines a critic with a data dependent constraint o… ▽ More

    Submitted 3 November, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

    Comments: Published at NIPS 2017. v2: added inception score table & plot update, relation to f-gan, illustration (Figure 1). v3: added strong SSL results for critic without batch normalization

  12. arXiv:1703.02136  [pdf, other

    cs.CL

    English Conversational Telephone Speech Recognition by Humans and Machines

    Authors: George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall

    Abstract: One of the most difficult speech recognition tasks is accurate recognition of human to human communication. Advances in deep learning over the last few years have produced major speech recognition improvements on the representative Switchboard conversational corpus. Word error rates that just a few years ago were 14% have dropped to 8.0%, then 6.6% and most recently 5.8%, and are now believed to b… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.

  13. arXiv:1702.08398  [pdf, other

    cs.LG stat.ML

    McGan: Mean and Covariance Feature Matching GAN

    Authors: Youssef Mroueh, Tom Sercu, Vaibhava Goel

    Abstract: We introduce new families of Integral Probability Metrics (IPM) for training Generative Adversarial Networks (GAN). Our IPMs are based on matching statistics of distributions embedded in a finite dimensional feature space. Mean and covariance feature matching IPMs allow for stable training of GANs, which we will call McGan. McGan minimizes a meaningful loss between distributions.

    Submitted 8 June, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: 15 pages; published at ICML 2017

  14. arXiv:1611.09288  [pdf, other

    cs.CL cs.LG cs.NE

    Dense Prediction on Sequences with Time-Dilated Convolutions for Speech Recognition

    Authors: Tom Sercu, Vaibhava Goel

    Abstract: In computer vision pixelwise dense prediction is the task of predicting a label for each pixel in the image. Convolutional neural networks achieve good performance on this task, while being computationally efficient. In this paper we carry these ideas over to the problem of assigning a sequence of labels to a set of speech frames, a task commonly known as framewise classification. We show that den… ▽ More

    Submitted 14 December, 2016; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: Appeared at NIPS 2016 End-to-end Learning for Speech and Audio Processing Workshop

  15. arXiv:1604.08242  [pdf, other

    cs.CL

    The IBM 2016 English Conversational Telephone Speech Recognition System

    Authors: George Saon, Tom Sercu, Steven Rennie, Hong-Kwang J. Kuo

    Abstract: We describe a collection of acoustic and language modeling techniques that lowered the word error rate of our English conversational telephone LVCSR system to a record 6.6% on the Switchboard subset of the Hub5 2000 evaluation testset. On the acoustic side, we use a score fusion of three strong models: recurrent nets with maxout activations, very deep convolutional nets with 3x3 kernels, and bidir… ▽ More

    Submitted 22 June, 2016; v1 submitted 27 April, 2016; originally announced April 2016.

    Comments: Submitted to Interspeech 2016

  16. arXiv:1604.01792  [pdf, other

    cs.CL cs.LG cs.NE

    Advances in Very Deep Convolutional Neural Networks for LVCSR

    Authors: Tom Sercu, Vaibhava Goel

    Abstract: Very deep CNNs with small 3x3 kernels have recently been shown to achieve very strong performance as acoustic models in hybrid NN-HMM speech recognition systems. In this paper we investigate how to efficiently scale these models to larger datasets. Specifically, we address the design choice of pooling and padding along the time dimension which renders convolutional evaluation of sequences highly i… ▽ More

    Submitted 24 June, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: Proc. Interspeech 2016

  17. arXiv:1509.08967  [pdf, other

    cs.CL cs.NE

    Very Deep Multilingual Convolutional Neural Networks for LVCSR

    Authors: Tom Sercu, Christian Puhrsch, Brian Kingsbury, Yann LeCun

    Abstract: Convolutional neural networks (CNNs) are a standard component of many current state-of-the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However, CNNs in LVCSR have not kept pace with recent advances in other domains where deeper neural networks provide superior performance. In this paper we propose a number of architectural advances in CNNs for LVCSR. First, we introduce a v… ▽ More

    Submitted 23 January, 2016; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: Accepted for publication at ICASSP 2016