Skip to main content

Showing 1–6 of 6 results for author: Jernite, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:1705.08557  [pdf, other

    stat.ML cs.CL cs.LG cs.NE

    Grounded Recurrent Neural Networks

    Authors: Ankit Vani, Yacine Jernite, David Sontag

    Abstract: In this work, we present the Grounded Recurrent Neural Network (GRNN), a recurrent neural network architecture for multi-label prediction which explicitly ties labels to specific dimensions of the recurrent hidden state (we call this process "grounding"). The approach is particularly well-suited for extracting large numbers of concepts from text. We apply the new model to address an important prob… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

  2. arXiv:1705.00557  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning

    Authors: Yacine Jernite, Samuel R. Bowman, David Sontag

    Abstract: This work presents a novel objective function for the unsupervised training of neural network sentence encoders. It exploits signals from paragraph-level discourse coherence to train these models to understand text. Our objective is purely discriminative, allowing us to train models many times faster than was possible under prior methods, and it yields models which perform well in extrinsic evalua… ▽ More

    Submitted 23 April, 2017; originally announced May 2017.

  3. arXiv:1611.06188  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    Variable Computation in Recurrent Neural Networks

    Authors: Yacine Jernite, Edouard Grave, Armand Joulin, Tomas Mikolov

    Abstract: Recurrent neural networks (RNNs) have been used extensively and with increasing success to model various types of sequential data. Much of this progress has been achieved through devising recurrent units and architectures with the flexibility to capture complex statistics in the data, such as long range dependency or localized attention phenomena. However, while many sequential data (such as video… ▽ More

    Submitted 2 March, 2017; v1 submitted 18 November, 2016; originally announced November 2016.

  4. arXiv:1610.04658  [pdf, other

    stat.ML cs.CL cs.LG

    Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation

    Authors: Yacine Jernite, Anna Choromanska, David Sontag

    Abstract: We consider multi-class classification where the predictor has a hierarchical structure that allows for a very large number of labels both at train and test time. The predictive power of such models can heavily depend on the structure of the tree, and although past work showed how to learn the tree structure, it expected that the feature vectors remained static. We provide a novel algorithm to sim… ▽ More

    Submitted 2 March, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

  5. arXiv:1508.06615  [pdf, other

    cs.CL cs.NE stat.ML

    Character-Aware Neural Language Models

    Authors: Yoon Kim, Yacine Jernite, David Sontag, Alexander M. Rush

    Abstract: We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway network over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing… ▽ More

    Submitted 1 December, 2015; v1 submitted 26 August, 2015; originally announced August 2015.

    Comments: AAAI 2016

  6. The random subgraph model for the analysis of an ecclesiastical network in Merovingian Gaul

    Authors: Yacine Jernite, Pierre Latouche, Charles Bouveyron, Patrick Rivera, Laurent Jegou, Stéphane Lamassé

    Abstract: In the last two decades many random graph models have been proposed to extract knowledge from networks. Most of them look for communities or, more generally, clusters of vertices with homogeneous connection profiles. While the first models focused on networks with binary edges only, extensions now allow to deal with valued networks. Recently, new models were also introduced in order to characteriz… ▽ More

    Submitted 5 May, 2014; v1 submitted 21 December, 2012; originally announced December 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOAS691 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS691

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 1, 377-405