Showing 1–2 of 2 results for author: Vani, A

Search v0.5.6 released 2020-02-24

arXiv:1906.08325 [pdf, other]

cs.LG stat.ML

GAIT: A Geometric Approach to Information Theory

Authors: Jose Gallego-Posada, Ankit Vani, Max Schwarzer, Simon Lacoste-Julien

Abstract: We advocate the use of a notion of entropy that reflects the relative abundances of the symbols in an alphabet, as well as the similarities between them. This concept was originally introduced in theoretical ecology to study the diversity of ecosystems. Based on this notion of entropy, we introduce geometry-aware counterparts for several concepts and theorems in information theory. Notably, our pr… ▽ More We advocate the use of a notion of entropy that reflects the relative abundances of the symbols in an alphabet, as well as the similarities between them. This concept was originally introduced in theoretical ecology to study the diversity of ecosystems. Based on this notion of entropy, we introduce geometry-aware counterparts for several concepts and theorems in information theory. Notably, our proposed divergence exhibits performance on par with state-of-the-art methods based on the Wasserstein distance, but enjoys a closed-form expression that can be computed efficiently. We demonstrate the versatility of our method via experiments on a broad range of domains: training generative models, computing image barycenters, approximating empirical measures and counting modes. △ Less

Submitted 13 October, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

Comments: Appears in: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020. 19 pages

Journal ref: PMLR (2020) 108:2601-2611
arXiv:1705.08557 [pdf, other]

stat.ML cs.CL cs.LG cs.NE

Grounded Recurrent Neural Networks

Authors: Ankit Vani, Yacine Jernite, David Sontag

Abstract: In this work, we present the Grounded Recurrent Neural Network (GRNN), a recurrent neural network architecture for multi-label prediction which explicitly ties labels to specific dimensions of the recurrent hidden state (we call this process "grounding"). The approach is particularly well-suited for extracting large numbers of concepts from text. We apply the new model to address an important prob… ▽ More In this work, we present the Grounded Recurrent Neural Network (GRNN), a recurrent neural network architecture for multi-label prediction which explicitly ties labels to specific dimensions of the recurrent hidden state (we call this process "grounding"). The approach is particularly well-suited for extracting large numbers of concepts from text. We apply the new model to address an important problem in healthcare of understanding what medical concepts are discussed in clinical text. Using a publicly available dataset derived from Intensive Care Units, we learn to label a patient's diagnoses and procedures from their discharge summary. Our evaluation shows a clear advantage to using our proposed architecture over a variety of strong baselines. △ Less

Submitted 23 May, 2017; originally announced May 2017.

Search v0.5.6 released 2020-02-24