Skip to main content

Showing 1–11 of 11 results for author: Sankar, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.02141  [pdf, other

    stat.ME cs.LG econ.EM stat.CO stat.ML

    Robustly estimating heterogeneity in factorial data using Rashomon Partitions

    Authors: Aparajithan Venkateswaran, Anirudh Sankar, Arun G. Chandrasekhar, Tyler H. McCormick

    Abstract: Many statistical analyses, in both observational data and randomized control trials, ask: how does the outcome of interest vary with combinations of observable covariates? How do various drug combinations affect health outcomes, or how does technology adoption depend on incentives and demographics? Our goal is to partition this factorial space into "pools" of covariate combinations where the outco… ▽ More

    Submitted 25 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  2. arXiv:2104.09645  [pdf, other

    stat.AP q-bio.PE

    Selecting the Most Effective Nudge: Evidence from a Large-Scale Experiment on Immunization

    Authors: Abhijit Banerjee, Arun G. Chandrasekhar, Suresh Dalpath, Esther Duflo, John Floretta, Matthew O. Jackson, Harini Kannan, Francine Loza, Anirudh Sankar, Anna Schrimpf, Maheshwor Shrestha

    Abstract: Policymakers often choose a policy bundle that is a combination of different interventions in different dosages. We develop a new technique -- treatment variant aggregation (TVA) -- to select a policy from a large factorial design. TVA pools together policy variants that are not meaningfully different and prunes those deemed ineffective. This allows us to restrict attention to aggregated policy va… ▽ More

    Submitted 12 September, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

  3. arXiv:2006.07630  [pdf, other

    cs.CV stat.ML

    Equivariant Neural Rendering

    Authors: Emilien Dupont, Miguel Angel Bautista, Alex Colburn, Aditya Sankar, Carlos Guestrin, Josh Susskind, Qi Shan

    Abstract: We propose a framework for learning neural scene representations directly from images, without 3D supervision. Our key insight is that 3D structure can be imposed by ensuring that the learned representation transforms like a real 3D scene. Specifically, we introduce a loss which enforces equivariance of the scene representation with respect to 3D transformations. Our formulation allows us to infer… ▽ More

    Submitted 21 December, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Add link to code

  4. arXiv:2003.08469  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Train, Learn, Expand, Repeat

    Authors: Abhijeet Parida, Aadhithya Sankar, Rami Eisawy, Tom Finck, Benedikt Wiestler, Franz Pfister, Julia Moosbauer

    Abstract: High-quality labeled data is essential to successfully train supervised machine learning models. Although a large amount of unlabeled data is present in the medical domain, labeling poses a major challenge: medical professionals who can expertly label the data are a scarce and expensive resource. Making matters worse, voxel-wise delineation of data (e.g. for segmentation tasks) is tedious and suff… ▽ More

    Submitted 19 April, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Published as a workshop paper at AI4AH, ICLR 2020

  5. DANTE: Deep AlterNations for Training nEural networks

    Authors: Vaibhav B Sinha, Sneha Kudugunta, Adepu Ravi Sankar, Surya Teja Chavali, Purushottam Kar, Vineeth N Balasubramanian

    Abstract: We present DANTE, a novel method for training neural networks using the alternating minimization principle. DANTE provides an alternate perspective to traditional gradient-based backpropagation techniques commonly used to train deep networks. It utilizes an adaptation of quasi-convexity to cast training a neural network as a bi-quasi-convex optimization problem. We show that for neural network con… ▽ More

    Submitted 9 August, 2020; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 19 pages

    Journal ref: Neural Networks 131 (2020) 127-143

  6. arXiv:1812.09430  [pdf, other

    cs.LG cs.SI stat.ML

    Dynamic Graph Representation Learning via Self-Attention Networks

    Authors: Aravind Sankar, Yanhong Wu, Liang Gou, Wei Zhang, Hao Yang

    Abstract: Learning latent representations of nodes in graphs is an important and ubiquitous task with widespread applications such as link prediction, node classification, and graph visualization. Previous methods on graph representation learning mainly focus on static graphs, however, many real-world graphs are dynamic and evolve over time. In this paper, we present Dynamic Self-Attention Network (DySAT),… ▽ More

    Submitted 15 June, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

  7. arXiv:1807.08140  [pdf, other

    cs.LG math.OC stat.ML

    On the Analysis of Trajectories of Gradient Descent in the Optimization of Deep Neural Networks

    Authors: Adepu Ravi Sankar, Vishwak Srinivasan, Vineeth N Balasubramanian

    Abstract: Theoretical analysis of the error landscape of deep neural networks has garnered significant interest in recent years. In this work, we theoretically study the importance of noise in the trajectories of gradient descent towards optimal solutions in multi-layer neural networks. We show that adding noise (in different ways) to a neural network while training increases the rank of the product of weig… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

    Comments: 4 pages + 1 figure (main, excluding references), 5 pages + 4 figures (appendix)

  8. arXiv:1712.07424  [pdf, ps, other

    stat.ML cs.LG

    ADINE: An Adaptive Momentum Method for Stochastic Gradient Descent

    Authors: Vishwak Srinivasan, Adepu Ravi Sankar, Vineeth N Balasubramanian

    Abstract: Two major momentum-based techniques that have achieved tremendous success in optimization are Polyak's heavy ball method and Nesterov's accelerated gradient. A crucial step in all momentum-based methods is the choice of the momentum parameter $m$ which is always suggested to be set to less than $1$. Although the choice of $m < 1$ is justified only under very strong theoretical assumptions, it work… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: 8 + 1 pages, 12 figures, accepted at CoDS-COMAD 2018

  9. arXiv:1711.07274  [pdf, ps, other

    cs.CL cs.SD eess.AS stat.ML

    Speech recognition for medical conversations

    Authors: Chung-Cheng Chiu, Anshuman Tripathi, Katherine Chou, Chris Co, Navdeep Jaitly, Diana Jaunzeikare, Anjuli Kannan, Patrick Nguyen, Hasim Sak, Ananth Sankar, Justin Tansuwan, Nathan Wan, Yonghui Wu, Xuedong Zhang

    Abstract: In this work we explored building automatic speech recognition models for transcribing doctor patient conversation. We collected a large scale dataset of clinical conversations ($14,000$ hr), designed the task to represent the real word scenario, and explored several alignment approaches to iteratively improve data quality. We explored both CTC and LAS systems for building speech recognition model… ▽ More

    Submitted 20 June, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: Interspeech 2018 camera ready

  10. arXiv:1706.02052  [pdf, other

    stat.ML cs.LG cs.NE

    Are Saddles Good Enough for Deep Learning?

    Authors: Adepu Ravi Sankar, Vineeth N Balasubramanian

    Abstract: Recent years have seen a growing interest in understanding deep neural networks from an optimization perspective. It is understood now that converging to low-cost local minima is sufficient for such models to become effective in practice. However, in this work, we propose a new hypothesis based on recent theoretical findings and empirical studies that deep neural network models actually converge t… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

  11. arXiv:1511.06546  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Bayesian identification of bacterial strains from sequencing data

    Authors: Aravind Sankar, Brandon Malone, Sion Bayliss, Ben Pascoe, Guillaume Méric, Matthew D. Hitchings, Samuel K. Sheppard, Edward J. Feil, Jukka Corander, Antti Honkela

    Abstract: Rapidly assaying the diversity of a bacterial species present in a sample obtained from a hospital patient or an evironmental source has become possible after recent technological advances in DNA sequencing. For several applications it is important to accurately identify the presence and estimate relative abundances of the target organisms from short sequence reads obtained from a sample. This tas… ▽ More

    Submitted 17 February, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: 16 pages, 7 figures