Skip to main content

Showing 1–12 of 12 results for author: Paria, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.12558  [pdf, other

    cs.CL

    Gradient-Based Constrained Sampling from Language Models

    Authors: Sachin Kumar, Biswajit Paria, Yulia Tsvetkov

    Abstract: Large pretrained language models generate fluent text but are notoriously hard to controllably sample from. In this work, we study constrained sampling from such language models: generating text that satisfies user-defined constraints, while maintaining fluency and the model's performance in a downstream task. We propose MuCoLa -- a sampling procedure that combines the log-likelihood of the langua… ▽ More

    Submitted 8 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  2. arXiv:2204.10414  [pdf, other

    cs.LG stat.ML

    Dirichlet Proportions Model for Hierarchically Coherent Probabilistic Forecasting

    Authors: Abhimanyu Das, Weihao Kong, Biswajit Paria, Rajat Sen

    Abstract: Probabilistic, hierarchically coherent forecasting is a key problem in many practical forecasting applications -- the goal is to obtain coherent probabilistic predictions for a large number of time series arranged in a pre-specified tree hierarchy. In this paper, we present an end-to-end deep probabilistic model for hierarchical forecasting that is motivated by a classical top-down strategy. It jo… ▽ More

    Submitted 1 March, 2023; v1 submitted 21 April, 2022; originally announced April 2022.

  3. arXiv:2112.05244  [pdf, other

    cs.LG cs.AI cs.IT cs.RO stat.ML

    An Experimental Design Perspective on Model-Based Reinforcement Learning

    Authors: Viraj Mehta, Biswajit Paria, Jeff Schneider, Stefano Ermon, Willie Neiswanger

    Abstract: In many practical applications of RL, it is expensive to observe state transitions from the environment. For example, in the problem of plasma control for nuclear fusion, computing the next state for a given state-action pair requires querying an expensive transition function which can lead to many hours of computer simulation or dollars of scientific research. Such expensive data collection prohi… ▽ More

    Submitted 15 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Conference paper at ICLR 2022

  4. arXiv:2106.07630  [pdf, other

    cs.LG

    Hierarchically Regularized Deep Forecasting

    Authors: Biswajit Paria, Rajat Sen, Amr Ahmed, Abhimanyu Das

    Abstract: Hierarchical forecasting is a key problem in many practical multivariate forecasting applications - the goal is to simultaneously predict a large number of correlated time series that are arranged in a pre-specified aggregation hierarchy. The main challenge is to exploit the hierarchical correlations to simultaneously obtain good prediction accuracy for time series at different levels of the hiera… ▽ More

    Submitted 12 October, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  5. arXiv:2004.05665  [pdf, other

    cs.LG stat.ML

    Minimizing FLOPs to Learn Efficient Sparse Representations

    Authors: Biswajit Paria, Chih-Kuan Yeh, Ian E. H. Yen, Ning Xu, Pradeep Ravikumar, Barnabás Póczos

    Abstract: Deep representation learning has become one of the most widely adopted approaches for visual search, recommendation, and identification. Retrieval of such representations from a large database is however computationally challenging. Approximate methods based on learning compact representations, have been widely explored for this problem, such as locality sensitive hashing, product quantization, an… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: Published at ICLR 2020

  6. arXiv:2001.09938  [pdf, other

    physics.app-ph cs.LG

    Autonomous discovery of battery electrolytes with robotic experimentation and machine-learning

    Authors: Adarsh Dave, Jared Mitchell, Kirthevasan Kandasamy, Sven Burke, Biswajit Paria, Barnabas Poczos, Jay Whitacre, Venkatasubramanian Viswanathan

    Abstract: Innovations in batteries take years to formulate and commercialize, requiring extensive experimentation during the design and optimization phases. We approached the design and selection of a battery electrolyte through a black-box optimization algorithm directly integrated into a robotic test-stand. We report here the discovery of a novel battery electrolyte by this experiment completely guided by… ▽ More

    Submitted 22 October, 2019; originally announced January 2020.

    Comments: 23 pages, 4 figures, 10 pages of Extended Data

    Journal ref: Cell Reports Physical Science, 1, (2020) 100264

  7. arXiv:1903.06694  [pdf, other

    stat.ML cs.AI cs.LG

    Tuning Hyperparameters without Grad Students: Scalable and Robust Bayesian Optimisation with Dragonfly

    Authors: Kirthevasan Kandasamy, Karun Raju Vysyaraju, Willie Neiswanger, Biswajit Paria, Christopher R. Collins, Jeff Schneider, Barnabas Poczos, Eric P. Xing

    Abstract: Bayesian Optimisation (BO) refers to a suite of techniques for global optimisation of expensive black box functions, which use introspective Bayesian models of the function to efficiently search for the optimum. While BO has been applied successfully in many applications, modern optimisation tasks usher in new challenges where conventional methods fail spectacularly. In this work, we present Drago… ▽ More

    Submitted 19 April, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: Journal of Machine Learning Research 2020, Special Issue on Bayesian Optimization

  8. arXiv:1805.12168  [pdf, other

    cs.LG stat.ML

    A Flexible Framework for Multi-Objective Bayesian Optimization using Random Scalarizations

    Authors: Biswajit Paria, Kirthevasan Kandasamy, Barnabás Póczos

    Abstract: Many real world applications can be framed as multi-objective optimization problems, where we wish to simultaneously optimize for multiple criteria. Bayesian optimization techniques for the multi-objective setting are pertinent when the evaluation of the functions in question are expensive. Traditional methods for multi-objective optimization, both Bayesian and otherwise, are aimed at recovering t… ▽ More

    Submitted 20 June, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to UAI 2019

  9. arXiv:1701.04548  [pdf, ps, other

    cs.DM

    Analytic Connectivity in General Hypergraphs

    Authors: Ashwin Guha, Muni Sreenivas Pydi, Biswajit Paria, Ambedkar Dukkipati

    Abstract: In this paper we extend the known results of analytic connectivity to non-uniform hypergraphs. We prove a modified Cheeger's inequality and also give a bound on analytic connectivity with respect to the degree sequence and diameter of a hypergraph.

    Submitted 17 January, 2017; originally announced January 2017.

  10. arXiv:1611.04741  [pdf, other

    cs.CL

    A Neural Architecture Mimicking Humans End-to-End for Natural Language Inference

    Authors: Biswajit Paria, K. M. Annervaz, Ambedkar Dukkipati, Ankush Chatterjee, Sanjay Podder

    Abstract: In this work we use the recent advances in representation learning to propose a neural architecture for the problem of natural language inference. Our approach is aligned to mimic how a human does the natural language inference process given two statements. The model uses variants of Long Short Term Memory (LSTM), attention mechanism and composable neural networks, to carry out the task. Each part… ▽ More

    Submitted 27 January, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

    Comments: 8 pages, 2 figures

  11. arXiv:1608.01874  [pdf, other

    cs.LG

    Forward Stagewise Additive Model for Collaborative Multiview Boosting

    Authors: Avisek Lahiri, Biswajit Paria, Prabir Kumar Biswas

    Abstract: Multiview assisted learning has gained significant attention in recent years in supervised learning genre. Availability of high performance computing devices enables learning algorithms to search simultaneously over multiple views or feature spaces to obtain an optimum classification performance. The paper is a pioneering attempt of formulating a mathematical foundation for realizing a multiview a… ▽ More

    Submitted 5 August, 2016; originally announced August 2016.

    Comments: The manuscript is currently under "Minor Revision" at IEEE Transactions on Neural Networks and Learning Systems

  12. arXiv:1604.02646  [pdf, other

    cs.LG cs.CV cs.NE

    Visualization Regularizers for Neural Network based Image Recognition

    Authors: Biswajit Paria, Vikas Reddy, Anirban Santara, Pabitra Mitra

    Abstract: The success of deep neural networks is mostly due their ability to learn meaningful features from the data. Features learned in the hidden layers of deep neural networks trained in computer vision tasks have been shown to be similar to mid-level vision features. We leverage this fact in this work and propose the visualization regularizer for image tasks. The proposed regularization technique enfor… ▽ More

    Submitted 3 January, 2017; v1 submitted 10 April, 2016; originally announced April 2016.