Skip to main content

Showing 1–12 of 12 results for author: Banerjee, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16236  [pdf, ps, other

    stat.ML cs.LG

    A statistical framework for weak-to-strong generalization

    Authors: Seamus Somerstep, Felipe Maia Polo, Moulinath Banerjee, Ya'acov Ritov, Mikhail Yurochkin, Yuekai Sun

    Abstract: Modern large language model (LLM) alignment techniques rely on human feedback, but it is unclear whether the techniques fundamentally limit the capabilities of aligned LLMs. In particular, it is unclear whether it is possible to align (stronger) LLMs with superhuman capabilities with (weaker) human feedback without degrading their capabilities. This is an instance of the weak-to-strong generalizat… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2405.15172  [pdf, other

    stat.ML cs.LG

    Learning the Distribution Map in Reverse Causal Performative Prediction

    Authors: Daniele Bracale, Subha Maity, Moulinath Banerjee, Yuekai Sun

    Abstract: In numerous predictive scenarios, the predictive model affects the sampling distribution; for example, job applicants often meticulously craft their resumes to navigate through a screening systems. Such shifts in distribution are particularly prevalent in the realm of social computing, yet, the strategies to learn these shifts from data remain remarkably limited. Inspired by a microeconomic model… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 17 pages, 4 figures

  3. arXiv:2312.04601  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Estimating Fréchet bounds for validating programmatic weak supervision

    Authors: Felipe Maia Polo, Mikhail Yurochkin, Moulinath Banerjee, Subha Maity, Yuekai Sun

    Abstract: We develop methods for estimating Fréchet bounds on (possibly high-dimensional) distribution classes in which some variables are continuous-valued. We establish the statistical correctness of the computed bounds under uncertainty in the marginal constraints and demonstrate the usefulness of our algorithms by evaluating the performance of machine learning (ML) models trained with programmatic weak… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  4. arXiv:2307.02520  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Conditional independence testing under misspecified inductive biases

    Authors: Felipe Maia Polo, Yuekai Sun, Moulinath Banerjee

    Abstract: Conditional independence (CI) testing is a fundamental and challenging task in modern statistics and machine learning. Many modern methods for CI testing rely on powerful supervised learning methods to learn regression functions or Bayes predictors as an intermediate step; we refer to this class of tests as regression-based tests. Although these methods are guaranteed to control Type-I error when… ▽ More

    Submitted 27 October, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 proceedings

  5. arXiv:2205.13577  [pdf, other

    cs.LG stat.ME stat.ML

    Understanding new tasks through the lens of training data via exponential tilting

    Authors: Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun

    Abstract: Deploying machine learning models to new tasks is a major challenge despite the large size of the modern training datasets. However, it is conceivable that the training data can be reweighted to be more representative of the new (target) task. We consider the problem of reweighing the training samples to gain insights into the distribution of the target task. Specifically, we formulate a distribut… ▽ More

    Submitted 21 February, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted in ICLR 2023

  6. arXiv:2205.13575  [pdf, other

    cs.LG stat.CO

    Predictor-corrector algorithms for stochastic optimization under gradual distribution shift

    Authors: Subha Maity, Debarghya Mukherjee, Moulinath Banerjee, Yuekai Sun

    Abstract: Time-varying stochastic optimization problems frequently arise in machine learning practice (e.g. gradual domain shift, object tracking, strategic classification). Although most problems are solved in discrete time, the underlying process is often continuous in nature. We exploit this underlying continuity by develo** predictor-corrector algorithms for time-varying stochastic optimizations. We p… ▽ More

    Submitted 23 February, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted in ICLR 2023

  7. arXiv:2106.15301  [pdf, other

    cs.CV cs.LG

    VolterraNet: A higher order convolutional network with group equivariance for homogeneous manifolds

    Authors: Monami Banerjee, Rudrasis Chakraborty, Jose Bouza, Baba C. Vemuri

    Abstract: Convolutional neural networks have been highly successful in image-based learning tasks due to their translation equivariance property. Recent work has generalized the traditional convolutional layer of a convolutional neural network to non-Euclidean spaces and shown group equivariance of the generalized convolution operation. In this paper, we present a novel higher order Volterra convolutional n… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)

  8. arXiv:2006.11439  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Two Simple Ways to Learn Individual Fairness Metrics from Data

    Authors: Debarghya Mukherjee, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun

    Abstract: Individual fairness is an intuitive definition of algorithmic fairness that addresses some of the drawbacks of group fairness. Despite its benefits, it depends on a task specific fair metric that encodes our intuition of what is fair and unfair for the ML task at hand, and the lack of a widely accepted fair metric for many ML tasks is the main barrier to broader adoption of individual fairness. In… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: To appear in ICML 2020

  9. arXiv:1805.11204  [pdf, other

    cs.LG stat.ML

    A Statistical Recurrent Model on the Manifold of Symmetric Positive Definite Matrices

    Authors: Rudrasis Chakraborty, Chun-Hao Yang, Xingjian Zhen, Monami Banerjee, Derek Archer, David Vaillancourt, Vikas Singh, Baba C. Vemuri

    Abstract: In a number of disciplines, the data (e.g., graphs, manifolds) to be analyzed are non-Euclidean in nature. Geometric deep learning corresponds to techniques that generalize deep neural network models to such non-Euclidean spaces. Several recent papers have shown how convolutional neural networks (CNNs) can be extended to learn with graph-based data. In this work, we study the setting where the dat… ▽ More

    Submitted 27 October, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted in Thirty-second Conference on Neural Information Processing Systems (NIPS), 2018

  10. arXiv:1805.05487  [pdf, other

    cs.CV

    A CNN for homogneous Riemannian manifolds with applications to Neuroimaging

    Authors: Rudrasis Chakraborty, Monami Banerjee, Baba C. Vemuri

    Abstract: Convolutional neural networks are ubiquitous in Machine Learning applications for solving a variety of problems. They however can not be used in their native form when the domain of the data is commonly encountered manifolds such as the sphere, the special orthogonal group, the Grassmanian, the manifold of symmetric positive definite matrices and others. Most recently, generalization of CNNs to da… ▽ More

    Submitted 6 August, 2018; v1 submitted 14 May, 2018; originally announced May 2018.

  11. arXiv:1805.02505  [pdf, other

    cs.CV

    Dictionary Learning and Sparse Coding on Statistical Manifolds

    Authors: Rudrasis Chakraborty, Monami Banerjee, Baba C. Vemuri

    Abstract: In this paper, we propose a novel information theoretic framework for dictionary learning (DL) and sparse coding (SC) on a statistical manifold (the manifold of probability distributions). Unlike the traditional DL and SC framework, our new formulation does not explicitly incorporate any sparsity inducing norm in the cost function being optimized but yet yields sparse codes. Our algorithm approxim… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1604.06939

  12. arXiv:1604.06939   

    cs.CV

    An information theoretic formulation of the Dictionary Learning and Sparse Coding Problems on Statistical Manifolds

    Authors: Rudrasis Chakraborty, Monami Banerjee, Victoria Crawford, Baba C. Vemuri

    Abstract: In this work, we propose a novel information theoretic framework for dictionary learning (DL) and sparse coding (SC) on a statistical manifold (the manifold of probability distributions). Unlike the traditional DL and SC framework, our new formulation {\it does not explicitly incorporate any sparsity inducing norm in the cost function but yet yields SCs}. Moreover, we extend this framework to the… ▽ More

    Submitted 3 February, 2017; v1 submitted 23 April, 2016; originally announced April 2016.

    Comments: This paper has been withdrawn by the author due to major change