Skip to main content

Showing 1–27 of 27 results for author: Guha, N

.
  1. arXiv:2402.11729  [pdf, other

    cs.LG cs.AI q-bio.QM

    Prospector Heads: Generalized Feature Attribution for Large Models & Data

    Authors: Gautam Machiraju, Alexander Derry, Arjun Desai, Neel Guha, Amir-Hossein Karimi, James Zou, Russ Altman, Christopher Ré, Parag Mallick

    Abstract: Feature attribution, the ability to localize regions of the input data that are relevant for classification, is an important capability for ML models in scientific and biomedical domains. Current methods for feature attribution, which rely on "explaining" the predictions of end-to-end classifiers, suffer from imprecise feature localization and are inadequate for use with small sample sizes and hig… ▽ More

    Submitted 19 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 30 pages, 16 figures, 8 tables. Accepted to ICML 2024

  2. arXiv:2402.07440  [pdf, other

    cs.IR cs.LG

    Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

    Authors: Jon Saad-Falcon, Daniel Y. Fu, Simran Arora, Neel Guha, Christopher Ré

    Abstract: Retrieval pipelines-an integral component of many machine learning systems-perform poorly in domains where documents are long (e.g., 10K tokens or more) and where identifying the relevant document requires synthesizing information across the entire text. Develo** long-context retrieval encoders suitable for these domains raises three challenges: (1) how to evaluate long-context retrieval perform… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  4. arXiv:2307.11031  [pdf, ps, other

    cs.LG cs.CL

    Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification

    Authors: Neel Guha, Mayee F. Chen, Kush Bhatia, Azalia Mirhoseini, Frederic Sala, Christopher Ré

    Abstract: Recent work has shown that language models' (LMs) prompt-based learning capabilities make them well suited for automating data labeling in domains where manual annotation is expensive. The challenge is that while writing an initial prompt is cheap, improving a prompt is costly -- practitioners often require significant labeled data in order to evaluate the impact of prompt modifications. Our work… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 38 pages, 22 figures, 8 tables

  5. arXiv:2305.06848  [pdf, other

    math.OC math.NA

    Stochastic Variance-Reduced Majorization-Minimization Algorithms

    Authors: Duy-Nhat Phan, Sedi Bartz, Nilabja Guha, Hung M. Phan

    Abstract: We study a class of nonconvex nonsmooth optimization problems in which the objective is a sum of two functions: One function is the average of a large number of differentiable functions, while the other function is proper, lower semicontinuous and has a surrogate function that satisfies standard assumptions. Such problems arise in machine learning and regularized empirical risk minimization applic… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    MSC Class: 90C26; 65K05

  6. arXiv:2211.09110  [pdf, other

    cs.CL cs.AI cs.LG

    Holistic Evaluation of Language Models

    Authors: Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao , et al. (25 additional authors not shown)

    Abstract: Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest fo… ▽ More

    Submitted 1 October, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Project page: https://crfm.stanford.edu/helm/v1.0

    Journal ref: Published in Transactions on Machine Learning Research (TMLR), 2023

  7. arXiv:2210.02441  [pdf, other

    cs.CL

    Ask Me Anything: A simple strategy for prompting language models

    Authors: Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré

    Abstract: Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect promp… ▽ More

    Submitted 19 November, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  8. arXiv:2209.06120  [pdf, ps, other

    cs.AI

    LegalBench: Prototy** a Collaborative Benchmark for Legal Reasoning

    Authors: Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré

    Abstract: Can foundation models be guided to execute tasks involving legal reasoning? We believe that building a benchmark to answer this question will require sustained collaborative efforts between the computer science and legal communities. To that end, this short paper serves three purposes. First, we describe how IRAC-a framework legal scholars use to distinguish different types of legal reasoning-can… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 tables

  9. arXiv:2207.03242  [pdf, ps, other

    stat.ME

    A Bayesian Survival Tree Partition Model Using Latent Gaussian Processes

    Authors: Richard D. Payne, Nilabja Guha, Bani K. Mallick

    Abstract: Survival models are used to analyze time-to-event data in a variety of disciplines. Proportional hazard models provide interpretable parameter estimates, but proportional hazards assumptions are not always appropriate. Non-parametric models are more flexible but often lack a clear inferential framework. We propose a Bayesian tree partition model which is both flexible and inferential. Inference is… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  10. arXiv:2207.00220  [pdf, other

    cs.CL cs.CY

    Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

    Authors: Peter Henderson, Mark S. Krass, Lucia Zheng, Neel Guha, Christopher D. Manning, Dan Jurafsky, Daniel E. Ho

    Abstract: One concern with the rise of large language models lies with their potential for significant harm, particularly from pretraining on biased, obscene, copyrighted, and private information. Emerging ethical approaches have attempted to filter pretraining material, but such approaches have been ad hoc and failed to take context into account. We offer an approach to filtering grounded in law, which has… ▽ More

    Submitted 29 November, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Presented at NeurIPS Datasets & Benchmarks (2022)

  11. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  12. arXiv:2104.08671  [pdf, other

    cs.CL

    When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

    Authors: Lucia Zheng, Neel Guha, Brandon R. Anderson, Peter Henderson, Daniel E. Ho

    Abstract: While self-supervised learning has made rapid advances in natural language processing, it remains unclear when researchers should engage in resource-intensive domain-specific pretraining (domain pretraining). The law, puzzlingly, has yielded few documented instances of substantial gains to domain pretraining in spite of the fact that legal language is widely seen to be unique. We hypothesize that… ▽ More

    Submitted 5 July, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: ICAIL 2021. Code & data available at https://github.com/reglab/casehold

  13. arXiv:2102.12938  [pdf, other

    stat.ME math.ST

    On Posterior consistency of Bayesian Changepoint models

    Authors: Nilabja Guha, Jyotishka Datta

    Abstract: While there have been a lot of recent developments in the context of Bayesian model selection and variable selection for high dimensional linear models, there is not much work in the presence of change point in literature, unlike the frequentist counterpart. We consider a hierarchical Bayesian linear model where the active set of covariates that affects the observations through a mean model can va… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  14. Leveraging Administrative Data for Bias Audits: Assessing Disparate Coverage with Mobility Data for COVID-19 Policy

    Authors: Amanda Coston, Neel Guha, Derek Ouyang, Lisa Lu, Alexandra Chouldechova, Daniel E. Ho

    Abstract: Anonymized smartphone-based mobility data has been widely adopted in devising and evaluating COVID-19 response strategies such as the targeting of public health resources. Yet little attention has been paid to measurement validity and demographic bias, due in part to the lack of documentation about which users are represented as well as the challenge of obtaining ground truth data on unique visits… ▽ More

    Submitted 15 April, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Journal ref: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. pp. 173-184

  15. arXiv:2010.14638  [pdf, ps, other

    stat.ME math.ST

    Bayesian Variable Selection in Multivariate Nonlinear Regression with Graph Structures

    Authors: Yabo Niu, Nilabja Guha, Debkumar De, Anindya Bhadra, Veerabhadran Baladandayuthapani, Bani K. Mallick

    Abstract: Gaussian graphical models (GGMs) are well-established tools for probabilistic exploration of dependence structures using precision matrices. We develop a Bayesian method to incorporate covariate information in this GGMs setup in a nonlinear seemingly unrelated regression framework. We propose a joint predictor and graph selection model and develop an efficient collapsed Gibbs sampler algorithm to… ▽ More

    Submitted 30 July, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

  16. arXiv:2010.10363  [pdf, other

    cs.CL cs.AI cs.LG

    Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

    Authors: Laurel Orr, Megan Leszczynski, Simran Arora, Sen Wu, Neel Guha, Xiao Ling, Christopher Re

    Abstract: A challenge for named entity disambiguation (NED), the task of map** textual mentions to entities in a knowledge base, is how to disambiguate entities that appear rarely in the training data, termed tail entities. Humans use subtle reasoning patterns based on knowledge of entity facts, relations, and types to disambiguate unfamiliar entities. Inspired by these patterns, we introduce Bootleg, a s… ▽ More

    Submitted 23 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  17. arXiv:2006.14734  [pdf, other

    math.ST

    Stochastic Approximation Algorithm for Estimating Mixing Distribution for Dependent Observations

    Authors: Nilabja Guha, Anindya Roy

    Abstract: Estimating the mixing density of a mixture distribution remains an interesting problem in statistics literature. Using a stochastic approximation method, Newton and Zhang (1999) introduced a fast recursive algorithm for estimating the mixing density of a mixture. Under suitably chosen weights the stochastic approximation estimator converges to the true solution. In Tokdar et. al. (2009) the consis… ▽ More

    Submitted 26 March, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

  18. arXiv:1910.08842  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Machine Learning for AC Optimal Power Flow

    Authors: Neel Guha, Zhecheng Wang, Matt Wytock, Arun Majumdar

    Abstract: We explore machine learning methods for AC Optimal Powerflow (ACOPF) - the task of optimizing power generation in a transmission network according while respecting physical and engineering constraints. We present two formulations of ACOPF as a machine learning problem: 1) an end-to-end prediction task where we directly predict the optimal generator settings, and 2) a constraint prediction task whe… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: 3 pages, 2 tables. Presented at the Climate Change Workshop at ICML 2019

  19. arXiv:1902.11175  [pdf, other

    cs.LG stat.ML

    One-Shot Federated Learning

    Authors: Neel Guha, Ameet Talwalkar, Virginia Smith

    Abstract: We present one-shot federated learning, where a central server learns a global model over a network of federated devices in a single round of communication. Our approach - drawing on ensemble learning and knowledge aggregation - achieves an average relative gain of 51.5% in AUC over local baselines and comes within 90.1% of the (unattainable) global ideal. We discuss these methods and identify sev… ▽ More

    Submitted 5 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: 5 pages, 3 figures, 1 table. 2nd Workshop on Machine Learning on the Phone and other Consumer Devices, NeurIPs 2018

  20. arXiv:1806.05832  [pdf, other

    math.NA

    Dynamic Data-driven Bayesian GMsFEM

    Authors: Siu Wun Cheung, Nilabja Guha

    Abstract: In this paper, we propose a Bayesian approach for multiscale problems with the availability of dynamic observational data. Our method selects important degrees of freedom probabilistically in a Generalized multiscale finite element method framework. Due to scale disparity in many multiscale applications, computational models can not resolve all scales. Dominant modes in the Generalized Multiscale… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

  21. arXiv:1805.07782  [pdf, other

    cs.LG cs.AI stat.ML

    Model Aggregation via Good-Enough Model Spaces

    Authors: Neel Guha, Virginia Smith

    Abstract: In many applications, the training data for a machine learning task is partitioned across multiple nodes, and aggregating this data may be infeasible due to communication, privacy, or storage constraints. Existing distributed optimization methods for learning global models in these settings typically aggregate local updates from each node in an iterative fashion. However, these approaches require… ▽ More

    Submitted 4 June, 2019; v1 submitted 20 May, 2018; originally announced May 2018.

    Comments: 21 pages, 6 figures, 8 tablees

  22. arXiv:1704.03138  [pdf, other

    cs.CR

    Semantic Identification of Web Browsing Sessions

    Authors: Neel Guha

    Abstract: We introduce a semantic identification attack, in which an adversary uses semantic signals about the pages visited in one browsing session to identify other browsing sessions launched by the same user. Current user fingerprinting methods fail when a single machine is used by multiple users (e.g., in cybercafes or spaces with public computers) as these methods fingerprint devices, not individuals.… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

    Comments: 10 pages and Appendix. arXiv admin note: substantial text overlap with arXiv:1610.09417

  23. arXiv:1703.06978  [pdf, other

    stat.ME

    A Conditional Density Estimation Partition Model Using Logistic Gaussian Processes

    Authors: Richard D. Payne, Nilabja Guha, Yu Ding, Bani K. Mallick

    Abstract: Conditional density estimation (density regression) estimates the distribution of a response variable y conditional on covariates x. Utilizing a partition model framework, a conditional density estimation method is proposed using logistic Gaussian processes. The partition is created using a Voronoi tessellation and is learned from the data using a reversible jump Markov chain Monte Carlo algorithm… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.

  24. arXiv:1702.02973  [pdf, other

    math.NA

    Bayesian Multiscale Finite Element Methods. Modeling missing subgrid information probabilistically

    Authors: Y. Efendiev, W. T. Leung, S. W. Cheung, N. Guha, V. H. Hoang, B. Mallick

    Abstract: In this paper, we develop a Bayesian multiscale approach based on a multiscale finite element method. Because of scale disparity in many multiscale applications, computational models can not resolve all scales. Various subgrid models are proposed to represent un-resolved scales. Here, we consider a probabilistic approach for modeling un-resolved scales using the Multiscale Finite Element Method (c… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

  25. arXiv:1611.02480  [pdf, other

    stat.ME

    Quantile Graphical Models: Bayesian Approaches

    Authors: Nilabja Guha, Veera Baladandayuthapani, Bani K. Mallick

    Abstract: Graphical models are ubiquitous tools to describe the interdependence between variables measured simultaneously such as large-scale gene or protein expression data. Gaussian graphical models (GGMs) are well-established tools for probabilistic exploration of dependence structures using precision matrices and they are generated under a multivariate normal joint distribution. However, they suffer fro… ▽ More

    Submitted 8 January, 2020; v1 submitted 8 November, 2016; originally announced November 2016.

  26. Bayesian and Variational Bayesian approaches for flows in heterogenous random media

    Authors: Keren Yang, Nilabja Guha, Yalchin Efendiev, Bani K. Mallick

    Abstract: In this paper, we study porous media flows in heterogeneous stochastic media. We propose an efficient forward simulation technique that is tailored for variational Bayesian inversion. As a starting point, the proposed forward simulation technique decomposes the solution into the sum of separable functions (with respect to randomness and the space), where each term is calculated based on a variatio… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2016; originally announced November 2016.

  27. arXiv:1610.09417  [pdf, other

    cs.CR

    Semantic Identification Attacks on Web Browsing

    Authors: Neel Guha

    Abstract: We introduce a Semantic Identification Attack, in which an adversary uses semantic signals about the pages visited in one browsing session to identify other browsing sessions launched by the same user. This attack allows an adver- sary to determine if two browsing sessions originate from the same user regardless of any measures taken by the user to disguise their browser or network. We use the MSN… ▽ More

    Submitted 28 October, 2016; originally announced October 2016.