Skip to main content

Showing 1–15 of 15 results for author: Madras, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.14322  [pdf, other

    cs.CV cs.CY

    Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images

    Authors: Hansa Srinivasan, Candice Schumann, Aradhana Sinha, David Madras, Gbolahan Oluwafemi Olanubi, Alex Beutel, Susanna Ricco, Jilin Chen

    Abstract: Capturing the diversity of people in images is challenging: recent literature tends to focus on diversifying one or two attributes, requiring expensive attribute labels or building classifiers. We introduce a diverse people image ranking method which more flexibly aligns with human notions of people diversity in a less prescriptive, label-free manner. The Perception-Aligned Text-derived Human repr… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  3. arXiv:2312.17463  [pdf, other

    cs.LG stat.ML

    Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift

    Authors: Benjamin Eyre, Elliot Creager, David Madras, Vardan Papyan, Richard Zemel

    Abstract: Designing deep neural network classifiers that perform robustly on distributions differing from the available training data is an active area of machine learning research. However, out-of-distribution generalization for regression-the analogous problem for modeling continuous targets-remains relatively unexplored. To tackle this problem, we return to first principles and analyze how the closed-for… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  4. arXiv:2312.12736  [pdf, other

    cs.CL cs.LG

    Learning and Forgetting Unsafe Examples in Large Language Models

    Authors: Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren

    Abstract: As the number of large language models (LLMs) released to the public grows, there is a pressing need to understand the safety implications associated with these models learning from third-party custom finetuning data. We explore the behavior of LLMs finetuned on noisy custom data containing unsafe content, represented by datasets that contain biases, toxicity, and harmfulness, finding that while a… ▽ More

    Submitted 3 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: accepted by ICML 24

  5. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  6. arXiv:2110.13223  [pdf, other

    cs.LG cs.CV

    Identifying and Benchmarking Natural Out-of-Context Prediction Problems

    Authors: David Madras, Richard Zemel

    Abstract: Deep learning systems frequently fail at out-of-context (OOC) prediction, the problem of making reliable predictions on uncommon or unusual inputs or subgroups of the training distribution. To this end, a number of benchmarks for measuring OOC performance have recently been introduced. In this work, we introduce a framework unifying the literature on OOC performance measurement, and demonstrate ho… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021

  7. arXiv:2011.06485  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification

    Authors: Robert Adragna, Elliot Creager, David Madras, Richard Zemel

    Abstract: Robustness is of central importance in machine learning and has given rise to the fields of domain generalization and invariant learning, which are concerned with improving performance on a test distribution distinct from but related to the training distribution. In light of recent work suggesting an intimate connection between fairness and robustness, we investigate whether algorithms from robust… ▽ More

    Submitted 1 December, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 12 pages, 5 figures. Appears in the NeurIPS 2020 Workshop on Algorithmic Fairness through the Lens of Causality and Interpretability

  8. arXiv:2006.10833  [pdf, other

    cs.LG stat.ML

    Amortized Causal Discovery: Learning to Infer Causal Graphs from Time-Series Data

    Authors: Sindy Löwe, David Madras, Richard Zemel, Max Welling

    Abstract: On time-series data, most causal discovery methods fit a new model whenever they encounter samples from a new underlying causal graph. However, these samples often share relevant information which is lost when following this approach. Specifically, different samples may share the dynamics which describe the effects of their causal relations. We propose Amortized Causal Discovery, a novel framework… ▽ More

    Submitted 21 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted as a conference paper at CLeaR 2022

  9. arXiv:1910.09573  [pdf, other

    cs.LG stat.ML

    Detecting Underspecification with Local Ensembles

    Authors: David Madras, James Atwood, Alex D'Amour

    Abstract: We present local ensembles, a method for detecting underspecification -- when many possible predictors are consistent with the training data and model class -- at test time in a pre-trained model. Our method uses local second-order information to approximate the variance of predictions across an ensemble of models from the same class. We compute this approximation by estimating the norm of the com… ▽ More

    Submitted 7 December, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Published as a conference paper at ICLR 2020 under the title "Detecting Extrapolation with Local Ensembles"

  10. arXiv:1909.09141  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Causal Modeling for Fairness in Dynamical Systems

    Authors: Elliot Creager, David Madras, Toniann Pitassi, Richard Zemel

    Abstract: In many application areas---lending, education, and online recommenders, for example---fairness and equity concerns emerge when a machine learning system interacts with a dynamically changing environment to produce both immediate and long-term effects for individuals and demographic groups. We discuss causal directed acyclic graphs (DAGs) as a unifying framework for the recent literature on fairne… ▽ More

    Submitted 6 July, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

  11. arXiv:1906.02589  [pdf, other

    cs.LG cs.AI stat.ML

    Flexibly Fair Representation Learning by Disentanglement

    Authors: Elliot Creager, David Madras, Jörn-Henrik Jacobsen, Marissa A. Weis, Kevin Swersky, Toniann Pitassi, Richard Zemel

    Abstract: We consider the problem of learning representations that achieve group and subgroup fairness with respect to multiple sensitive attributes. Taking inspiration from the disentangled representation learning literature, we propose an algorithm for learning compact representations of datasets that are useful for reconstruction and prediction, but are also \emph{flexibly fair}, meaning they can be easi… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of the International Conference on Machine Learning (ICML), 2019

  12. arXiv:1809.02519  [pdf, ps, other

    cs.LG stat.ML

    Fairness Through Causal Awareness: Learning Latent-Variable Models for Biased Data

    Authors: David Madras, Elliot Creager, Toniann Pitassi, Richard Zemel

    Abstract: How do we learn from biased data? Historical datasets often reflect historical prejudices; sensitive or protected attributes may affect the observed treatments and outcomes. Classification algorithms tasked with predicting outcomes accurately from these datasets tend to replicate these biases. We advocate a causal modeling approach to learning from biased data, exploring the relationship between f… ▽ More

    Submitted 2 December, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

    Comments: Accepted as a conference paper at ACM Conference on Fairness, Accountability, and Transparency (ACM FAT*) 2019

  13. arXiv:1802.06309  [pdf, other

    cs.LG stat.ML

    Learning Adversarially Fair and Transferable Representations

    Authors: David Madras, Elliot Creager, Toniann Pitassi, Richard Zemel

    Abstract: In this paper, we advocate for representation learning as the key to mitigating unfair prediction outcomes downstream. Motivated by a scenario where learned representations are used by third parties with unknown objectives, we propose and explore adversarial representation learning as a natural method of ensuring those parties act fairly. We connect group fairness (demographic parity, equalized od… ▽ More

    Submitted 22 October, 2018; v1 submitted 17 February, 2018; originally announced February 2018.

  14. arXiv:1711.06664  [pdf, other

    stat.ML cs.LG

    Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer

    Authors: David Madras, Toniann Pitassi, Richard Zemel

    Abstract: In many machine learning applications, there are multiple decision-makers involved, both automated and human. The interaction between these agents often goes unaddressed in algorithmic development. In this work, we explore a simple version of this interaction with a two-stage framework containing an automated model and an external decision-maker. The model can choose to say "Pass", and pass the de… ▽ More

    Submitted 6 September, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: Accepted as a conference paper at Neural Information Processing Systems 2018

  15. arXiv:1610.06453  [pdf, other

    cs.CV cs.LG stat.ML

    Change-point Detection Methods for Body-Worn Video

    Authors: Stephanie Allen, David Madras, Ye Ye, Greg Zanotti

    Abstract: Body-worn video (BWV) cameras are increasingly utilized by police departments to provide a record of police-public interactions. However, large-scale BWV deployment produces terabytes of data per week, necessitating the development of effective computational methods to identify salient changes in video. In work carried out at the 2016 RIPS program at IPAM, UCLA, we present a novel two-stage framew… ▽ More

    Submitted 20 October, 2016; originally announced October 2016.