Skip to main content

Showing 1–14 of 14 results for author: Sculley, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.07411  [pdf, other

    cs.LG stat.ML

    Plex: Towards Reliability using Pretrained Large Model Extensions

    Authors: Dustin Tran, Jeremiah Liu, Michael W. Dusenberry, Du Phan, Mark Collier, Jie Ren, Kehang Han, Zi Wang, Zelda Mariet, Huiyi Hu, Neil Band, Tim G. J. Rudner, Karan Singhal, Zachary Nado, Joost van Amersfoort, Andreas Kirsch, Rodolphe Jenatton, Nithum Thain, Honglin Yuan, Kelly Buchanan, Kevin Murphy, D. Sculley, Yarin Gal, Zoubin Ghahramani, Jasper Snoek , et al. (1 additional authors not shown)

    Abstract: A recent trend in artificial intelligence is the use of pretrained models for language and vision tasks, which have achieved extraordinary performance but also puzzling failures. Probing these models' abilities in diverse ways is therefore critical to the field. In this paper, we explore the reliability of models, where we define a reliable model as one that not only achieves strong predictive per… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Code available at https://goo.gle/plex-code

  2. arXiv:2011.13012  [pdf

    q-bio.GN stat.AP

    Large-scale machine learning-based phenoty** significantly improves genomic discovery for optic nerve head morphology

    Authors: Babak Alipanahi, Farhad Hormozdiari, Babak Behsaz, Justin Cosentino, Zachary R. McCaw, Emanuel Schorsch, D. Sculley, Elizabeth H. Dorfman, Sonia Phene, Naama Hammel, Andrew Carroll, Anthony P. Khawaja, Cory Y. McLean

    Abstract: Genome-wide association studies (GWAS) require accurate cohort phenoty**, but expert labeling can be costly, time-intensive, and variable. Here we develop a machine learning (ML) model to predict glaucomatous optic nerve head features from color fundus photographs. We used the model to predict vertical cup-to-disc ratio (VCDR), a diagnostic parameter and cardinal endophenotype for glaucoma, in 6… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Includes Supplementary Information and Tables

  3. arXiv:2011.03395  [pdf, other

    cs.LG stat.ML

    Underspecification Presents Challenges for Credibility in Modern Machine Learning

    Authors: Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne , et al. (15 additional authors not shown)

    Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predict… ▽ More

    Submitted 24 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updates: Updated statistical analysis in Section 6; Additional citations

  4. arXiv:2006.10963  [pdf, other

    cs.LG stat.ML

    Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift

    Authors: Zachary Nado, Shreyas Padhy, D. Sculley, Alexander D'Amour, Balaji Lakshminarayanan, Jasper Snoek

    Abstract: Covariate shift has been shown to sharply degrade both predictive accuracy and the calibration of uncertainty estimates for deep learning models. This is worrying, because covariate shift is prevalent in a wide range of real world deployment settings. However, in this paper, we note that frequently there exists the potential to access small unlabeled batches of the shifted data just before predict… ▽ More

    Submitted 14 January, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

  5. Population-Based Black-Box Optimization for Biological Sequence Design

    Authors: Christof Angermueller, David Belanger, Andreea Gane, Zelda Mariet, David Dohan, Kevin Murphy, Lucy Colwell, D Sculley

    Abstract: The use of black-box optimization for the design of new biological sequences is an emerging research area with potentially revolutionary impact. The cost and latency of wet-lab experiments requires methods that find good sequences in few experimental rounds of large batches of sequences--a setting that off-the-shelf black-box optimization methods are ill-equipped to handle. We find that the perfor… ▽ More

    Submitted 10 July, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020

  6. arXiv:1911.05489  [pdf, other

    cs.SI cs.LG stat.ML

    Fair treatment allocations in social networks

    Authors: James Atwood, Hansa Srinivasan, Yoni Halpern, D Sculley

    Abstract: Simulations of infectious disease spread have long been used to understand how epidemics evolve and how to effectively treat them. However, comparatively little attention has been paid to understanding the fairness implications of different treatment strategies -- that is, how might such strategies distribute the expected disease burden differentially across various subgroups or communities in the… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: To appear in the Fair ML for Health workshop at NeurIPS 2019

  7. arXiv:1906.02530  [pdf, other

    stat.ML cs.LG

    Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

    Authors: Yaniv Ovadia, Emily Fertig, Jie Ren, Zachary Nado, D Sculley, Sebastian Nowozin, Joshua V. Dillon, Balaji Lakshminarayanan, Jasper Snoek

    Abstract: Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying uncertainty is especially critical in real-world settings, which often involve input distributions that are shifted from the training distribution due to a var… ▽ More

    Submitted 17 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Advances in Neural Information Processing Systems, 2019

  8. arXiv:1901.06246  [pdf, other

    cs.CY cs.DL cs.LG stat.ML

    Avoiding a Tragedy of the Commons in the Peer Review Process

    Authors: D Sculley, Jasper Snoek, Alex Wiltschko

    Abstract: Peer review is the foundation of scientific publication, and the task of reviewing has long been seen as a cornerstone of professional service. However, the massive growth in the field of machine learning has put this community benefit under stress, threatening both the sustainability of an effective review process and the overall progress of the field. In this position paper, we argue that a trag… ▽ More

    Submitted 18 December, 2018; originally announced January 2019.

    Comments: Appeared in the 2018 Advances in Neural Information Processing Systems Workshop on Critiquing and Correcting Trends in Machine Learning

  9. arXiv:1812.06869  [pdf, other

    cs.LG cs.CV stat.ML

    BriarPatches: Pixel-Space Interventions for Inducing Demographic Parity

    Authors: Alexey A. Gritsenko, Alex D'Amour, James Atwood, Yoni Halpern, D. Sculley

    Abstract: We introduce the BriarPatch, a pixel-space intervention that obscures sensitive attributes from representations encoded in pre-trained classifiers. The patches encourage internal model representations not to encode sensitive information, which has the effect of pushing downstream predictors towards exhibiting demographic parity with respect to the sensitive information. The net result is that thes… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 6 pages, 5 figures, NeurIPS Workshop on Ethical, Social and Governance Issues in AI

  10. arXiv:1811.08545  [pdf, other

    physics.chem-ph stat.ML

    Rapid Prediction of Electron-Ionization Mass Spectrometry using Neural Networks

    Authors: Jennifer N. Wei, David Belanger, Ryan P. Adams, D. Sculley

    Abstract: When confronted with a substance of unknown identity, researchers often perform mass spectrometry on the sample and compare the observed spectrum to a library of previously-collected spectra to identify the molecule. While popular, this approach will fail to identify molecules that are not in the existing library. In response, we propose to improve the library's coverage by augmenting it with synt… ▽ More

    Submitted 17 March, 2019; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: 12 pages, 5 figures

    Journal ref: ACS Cent. Sci. 2019 5 (4) 700-708

  11. arXiv:1810.08061  [pdf, ps, other

    cs.PL cs.LG stat.ML

    AutoGraph: Imperative-style Coding with Graph-based Performance

    Authors: Dan Moldovan, James M Decker, Fei Wang, Andrew A Johnson, Brian K Lee, Zachary Nado, D Sculley, Tiark Rompf, Alexander B Wiltschko

    Abstract: There is a perceived trade-off between machine learning code that is easy to write, and machine learning code that is scalable or fast to execute. In machine learning, imperative style libraries like Autograd and PyTorch are easy to write, but suffer from high interpretive overhead and are not easily deployable in production or mobile settings. Graph-based libraries like TensorFlow and Theano bene… ▽ More

    Submitted 26 March, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

  12. arXiv:1711.08536  [pdf, other

    stat.ML

    No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Develo** World

    Authors: Shreya Shankar, Yoni Halpern, Eric Breck, James Atwood, Jimbo Wilson, D. Sculley

    Abstract: Modern machine learning systems such as image classifiers rely heavily on large scale data sets for training. Such data sets are costly to create, thus in practice a small number of freely available, open source data sets are widely used. We suggest that examining the geo-diversity of open data sets is critical before adopting a data set for use cases in the develo** world. We analyze two large,… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Develo** World

  13. arXiv:1708.03788  [pdf, other

    cs.LG cs.HC stat.ML

    Direct-Manipulation Visualization of Deep Networks

    Authors: Daniel Smilkov, Shan Carter, D. Sculley, Fernanda B. ViƩgas, Martin Wattenberg

    Abstract: The recent successes of deep learning have led to a wave of interest from non-experts. Gaining an understanding of this technology, however, is difficult. While the theory is important, it is also helpful for novices to develop an intuitive feel for the effect of different hyperparameters and structural variations. We describe TensorFlow Playground, an interactive, open sourced visualization that… ▽ More

    Submitted 12 August, 2017; originally announced August 2017.

  14. arXiv:1611.09207  [pdf, other

    cs.CL cs.LG stat.ML

    AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech

    Authors: Brian Patton, Yannis Agiomyrgiannakis, Michael Terry, Kevin Wilson, Rif A. Saurous, D. Sculley

    Abstract: Developers of text-to-speech synthesizers (TTS) often make use of human raters to assess the quality of synthesized speech. We demonstrate that we can model human raters' mean opinion scores (MOS) of synthesized speech using a deep recurrent neural network whose inputs consist solely of a raw waveform. Our best models provide utterance-level estimates of MOS only moderately inferior to sampled hum… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: 4 pages, 2 figures, 2 tables, NIPS 2016 End-to-end Learning for Speech and Audio Processing Workshop