Skip to main content

Showing 1–6 of 6 results for author: McDermott, J H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.06979  [pdf, other

    q-bio.NC cs.LG cs.NE

    Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

    Authors: Joel Dapello, Jenelle Feather, Hang Le, Tiago Marques, David D. Cox, Josh H. McDermott, James J. DiCarlo, SueYeon Chung

    Abstract: Adversarial examples are often cited by neuroscientists and machine learning researchers as an example of how computational models diverge from biological sensory systems. Recent work has proposed adding biologically-inspired components to visual neural networks as a way to improve their adversarial robustness. One surprisingly effective component for reducing adversarial vulnerability is response… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  2. arXiv:2011.10706  [pdf, other

    eess.AS cs.SD

    Speech Denoising with Auditory Models

    Authors: Mark R. Saddler, Andrew Francl, Jenelle Feather, Kaizhi Qian, Yang Zhang, Josh H. McDermott

    Abstract: Contemporary speech enhancement predominantly relies on audio transforms that are trained to reconstruct a clean speech waveform. The development of high-performing neural network sound recognition systems has raised the possibility of using deep feature representations as 'perceptual' losses with which to train denoising systems. We explored their utility by first training deep neural networks to… ▽ More

    Submitted 12 August, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: First two authors contributed equally, 5 pages, 3 PDF figures

  3. arXiv:2007.04954  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

    Authors: Chuang Gan, Jeremy Schwartz, Seth Alter, Damian Mrowca, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Michael Lingelbach, Aidan Curtis, Kevin Feigelis, Daniel M. Bear, Dan Gutfreund, David Cox, Antonio Torralba, James J. DiCarlo, Joshua B. Tenenbaum, Josh H. McDermott, Daniel L. K. Yamins

    Abstract: We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties include: real-time near-photo-realistic image rendering; a library of objects and environments, and routines for their customization; generative procedu… ▽ More

    Submitted 28 December, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Oral Presentation at NeurIPS 21 Datasets and Benchmarks Track. Project page: http://www.threedworld.org

  4. arXiv:1712.07271  [pdf, other

    cs.CV

    Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning

    Authors: Andrew Owens, Jiajun Wu, Josh H. McDermott, William T. Freeman, Antonio Torralba

    Abstract: The sound of crashing waves, the roar of fast-moving cars -- sound conveys important information about the objects in our surroundings. In this work, we show that ambient sounds can be used as a supervisory signal for learning visual models. To demonstrate this, we train a convolutional neural network to predict a statistical summary of the sound associated with a video frame. We show that, throug… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: Journal preprint of arXiv:1608.07017 (unpublished submission to IJCV)

  5. arXiv:1701.07138  [pdf, other

    q-bio.NC cs.SD

    Learning Mid-Level Auditory Codes from Natural Sound Statistics

    Authors: Wiktor MÅ‚ynarski, Josh H. McDermott

    Abstract: Interaction with the world requires an organism to transform sensory signals into representations in which behaviorally meaningful properties of the environment are made explicit. These representations are derived through cascades of neuronal processing stages in which neurons at each stage recode the output of preceding stages. Explanations of sensory coding may thus involve understanding how low… ▽ More

    Submitted 14 October, 2017; v1 submitted 24 January, 2017; originally announced January 2017.

    Comments: 38 pages, 12 figures

  6. arXiv:1608.07017  [pdf, other

    cs.CV

    Ambient Sound Provides Supervision for Visual Learning

    Authors: Andrew Owens, Jiajun Wu, Josh H. McDermott, William T. Freeman, Antonio Torralba

    Abstract: The sound of crashing waves, the roar of fast-moving cars -- sound conveys important information about the objects in our surroundings. In this work, we show that ambient sounds can be used as a supervisory signal for learning visual models. To demonstrate this, we train a convolutional neural network to predict a statistical summary of the sound associated with a video frame. We show that, throug… ▽ More

    Submitted 5 December, 2016; v1 submitted 25 August, 2016; originally announced August 2016.

    Comments: ECCV 2016