Skip to main content

Showing 1–8 of 8 results for author: Biscione, V

.
  1. arXiv:2404.05290  [pdf, other

    cs.CV cs.AI

    MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

    Authors: Valerio Biscione, Dong Yin, Gaurav Malhotra, Marin Dujmovic, Milton L. Montero, Guillermo Puebla, Federico Adolfi, Rachel F. Heaton, John E. Hummel, Benjamin D. Evans, Karim Habashy, Jeffrey S. Bowers

    Abstract: Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbo… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  2. arXiv:2302.03992  [pdf, other

    cs.CV

    Convolutional Neural Networks Trained to Identify Words Provide a Surprisingly Good Account of Visual Form Priming Effects

    Authors: Dong Yin, Valerio Biscione, Jeffrey Bowers

    Abstract: A wide variety of orthographic coding schemes and models of visual word identification have been developed to account for masked priming data that provide a measure of orthographic similarity between letter strings. These models tend to include hand-coded orthographic representations with single unit coding for specific forms of knowledge (e.g., units coding for a letter in a given position). Here… ▽ More

    Submitted 14 March, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  3. arXiv:2203.07302  [pdf, other

    cs.AI

    Mixed Evidence for Gestalt Grou** in Deep Neural Networks

    Authors: Valerio Biscione, Jeffrey S. Bowers

    Abstract: Gestalt psychologists have identified a range of conditions in which humans organize elements of a scene into a group or whole, and perceptual grou** principles play an essential role in scene perception and object identification. Recently, Deep Neural Networks (DNNs) trained on natural images (ImageNet) have been proposed as compelling models of human vision based on reports that they perform w… ▽ More

    Submitted 20 February, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted in Computational Brain & Behaviour

  4. arXiv:2110.05861  [pdf, other

    cs.CV cs.AI

    Convolutional Neural Networks Are Not Invariant to Translation, but They Can Learn to Be

    Authors: Valerio Biscione, Jeffrey S. Bowers

    Abstract: When seeing a new object, humans can immediately recognize it across different retinal locations: the internal object representation is invariant to translation. It is commonly believed that Convolutional Neural Networks (CNNs) are architecturally invariant to translation thanks to the convolution and/or pooling operations they are endowed with. In fact, several studies have found that these netwo… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Journal ref: Journal of Machine Learning Research 2021 22(229) 1-28

  5. Learning Online Visual Invariances for Novel Objects via Supervised and Self-Supervised Training

    Authors: Valerio Biscione, Jeffrey S. Bowers

    Abstract: Humans can identify objects following various spatial transformations such as scale and viewpoint. This extends to novel objects, after a single presentation at a single pose, sometimes referred to as online invariance. CNNs have been proposed as a compelling model of human vision, but their ability to identify objects across transformations is typically tested on held-out samples of trained categ… ▽ More

    Submitted 14 January, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Journal ref: Neural Networks, Volume 150, 2022, Pages 222-236, ISSN 0893-6080,

  6. arXiv:2012.05950  [pdf, other

    q-bio.NC

    A case for robust translation tolerance in humans and CNNs. A commentary on Han et al

    Authors: Ryan Blything, Valerio Biscione, Jeffrey Bowers

    Abstract: Han et al. (2020) reported a behavioral experiment that assessed the extent to which the human visual system can identify novel images at unseen retinal locations (what the authors call "intrinsic translation invariance") and developed a novel convolutional neural network model (an Eccentricity Dependent Network or ENN) to capture key aspects of the behavioral results. Here we show that their anal… ▽ More

    Submitted 14 December, 2020; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 8 pages, 3 figures

  7. arXiv:2011.11757  [pdf, other

    cs.CV

    Learning Translation Invariance in CNNs

    Authors: Valerio Biscione, Jeffrey Bowers

    Abstract: When seeing a new object, humans can immediately recognize it across different retinal locations: we say that the internal object representation is invariant to translation. It is commonly believed that Convolutional Neural Networks (CNNs) are architecturally invariant to translation thanks to the convolution and/or pooling operations they are endowed with. In fact, several works have found that t… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Workshop SVRHM

  8. arXiv:2009.12855  [pdf

    q-bio.NC

    The human visual system and CNNs can both support robust online translation tolerance following extreme displacements

    Authors: Ryan Blything, Valerio Biscione, Ivan I. Vankov, Casimir J. H. Ludwig, Jeffrey S. Bowers

    Abstract: Visual translation tolerance refers to our capacity to recognize objects over a wide range of different retinal locations. Although translation is perhaps the simplest spatial transform that the visual system needs to cope with, the extent to which the human visual system can identify objects at previously unseen locations is unclear, with some studies reporting near complete invariance over 10° a… ▽ More

    Submitted 8 December, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

    Comments: Main manuscript contains 5 figures plus 2 tables. SI contains 2 tables