Search | arXiv e-print repository

Zero-shot counting with a dual-stream neural network model

Authors: Jessica A. F. Thompson, Hannah Sheahan, Tsvetomira Dumbalska, Julian Sandbrink, Manuela Piazza, Christopher Summerfield

Abstract: Deep neural networks have provided a computational framework for understanding object recognition, grounded in the neurophysiology of the primate ventral stream, but fail to account for how we process relational aspects of a scene. For example, deep neural networks fail at problems that involve enumerating the number of elements in an array, a problem that in humans relies on parietal cortex. Here… ▽ More Deep neural networks have provided a computational framework for understanding object recognition, grounded in the neurophysiology of the primate ventral stream, but fail to account for how we process relational aspects of a scene. For example, deep neural networks fail at problems that involve enumerating the number of elements in an array, a problem that in humans relies on parietal cortex. Here, we build a 'dual-stream' neural network model which, equipped with both dorsal and ventral streams, can generalise its counting ability to wholly novel items ('zero-shot' counting). In doing so, it forms spatial response fields and lognormal number codes that resemble those observed in macaque posterior parietal cortex. We use the dual-stream network to make successful predictions about behavioural studies of the human gaze during similar counting tasks. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2402.15601 [pdf, other]

Error Bounds for Compositions of Piecewise Affine Approximations

Authors: Jonah J. Glunt, Jacob A. Siefert, Andrew F. Thompson, Herschel C. Pangborn

Abstract: Nonlinear expressions are often approximated by piecewise affine (PWA) functions to simplify analysis or reduce computational costs. To reduce computational complexity, multivariate functions can be represented as compositions of functions with one or two inputs, which can be approximated individually. This paper provides efficient methods to generate PWA approximations of nonlinear functions via… ▽ More Nonlinear expressions are often approximated by piecewise affine (PWA) functions to simplify analysis or reduce computational costs. To reduce computational complexity, multivariate functions can be represented as compositions of functions with one or two inputs, which can be approximated individually. This paper provides efficient methods to generate PWA approximations of nonlinear functions via functional decomposition. The key contributions focus on intelligent placement of breakpoints for PWA approximations without requiring optimization, and on bounding the error of PWA compositions as a function of the error tolerance for each component of that composition. The proposed methods are used to systematically construct a PWA approximation for a complicated function, either to within a desired error tolerance or to a given level of complexity. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2304.07924 [pdf, ps, other]

Set-valued State Estimation for Nonlinear Systems Using Hybrid Zonotopes

Authors: Jacob A. Siefert, Andrew F. Thompson, Jonah J. Glunt, Herschel C. Pangborn

Abstract: This paper proposes a method for set-valued state estimation of nonlinear, discrete-time systems. This is achieved by combining graphs of functions representing system dynamics and measurements with the hybrid zonotope set representation that can efficiently represent nonconvex and disjoint sets. Tight over-approximations of complex nonlinear functions are efficiently produced by leveraging specia… ▽ More This paper proposes a method for set-valued state estimation of nonlinear, discrete-time systems. This is achieved by combining graphs of functions representing system dynamics and measurements with the hybrid zonotope set representation that can efficiently represent nonconvex and disjoint sets. Tight over-approximations of complex nonlinear functions are efficiently produced by leveraging special ordered sets and neural networks, which enable computation of set-valued state estimates that grow linearly in memory complexity with time. A numerical example demonstrates significant reduction of conservatism in the set-valued state estimates using the proposed method as compared to an idealized convex approach. △ Less

Submitted 16 September, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

arXiv:2304.06827 [pdf, other]

Reachability Analysis Using Hybrid Zonotopes and Functional Decomposition

Authors: Jacob A. Siefert, Trevor J. Bird, Andrew F. Thompson, Jonah J. Glunt, Justin P. Koeln, Neera Jain, Herschel C. Pangborn

Abstract: This paper proposes methods for reachability analysis of nonlinear systems in both open loop and closed loop with advanced controllers. The methods combine hybrid zonotopes, a construct called a state-update set, functional decomposition, and special ordered set approximations to enable linear growth in reachable set memory complexity with time and linear scaling in computational complexity with t… ▽ More This paper proposes methods for reachability analysis of nonlinear systems in both open loop and closed loop with advanced controllers. The methods combine hybrid zonotopes, a construct called a state-update set, functional decomposition, and special ordered set approximations to enable linear growth in reachable set memory complexity with time and linear scaling in computational complexity with the system dimension. Facilitating this combination are new identities for constructing nonconvex sets that contain nonlinear functions and for efficiently converting a collection of polytopes from vertex representation to hybrid zonotope representation. Benchmark numerical examples from the literature demonstrate the proposed methods and provide comparison to state-of-the-art techniques. △ Less

Submitted 22 February, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

arXiv:2101.11771 [pdf, ps, other]

doi 10.1103/PhysRevFluids.6.084804

Resolvent analysis of stratification effects on wall-bounded shear flows

Authors: M. A. Ahmed, H. J. Bae, A. F. Thompson, B. J. McKeon

Abstract: The interaction between shear driven turbulence and stratification is a key process in a wide array of geophysical flows with spatio-temporal scales that span many orders of magnitude. A quick numerical model prediction based on external parameters of stratified boundary layers could greatly benefit the understanding of the interaction between velocity and scalar flux at varying scales. For these… ▽ More The interaction between shear driven turbulence and stratification is a key process in a wide array of geophysical flows with spatio-temporal scales that span many orders of magnitude. A quick numerical model prediction based on external parameters of stratified boundary layers could greatly benefit the understanding of the interaction between velocity and scalar flux at varying scales. For these reasons, here, we use the resolvent framework to investigate the effects of an active scalar on incompressible wall-bounded turbulence. We obtain the state of the flow system by applying the linear resolvent operator to the nonlinear terms in the governing Navier-Stokes equations with the Boussinesq approximation. This extends the formulation to include the scalar advection equation with the scalar component acting in the wall-normal direction in the momentum equations. We use the mean velocity profiles from a DNS of a stably-stratified turbulent channel flow at varying friction Richardson number. The results obtained from the resolvent analysis are compared to the premultiplied energy spectra, auto-correlation coefficient, and the energy budget terms obtained from the DNS. It is shown that despite using only a very limited range of representative scales, the resolvent model is able to reproduce the balance of energy budget terms as well as provide meaningful insight of coherent structures occurring in the flow. Computation of the leading resolvent models, despite considering a limited range of scales, reproduces the balance of energy budget terms, provides meaningful predictions of coherent structures in the flow, and is more cost-effective than performing full-scale simulations. This quick model can provide further understanding of stratified flows with only information about the mean profile and prior knowledge of energetic scales of motion in the neutrally-buoyant boundary layers. △ Less

Submitted 27 January, 2021; originally announced January 2021.

Journal ref: Phys. Rev. Fluids 6, 084804 (2021)

arXiv:2007.06173 [pdf, other]

doi 10.1038/s41561-021-00706-3

A pole-to-equator ocean overturning circulation on Enceladus

Authors: Ana H. Lobo, Andrew F. Thompson, Steven D. Vance, Saikiran Tharimena

Abstract: Enceladus is believed to have a saltwater global ocean with a mean depth of at least 30~km, heated from below at the ocean-core interface and cooled at the top, where the ocean loses heat to the icy lithosphere above. This scenario suggests an important role for vertical convection to influence the interior properties and circulation of Enceladus' ocean. Additionally, the ice shell that encompasse… ▽ More Enceladus is believed to have a saltwater global ocean with a mean depth of at least 30~km, heated from below at the ocean-core interface and cooled at the top, where the ocean loses heat to the icy lithosphere above. This scenario suggests an important role for vertical convection to influence the interior properties and circulation of Enceladus' ocean. Additionally, the ice shell that encompasses the ocean has dramatic meridional thickness variations that, in steady state, must be sustained against processes acting to remove these ice thickness anomalies. One mechanism that would maintain variations in the ice shell thickness involves spatially-separated regions of freezing and melting at the ocean-ice interface. Here, we use an idealized, dynamical ocean model forced by an observationally-guided density forcing at the ocean-ice interface to argue that Enceladus' interior ocean should support a meridional overturning circulation. This circulation establishes an interior density structure that is more complex than in studies that have focused only on convection, including a shallow freshwater lens in the polar regions. Spatially-separated sites of ice formation and melt enable Enceladus to sustain a significant vertical and horizontal stratification, which impacts interior heat transport, and is critical for understanding the relationship between a global ocean and the planetary energy budget. The presence of low salinity layers near the polar ocean-ice interface also influences whether samples measured from the plumes are representative of the global ocean. △ Less

Submitted 12 July, 2020; originally announced July 2020.

Journal ref: This is a preprint of an article published in Nature Geoscience (2021)

arXiv:1912.02260 [pdf, other]

doi 10.32470/CCN.2019.1300-0

The effect of task and training on intermediate representations in convolutional neural networks revealed with modified RV similarity analysis

Authors: Jessica A. F. Thompson, Yoshua Bengio, Marc Schoenwiesner

Abstract: Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained ne… ▽ More Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained network (all parameters updated at every step), a freeze trained network (layers gradually frozen during training), random networks (only some layers trained), and a completely untrained network. We found that RV2 was able to recover expected similarity patterns and provide interpretable similarity matrices that suggested hypotheses about how representations are affected by different training recipes. We propose that the superior performance achieved by freeze training can be attributed to representational differences in the penultimate layer. Our comparisons of random networks suggest that the inputs and targets serve as anchors on the representations in the lowest and highest layers. △ Less

Submitted 4 December, 2019; originally announced December 2019.

Comments: 4 pages, 4 figures, Conference on Cognitive Computational Neuroscience 2019

arXiv:1810.08651 [pdf, ps, other]

How can deep learning advance computational modeling of sensory information processing?

Authors: Jessica A. F. Thompson, Yoshua Bengio, Elia Formisano, Marc Schönwiesner

Abstract: Deep learning, computational neuroscience, and cognitive science have overlap** goals related to understanding intelligence such that perception and behaviour can be simulated in computational systems. In neuroimaging, machine learning methods have been used to test computational models of sensory information processing. Recently, these model comparison techniques have been used to evaluate deep… ▽ More Deep learning, computational neuroscience, and cognitive science have overlap** goals related to understanding intelligence such that perception and behaviour can be simulated in computational systems. In neuroimaging, machine learning methods have been used to test computational models of sensory information processing. Recently, these model comparison techniques have been used to evaluate deep neural networks (DNNs) as models of sensory information processing. However, the interpretation of such model evaluations is muddied by imprecise statistical conclusions. Here, we make explicit the types of conclusions that can be drawn from these existing model comparison techniques and how these conclusions change when the model in question is a DNN. We discuss how DNNs are amenable to new model comparison techniques that allow for stronger conclusions to be made about the computational mechanisms underlying sensory information processing. △ Less

Submitted 25 September, 2018; originally announced October 2018.

Comments: Presented at MLINI-2016 workshop, 2016 (arXiv:1701.01437)

Report number: MLINI/2016/04

Showing 1–8 of 8 results for author: Thompson, A F