-
Super-resolution diamond magnetic microscopy of superparamagnetic nanoparticles
Authors:
Nazanin Mosavian,
Forrest Hubert,
Janis Smits,
Pauli Kehayias,
Yaser Silani,
Bryan A. Richards,
Victor M. Acosta
Abstract:
Scanning-probe and wide-field magnetic microscopes based on Nitrogen-Vacancy (NV) centers in diamond have enabled remarkable advances in the study of biology and materials, but each method has drawbacks. Here, we implement an alternative method for nanoscale magnetic microscopy based on optical control of the charge state of NV centers in a dense layer near the diamond surface. By combining a donu…
▽ More
Scanning-probe and wide-field magnetic microscopes based on Nitrogen-Vacancy (NV) centers in diamond have enabled remarkable advances in the study of biology and materials, but each method has drawbacks. Here, we implement an alternative method for nanoscale magnetic microscopy based on optical control of the charge state of NV centers in a dense layer near the diamond surface. By combining a donut-beam super-resolution technique with optically detected magnetic resonance spectroscopy, we imaged the magnetic fields produced by single 30-nm iron-oxide nanoparticles. The magnetic microscope has a lateral spatial resolution of ~100 nm, and it resolves the individual magnetic dipole features from clusters of nanoparticles with interparticle spacings down to ~190 nm. The magnetic feature amplitudes are more than an order of magnitude larger than those obtained by confocal magnetic microscopy due to the smaller characteristic NV-nanoparticle distance within nearby sensing voxels. We analyze the magnetic point-spread function and sensitivity as a function of the microscope's spatial resolution and identify sources of background fluorescence that limit the present performance, including diamond second-order Raman emission and imperfect NV charge-state control. Our method, which uses less than 10 mW laser power and can be parallelized by patterned illumination, introduces a new format for nanoscale magnetic imaging.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Beyond accuracy: generalization properties of bio-plausible temporal credit assignment rules
Authors:
Yuhan Helena Liu,
Arna Ghosh,
Blake A. Richards,
Eric Shea-Brown,
Guillaume Lajoie
Abstract:
To unveil how the brain learns, ongoing work seeks biologically-plausible approximations of gradient descent algorithms for training recurrent neural networks (RNNs). Yet, beyond task accuracy, it is unclear if such learning rules converge to solutions that exhibit different levels of generalization than their nonbiologically-plausible counterparts. Leveraging results from deep learning theory bas…
▽ More
To unveil how the brain learns, ongoing work seeks biologically-plausible approximations of gradient descent algorithms for training recurrent neural networks (RNNs). Yet, beyond task accuracy, it is unclear if such learning rules converge to solutions that exhibit different levels of generalization than their nonbiologically-plausible counterparts. Leveraging results from deep learning theory based on loss landscape curvature, we ask: how do biologically-plausible gradient approximations affect generalization? We first demonstrate that state-of-the-art biologically-plausible learning rules for training RNNs exhibit worse and more variable generalization performance compared to their machine learning counterparts that follow the true gradient more closely. Next, we verify that such generalization performance is correlated significantly with loss landscape curvature, and we show that biologically-plausible learning rules tend to approach high-curvature regions in synaptic weight space. Using tools from dynamical systems, we derive theoretical arguments and present a theorem explaining this phenomenon. This predicts our numerical results, and explains why biologically-plausible rules lead to worse and more variable generalization properties. Finally, we suggest potential remedies that could be used by the brain to mitigate this effect. To our knowledge, our analysis is the first to identify the reason for this generalization gap between artificial and biologically-plausible learning rules, which can help guide future investigations into how the brain learns solutions that generalize.
△ Less
Submitted 13 January, 2023; v1 submitted 1 June, 2022;
originally announced June 2022.
-
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Authors:
Anthony GX-Chen,
Veronica Chelu,
Blake A. Richards,
Joelle Pineau
Abstract:
Estimating value functions is a core component of reinforcement learning algorithms. Temporal difference (TD) learning algorithms use bootstrap**, i.e. they update the value function toward a learning target using value estimates at subsequent time-steps. Alternatively, the value function can be updated toward a learning target constructed by separately predicting successor features (SF)--a poli…
▽ More
Estimating value functions is a core component of reinforcement learning algorithms. Temporal difference (TD) learning algorithms use bootstrap**, i.e. they update the value function toward a learning target using value estimates at subsequent time-steps. Alternatively, the value function can be updated toward a learning target constructed by separately predicting successor features (SF)--a policy-dependent model--and linearly combining them with instantaneous rewards. We focus on bootstrap** targets used when estimating value functions, and propose a new backup target, the $η$-return mixture, which implicitly combines value-predictive knowledge (used by TD methods) with (successor) feature-predictive knowledge--with a parameter $η$ capturing how much to rely on each. We illustrate that incorporating predictive knowledge through an $ηγ$-discounted SF model makes more efficient use of sampled experience, compared to either extreme, i.e. bootstrap** entirely on the value function estimate, or bootstrap** on the product of separately estimated successor features and instantaneous reward models. We empirically show this approach leads to faster policy evaluation and better control performance, for tabular and nonlinear function approximations, indicating scalability and generality.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Current State and Future Directions for Learning in Biological Recurrent Neural Networks: A Perspective Piece
Authors:
Luke Y. Prince,
Roy Henha Eyono,
Ellen Boven,
Arna Ghosh,
Joe Pemberton,
Franz Scherr,
Claudia Clopath,
Rui Ponte Costa,
Wolfgang Maass,
Blake A. Richards,
Cristina Savin,
Katharina Anna Wilmes
Abstract:
We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both th…
▽ More
We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both theoretical and experimental neuroscientists when designing new studies that could help bring clarity to these issues.
△ Less
Submitted 5 January, 2022; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Adversarial Feature Desensitization
Authors:
Pouya Bashivan,
Reza Bayat,
Adam Ibrahim,
Kartik Ahuja,
Mojtaba Faramarzi,
Touraj Laleh,
Blake Aaron Richards,
Irina Rish
Abstract:
Neural networks are known to be vulnerable to adversarial attacks -- slight but carefully constructed perturbations of the inputs which can drastically impair the network's performance. Many defense methods have been proposed for improving robustness of deep networks by training them on adversarially perturbed inputs. However, these models often remain vulnerable to new types of attacks not seen d…
▽ More
Neural networks are known to be vulnerable to adversarial attacks -- slight but carefully constructed perturbations of the inputs which can drastically impair the network's performance. Many defense methods have been proposed for improving robustness of deep networks by training them on adversarially perturbed inputs. However, these models often remain vulnerable to new types of attacks not seen during training, and even to slightly stronger versions of previously seen attacks. In this work, we propose a novel approach to adversarial robustness, which builds upon the insights from the domain adaptation field. Our method, called Adversarial Feature Desensitization (AFD), aims at learning features that are invariant towards adversarial perturbations of the inputs. This is achieved through a game where we learn features that are both predictive and robust (insensitive to adversarial attacks), i.e. cannot be used to discriminate between natural and adversarial data. Empirical results on several benchmarks demonstrate the effectiveness of the proposed approach against a wide range of attack types and attack strengths. Our code is available at https://github.com/BashivanLab/afd.
△ Less
Submitted 4 January, 2022; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Spike-based causal inference for weight alignment
Authors:
Jordan Guerguiev,
Konrad P. Kording,
Blake A. Richards
Abstract:
In artificial neural networks trained with gradient descent, the weights used for processing stimuli are also used during backward passes to calculate gradients. For the real brain to approximate gradients, gradient information would have to be propagated separately, such that one set of synaptic weights is used for processing and another set is used for backward passes. This produces the so-calle…
▽ More
In artificial neural networks trained with gradient descent, the weights used for processing stimuli are also used during backward passes to calculate gradients. For the real brain to approximate gradients, gradient information would have to be propagated separately, such that one set of synaptic weights is used for processing and another set is used for backward passes. This produces the so-called "weight transport problem" for biological models of learning, where the backward weights used to calculate gradients need to mirror the forward weights used to process stimuli. This weight transport problem has been considered so hard that popular proposals for biological learning assume that the backward weights are simply random, as in the feedback alignment algorithm. However, such random weights do not appear to work well for large networks. Here we show how the discontinuity introduced in a spiking system can lead to a solution to this problem. The resulting algorithm is a special case of an estimator used for causal inference in econometrics, regression discontinuity design. We show empirically that this algorithm rapidly makes the backward weights approximate the forward weights. As the backward weights become correct, this improves learning performance over feedback alignment on tasks such as Fashion-MNIST, SVHN, CIFAR-10 and VOC. Our results demonstrate that a simple learning rule in a spiking network can allow neurons to produce the right backward connections and thus solve the weight transport problem.
△ Less
Submitted 1 February, 2020; v1 submitted 3 October, 2019;
originally announced October 2019.
-
Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures
Authors:
Sergey Bartunov,
Adam Santoro,
Blake A. Richards,
Luke Marris,
Geoffrey E. Hinton,
Timothy Lillicrap
Abstract:
The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The recent success of deep networks in machine learning and AI, however, has inspired proposals for understanding how the brain might learn across multiple layers, and hence how it might approximate BP. As of yet, none of these proposals have been rigorously evaluated on tasks where BP-guided deep learning has…
▽ More
The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The recent success of deep networks in machine learning and AI, however, has inspired proposals for understanding how the brain might learn across multiple layers, and hence how it might approximate BP. As of yet, none of these proposals have been rigorously evaluated on tasks where BP-guided deep learning has proved critical, or in architectures more structured than simple fully-connected networks. Here we present results on scaling up biologically motivated models of deep learning on datasets which need deep networks with appropriate architectures to achieve good performance. We present results on the MNIST, CIFAR-10, and ImageNet datasets and explore variants of target-propagation (TP) and feedback alignment (FA) algorithms, and explore performance in both fully- and locally-connected architectures. We also introduce weight-transport-free variants of difference target propagation (DTP) modified to remove backpropagation from the penultimate layer. Many of these algorithms perform well for MNIST, but for CIFAR and ImageNet we find that TP and FA variants perform significantly worse than BP, especially for networks composed of locally connected units, opening questions about whether new architectures and algorithms are required to scale these approaches. Our results and implementation details help establish baselines for biologically motivated deep learning schemes going forward.
△ Less
Submitted 20 November, 2018; v1 submitted 12 July, 2018;
originally announced July 2018.
-
Towards deep learning with segregated dendrites
Authors:
Jordan Guergiuev,
Timothy P. Lillicrap,
Blake A. Richards
Abstract:
Deep learning has led to significant advances in artificial intelligence, in part, by adopting strategies motivated by neurophysiology. However, it is unclear whether deep learning could occur in the real brain. Here, we show that a deep learning algorithm that utilizes multi-compartment neurons might help us to understand how the brain optimizes cost functions. Like neocortical pyramidal neurons,…
▽ More
Deep learning has led to significant advances in artificial intelligence, in part, by adopting strategies motivated by neurophysiology. However, it is unclear whether deep learning could occur in the real brain. Here, we show that a deep learning algorithm that utilizes multi-compartment neurons might help us to understand how the brain optimizes cost functions. Like neocortical pyramidal neurons, neurons in our model receive sensory information and higher-order feedback in electrotonically segregated compartments. Thanks to this segregation, the neurons in different layers of the network can coordinate synaptic weight updates. As a result, the network can learn to categorize images better than a single layer network. Furthermore, we show that our algorithm takes advantage of multilayer architectures to identify useful representations---the hallmark of deep learning. This work demonstrates that deep learning can be achieved using segregated dendritic compartments, which may help to explain the dendritic morphology of neocortical pyramidal neurons.
△ Less
Submitted 7 April, 2017; v1 submitted 1 October, 2016;
originally announced October 2016.