-
Generative AI-based closed-loop fMRI system
Authors:
Mikihiro Kasahara,
Taiki Oka,
Vincent Taschereau-Dumouchel,
Mitsuo Kawato,
Hiroki Takakura,
Aurelio Cortese
Abstract:
While generative AI is now widespread and useful in society, there are potential risks of misuse, e.g., unconsciously influencing cognitive processes or decision-making. Although this causes a security problem in the cognitive domain, there has been no research about neural and computational mechanisms counteracting the impact of malicious generative AI in humans. We propose DecNefGAN, a novel fra…
▽ More
While generative AI is now widespread and useful in society, there are potential risks of misuse, e.g., unconsciously influencing cognitive processes or decision-making. Although this causes a security problem in the cognitive domain, there has been no research about neural and computational mechanisms counteracting the impact of malicious generative AI in humans. We propose DecNefGAN, a novel framework that combines a generative adversarial system and a neural reinforcement model. More specifically, DecNefGAN bridges human and generative AI in a closed-loop system, with the AI creating stimuli that induce specific mental states, thus exerting external control over neural activity. The objective of the human is the opposite, to compete and reach an orthogonal mental state. This framework can contribute to elucidating how the human brain responds to and counteracts the potential influence of generative AI.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
BLAST: A Wafer-scale Transfer Process for Heterogeneous Integration of Optics and Electronics
Authors:
Yanxin Ji,
Alejandro J. Cortese,
Conrad L. Smart,
Alyosha C. Molnar,
Paul L. McEuen
Abstract:
We present a general transfer method for the heterogeneous integration of different photonic and electronic materials systems and devices onto a single substrate. Called BLAST, for Bond, Lift, Align, and Slide Transfer, the process works at wafer scale and offers precision alignment, high yield, varying topographies, and suitability for subsequent lithographic processing. We demonstrate BLAST's ca…
▽ More
We present a general transfer method for the heterogeneous integration of different photonic and electronic materials systems and devices onto a single substrate. Called BLAST, for Bond, Lift, Align, and Slide Transfer, the process works at wafer scale and offers precision alignment, high yield, varying topographies, and suitability for subsequent lithographic processing. We demonstrate BLAST's capabilities by integrating both GaAs and GaN microLEDs with silicon photovoltaics to fabricate optical wireless integrated circuits that up-convert photons from the red to the blue. We also show that BLAST can be applied to a variety of other devices and substrates, including CMOS electronics, vertical cavity surface emitting lasers (VCSELs), and 2D materials. BLAST further enables the modularization of optoelectronic microsystems, where optical devices fabricated on one material substrate can be lithographically integrated with electronic devices on a different substrate in a scalable process.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
"Task-relevant autoencoding" enhances machine learning for human neuroscience
Authors:
Seyedmehdi Orouji,
Vincent Taschereau-Dumouchel,
Aurelio Cortese,
Brian Odegaard,
Cody Cushing,
Mouslim Cherkaoui,
Mitsuo Kawato,
Hakwan Lau,
Megan A. K. Peters
Abstract:
In human neuroscience, machine learning can help reveal lower-dimensional neural representations relevant to subjects' behavior. However, state-of-the-art models typically require large datasets to train, so are prone to overfitting on human neuroimaging data that often possess few samples but many input dimensions. Here, we capitalized on the fact that the features we seek in human neuroscience a…
▽ More
In human neuroscience, machine learning can help reveal lower-dimensional neural representations relevant to subjects' behavior. However, state-of-the-art models typically require large datasets to train, so are prone to overfitting on human neuroimaging data that often possess few samples but many input dimensions. Here, we capitalized on the fact that the features we seek in human neuroscience are precisely those relevant to subjects' behavior. We thus developed a Task-Relevant Autoencoder via Classifier Enhancement (TRACE), and tested its ability to extract behaviorally-relevant, separable representations compared to a standard autoencoder, a variational autoencoder, and principal component analysis for two severely truncated machine learning datasets. We then evaluated all models on fMRI data from 59 subjects who observed animals and objects. TRACE outperformed all models nearly unilaterally, showing up to 12% increased classification accuracy and up to 56% improvement in discovering "cleaner", task-relevant representations. These results showcase TRACE's potential for a wide variety of data related to human behavior.
△ Less
Submitted 22 September, 2023; v1 submitted 17 August, 2022;
originally announced August 2022.
-
Motor neuron pathology in CANVAS due to RFC1 expansions
Authors:
Vincent Huin,
Giulia Coarelli,
Clément Guemy,
Susana Boluda,
Rabab Debs,
Fanny Mochel,
Tanya Stojkovic,
David Grabli,
Thierry Maisonobe,
Bertrand Gaymard,
Timothée Lenglet,
Céline Tard,
Jean-Baptiste Davion,
Bernard Sablonnière,
Marie-Lorraine Monin,
Claire Ewenczyk,
Karine Viala,
Perrine Charles,
Isabelle Le Ber,
Mary Reilly,
Henry Houlden,
Andrea Cortese,
Danielle Seilhean,
Alexis Brice,
Alexandra Durr
Abstract:
CANVAS caused by RFC1 biallelic expansions is a major cause of inherited sensory neuronopathy. Detection of RFC1 expansion is challenging and CANVAS can be associated with atypical features. We clinically and genetically characterized 50 patients, selected based on the presence of sensory neuronopathy confirmed by EMG. We screened RFC1 expansion by PCR, repeat-primed PCR, and Southern blotting of…
▽ More
CANVAS caused by RFC1 biallelic expansions is a major cause of inherited sensory neuronopathy. Detection of RFC1 expansion is challenging and CANVAS can be associated with atypical features. We clinically and genetically characterized 50 patients, selected based on the presence of sensory neuronopathy confirmed by EMG. We screened RFC1 expansion by PCR, repeat-primed PCR, and Southern blotting of long-range PCR products, a newly developed method. Neuropathological characterization was performed on the brain and spinal cord of one patient. Most patients (88%) carried a biallelic (AAGGG)n expansion in RFC1. In addition to the core CANVAS phenotype (sensory neuronopathy, cerebellar syndrome, and vestibular impairment), we observed chronic cough (97%), oculomotor signs (85%), motor neuron involvement (55%), dysautonomia (50%), and parkinsonism (10%). Motor neuron involvement was found for 24 of 38 patients (63.1%). First motor neuron signs, such as brisk reflexes, extensor plantar responses, and/or spasticity, were present in 29% of patients, second motor neuron signs, such as fasciculations, wasting, weakness, or a neurogenic pattern on EMG in 18%, and both in 16%. Mixed motor and sensory neuronopathy was observed in 19% of patients. Among six non-RFC1 patients, one carried a heterozygous AAGGG expansion and a pathogenic variant in GRM1. Neuropathological examination of one RFC1 patient with an enriched phenotype, including parkinsonism, dysautonomia, and cognitive decline, showed posterior column and lumbar posterior root atrophy. Degeneration of the vestibulospinal and spinocerebellar tracts was mild. We observed marked astrocytic gliosis and axonal swelling of the synapse between first and second motor neurons in the anterior horn at the lumbar level. The cerebellum showed mild depletion of Purkinje cells, with empty baskets, torpedoes, and astrogliosis characterized by a disorganization of the Bergmann's radial glia. We found neuronal loss in the vagal nucleus. The pars compacta of the substantia nigra was depleted, with widespread Lewy bodies in the locus coeruleus, substantia nigra, hippocampus, entorhinal cortex, and amygdala. We propose new guidelines for the screening of RFC1 expansion, considering different expansion motifs. Here, we developed a new method to more easily detect pathogenic RFC1 expansions. We report frequent motor neuron involvement and different neuronopathy subtypes. Parkinsonism was more prevalent in this cohort than in the general population, 10% versus the expected 1% (p < .001). We describe, for the first time, the spinal cord pathology in CANVAS, showing the alteration of posterior columns and roots, astrocytic gliosis and axonal swelling, suggesting motor neuron synaptic dysfunction.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
From internal models toward metacognitive AI
Authors:
Mitsuo Kawato,
Aurelio Cortese
Abstract:
In several papers published in Biological Cybernetics in the 1980s and 1990s, Kawato and colleagues proposed computational models explaining how internal models are acquired in the cerebellum. These models were later supported by neurophysiological experiments using monkeys and neuroimaging experiments involving humans. These early studies influenced neuroscience from basic, sensory-motor control…
▽ More
In several papers published in Biological Cybernetics in the 1980s and 1990s, Kawato and colleagues proposed computational models explaining how internal models are acquired in the cerebellum. These models were later supported by neurophysiological experiments using monkeys and neuroimaging experiments involving humans. These early studies influenced neuroscience from basic, sensory-motor control to higher cognitive functions. One of the most perplexing enigmas related to internal models is to understand the neural mechanisms that enable animals to learn large-dimensional problems with so few trials. Consciousness and metacognition -- the ability to monitor one's own thoughts, may be part of the solution to this enigma. Based on literature reviews of the past 20 years, here we propose a computational neuroscience model of metacognition. The model comprises a modular hierarchical reinforcement-learning architecture of parallel and layered, generative-inverse model pairs. In the prefrontal cortex, a distributed executive network called the "cognitive reality monitoring network" (CRMN) orchestrates conscious involvement of generative-inverse model pairs in perception and action. Based on mismatches between computations by generative and inverse models, as well as reward prediction errors, CRMN computes a "responsibility signal" that gates selection and learning of pairs in perception, action, and reinforcement learning. A high responsibility signal is given to the pairs that best capture the external world, that are competent in movements (small mismatch), and that are capable of reinforcement learning (small reward prediction error). CRMN selects pairs with higher responsibility signals as objects of metacognition, and consciousness is determined by the entropy of responsibility signals across all pairs.
△ Less
Submitted 20 December, 2021; v1 submitted 27 September, 2021;
originally announced September 2021.
-
How do we generalize?
Authors:
Jessica Elizabeth Taylor,
Aurelio Cortese,
Helen C. Barron,
Xiaochuan Pan,
Masamichi Sakagami,
Dagmar Zeithamova
Abstract:
Humans and animals are able to generalize or transfer information from previous experience so that they can behave appropriately in novel situations. What mechanisms--computations, representations, and neural systems--give rise to this remarkable ability? The members of this Generative Adversarial Collaboration (GAC) come from a range of academic backgrounds but are all interested in uncovering th…
▽ More
Humans and animals are able to generalize or transfer information from previous experience so that they can behave appropriately in novel situations. What mechanisms--computations, representations, and neural systems--give rise to this remarkable ability? The members of this Generative Adversarial Collaboration (GAC) come from a range of academic backgrounds but are all interested in uncovering the mechanisms of generalization. We started out this GAC with the aim of arbitrating between two alternative conceptual accounts: (1) generalization stems from integration of multiple experiences into summary representations that reflect generalized knowledge, and (2) generalization is computed on-the-fly using separately stored individual memories. Across the course of this collaboration, we found that--despite using different terminology and techniques, and although some of our specific papers may provide evidence one way or the other--we in fact largely agree that both of these broad accounts (as well as several others) are likely valid. We believe that future research and theoretical synthesis across multiple lines of research is necessary to help determine the degree to which different candidate generalization mechanisms may operate simultaneously, operate on different scales, or be employed under distinct conditions. Here, as the first step, we introduce some of these candidate mechanisms and we discuss the issues currently hindering better synthesis of generalization research. Finally, we introduce some of our own research questions that have arisen over the course of this GAC, that we believe would benefit from future collaborative efforts.
△ Less
Submitted 27 August, 2021; v1 submitted 2 April, 2021;
originally announced April 2021.
-
Attention or memory? Neurointerpretable agents in space and time
Authors:
Lennart Bramlage,
Aurelio Cortese
Abstract:
In neuroscience, attention has been shown to bidirectionally interact with reinforcement learning (RL) processes. This interaction is thought to support dimensionality reduction of task representations, restricting computations to relevant features. However, it remains unclear whether these properties can translate into real algorithmic advantages for artificial agents, especially in dynamic envir…
▽ More
In neuroscience, attention has been shown to bidirectionally interact with reinforcement learning (RL) processes. This interaction is thought to support dimensionality reduction of task representations, restricting computations to relevant features. However, it remains unclear whether these properties can translate into real algorithmic advantages for artificial agents, especially in dynamic environments. We design a model incorporating a self-attention mechanism that implements task-state representations in semantic feature-space, and test it on a battery of Atari games. To evaluate the agent's selective properties, we add a large volume of task-irrelevant features to observations. In line with neuroscience predictions, self-attention leads to increased robustness to noise compared to benchmark models. Strikingly, this self-attention mechanism is general enough, such that it can be naturally extended to implement a transient working-memory, able to solve a partially observable maze task. Lastly, we highlight the predictive quality of attended stimuli. Because we use semantic observations, we can uncover not only which features the agent elects to base decisions on, but also how it chooses to compile more complex, relational features from simpler ones. These results formally illustrate the benefits of attention in deep RL and provide evidence for the interpretability of self-attention mechanisms.
△ Less
Submitted 12 July, 2020; v1 submitted 9 July, 2020;
originally announced July 2020.
-
MoS$_{2}$ pixel arrays for real-time photoluminescence imaging of redox molecules
Authors:
M. F. Reynolds,
M. H. D. Guimaraes,
H. Gao,
K. Kang,
A. J. Cortese,
D. C. Ralph,
J. Park,
P. L. McEuen
Abstract:
Measuring the behavior of redox-active molecules in space and time is crucial for better understanding of chemical and biological systems and for the development of new technologies. Optical schemes are non-invasive, scalable and can be applied to many different systems, but usually have a slow response compared to electrical detection methods. Furthermore, many fluorescent molecules for redox det…
▽ More
Measuring the behavior of redox-active molecules in space and time is crucial for better understanding of chemical and biological systems and for the development of new technologies. Optical schemes are non-invasive, scalable and can be applied to many different systems, but usually have a slow response compared to electrical detection methods. Furthermore, many fluorescent molecules for redox detection degrade in brightness over long exposure times. Here we show that the photoluminescence of pixel arrays of an atomically thin two-dimensional (2D) material, a monolayer of MoS$_{2}$, can image spatial and temporal changes in redox molecule concentration in real time. Because of the strong dependence of MoS$_{2}$ photoluminescence on do** and sensitivity to surface changes characteristic of 2D materials, changes in the local chemical potential significantly modulate the photoluminescence of MoS$_{2}$, with a sensitivity of 0.9 mV/$\sqrt{Hz}$ on a 5 $μ$m by 5 $μ$m pixel, corresponding to better than parts-per-hundred changes in redox molecule concentration down to nanomolar concentrations at 100 ms frame rates. The real-time imaging of electrochemical potentials with a fast response time provides a new strategy for visualizing chemical reactions and biomolecules with a 2D material screen.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.
-
The neural and cognitive architecture for learning from a small sample
Authors:
Aurelio Cortese,
Benedetto De Martino,
Mitsuo Kawato
Abstract:
Artificial intelligence algorithms are capable of fantastic exploits, yet they are still grossly inefficient compared with the brain's ability to learn from few exemplars or solve problems that have not been explicitly defined. What is the secret that the evolution of human intelligence has unlocked? Generalization is one answer, but there is more to it. The brain does not directly solve difficult…
▽ More
Artificial intelligence algorithms are capable of fantastic exploits, yet they are still grossly inefficient compared with the brain's ability to learn from few exemplars or solve problems that have not been explicitly defined. What is the secret that the evolution of human intelligence has unlocked? Generalization is one answer, but there is more to it. The brain does not directly solve difficult problems, it is able to recast them into new and more tractable problems. Here we propose a model whereby higher cognitive functions profoundly interact with reinforcement learning to drastically reduce the degrees of freedom of the search space, simplifying complex problems and fostering more efficient learning.
△ Less
Submitted 4 October, 2018;
originally announced October 2018.
-
Loading Classical Data into a Quantum Computer
Authors:
John A. Cortese,
Timothy M. Braje
Abstract:
This document describes a family of quantum circuits which load classical data into a quantum state. When loading $N$ classical bits, the result quantum state is of order $\log_2(N)$ qubits. Furthermore the gate depth of the data loading circuit is of order $\log_2(N)$. Limitations to the efficiency of the data loading process such as the Holevo bound are discussed. Methods to improve the efficien…
▽ More
This document describes a family of quantum circuits which load classical data into a quantum state. When loading $N$ classical bits, the result quantum state is of order $\log_2(N)$ qubits. Furthermore the gate depth of the data loading circuit is of order $\log_2(N)$. Limitations to the efficiency of the data loading process such as the Holevo bound are discussed. Methods to improve the efficiency of the data loading procedure such as combining classical compression techniques with quantum decompression circuitry, are also discussed. Simulations using the Quipper language were conducted to verify the circuits behavior.
△ Less
Submitted 5 March, 2018;
originally announced March 2018.
-
Decoded fMRI neurofeedback can induce bidirectional behavioral changes within single participants
Authors:
Aurelio Cortese,
Kaoru Amano,
Ai Koizumi,
Hakwan Lau,
Mitsuo Kawato
Abstract:
Studies using real-time functional magnetic resonance imaging (rt-fMRI) have recently incorporated the decoding approach, allowing for fMRI to be used as a tool for manipulation of fine-grained neural activity. Because of the tremendous potential for clinical applications, certain questions regarding decoded neurofeedback (DecNef) must be addressed. Neurofeedback effects can last for months, but t…
▽ More
Studies using real-time functional magnetic resonance imaging (rt-fMRI) have recently incorporated the decoding approach, allowing for fMRI to be used as a tool for manipulation of fine-grained neural activity. Because of the tremendous potential for clinical applications, certain questions regarding decoded neurofeedback (DecNef) must be addressed. Neurofeedback effects can last for months, but the short- to mid-term dynamics are not known. Specifically, can the same subjects learn to induce neural patterns in two opposite directions in different sessions? This leads to a further question, whether learning to reverse a neural pattern may be less effective after training to induce it in a previous session. Here we employed a within-subjects' design, with subjects undergoing DecNef training sequentially in opposite directions (up or down regulation of confidence judgements in a perceptual task), with the order counterbalanced across subjects. Behavioral results indicated that the manipulation was strongly influenced by the order and direction of neurofeedback. We therefore applied nonlinear mathematical modeling to parametrize four main consequences of DecNef: main effect of change in behavior, strength of down-regulation effect relative to up-regulation, maintenance of learning over sessions, and anterograde learning interference. Modeling results revealed that DecNef successfully induced bidirectional behavioral changes in different sessions. Furthermore, up-regulation was more sizable, and the effect was largely preserved even after an interval of one-week. Lastly, the second week effect was diminished as compared to the first week effect, indicating strong anterograde learning interference. These results suggest reinforcement learning characteristics of DecNef, and provide important constraints on its application to basic neuroscience, occupational and sports trainings, and therapies.
△ Less
Submitted 10 March, 2016;
originally announced March 2016.
-
The Holevo-Schumacher-Westmoreland Channel Capacity for a Class of Qudit Unital Channels
Authors:
John A. Cortese
Abstract:
The Holevo-Schumacher-Westmoreland (HSW) classical (entanglement-unassisted) channel capacity for a class of qudit unital channels is shown to be C = log2(d) - Smin, where d is the dimension of the qudit, and Smin is the minimum possible von Neumann entropy at the channel output. The HSW channel capacity for tensor products of this class of unital qudit channels is shown to obey the same formula…
▽ More
The Holevo-Schumacher-Westmoreland (HSW) classical (entanglement-unassisted) channel capacity for a class of qudit unital channels is shown to be C = log2(d) - Smin, where d is the dimension of the qudit, and Smin is the minimum possible von Neumann entropy at the channel output. The HSW channel capacity for tensor products of this class of unital qudit channels is shown to obey the same formula.
△ Less
Submitted 15 November, 2002;
originally announced November 2002.