Search | arXiv e-print repository

The Past, Present, and Future of the Brain Imaging Data Structure (BIDS)

Authors: Russell A. Poldrack, Christopher J. Markiewicz, Stefan Appelhoff, Yoni K. Ashar, Tibor Auer, Sylvain Baillet, Shashank Bansal, Leandro Beltrachini, Christian G. Benar, Giacomo Bertazzoli, Suyash Bhogawar, Ross W. Blair, Marta Bortoletto, Mathieu Boudreau, Teon L. Brooks, Vince D. Calhoun, Filippo Maria Castelli, Patricia Clement, Alexander L Cohen, Julien Cohen-Adad, Sasha D'Ambrosio, Gilles de Hollander, María de la iglesia-Vayá, Alejandro de la Vega, Arnaud Delorme , et al. (89 additional authors not shown)

Abstract: The Brain Imaging Data Structure (BIDS) is a community-driven standard for the organization of data and metadata from a growing range of neuroscience modalities. This paper is meant as a history of how the standard has developed and grown over time. We outline the principles behind the project, the mechanisms by which it has been extended, and some of the challenges being addressed as it evolves.… ▽ More The Brain Imaging Data Structure (BIDS) is a community-driven standard for the organization of data and metadata from a growing range of neuroscience modalities. This paper is meant as a history of how the standard has developed and grown over time. We outline the principles behind the project, the mechanisms by which it has been extended, and some of the challenges being addressed as it evolves. We also discuss the lessons learned through the project, with the aim of enabling researchers in other domains to learn from the success of BIDS. △ Less

Submitted 8 January, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

arXiv:2304.11490 [pdf]

Boosting Theory-of-Mind Performance in Large Language Models via Prompting

Authors: Shima Rahimi Moghaddam, Christopher J. Honey

Abstract: Large language models (LLMs) excel in many tasks in 2023, but they still face challenges in complex reasoning. Theory-of-mind (ToM) tasks, which require understanding agents' beliefs, goals, and mental states, are essential for common-sense reasoning involving humans, making it crucial to enhance LLM performance in this area. This study measures the ToM performance of GPT-4 and three GPT-3.5 varia… ▽ More Large language models (LLMs) excel in many tasks in 2023, but they still face challenges in complex reasoning. Theory-of-mind (ToM) tasks, which require understanding agents' beliefs, goals, and mental states, are essential for common-sense reasoning involving humans, making it crucial to enhance LLM performance in this area. This study measures the ToM performance of GPT-4 and three GPT-3.5 variants (Davinci-2, Davinci-3, GPT-3.5-Turbo), and investigates the effectiveness of in-context learning in improving their ToM comprehension. We evaluated prompts featuring two-shot chain of thought reasoning and step-by-step thinking instructions. We found that LLMs trained with Reinforcement Learning from Human Feedback (RLHF) (all models excluding Davinci-2) improved their ToM accuracy via in-context learning. GPT-4 performed best in zero-shot settings, reaching nearly 80% ToM accuracy, but still fell short of the 87% human accuracy on the test set. However, when supplied with prompts for in-context learning, all RLHF-trained LLMs exceeded 80% ToM accuracy, with GPT-4 reaching 100%. These results demonstrate that appropriate prompting enhances LLM ToM reasoning, and they underscore the context-dependent nature of LLM cognitive capacities. △ Less

Submitted 26 April, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

Comments: 27 pages, 4 main figures, 2 supplementary figures

arXiv:2105.05944 [pdf, other]

Slower is Better: Revisiting the Forgetting Mechanism in LSTM for Slower Information Decay

Authors: Hsiang-Yun Sherry Chien, Javier S. Turek, Nicole Beckage, Vy A. Vo, Christopher J. Honey, Ted L. Willke

Abstract: Sequential information contains short- to long-range dependencies; however, learning long-timescale information has been a challenge for recurrent neural networks. Despite improvements in long short-term memory networks (LSTMs), the forgetting mechanism results in the exponential decay of information, limiting their capacity to capture long-timescale information. Here, we propose a power law forge… ▽ More Sequential information contains short- to long-range dependencies; however, learning long-timescale information has been a challenge for recurrent neural networks. Despite improvements in long short-term memory networks (LSTMs), the forgetting mechanism results in the exponential decay of information, limiting their capacity to capture long-timescale information. Here, we propose a power law forget gate, which instead learns to forget information along a slower power law decay function. Specifically, the new gate learns to control the power law decay factor, p, allowing the network to adjust the information decay rate according to task demands. Our experiments show that an LSTM with power law forget gates (pLSTM) can effectively capture long-range dependencies beyond hundreds of elements on image classification, language modeling, and categorization tasks, improving performance over the vanilla LSTM. We also inspected the revised forget gate by varying the initialization of p, setting p to a fixed value, and ablating cells in the pLSTM network. The results show that the information decay can be controlled by the learnable decay factor p, which allows pLSTM to achieve its superior performance. Altogether, we found that LSTM with the proposed forget gate can learn long-term dependencies, outperforming other recurrent networks in multiple domains; such gating mechanism can be integrated into other architectures for improving the learning of long timescale information in recurrent neural networks. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: 16 pages, 10 figures

arXiv:2101.06913 [pdf, ps, other]

doi 10.1063/5.0031031

Phase and amplitude dynamics of coupled oscillator systems on complex networks

Authors: Jae Hyung Woo, Christopher J. Honey, Joon-Young Moon

Abstract: We investigated the locking behaviors of coupled limit-cycle oscillators with phase and amplitude dynamics. We focused on how the dynamics are affected by inhomogeneous coupling strength and by angular and radial shifts in the coupling function. We performed mean-field analyses of oscillator systems with inhomogeneous coupling strength, testing Gaussian, power-law, and brain-like degree distributi… ▽ More We investigated the locking behaviors of coupled limit-cycle oscillators with phase and amplitude dynamics. We focused on how the dynamics are affected by inhomogeneous coupling strength and by angular and radial shifts in the coupling function. We performed mean-field analyses of oscillator systems with inhomogeneous coupling strength, testing Gaussian, power-law, and brain-like degree distributions. Even for oscillators with identical intrinsic frequencies and intrinsic amplitudes, we found that the coupling strength distribution and coupling function generated a wide repertoire of phase and amplitude dynamics. These included fully and partially locked states in which high-degree or low-degree nodes would phase-lead the network. The mean-field analytical findings were confirmed via numerical simulations. The results suggest that, in oscillator systems in which individual nodes can independently vary their amplitude over time, qualitatively different dynamics can be produced via shifts in the coupling strength distribution and the coupling form. Of particular relevance to information flows in oscillator networks, changes in the non-specific drive to individual nodes can make high-degree nodes phase-lag or phase-lead the rest of the network. △ Less

Submitted 18 January, 2021; originally announced January 2021.

Comments: 16 pages, 9 figures

Journal ref: Chaos 30, 121102 (2020)

arXiv:2012.06717 [pdf, other]

Map** the Timescale Organization of Neural Language Models

Authors: Hsiang-Yun Sherry Chien, **han Zhang, Christopher. J. Honey

Abstract: In the human brain, sequences of language input are processed within a distributed and hierarchical architecture, in which higher stages of processing encode contextual information over longer timescales. In contrast, in recurrent neural networks which perform natural language processing, we know little about how the multiple timescales of contextual information are functionally organized. Therefo… ▽ More In the human brain, sequences of language input are processed within a distributed and hierarchical architecture, in which higher stages of processing encode contextual information over longer timescales. In contrast, in recurrent neural networks which perform natural language processing, we know little about how the multiple timescales of contextual information are functionally organized. Therefore, we applied tools developed in neuroscience to map the "processing timescales" of individual units within a word-level LSTM language model. This timescale-map** method assigned long timescales to units previously found to track long-range syntactic dependencies. Additionally, the map** revealed a small subset of the network (less than 15% of units) with long timescales and whose function had not previously been explored. We next probed the functional organization of the network by examining the relationship between the processing timescale of units and their network connectivity. We identified two classes of long-timescale units: "controller" units composed a densely interconnected subnetwork and strongly projected to the rest of the network, while "integrator" units showed the longest timescales in the network, and expressed projection profiles closer to the mean projection profile. Ablating integrator and controller units affected model performance at different positions within a sentence, suggesting distinctive functions of these two sets of units. Finally, we tested the generalization of these results to a character-level LSTM model and models with different architectures. In summary, we demonstrated a model-free technique for map** the timescale organization in recurrent neural networks, and we applied this method to reveal the timescale and functional organization of neural language models. △ Less

Submitted 17 March, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

Comments: 23 pages, 4 main figures, 10 appendix figures; published as a conference paper at ICLR 2021

arXiv:2012.06694 [pdf]

Consequences of Slow Neural Dynamics for Incremental Learning

Authors: Shima Rahimi Moghaddam, Fanjun Bu, Christopher J. Honey

Abstract: In the human brain, internal states are often correlated over time (due to local recurrence and other intrinsic circuit properties), punctuated by abrupt transitions. At first glance, temporal smoothness of internal states presents a problem for learning input-output map**s (e.g. category labels for images), because the internal representation of the input will contain a mixture of current input… ▽ More In the human brain, internal states are often correlated over time (due to local recurrence and other intrinsic circuit properties), punctuated by abrupt transitions. At first glance, temporal smoothness of internal states presents a problem for learning input-output map**s (e.g. category labels for images), because the internal representation of the input will contain a mixture of current input and prior inputs. However, when training with naturalistic data (e.g. movies) there is also temporal autocorrelation in the input. How does the temporal "smoothness" of internal states affect the efficiency of learning when the training data are also temporally smooth? How does it affect the kinds of representations that are learned? We found that, when trained with temporally smooth data, "slow" neural networks (equipped with linear recurrence and gating mechanisms) learned to categorize more efficiently than feedforward networks. Furthermore, networks with linear recurrence and multi-timescale gating could learn internal representations that "un-mixed" quickly-varying and slowly-varying data sources. Together, these findings demonstrate how a fundamental property of cortical dynamics (their temporal autocorrelation) can serve as an inductive bias, leading to more efficient category learning and to the representational separation of fast and slow sources in the environment. △ Less

Submitted 22 May, 2023; v1 submitted 11 December, 2020; originally announced December 2020.

Showing 1–6 of 6 results for author: Honey, C J