Skip to main content

Showing 1–10 of 10 results for author: Musslick, S

.
  1. arXiv:2403.03230  [pdf, other

    q-bio.NC cs.AI

    Large language models surpass human experts in predicting neuroscience results

    Authors: Xiaoliang Luo, Akilles Rechardt, Guangzhi Sun, Kevin K. Nejad, Felipe Yáñez, Bati Yilmaz, Kangjoo Lee, Alexandra O. Cohen, Valentina Borghesani, Anton Pashkov, Daniele Marinazzo, Jonathan Nicholas, Alessandro Salatiello, Ilia Sucholutsky, Pasquale Minervini, Sepehr Razavi, Roberta Rocca, Elkhan Yusifov, Tereza Okalova, Nianlong Gu, Martin Ferianc, Mikail Khona, Kaustubh R. Patil, Pui-Shee Lee, Rui Mata , et al. (14 additional authors not shown)

    Abstract: Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created Brain… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  2. arXiv:2312.00396  [pdf, other

    cs.LG stat.ML

    GFN-SR: Symbolic Regression with Generative Flow Networks

    Authors: Sida Li, Ioana Marinescu, Sebastian Musslick

    Abstract: Symbolic regression (SR) is an area of interpretable machine learning that aims to identify mathematical expressions, often composed of simple functions, that best fit in a given set of covariates $X$ and response $y$. In recent years, deep symbolic regression (DSR) has emerged as a popular method in the field by leveraging deep reinforcement learning to solve the complicated combinatorial search… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted by the NeurIPS 2023 AI4Science Workshop

  3. arXiv:2307.07575  [pdf, other

    cs.LG cs.NE

    A Quantitative Approach to Predicting Representational Learning and Performance in Neural Networks

    Authors: Ryan Pyle, Sebastian Musslick, Jonathan D. Cohen, Ankit B. Patel

    Abstract: A key property of neural networks (both biological and artificial) is how they learn to represent and manipulate input information in order to solve a task. Different types of representations may be suited to different types of tasks, making identifying and understanding learned representations a critical part of understanding and designing useful networks. In this paper, we introduce a new pseudo… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 30 pages, 16 figures

  4. arXiv:2206.05379  [pdf, other

    cs.CV cs.AI

    A Benchmark for Compositional Visual Reasoning

    Authors: Aimen Zerroug, Mohit Vaishnav, Julien Colin, Sebastian Musslick, Thomas Serre

    Abstract: A fundamental component of human vision is our ability to parse complex visual scenes and judge the relations between their constituent objects. AI benchmarks for visual reasoning have driven rapid progress in recent years with state-of-the-art systems now reaching human accuracy on some of these benchmarks. Yet, a major gap remains in terms of the sample efficiency with which humans and AI system… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  5. arXiv:2103.13939  [pdf, other

    cs.LG

    Recovering Quantitative Models of Human Information Processing with Differentiable Architecture Search

    Authors: Sebastian Musslick

    Abstract: The integration of behavioral phenomena into mechanistic models of cognitive function is a fundamental staple of cognitive science. Yet, researchers are beginning to accumulate increasing amounts of data without having the temporal or monetary resources to integrate these data into scientific theories. We seek to overcome these limitations by incorporating existing machine learning techniques into… ▽ More

    Submitted 17 May, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  6. arXiv:2007.10527  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Navigating the Trade-Off between Multi-Task Learning and Learning to Multitask in Deep Neural Networks

    Authors: Sachin Ravi, Sebastian Musslick, Maia Hamin, Theodore L. Willke, Jonathan D. Cohen

    Abstract: The terms multi-task learning and multitasking are easily confused. Multi-task learning refers to a paradigm in machine learning in which a network is trained on various related tasks to facilitate the acquisition of tasks. In contrast, multitasking is used to indicate, especially in the cognitive science literature, the ability to execute multiple tasks simultaneously. While multi-task learning e… ▽ More

    Submitted 5 January, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

  7. arXiv:2007.03124  [pdf, other

    q-bio.NC

    Efficiency of learning vs. processing: Towards a normative theory of multitasking

    Authors: Yotam Sagiv, Sebastian Musslick, Yael Niv, Jonathan D. Cohen

    Abstract: A striking limitation of human cognition is our inability to execute some tasks simultaneously. Recent work suggests that such limitations can arise from a fundamental tradeoff in network architectures that is driven by the sharing of representations between tasks: sharing promotes quicker learning, at the expense of interference while multitasking. From this perspective, multitasking failures mig… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  8. arXiv:1708.03263  [pdf, other

    q-bio.NC

    Topological limits to parallel processing capability of network architectures

    Authors: Giovanni Petri, Sebastian Musslick, Biswadip Dey, Kayhan Ozcimder, David Turner, Nesreen K. Ahmed, Theodore Willke, Jonathan D. Cohen

    Abstract: The ability to learn new tasks and generalize performance to others is one of the most remarkable characteristics of the human brain and of recent AI systems. The ability to perform multiple tasks simultaneously is also a signature characteristic of large-scale parallel architectures, that is evident in the human brain, and has been exploited effectively more traditional, massively parallel comput… ▽ More

    Submitted 10 November, 2020; v1 submitted 10 August, 2017; originally announced August 2017.

    Comments: version 4. Added SIs, 33 pages total, 4 figures + 14 figures in SI, major edits to text

  9. arXiv:1706.00085  [pdf, other

    q-bio.NC

    A Formal Approach to Modeling the Cost of Cognitive Control

    Authors: Kayhan Ozcimder, Biswadip Dey, Sebastian Musslick, Giovanni Petri, Nesreen K. Ahmed, Theodore L. Willke, Jonathan D. Cohen

    Abstract: This paper introduces a formal method to model the level of demand on control when executing cognitive processes. The cost of cognitive control is parsed into an intensity cost which encapsulates how much additional input information is required so as to get the specified response, and an interaction cost which encapsulates the level of interference between individual processes in a network. We de… ▽ More

    Submitted 31 May, 2017; originally announced June 2017.

    Comments: 6 pages, 3 figures, Conference paper

  10. arXiv:1611.02400  [pdf, other

    cs.DM

    A Graph-Theoretic Approach to Multitasking

    Authors: Noga Alon, Jonathan D. Cohen, Biswadip Dey, Tom Griffiths, Sebastian Musslick, Kayhan Ozcimder, Daniel Reichman, Igor Shinkar, Tal Wagner

    Abstract: A key feature of neural network architectures is their ability to support the simultaneous interaction among large numbers of units in the learning and processing of representations. However, how the richness of such interactions trades off against the ability of a network to simultaneously carry out multiple independent processes -- a salient limitation in many domains of human cognition -- remai… ▽ More

    Submitted 9 June, 2017; v1 submitted 8 November, 2016; originally announced November 2016.