Search | arXiv e-print repository

Empirical influence functions to understand the logic of fine-tuning

Authors: Jordan K. Matelsky, Lyle Ungar, Konrad P. Kording

Abstract: Understanding the process of learning in neural networks is crucial for improving their performance and interpreting their behavior. This can be approximately understood by asking how a model's output is influenced when we fine-tune on a new training sample. There are desiderata for such influences, such as decreasing influence with semantic distance, sparseness, noise invariance, transitive causa… ▽ More Understanding the process of learning in neural networks is crucial for improving their performance and interpreting their behavior. This can be approximately understood by asking how a model's output is influenced when we fine-tune on a new training sample. There are desiderata for such influences, such as decreasing influence with semantic distance, sparseness, noise invariance, transitive causality, and logical consistency. Here we use the empirical influence measured using fine-tuning to demonstrate how individual training samples affect outputs. We show that these desiderata are violated for both for simple convolutional networks and for a modern LLM. We also illustrate how prompting can partially rescue this failure. Our paper presents an efficient and practical way of quantifying how well neural networks learn from fine-tuning stimuli. Our results suggest that popular models cannot generalize or perform logic in the way they appear to. △ Less

Submitted 1 June, 2024; originally announced June 2024.

arXiv:2401.15251 [pdf, other]

EM and XRM Connectomics Imaging and Experimental Metadata Standards

Authors: Miguel E. Wimbish, Nicole K. Guittari, Victoria A. Rose, Jorge L. Rivera Jr, Patricia K. Rivlin, Mark A. Hinton, Jordan K. Matelsky, Nicole E. Stock, Brock A. Wester, Erik C. Johnson, William R. Gray-Roncal

Abstract: High resolution volumetric neuroimaging datasets from electron microscopy (EM) and x-ray micro and holographic-nano tomography (XRM/XHN) are being generated at an increasing rate and by a growing number of research teams. These datasets are derived from an increasing number of species, in an increasing number of brain regions, and with an increasing number of techniques. Each of these large-scale… ▽ More High resolution volumetric neuroimaging datasets from electron microscopy (EM) and x-ray micro and holographic-nano tomography (XRM/XHN) are being generated at an increasing rate and by a growing number of research teams. These datasets are derived from an increasing number of species, in an increasing number of brain regions, and with an increasing number of techniques. Each of these large-scale datasets, often surpassing petascale levels, is typically accompanied by a unique and varied set of metadata. These datasets can be used to derive connectomes, or neuron-synapse level connectivity diagrams, to investigate the fundamental organization of neural circuitry, neuronal development, and neurodegenerative disease. Standardization is essential to facilitate comparative connectomics analysis and enhance data utilization. Although the neuroinformatics community has successfully established and adopted data standards for many modalities, this effort has not yet encompassed EM and XRM/ XHN connectomics data. This lack of standardization isolates these datasets, hindering their integration and comparison with other research performed in the field. Towards this end, our team formed a working group consisting of community stakeholders to develop Image and Experimental Metadata Standards for EM and XRM/XHN data to ensure the scientific impact and further motivate the generation and sharing of these data. This document addresses version 1.1 of these standards, aiming to support metadata services and future software designs for community collaboration. Standards for derived annotations are described in a companion document. Standards definitions are available on a community github page. We hope these standards will enable comparative analysis, improve interoperability between connectomics software tools, and continue to be refined and improved by the neuroinformatics community. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: 15 Pages, 3 figures, 2 tables

arXiv:2308.02439 [pdf, other]

A large language model-assisted education tool to provide feedback on open-ended responses

Authors: Jordan K. Matelsky, Felipe Parodi, Tony Liu, Richard D. Lange, Konrad P. Kording

Abstract: Open-ended questions are a favored tool among instructors for assessing student understanding and encouraging critical exploration of course material. Providing feedback for such responses is a time-consuming task that can lead to overwhelmed instructors and decreased feedback quality. Many instructors resort to simpler question formats, like multiple-choice questions, which provide immediate feed… ▽ More Open-ended questions are a favored tool among instructors for assessing student understanding and encouraging critical exploration of course material. Providing feedback for such responses is a time-consuming task that can lead to overwhelmed instructors and decreased feedback quality. Many instructors resort to simpler question formats, like multiple-choice questions, which provide immediate feedback but at the expense of personalized and insightful comments. Here, we present a tool that uses large language models (LLMs), guided by instructor-defined criteria, to automate responses to open-ended questions. Our tool delivers rapid personalized feedback, enabling students to quickly test their knowledge and identify areas for improvement. We provide open-source reference implementations both as a web application and as a Jupyter Notebook widget that can be used with instructional coding or math notebooks. With instructor guidance, LLMs hold promise to enhance student learning outcomes and elevate instructional methodologies. △ Less

Submitted 25 July, 2023; originally announced August 2023.

arXiv:2305.17300 [pdf, other]

Exploiting Large Neuroimaging Datasets to Create Connectome-Constrained Approaches for more Robust, Efficient, and Adaptable Artificial Intelligence

Authors: Erik C. Johnson, Brian S. Robinson, Gautam K. Vallabha, Justin Joyce, Jordan K. Matelsky, Raphael Norman-Tenazas, Isaac Western, Marisel Villafañe-Delgado, Martha Cervantes, Michael S. Robinette, Arun V. Reddy, Lindsey Kitchell, Patricia K. Rivlin, Elizabeth P. Reilly, Nathan Drenkow, Matthew J. Roos, I-Jeng Wang, Brock A. Wester, William R. Gray-Roncal, Joan A. Hoffmann

Abstract: Despite the progress in deep learning networks, efficient learning at the edge (enabling adaptable, low-complexity machine learning solutions) remains a critical need for defense and commercial applications. We envision a pipeline to utilize large neuroimaging datasets, including maps of the brain which capture neuron and synapse connectivity, to improve machine learning approaches. We have pursue… ▽ More Despite the progress in deep learning networks, efficient learning at the edge (enabling adaptable, low-complexity machine learning solutions) remains a critical need for defense and commercial applications. We envision a pipeline to utilize large neuroimaging datasets, including maps of the brain which capture neuron and synapse connectivity, to improve machine learning approaches. We have pursued different approaches within this pipeline structure. First, as a demonstration of data-driven discovery, the team has developed a technique for discovery of repeated subcircuits, or motifs. These were incorporated into a neural architecture search approach to evolve network architectures. Second, we have conducted analysis of the heading direction circuit in the fruit fly, which performs fusion of visual and angular velocity features, to explore augmenting existing computational models with new insight. Our team discovered a novel pattern of connectivity, implemented a new model, and demonstrated sensor fusion on a robotic platform. Third, the team analyzed circuitry for memory formation in the fruit fly connectome, enabling the design of a novel generative replay approach. Finally, the team has begun analysis of connectivity in mammalian cortex to explore potential improvements to transformer networks. These constraints increased network robustness on the most challenging examples in the CIFAR-10-C computer vision robustness benchmark task, while reducing learnable attention parameters by over an order of magnitude. Taken together, these results demonstrate multiple potential approaches to utilize insight from neural systems for develo** robust and efficient machine learning techniques. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: 11 pages, 4 figures

arXiv:2112.07718 [pdf, other]

Scatterbrained: A flexible and expandable pattern for decentralized machine learning

Authors: Miller Wilt, Jordan K. Matelsky, Andrew S. Gearhart

Abstract: Federated machine learning is a technique for training a model across multiple devices without exchanging data between them. Because data remains local to each compute node, federated learning is well-suited for use-cases in fields where data is carefully controlled, such as medicine, or in domains with bandwidth constraints. One weakness of this approach is that most federated learning tools rely… ▽ More Federated machine learning is a technique for training a model across multiple devices without exchanging data between them. Because data remains local to each compute node, federated learning is well-suited for use-cases in fields where data is carefully controlled, such as medicine, or in domains with bandwidth constraints. One weakness of this approach is that most federated learning tools rely upon a central server to perform workload delegation and to produce a single shared model. Here, we suggest a flexible framework for decentralizing the federated learning pattern, and provide an open-source, reference implementation compatible with PyTorch. △ Less

Submitted 14 December, 2021; originally announced December 2021.

Comments: Code and documentation is available at https://github.com/JHUAPL/scatterbrained

Showing 1–5 of 5 results for author: Matelsky, J K