Skip to main content

Showing 1–14 of 14 results for author: Huth, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16714  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Crafting Interpretable Embeddings by Asking LLMs Questions

    Authors: Vinamra Benara, Chandan Singh, John X. Morris, Richard Antonello, Ion Stoica, Alexander G. Huth, Jianfeng Gao

    Abstract: Large language models (LLMs) have rapidly improved text embeddings for a growing array of natural-language processing tasks. However, their opaqueness and proliferation into scientific domains such as neuroscience have created a growing need for interpretability. Here, we ask whether we can obtain interpretable embeddings through LLM prompting. We introduce question-answering embeddings (QA-Emb),… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2405.14055  [pdf, other

    cs.CL cs.AI cs.ET

    How Many Bytes Can You Take Out Of Brain-To-Text Decoding?

    Authors: Richard Antonello, Nihita Sarma, Jerry Tang, Jiaru Song, Alexander Huth

    Abstract: Brain-computer interfaces have promising medical and scientific applications for aiding speech and studying the brain. In this work, we propose an information-based evaluation metric for brain-to-text decoders. Using this metric, we examine two methods to augment existing state-of-the-art continuous text decoders. We show that these methods, in concert, can improve brain decoding performance by up… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2310.06408  [pdf, other

    cs.CL

    Humans and language models diverge when predicting repeating text

    Authors: Aditya R. Vaidya, Javier Turek, Alexander G. Huth

    Abstract: Language models that are trained on the next-word prediction task have been shown to accurately model human behavior in word prediction and reading speed. In contrast with these findings, we present a scenario in which the performance of humans and LMs diverges. We collected a dataset of human next-word predictions for five stimuli that are formed by repeating spans of text. Human and GPT-2 LM pre… ▽ More

    Submitted 22 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: To appear in the 26th Conference on Computational Natural Language Learning (CoNLL 2023). Code and data are available at https://github.com/HuthLab/lm-repeating-text

  4. arXiv:2305.12248  [pdf, other

    cs.CL cs.CV

    Brain encoding models based on multimodal transformers can transfer across language and vision

    Authors: Jerry Tang, Meng Du, Vy A. Vo, Vasudev Lal, Alexander G. Huth

    Abstract: Encoding models have been used to assess how the human brain represents concepts in language and vision. While language and vision rely on similar concept representations, current encoding models are typically trained and tested on brain responses to each modality in isolation. Recent advances in multimodal pretraining have produced transformers that can extract aligned representations of concepts… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  5. arXiv:2305.11863  [pdf, other

    cs.CL cs.AI

    Scaling laws for language encoding models in fMRI

    Authors: Richard Antonello, Aditya Vaidya, Alexander G. Huth

    Abstract: Representations from transformer-based unidirectional language models are known to be effective at predicting brain responses to natural language. However, most studies comparing language models to brains have used GPT-2 or similarly sized language models. Here we tested whether larger open-source models such as those from the OPT and LLaMA families are better at predicting brain responses recorde… ▽ More

    Submitted 30 January, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS 2023). Please cite NeurIPS version

  6. arXiv:2305.09863  [pdf, other

    cs.AI cs.CL cs.LG q-bio.NC

    Explaining black box text modules in natural language with language models

    Authors: Chandan Singh, Aliyah R. Hsu, Richard Antonello, Shailee Jain, Alexander G. Huth, Bin Yu, Jianfeng Gao

    Abstract: Large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their rapid proliferation and increasing opaqueness have created a growing need for interpretability. Here, we ask whether we can automatically obtain natural language explanations for black box text modules. A "text module" is any function that maps text to a scalar continuous v… ▽ More

    Submitted 15 November, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  7. arXiv:2205.14252  [pdf, other

    cs.CL

    Self-supervised models of audio effectively explain human cortical responses to speech

    Authors: Aditya R. Vaidya, Shailee Jain, Alexander G. Huth

    Abstract: Self-supervised language models are very effective at predicting high-level cortical responses during language comprehension. However, the best current models of lower-level auditory processing in the human brain rely on either hand-constructed acoustic filters or representations from supervised audio neural networks. In this work, we capitalize on the progress of self-supervised speech representa… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2022

  8. arXiv:2106.05426  [pdf, other

    cs.CL cs.LG

    Low-Dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

    Authors: Richard Antonello, Javier Turek, Vy Vo, Alexander Huth

    Abstract: How related are the representations learned by neural language models, translation models, and language tagging tasks? We answer this question by adapting an encoder-decoder transfer learning method from computer vision to investigate the structure among 100 different feature spaces extracted from hidden representations of various networks trained on language tasks. This method reveals a low-dimen… ▽ More

    Submitted 12 January, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted to the Advances in Neural Information Processing Systems 34 (2021)

  9. arXiv:2105.08196  [pdf, other

    cs.CV

    Physically Plausible Pose Refinement using Fully Differentiable Forces

    Authors: Akarsh Kumar, Aditya R. Vaidya, Alexander G. Huth

    Abstract: All hand-object interaction is controlled by forces that the two bodies exert on each other, but little work has been done in modeling these underlying forces when doing pose and contact estimation from RGB/RGB-D data. Given the pose of the hand and object from any pose estimation system, we propose an end-to-end differentiable model that refines pose estimates by learning the forces experienced b… ▽ More

    Submitted 25 August, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted to the EPIC@CVPR2021 workshop. Figure 2 replaced with a rasterized image

  10. arXiv:2009.12727  [pdf, other

    cs.CL cs.LG

    Multi-timescale Representation Learning in LSTM Language Models

    Authors: Shivangi Mahto, Vy A. Vo, Javier S. Turek, Alexander G. Huth

    Abstract: Language models must capture statistical dependencies between words at timescales ranging from very short to very long. Earlier work has demonstrated that dependencies in natural language tend to decay with distance between words according to a power law. However, it is unclear how this knowledge can be used for analyzing or designing neural network language models. In this work, we derived a theo… ▽ More

    Submitted 17 March, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    MSC Class: 91F20 ACM Class: I.2.7; I.2.6

    Journal ref: International Conference on Learning Representations 2021

  11. arXiv:2005.00175  [pdf, other

    cs.CL

    Selecting Informative Contexts Improves Language Model Finetuning

    Authors: Richard Antonello, Nicole Beckage, Javier Turek, Alexander Huth

    Abstract: Language model fine-tuning is essential for modern natural language processing, but is computationally expensive and time-consuming. Further, the effectiveness of fine-tuning is limited by the inclusion of training examples that negatively affect performance. Here we present a general fine-tuning method that we call information gain filtration for improving the overall training efficiency and fina… ▽ More

    Submitted 19 May, 2022; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: Accepted submission at the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing

  12. arXiv:1909.00021  [pdf, ps, other

    cs.LG cs.CL cs.NE stat.ML

    Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network

    Authors: Javier S. Turek, Shailee Jain, Vy Vo, Mihai Capota, Alexander G. Huth, Theodore L. Willke

    Abstract: Recent work has shown that topological enhancements to recurrent neural networks (RNNs) can increase their expressiveness and representational capacity. Two popular enhancements are stacked RNNs, which increases the capacity for learning non-linear functions, and bidirectional processing, which exploits acausal information in a sequence. In this work, we explore the delayed-RNN, which is a single-… ▽ More

    Submitted 18 June, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: to be published in Proceedings of International Conference on Machine Learning 2020 (ICML)

    MSC Class: 62M45 ACM Class: I.2.6; I.5.1

  13. arXiv:1808.02084  [pdf, other

    cs.CV

    Deep Generative Modeling for Scene Synthesis via Hybrid Representations

    Authors: Zaiwei Zhang, Zhenpei Yang, Chongyang Ma, Linjie Luo, Alexander Huth, Etienne Vouga, Qixing Huang

    Abstract: We present a deep generative scene modeling technique for indoor environments. Our goal is to train a generative model using a feed-forward neural network that maps a prior distribution (e.g., a normal distribution) to the distribution of primary objects in indoor scenes. We introduce a 3D object arrangement representation that models the locations and orientations of objects, based on their size… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

  14. arXiv:1705.10887  [pdf, other

    stat.ML cs.CV cs.LG math.NA

    Efficient, sparse representation of manifold distance matrices for classical scaling

    Authors: Javier S. Turek, Alexander Huth

    Abstract: Geodesic distance matrices can reveal shape properties that are largely invariant to non-rigid deformations, and thus are often used to analyze and represent 3-D shapes. However, these matrices grow quadratically with the number of points. Thus for large point sets it is common to use a low-rank approximation to the distance matrix, which fits in memory and can be efficiently analyzed using method… ▽ More

    Submitted 29 March, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: Conference CVPR 2018

    MSC Class: 65D05; 68T99; 65F50; 68T45 ACM Class: I.2.10; G.1; I.4