Skip to main content

Showing 1–9 of 9 results for author: Huth, A G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16714  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Crafting Interpretable Embeddings by Asking LLMs Questions

    Authors: Vinamra Benara, Chandan Singh, John X. Morris, Richard Antonello, Ion Stoica, Alexander G. Huth, Jianfeng Gao

    Abstract: Large language models (LLMs) have rapidly improved text embeddings for a growing array of natural-language processing tasks. However, their opaqueness and proliferation into scientific domains such as neuroscience have created a growing need for interpretability. Here, we ask whether we can obtain interpretable embeddings through LLM prompting. We introduce question-answering embeddings (QA-Emb),… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2310.06408  [pdf, other

    cs.CL

    Humans and language models diverge when predicting repeating text

    Authors: Aditya R. Vaidya, Javier Turek, Alexander G. Huth

    Abstract: Language models that are trained on the next-word prediction task have been shown to accurately model human behavior in word prediction and reading speed. In contrast with these findings, we present a scenario in which the performance of humans and LMs diverges. We collected a dataset of human next-word predictions for five stimuli that are formed by repeating spans of text. Human and GPT-2 LM pre… ▽ More

    Submitted 22 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: To appear in the 26th Conference on Computational Natural Language Learning (CoNLL 2023). Code and data are available at https://github.com/HuthLab/lm-repeating-text

  3. arXiv:2305.12248  [pdf, other

    cs.CL cs.CV

    Brain encoding models based on multimodal transformers can transfer across language and vision

    Authors: Jerry Tang, Meng Du, Vy A. Vo, Vasudev Lal, Alexander G. Huth

    Abstract: Encoding models have been used to assess how the human brain represents concepts in language and vision. While language and vision rely on similar concept representations, current encoding models are typically trained and tested on brain responses to each modality in isolation. Recent advances in multimodal pretraining have produced transformers that can extract aligned representations of concepts… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  4. arXiv:2305.11863  [pdf, other

    cs.CL cs.AI

    Scaling laws for language encoding models in fMRI

    Authors: Richard Antonello, Aditya Vaidya, Alexander G. Huth

    Abstract: Representations from transformer-based unidirectional language models are known to be effective at predicting brain responses to natural language. However, most studies comparing language models to brains have used GPT-2 or similarly sized language models. Here we tested whether larger open-source models such as those from the OPT and LLaMA families are better at predicting brain responses recorde… ▽ More

    Submitted 30 January, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS 2023). Please cite NeurIPS version

  5. arXiv:2305.09863  [pdf, other

    cs.AI cs.CL cs.LG q-bio.NC

    Explaining black box text modules in natural language with language models

    Authors: Chandan Singh, Aliyah R. Hsu, Richard Antonello, Shailee Jain, Alexander G. Huth, Bin Yu, Jianfeng Gao

    Abstract: Large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their rapid proliferation and increasing opaqueness have created a growing need for interpretability. Here, we ask whether we can automatically obtain natural language explanations for black box text modules. A "text module" is any function that maps text to a scalar continuous v… ▽ More

    Submitted 15 November, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  6. arXiv:2205.14252  [pdf, other

    cs.CL

    Self-supervised models of audio effectively explain human cortical responses to speech

    Authors: Aditya R. Vaidya, Shailee Jain, Alexander G. Huth

    Abstract: Self-supervised language models are very effective at predicting high-level cortical responses during language comprehension. However, the best current models of lower-level auditory processing in the human brain rely on either hand-constructed acoustic filters or representations from supervised audio neural networks. In this work, we capitalize on the progress of self-supervised speech representa… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2022

  7. arXiv:2105.08196  [pdf, other

    cs.CV

    Physically Plausible Pose Refinement using Fully Differentiable Forces

    Authors: Akarsh Kumar, Aditya R. Vaidya, Alexander G. Huth

    Abstract: All hand-object interaction is controlled by forces that the two bodies exert on each other, but little work has been done in modeling these underlying forces when doing pose and contact estimation from RGB/RGB-D data. Given the pose of the hand and object from any pose estimation system, we propose an end-to-end differentiable model that refines pose estimates by learning the forces experienced b… ▽ More

    Submitted 25 August, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted to the EPIC@CVPR2021 workshop. Figure 2 replaced with a rasterized image

  8. arXiv:2009.12727  [pdf, other

    cs.CL cs.LG

    Multi-timescale Representation Learning in LSTM Language Models

    Authors: Shivangi Mahto, Vy A. Vo, Javier S. Turek, Alexander G. Huth

    Abstract: Language models must capture statistical dependencies between words at timescales ranging from very short to very long. Earlier work has demonstrated that dependencies in natural language tend to decay with distance between words according to a power law. However, it is unclear how this knowledge can be used for analyzing or designing neural network language models. In this work, we derived a theo… ▽ More

    Submitted 17 March, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    MSC Class: 91F20 ACM Class: I.2.7; I.2.6

    Journal ref: International Conference on Learning Representations 2021

  9. arXiv:1909.00021  [pdf, ps, other

    cs.LG cs.CL cs.NE stat.ML

    Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network

    Authors: Javier S. Turek, Shailee Jain, Vy Vo, Mihai Capota, Alexander G. Huth, Theodore L. Willke

    Abstract: Recent work has shown that topological enhancements to recurrent neural networks (RNNs) can increase their expressiveness and representational capacity. Two popular enhancements are stacked RNNs, which increases the capacity for learning non-linear functions, and bidirectional processing, which exploits acausal information in a sequence. In this work, we explore the delayed-RNN, which is a single-… ▽ More

    Submitted 18 June, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: to be published in Proceedings of International Conference on Machine Learning 2020 (ICML)

    MSC Class: 62M45 ACM Class: I.2.6; I.5.1