Skip to main content

Showing 1–19 of 19 results for author: Jäger, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04988  [pdf, other

    cs.CL

    Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences

    Authors: Patrick Haller, Lena S. Bolliger, Lena A. Jäger

    Abstract: To date, most investigations on surprisal and entropy effects in reading have been conducted on the group level, disregarding individual differences. In this work, we revisit the predictive power of surprisal and entropy measures estimated from a range of language models (LMs) on data of human reading times as a measure of processing effort by incorporating information of language users' cognitive… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024

  2. Reporting Eye-Tracking Data Quality: Towards a New Standard

    Authors: Deborah N. Jakobi, Daniel G. Krakowczyk, Lena A. Jäger

    Abstract: Eye-tracking datasets are often shared in the format used by their creators for their original analyses, usually resulting in the exclusion of data considered irrelevant to the primary purpose. In order to increase re-usability of existing eye-tracking datasets for more diverse and initially not considered use cases, this work advocates a new approach of sharing eye-tracking data. Instead of publi… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: Proceedings of the 2024 Symposium on Eye Tracking Research and Applications (ETRA '24) Article 47 1-3

  3. arXiv:2403.00506  [pdf, ps, other

    cs.CL

    PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus

    Authors: Deborah N. Jakobi, Thomas Kern, David R. Reich, Patrick Haller, Lena A. Jäger

    Abstract: The Potsdam Textbook Corpus (PoTeC) is a naturalistic eye-tracking-while-reading corpus containing data from 75 participants reading 12 scientific texts. PoTeC is the first naturalistic eye-tracking-while-reading corpus that contains eye-movements from domain-experts as well as novices in a within-participant manipulation: It is based on a 2x2x2 fully-crossed factorial design which includes the pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  4. arXiv:2310.15587  [pdf, other

    cs.CL

    ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts

    Authors: Lena S. Bolliger, David R. Reich, Patrick Haller, Deborah N. Jakobi, Paul Prasse, Lena A. Jäger

    Abstract: Eye movements in reading play a crucial role in psycholinguistic research studying the cognitive mechanisms underlying human language processing. More recently, the tight coupling between eye movements and cognition has also been leveraged for language-related machine learning tasks such as the interpretability, enhancement, and pre-training of language models, as well as the inference of reader-… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  5. arXiv:2310.14676  [pdf, other

    cs.CL

    Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding

    Authors: Shuwen Deng, Paul Prasse, David R. Reich, Tobias Scheffer, Lena A. Jäger

    Abstract: Human gaze data offer cognitive information that reflects natural language comprehension. Indeed, augmenting language models with human scanpaths has proven beneficial for a range of NLP tasks, including language understanding. However, the applicability of this approach is hampered because the abundance of text corpora is contrasted by a scarcity of gaze data. Although models for the generation o… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Pre-print for EMNLP 2023

  6. Bridging the Gap: Gaze Events as Interpretable Concepts to Explain Deep Neural Sequence Models

    Authors: Daniel G. Krakowczyk, Paul Prasse, David R. Reich, Sebastian Lapuschkin, Tobias Scheffer, Lena A. Jäger

    Abstract: Recent work in XAI for eye tracking data has evaluated the suitability of feature attribution methods to explain the output of deep neural sequence models for the task of oculomotric biometric identification. These methods provide saliency maps to highlight important input features of a specific eye gaze sequence. However, to date, its localization analysis has been lacking a quantitative approach… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Preprint for ETRA '23: 2023 Symposium on Eye Tracking Research and Applications

  7. arXiv:2304.10784  [pdf, other

    cs.CL cs.HC cs.LG

    Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading

    Authors: Shuwen Deng, David R. Reich, Paul Prasse, Patrick Haller, Tobias Scheffer, Lena A. Jäger

    Abstract: Eye movements during reading offer insights into both the reader's cognitive processes and the characteristics of the text that is being read. Hence, the analysis of scanpaths in reading have attracted increasing attention across fields, ranging from cognitive science over linguistics to computer science. In particular, eye-tracking-while-reading data has been argued to bear the potential to make… ▽ More

    Submitted 18 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  8. pymovements: A Python Package for Eye Movement Data Processing

    Authors: Daniel G. Krakowczyk, David R. Reich, Jakob Chwastek, Deborah N. Jakobi, Paul Prasse, Assunta Süss, Oleksii Turuta, Paweł Kasprowski, Lena A. Jäger

    Abstract: We introduce pymovements: a Python package for analyzing eye-tracking data that follows best practices in software development, including rigorous testing and adherence to coding standards. The package provides functionality for key processes along the entire preprocessing pipeline. This includes parsing of eye tracker data files, transforming positional data into velocity data, detecting gaze eve… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: Preprint for ETRA '23: 2023 Symposium on Eye Tracking Research and Applications

    MSC Class: 91 ACM Class: J.4

  9. arXiv:2210.09819  [pdf, other

    cs.CL cs.LG

    Eye-tracking based classification of Mandarin Chinese readers with and without dyslexia using neural sequence models

    Authors: Patrick Haller, Andreas Säuberli, Sarah Elisabeth Kiener, **ger Pan, Ming Yan, Lena Jäger

    Abstract: Eye movements are known to reflect cognitive processes in reading, and psychological reading research has shown that eye gaze patterns differ between readers with and without dyslexia. In recent years, researchers have attempted to classify readers with dyslexia based on their eye movements using Support Vector Machines (SVMs). However, these approaches (i) are based on highly aggregated features… ▽ More

    Submitted 2 December, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

  10. arXiv:2207.01377  [pdf, other

    cs.CV

    Detection of ADHD based on Eye Movements during Natural Viewing

    Authors: Shuwen Deng, Paul Prasse, David R. Reich, Sabine Dziemian, Maja Stegenwallner-Schütz, Daniel Krakowczyk, Silvia Makowski, Nicolas Langer, Tobias Scheffer, Lena A. Jäger

    Abstract: Attention-deficit/hyperactivity disorder (ADHD) is a neurodevelopmental disorder that is highly prevalent and requires clinical specialists to diagnose. It is known that an individual's viewing behavior, reflected in their eye movements, is directly related to attentional mechanisms and higher-order cognitive processes. We therefore explore whether ADHD can be detected based on recorded eye moveme… ▽ More

    Submitted 14 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Pre-print for Proceedings of the European Conference on Machine Learning, 2022

  11. arXiv:2112.06310  [pdf, other

    cs.CL

    Reading Task Classification Using EEG and Eye-Tracking Data

    Authors: Nora Hollenstein, Marius Tröndle, Martyna Plomecka, Samuel Kiegeland, Yilmazcan Özyurt, Lena A. Jäger, Nicolas Langer

    Abstract: The Zurich Cognitive Language Processing Corpus (ZuCo) provides eye-tracking and EEG signals from two reading paradigms, normal reading and task-specific reading. We analyze whether machine learning methods are able to classify these two tasks using eye-tracking and EEG features. We implement models with aggregated sentence-level features as well as fine-grained word-level features. We test the mo… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

  12. arXiv:2109.11635  [pdf, other

    cs.CL

    Revisiting the Uniform Information Density Hypothesis

    Authors: Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy

    Abstract: The uniform information density (UID) hypothesis posits a preference among language users for utterances structured such that information is distributed uniformly across a signal. While its implications on language production have been well explored, the hypothesis potentially makes predictions about language comprehension and linguistic acceptability as well. Further, it is unclear how uniformity… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Journal ref: Proceedings of EMNLP 2021

  13. arXiv:2104.12555  [pdf, other

    cs.CY stat.AP

    Linking open-source code commits and MOOC grades to evaluate massive online open peer review

    Authors: Siruo Wang, Leah R. Jager, Kai Kammers, Aboozar Hadavand, Jeffrey T. Leek

    Abstract: Massive Open Online Courses (MOOCs) have been used by students as a low-cost and low-touch educational credential in a variety of fields. Understanding the grading mechanisms behind these course assignments is important for evaluating MOOC credentials. A common approach to grading free-response assignments is massive scale peer-review, especially used for assignments that are not easy to grade pro… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  14. arXiv:2104.05433  [pdf, other

    cs.CL

    Multilingual Language Models Predict Human Reading Behavior

    Authors: Nora Hollenstein, Federico Pirovano, Ce Zhang, Lena Jäger, Lisa Beinborn

    Abstract: We analyze if large language models are able to predict patterns of human reading behavior. We compare the performance of language-specific and multilingual pretrained transformer models to predict reading time measures reflecting natural human sentence processing on Dutch, English, German, and Russian texts. This results in accurate models of human reading behavior, which indicates that transform… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: accepted at NAACL 2021

  15. arXiv:2103.09673  [pdf, other

    astro-ph.IM astro-ph.EP cs.LG

    Automatic detection of impact craters on Al foils from the Stardust interstellar dust collector using convolutional neural networks

    Authors: Logan Jaeger, Anna L. Butterworth, Zack Gainsforth, Robert Lettieri, Augusto Ardizzone, Michael Capraro, Mark Burchell, Penny Wozniakiewicz, Ryan C. Ogliore, Bradley T. De Gregorio, Rhonda M. Stroud, Andrew J. Westphal

    Abstract: NASA's Stardust mission utilized a sample collector composed of aerogel and aluminum foil to return cometary and interstellar particles to Earth. Analysis of the aluminum foil begins with locating craters produced by hypervelocity impacts of cometary and interstellar dust. Interstellar dust craters are typically less than one micrometer in size and are sparsely distributed, making them difficult t… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  16. arXiv:2003.11399  [pdf, other

    cs.LG stat.ML

    Discriminative Viewer Identification using Generative Models of Eye Gaze

    Authors: Silvia Makowski, Lena A. Jäger, Lisa Schwetlick, Hans Trukenbrod, Ralf Engbert, Tobias Scheffer

    Abstract: We study the problem of identifying viewers of arbitrary images based on their eye gaze. Psychological research has derived generative stochastic models of eye movements. In order to exploit this background knowledge within a discriminatively trained classification model, we derive Fisher kernels from different generative models of eye gaze. Experimentally, we find that the performance of the clas… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

  17. arXiv:1906.11889  [pdf, other

    cs.CV cs.CL cs.HC cs.LG stat.ML

    Deep Eyedentification: Biometric Identification using Micro-Movements of the Eye

    Authors: Lena A. Jäger, Silvia Makowski, Paul Prasse, Sascha Liehr, Maximilian Seidler, Tobias Scheffer

    Abstract: We study involuntary micro-movements of the eye for biometric identification. While prior studies extract lower-frequency macro-movements from the output of video-based eye-tracking systems and engineer explicit features of these macro-movements, we develop a deep convolutional architecture that processes the raw eye-tracking signal. Compared to prior work, the network attains a lower error rate b… ▽ More

    Submitted 5 May, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Journal ref: In: U. Brefeld et al. (Eds.): Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2019, LNCS 11907, pp. 299-314, Springer Nature, Switzerland, 2020

  18. arXiv:1809.08031  [pdf, other

    cs.LG stat.ML

    A Discriminative Model for Identifying Readers and Assessing Text Comprehension from Eye Movements

    Authors: Silvia Makowski, Lena Jäger, Ahmed Abdelwahab, Niels Landwehr, Tobias Scheffer

    Abstract: We study the problem of inferring readers' identities and estimating their level of text comprehension from observations of their eye movements during reading. We develop a generative model of individual gaze patterns (scanpaths) that makes use of lexical features of the fixated words. Using this generative model, we derive a Fisher-score representation of eye-movement sequences. We study whether… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: Proceedings of the European Conference on Machine Learning, 2018

  19. arXiv:1703.04081  [pdf, other

    stat.ML cs.CL stat.AP

    Feature overwriting as a finite mixture process: Evidence from comprehension data

    Authors: Shravan Vasishth, Lena A. Jäger, Bruno Nicenboim

    Abstract: The ungrammatical sentence "The key to the cabinets are on the table" is known to lead to an illusion of grammaticality. As discussed in the meta-analysis by Jaeger et al., 2017, faster reading times are observed at the verb are in the agreement-attraction sentence above compared to the equally ungrammatical sentence "The key to the cabinet are on the table". One explanation for this facilitation… ▽ More

    Submitted 20 January, 2018; v1 submitted 12 March, 2017; originally announced March 2017.

    Comments: 6 pages, 2 figures, 1 table, submitted to MathPsych/ICCM 2017, Warwick, UK