Skip to main content

Showing 1–10 of 10 results for author: Sevastjanova, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08468  [pdf, other

    cs.CL cs.AI

    Challenges and Opportunities in Text Generation Explainability

    Authors: Kenza Amara, Rita Sevastjanova, Mennatallah El-Assady

    Abstract: The necessity for interpretability in natural language processing (NLP) has risen alongside the growing prominence of large language models. Among the myriad tasks within NLP, text generation stands out as a primary objective of autoregressive models. The NLP community has begun to take a keen interest in gaining a deeper understanding of text generation, leading to the development of model-agnost… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures, xAI-2024 Conference, Main track

  2. arXiv:2403.07627  [pdf, other

    cs.HC cs.LG

    generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation

    Authors: Thilo Spinner, Rebecca Kehlbeck, Rita Sevastjanova, Tobias Stähle, Daniel A. Keim, Oliver Deussen, Mennatallah El-Assady

    Abstract: Large language models (LLMs) are widely deployed in various downstream tasks, e.g., auto-completion, aided writing, or chat-based text generation. However, the considered output candidates of the underlying search algorithm are under-explored and under-explained. We tackle this shortcoming by proposing a tree-in-the-loop approach, where a visual representation of the beam search tree is the centra… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 24 pages paper, 4 pages references, 3 pages appendix, 8 figures

    ACM Class: I.2.7; H.5.2

  3. arXiv:2402.09259  [pdf, other

    cs.CL cs.AI

    SyntaxShap: Syntax-aware Explainability Method for Text Generation

    Authors: Kenza Amara, Rita Sevastjanova, Mennatallah El-Assady

    Abstract: To harness the power of large language models in safety-critical domains, we need to ensure the explainability of their predictions. However, despite the significant attention to model interpretability, there remains an unexplored domain in explaining sequence-to-sequence tasks using methods tailored for textual data. This paper introduces SyntaxShap, a local, model-agnostic explainability method… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  4. arXiv:2310.11252  [pdf, other

    cs.CL cs.AI cs.HC

    Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges

    Authors: Thilo Spinner, Rebecca Kehlbeck, Rita Sevastjanova, Tobias Stähle, Daniel A. Keim, Oliver Deussen, Andreas Spitz, Mennatallah El-Assady

    Abstract: The growing popularity of generative language models has amplified interest in interactive methods to guide model outputs. Prompt refinement is considered one of the most effective means to influence output among these methods. We identify several challenges associated with prompting large language models, categorized into data- and model-specific, linguistic, and socio-linguistic challenges. A co… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 9 pages paper, 2 pages references, 7 figures

    ACM Class: H.5.2; I.2.7

  5. arXiv:2209.07836  [pdf, other

    cs.CL cs.AI

    Negation, Coordination, and Quantifiers in Contextualized Language Models

    Authors: Aikaterini-Lida Kalouli, Rita Sevastjanova, Christin Beck, Maribel Romero

    Abstract: With the success of contextualized language models, much research explores what these models really learn and in which cases they still fail. Most of this work focuses on specific NLP tasks and on the learning outcome. Little research has attempted to decouple the models' weaknesses from specific tasks and focus on the embeddings per se and their mode of learning. In this paper, we take up this re… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  6. arXiv:2208.08176  [pdf, other

    cs.AI

    Visual Comparison of Language Model Adaptation

    Authors: Rita Sevastjanova, Eren Cakmak, Shauli Ravfogel, Ryan Cotterell, Mennatallah El-Assady

    Abstract: Neural language models are widely used; however, their model parameters often need to be adapted to the specific domains and tasks of an application, which is time- and resource-consuming. Thus, adapters have recently been introduced as a lightweight alternative for model adaptation. They consist of a small set of task-specific parameters with a reduced training time and simple parameter compositi… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  7. arXiv:2207.06897  [pdf, other

    cs.CL

    Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language

    Authors: Rita Sevastjanova, Mennatallah El-Assady

    Abstract: Language models learn and represent language differently than humans; they learn the form and not the meaning. Thus, to assess the success of language model explainability, we need to consider the impact of its divergence from a user's mental model of language. In this position paper, we argue that in order to avoid harmful rationalization and achieve truthful understanding of language models, exp… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  8. CommAID: Visual Analytics for Communication Analysis through Interactive Dynamics Modeling

    Authors: Maximilian T. Fischer, Daniel Seebacher, Rita Sevastjanova, Daniel A. Keim, Mennatallah El-Assady

    Abstract: Communication consists of both meta-information as well as content. Currently, the automated analysis of such data often focuses either on the network aspects via social network analysis or on the content, utilizing methods from text-mining. However, the first category of approaches does not leverage the rich content information, while the latter ignores the conversation environment and the tempor… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: 12 pages, 7 figures, Computer Graphics Forum 2021 (pre-peer reviewed version)

    Journal ref: Computer Graphics Forum, 40(3), 2021

  9. Visual Analytics of Conversational Dynamics

    Authors: Daniel Seebacher, Maximilian T. Fischer, Rita Sevastjanova, Daniel A. Keim, Mennatallah El-Assady

    Abstract: Large-scale interaction networks of human communication are often modeled as complex graph structures, obscuring temporal patterns within individual conversations. To facilitate the understanding of such conversational dynamics, episodes with low or high communication activity as well as breaks in communication need to be detected to enable the identification of temporal interaction patterns. Trad… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: 5 pages, 3 figures, EuroVis Workshop on Visual Analytics (EuroVA), 2019

    ACM Class: H.5.2; H.1.2

    Journal ref: EuroVis Workshop on Visual Analytics (EuroVA), 2019

  10. arXiv:1907.12413  [pdf, other

    cs.CL cs.HC

    VIANA: Visual Interactive Annotation of Argumentation

    Authors: Fabian Sperrle, Rita Sevastjanova, Rebecca Kehlbeck, Mennatallah El-Assady

    Abstract: Argumentation Mining addresses the challenging tasks of identifying boundaries of argumentative text fragments and extracting their relationships. Fully automated solutions do not reach satisfactory accuracy due to their insufficient incorporation of semantics and domain knowledge. Therefore, experts currently rely on time-consuming manual annotations. In this paper, we present a visual analytics… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: Proceedings of IEEE Conference on Visual Analytics Science and Technology (VAST), 2019

    Journal ref: 2019 IEEE Conference on Visual Analytics Science and Technology (VAST)