Skip to main content

Showing 1–28 of 28 results for author: Padó, S

.
  1. arXiv:2403.12666  [pdf, other

    cs.CL

    Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean

    Authors: Dojun Park, Sebastian Padó

    Abstract: Almost all frameworks for the manual or automatic evaluation of machine translation characterize the quality of an MT output with a single number. An exception is the Multidimensional Quality Metrics (MQM) framework which offers a fine-grained ontology of quality dimensions for scoring (such as style, fluency, accuracy, and terminology). Previous studies have demonstrated the feasibility of MQM an… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, accepted at LREC-COLING 2024

  2. arXiv:2403.11834  [pdf, other

    cs.CL cs.LG

    Towards Understanding the Relationship between In-context Learning and Compositional Generalization

    Authors: Sungjun Han, Sebastian Padó

    Abstract: According to the principle of compositional generalization, the meaning of a complex expression can be understood as a function of the meaning of its parts and of how they are combined. This principle is crucial for human language processing and also, arguably, for NLP models in the face of out-of-distribution data. However, many neural network models, including Transformers, have been shown to st… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: To be published in LREC-COLING 2024

  3. arXiv:2402.17649  [pdf, other

    cs.CL cs.CY

    Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs

    Authors: Tanise Ceron, Neele Falk, Ana Barić, Dmitry Nikolaev, Sebastian Padó

    Abstract: Due to the widespread use of large language models (LLMs) in ubiquitous systems, we need to understand whether they embed a specific worldview and what these views reflect. Recent studies report that, prompted with political questionnaires, LLMs show left-liberal leanings (Feng et al., 2023; Motoki et al., 2024). However, it is as yet unclear whether these leanings are reliable (robust to prompt v… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 12 pages, under review

  4. arXiv:2402.02883  [pdf, other

    cs.CL cs.LG

    Approximate Attributions for Off-the-Shelf Siamese Transformers

    Authors: Lucas Möller, Dmitry Nikolaev, Sebastian Padó

    Abstract: Siamese encoders such as sentence transformers are among the least understood deep models. Established attribution methods cannot tackle this model class since it compares two inputs rather than processing a single one. To address this gap, we have recently proposed an attribution method specifically for Siamese encoders (Möller et al., 2023). However, it requires models to be adjusted and fine-tu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted for EACL 2024, St. Julian's, Malta

  5. arXiv:2402.00620  [pdf, other

    cs.CL

    Actor Identification in Discourse: A Challenge for LLMs?

    Authors: Ana Barić, Sean Papay, Sebastian Padó

    Abstract: The identification of political actors who put forward claims in public debate is a crucial step in the construction of discourse networks, which are helpful to analyze societal debates. Actor identification is, however, rather challenging: Often, the locally mentioned speaker of a claim is only a pronoun ("He proposed that [claim]"), so recovering the canonical actor name requires discourse under… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the EACL 2024 workshop on Computational Models of Discourse (St. Julian's, Malta)

  6. arXiv:2310.12575  [pdf, other

    cs.CL

    Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers

    Authors: Dmitry Nikolaev, Tanise Ceron, Sebastian Padó

    Abstract: Scaling analysis is a technique in computational political science that assigns a political actor (e.g. politician or party) a score on a predefined scale based on a (typically long) body of text (e.g. a parliamentary speech or an election manifesto). For example, political scientists have often used the left--right scale to systematically analyse political landscapes of different countries. NLP m… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  7. arXiv:2310.11923  [pdf, other

    cs.CL

    Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: The question of what kinds of linguistic information are encoded in different layers of Transformer-based language models is of considerable interest for the NLP community. Existing work, however, has overwhelmingly focused on word-level representations and encoder-only language models with the masked-token training objective. In this paper, we present experiments with semantic structural probing,… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted to BlackboxNLP 2023

  8. arXiv:2310.09256  [pdf, other

    cs.CL

    Political claim identification and categorization in a multilingual setting: First experiments

    Authors: Urs Zaberer, Sebastian Padó, Gabriella Lapesa

    Abstract: The identification and classification of political claims is an important step in the analysis of political newspaper reports; however, resources for this task are few and far between. This paper explores different strategies for the cross-lingual projection of political claims analysis. We conduct experiments on a German dataset, DebateNet2.0, covering the policy debate sparked by the 2015 refuge… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Presented at KONVENS 2023, Ingolstadt, Germany

  9. arXiv:2310.05703  [pdf, other

    cs.CL cs.AI cs.LG

    An Attribution Method for Siamese Encoders

    Authors: Lucas Möller, Dmitry Nikolaev, Sebastian Padó

    Abstract: Despite the success of Siamese encoder models such as sentence transformers (ST), little is known about the aspects of inputs they pay attention to. A barrier is that their predictions cannot be attributed to individual features, as they compare two inputs rather than processing a single one. This paper derives a local attribution method for Siamese encoders by generalizing the principle of integr… ▽ More

    Submitted 29 November, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP'23

  10. arXiv:2305.19650  [pdf, other

    cs.CL

    Adverbs, Surprisingly

    Authors: Dmitry Nikolaev, Collin F. Baker, Miriam R. L. Petruck, Sebastian Padó

    Abstract: This paper begins with the premise that adverbs are neglected in computational linguistics. This view derives from two analyses: a literature review and a novel adverb dataset to probe a state-of-the-art language model, thereby uncovering systematic gaps in accounts for adverb meaning. We suggest that using Frame Semantics for characterizing word meaning, as in FrameNet, provides a promising appro… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  11. arXiv:2305.10136  [pdf, other

    cs.CL cs.CY

    Additive manifesto decomposition: A policy domain aware method for understanding party positioning

    Authors: Tanise Ceron, Dmitry Nikolaev, Sebastian Padó

    Abstract: Automatic extraction of party (dis)similarities from texts such as party election manifestos or parliamentary speeches plays an increasing role in computational political science. However, existing approaches are fundamentally limited to targeting only global party (dis)-similarity: they condense the relationship between a pair of parties into a single figure, their similarity. In aggregating over… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  12. arXiv:2301.13039  [pdf, other

    cs.CL

    Representation biases in sentence transformers

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: Variants of the BERT architecture specialised for producing full-sentence representations often achieve better performance on downstream tasks than sentence embeddings extracted from vanilla BERT. However, there is still little understanding of what properties of inputs determine the properties of such representations. In this study, we construct several sets of sentences with pre-defined lexical… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to EACL 2023

  13. arXiv:2210.11989  [pdf, other

    cs.CL

    Optimizing text representations to capture (dis)similarity between political parties

    Authors: Tanise Ceron, Nico Blokker, Sebastian Padó

    Abstract: Even though fine-tuned neural language models have been pivotal in enabling "deep" automatic text analysis, optimizing text representations for specific applications remains a crucial bottleneck. In this study, we look at this problem in the context of a task from computational social science, namely modeling pairwise similarities between political parties. Our research question is what level of s… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Conference on Computational Natural Language Learning 2022

  14. arXiv:2207.14704  [pdf, other

    cs.IR cs.CL

    Understanding the Relation of User and News Representations in Content-Based Neural News Recommendation

    Authors: Lucas Möller, Sebastian Padó

    Abstract: A number of models for neural content-based news recommendation have been proposed. However, there is limited understanding of the relative importances of the three main components of such systems (news encoder, user encoder, and scoring function) and the trade-offs involved. In this paper, we assess the hypothesis that the most widely used means of matching user and candidate news representations… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: This work was accepted and presented in the 10th INRA workshop at SIGIR`22

  15. arXiv:2205.11987  [pdf, other

    cs.CL

    Word-order typology in Multilingual BERT: A case study in subordinate-clause detection

    Authors: Dmitry Nikolaev, Sebastian Padó

    Abstract: The capabilities and limitations of BERT and similar models are still unclear when it comes to learning syntactic abstractions, in particular across languages. In this paper, we use the task of subordinate-clause detection within and across languages to probe these properties. We show that this task is deceptively simple, with easy gains offset by a long tail of harder cases, and that BERT's zero-… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in the proceedings of SIGTYP workshop 2022

  16. arXiv:2201.08310  [pdf, other

    cs.LG cs.PL cs.SE

    Meta Learning for Code Summarization

    Authors: Moiz Rauf, Sebastian Padó, Michael Pradel

    Abstract: Source code summarization is the task of generating a high-level natural language description for a segment of programming language code. Current neural models for the task differ in their architecture and the aspects of code they consider. In this paper, we show that three SOTA models for code summarization work well on largely disjoint subsets of a large code-base. This complementarity motivates… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  17. arXiv:2111.10142  [pdf, other

    cs.CL

    Between welcome culture and border fence. A dataset on the European refugee crisis in German newspaper reports

    Authors: Nico Blokker, André Blessing, Erenay Dayanik, Jonas Kuhn, Sebastian Padó, Gabriella Lapesa

    Abstract: Newspaper reports provide a rich source of information on the unfolding of public debate on specific policy fields that can serve as basis for inquiry in political science. Such debates are often triggered by critical events, which attract public attention and incite the reactions of political actors: crisis sparks the debate. However, due to the challenges of reliable annotation and modeling, few… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Submitted to Language Resources and Evaluation. This manuscript is an extended version of https://aclanthology.org/2020.lrec-1.115

  18. arXiv:2109.10255  [pdf, other

    cs.CL

    Multi-Task Learning with Sentiment, Emotion, and Target Detection to Recognize Hate Speech and Offensive Language

    Authors: Flor Miriam Plaza-del-Arco, Sercan Halat, Sebastian Padó, Roman Klinger

    Abstract: The recognition of hate speech and offensive language (HOF) is commonly formulated as a classification task to decide if a text contains HOF. We investigate whether HOF detection can profit by taking into account the relationships between HOF and similar concepts: (a) HOF is related to sentiment analysis because hate speech is typically a negative statement and expresses a negative opinion; (b) it… ▽ More

    Submitted 11 July, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: publication at FIRE 2021 as system description paper in the HASOC-FIRE shared task on hate speech and offensive language detection. The original publication can be found at http://ceur-ws.org/Vol-3159/T1-30.pdf

  19. arXiv:2106.07306  [pdf, ps, other

    cs.LG cs.CL

    Constraining Linear-chain CRFs to Regular Languages

    Authors: Sean Papay, Roman Klinger, Sebastian Padó

    Abstract: A major challenge in structured prediction is to represent the interdependencies within output structures. When outputs are structured as sequences, linear-chain conditional random fields (CRFs) are a widely used model class which can learn \textit{local} dependencies in the output. However, the CRF's Markov assumption makes it impossible for CRFs to represent distributions with \textit{nonlocal}… ▽ More

    Submitted 11 August, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  20. arXiv:2103.01667  [pdf, other

    cs.CL

    Emotion Ratings: How Intensity, Annotation Confidence and Agreements are Entangled

    Authors: Enrica Troiano, Sebastian Padó, Roman Klinger

    Abstract: When humans judge the affective content of texts, they also implicitly assess the correctness of such judgment, that is, their confidence. We hypothesize that people's (in)confidence that they performed well in an annotation task leads to (dis)agreements among each other. If this is true, confidence may serve as a diagnostic tool for systematic differences in annotations. To probe our assumption,… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: WASSA 2021 at EACL 2021

  21. arXiv:2010.02587  [pdf, other

    cs.CL

    Dissecting Span Identification Tasks with Performance Prediction

    Authors: Sean Papay, Roman Klinger, Sebastian Padó

    Abstract: Span identification (in short, span ID) tasks such as chunking, NER, or code-switching detection, ask models to identify and classify relevant spans in a text. Despite being a staple of NLP, and sharing a common structure, there is little insight on how these tasks' properties influence their difficulty, and thus little guidance on what model families work well on span ID tasks, and why. We analyz… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: accepted at EMNLP 2020

  22. arXiv:1911.10422  [pdf, ps, other

    cs.CL cs.AI cs.IR

    SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations Between Pairs of Nominals

    Authors: Iris Hendrickx, Su Nam Kim, Zornitsa Kozareva, Preslav Nakov, Diarmuid Ó Séaghdha, Sebastian Padó, Marco Pennacchiotti, Lorenza Romano, Stan Szpakowicz

    Abstract: In response to the continuing research interest in computational semantic analysis, we have proposed a new task for SemEval-2010: multi-way classification of mutually exclusive semantic relations between pairs of nominals. The task is designed to compare different approaches to the problem and to provide a standard testbed for future research. In this paper, we define the task, describe the creati… ▽ More

    Submitted 23 November, 2019; originally announced November 2019.

    Comments: semantic relations, nominals

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: SemEval-2010

  23. arXiv:1907.10449  [pdf, other

    cs.CL

    Distributional Analysis of Polysemous Function Words

    Authors: Sebastian Pado, Daniel Hole

    Abstract: In this paper, we are concerned with the phenomenon of function word polysemy. We adopt the framework of distributional semantics, which characterizes word meaning by observing occurrence contexts in large corpora and which is in principle well situated to model polysemy. Nevertheless, function words were traditionally considered as impossible to analyze distributionally due to their highly flexib… ▽ More

    Submitted 27 January, 2021; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: Extended version of paper presented at TbiLLC 2019, September 2019

  24. arXiv:1905.13618  [pdf, other

    cs.CL cs.AI cs.HC

    Crowdsourcing and Validating Event-focused Emotion Corpora for German and English

    Authors: Enrica Troiano, Sebastian Padó, Roman Klinger

    Abstract: Sentiment analysis has a range of corpora available across multiple languages. For emotion analysis, the situation is more limited, which hinders potential research on cross-lingual modeling and the development of predictive models for other languages. In this paper, we fill this gap for German by constructing deISEAR, a corpus designed in analogy to the well-established English ISEAR emotion data… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: 14 pages, 1 figure, accepted for publication at ACL 2019

  25. Instantiation

    Authors: Abhijeet Gupta, Gemma Boleda, Sebastian Pado

    Abstract: In computational linguistics, a large body of work exists on distributed modeling of lexical relations, focussing largely on lexical relations such as hypernymy (scientist -- person) that hold between two categories, as expressed by common nouns. In contrast, computational linguistics has paid little attention to entities denoted by proper nouns (Marie Curie, Mumbai, ...). These have investigated… ▽ More

    Submitted 5 August, 2018; originally announced August 2018.

    Comments: submitted to Computational Linguistics

    Journal ref: Substantially revised version published at Cognitive Science (2021)

  26. arXiv:1702.01815  [pdf, other

    cs.CL

    Living a discrete life in a continuous world: Reference with distributed representations

    Authors: Gemma Boleda, Sebastian Padó, Nghia The Pham, Marco Baroni

    Abstract: Reference is a crucial property of language that allows us to connect linguistic expressions to the world. Modeling it requires handling both continuous and discrete aspects of meaning. Data-driven models excel at the former, but struggle with the latter, and the reverse is true for symbolic models. This paper (a) introduces a concrete referential task to test both aspects, called cross-modal en… ▽ More

    Submitted 4 September, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: Accepted at IWCS 2017. Final version, 9 pages

  27. "Show me the cup": Reference with Continuous Representations

    Authors: Gemma Boleda, Sebastian Padó, Marco Baroni

    Abstract: One of the most basic functions of language is to refer to objects in a shared scene. Modeling reference with continuous representations is challenging because it requires individuation, i.e., tracking and distinguishing an arbitrary number of referents. We introduce a neural network model that, given a definite description and a set of objects represented by natural images, points to the intended… ▽ More

    Submitted 28 June, 2016; originally announced June 2016.

    Journal ref: In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science, vol 10761. Springer, Cham

  28. Cross-lingual Annotation Projection for Semantic Roles

    Authors: Sebastian Pado, Mirella Lapata

    Abstract: This article considers the task of automatically inducing role-semantic annotations in the FrameNet paradigm for new languages. We propose a general framework that is based on annotation projection, phrased as a graph optimization problem. It is relatively inexpensive and has the potential to reduce the human effort involved in creating role-semantic resources. Within this framework, we present… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 36, pages 307-340, 2009