Skip to main content

Showing 1–50 of 67 results for author: Glavaš, G

.
  1. arXiv:2407.02310  [pdf, other

    cs.CL

    Evaluating the Ability of LLMs to Solve Semantics-Aware Process Mining Tasks

    Authors: Adrian Rebmann, Fabian David Schmidt, Goran Glavaš, Han van der Aa

    Abstract: The process mining community has recently recognized the potential of large language models (LLMs) for tackling various process mining tasks. Initial studies report the capability of LLMs to support process analysis and even, to some extent, that they are able to reason about how processes work. This latter property suggests that LLMs could also be used to tackle process mining tasks that benefit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Submitted to ICPM

  2. arXiv:2406.14496  [pdf, other

    cs.CV cs.CL

    African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification

    Authors: Gregor Geigle, Radu Timofte, Goran Glavaš

    Abstract: Recent Large Vision-Language Models (LVLMs) demonstrate impressive abilities on numerous image understanding and reasoning tasks. The task of fine-grained object classification (e.g., distinction between \textit{animal species}), however, has been probed insufficiently, despite its downstream importance. We fill this evaluation gap by creating \texttt{FOCI} (\textbf{F}ine-grained \textbf{O}bject \… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.14492  [pdf, other

    cs.CV cs.CL

    Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?

    Authors: Gregor Geigle, Radu Timofte, Goran Glavaš

    Abstract: Large vision-language models (LVLMs) have recently dramatically pushed the state of the art in image captioning and many image understanding tasks (e.g., visual question answering). LVLMs, however, often \textit{hallucinate} and produce captions that mention concepts that cannot be found in the image. These hallucinations erode the trustworthiness of LVLMs and are arguably among the main obstacles… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.12739  [pdf, other

    cs.CL

    Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

    Authors: Fabian David Schmidt, Philipp Borchert, Ivan Vulić, Goran Glavaš

    Abstract: LLMs have become a go-to solution not just for text generation, but also for natural language understanding (NLU) tasks. Acquiring extensive knowledge through language modeling on web-scale corpora, they excel on English NLU, yet struggle to extend their NLU capabilities to underrepresented languages. In contrast, machine translation models (MT) produce excellent multilingual representations, resu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  5. arXiv:2406.12634  [pdf, other

    cs.IR cs.AI

    News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation

    Authors: Andreea Iana, Fabian David Schmidt, Goran Glavaš, Heiko Paulheim

    Abstract: Rapidly growing numbers of multilingual news consumers pose an increasing challenge to news recommender systems in terms of providing customized recommendations. First, existing neural news recommenders, even when powered by multilingual language models (LMs), suffer substantial performance losses in zero-shot cross-lingual transfer (ZS-XLT). Second, the current paradigm of fine-tuning the backbon… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; H.3.3

  6. arXiv:2404.19319  [pdf, other

    cs.CL

    Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

    Authors: Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

    Abstract: Compared to standard language model (LM) pretraining (i.e., from scratch), Knowledge Distillation (KD) entails an additional forward pass through a teacher model that is typically substantially larger than the target student model. As such, KD in LM pretraining materially slows down throughput of pretraining instances vis-a-vis pretraining from scratch. Scaling laws of LM pretraining suggest that… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024

  7. arXiv:2403.17876  [pdf, other

    cs.IR

    MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation

    Authors: Andreea Iana, Goran Glavaš, Heiko Paulheim

    Abstract: Digital news platforms use news recommenders as the main instrument to cater to the individual information needs of readers. Despite an increasingly language-diverse online community, in which many Internet users consume news in multiple languages, the majority of news recommendation focuses on major, resource-rich languages, and English in particular. Moreover, nearly all news recommendation effo… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)

    ACM Class: H.3.3

  8. arXiv:2403.03894  [pdf, other

    cs.AI cs.CL cs.PL

    IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators

    Authors: Indraneil Paul, Goran Glavaš, Iryna Gurevych

    Abstract: Code understanding and generation have fast become some of the most popular applications of language models (LMs). Nonetheless, research on multilingual aspects of Code-LMs (i.e., LMs for code generation) such as cross-lingual transfer between different programming languages, language-specific data augmentation, and post-hoc LM adaptation, alongside exploitation of data sources other than the orig… ▽ More

    Submitted 15 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2312.13772  [pdf, other

    cs.CL cs.AI cs.LG

    On Task Performance and Model Calibration with Supervised and Self-Ensembled In-Context Learning

    Authors: Chengzu Li, Han Zhou, Goran Glavaš, Anna Korhonen, Ivan Vulić

    Abstract: Following the standard supervised fine-tuning (SFT) paradigm, in-context learning (ICL) has become an efficient approach propelled by the recent advancements in large language models (LLMs), yielding promising performance across various tasks in few-shot data setups. However, both paradigms are prone to suffer from the critical problem of overconfidence (i.e., miscalibration), especially in such l… ▽ More

    Submitted 22 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 9 pages, 4 figures, 5 tables (20 pages, 5 figures, 13 tables including references and appendices)

  10. arXiv:2311.09502  [pdf, other

    cs.CL

    SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU

    Authors: Evgeniia Razumovskaia, Goran Glavaš, Anna Korhonen, Ivan Vulić

    Abstract: Task-oriented dialogue (ToD) systems help users execute well-defined tasks across a variety of domains (e.g., $\textit{flight booking}$ or $\textit{food ordering}$), with their Natural Language Understanding (NLU) components being dedicated to the analysis of user utterances, predicting users' intents ($\textit{Intent Detection}$, ID) and extracting values for informational slots (… ▽ More

    Submitted 8 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL Main 2024

  11. arXiv:2311.09404  [pdf, other

    cs.CL

    To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages

    Authors: Benedikt Ebing, Goran Glavaš

    Abstract: Perfect machine translation (MT) would render cross-lingual transfer (XLT) by means of multilingual language models (LMs) superfluous. Given, on one hand, the large body of work on improving XLT with multilingual LMs and, on the other hand, recent advances in massively multilingual MT, in this work, we systematically evaluate existing and propose new translation-based XLT approaches for transfer t… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  12. arXiv:2311.02025  [pdf, other

    cs.CL

    Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection

    Authors: Gretel Liz De la Peña Sarracén, Paolo Rosso, Robert Litschko, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: Cross-lingual transfer learning from high-resource to medium and low-resource languages has shown encouraging results. However, the scarcity of resources in target languages remains a challenge. In this work, we resort to data augmentation and continual pre-training for domain adaptation to improve cross-lingual abusive language detection. For data augmentation, we analyze two existing techniques… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 (Main Conference)

  13. arXiv:2311.00408  [pdf, other

    cs.CL

    AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification

    Authors: Yongxin Huang, Kexin Wang, Sourav Dutta, Raj Nath Patel, Goran Glavaš, Iryna Gurevych

    Abstract: Recent work has found that few-shot sentence classification based on pre-trained Sentence Encoders (SEs) is efficient, robust, and effective. In this work, we investigate strategies for domain-specialization in the context of few-shot sentence classification with SEs. We first establish that unsupervised Domain-Adaptive Pre-Training (DAPT) of a base Pre-trained Language Model (PLM) (i.e., not an S… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 Main

  14. arXiv:2310.14909  [pdf, other

    cs.CL cs.AI cs.LG

    Linking Surface Facts to Large-Scale Knowledge Graphs

    Authors: Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš

    Abstract: Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples. These facts are, however, merely surface forms, the ambiguity of which impedes their downstream usage; e.g., the surface phrase "Michael Jordan" may refer to either the former basketball player or the university professor. Knowledge Graphs (KGs), on the other… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  15. arXiv:2310.10532  [pdf, other

    cs.CL

    One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Multilingual language models enable zero-shot cross-lingual transfer (ZS-XLT): fine-tuned on sizable source-language task data, they perform the task in target languages without labeled instances. The effectiveness of ZS-XLT hinges on the linguistic proximity between languages and the amount of pretraining data for a language. Because of this, model selection based on source-language validation is… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to findings of EMNLP 2023

  16. arXiv:2310.01146  [pdf, other

    cs.IR

    NewsRecLib: A PyTorch-Lightning Library for Neural News Recommendation

    Authors: Andreea Iana, Goran Glavaš, Heiko Paulheim

    Abstract: NewsRecLib is an open-source library based on Pytorch-Lightning and Hydra developed for training and evaluating neural news recommendation models. The foremost goals of NewsRecLib are to promote reproducible research and rigorous experimental evaluation by (i) providing a unified and highly configurable framework for exhaustive experimental studies and (ii) enabling a thorough analysis of the perf… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

    ACM Class: H.3.3; D.2.13

  17. arXiv:2307.16089  [pdf, other

    cs.IR

    Train Once, Use Flexibly: A Modular Framework for Multi-Aspect Neural News Recommendation

    Authors: Andreea Iana, Goran Glavaš, Heiko Paulheim

    Abstract: Recent neural news recommenders (NNRs) extend content-based recommendation (1) by aligning additional aspects (e.g., topic, sentiment) between candidate news and user history or (2) by diversifying recommendations w.r.t. these aspects. This customization is achieved by "hardcoding" additional constraints into the NNR's architecture and/or training objectives: any change in the desired recommendati… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 July, 2023; originally announced July 2023.

    ACM Class: H.3.3; I.2.7

  18. arXiv:2307.06930  [pdf, other

    cs.CV cs.CL

    mBLIP: Efficient Bootstrap** of Multilingual Vision-LLMs

    Authors: Gregor Geigle, Abhay Jain, Radu Timofte, Goran Glavaš

    Abstract: Modular vision-language models (Vision-LLMs) align pretrained image encoders with (frozen) large language models (LLMs) and post-hoc condition LLMs to `understand' the image input. With the abundance of readily available high-quality English image-text data as well as strong monolingual English LLMs, the research focus has been on English-only Vision-LLMs. Multilingual vision-language models are s… ▽ More

    Submitted 20 June, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: ALVR Workshop 2024

  19. arXiv:2306.08658  [pdf, other

    cs.CL cs.CV

    Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations

    Authors: Gregor Geigle, Radu Timofte, Goran Glavaš

    Abstract: Vision-and-language (VL) models with separate encoders for each modality (e.g., CLIP) have become the go-to models for zero-shot image classification and image-text retrieval. They are, however, mostly evaluated in English as multilingual benchmarks are limited in availability. We introduce Babel-ImageNet, a massively multilingual benchmark that offers (partial) translations of ImageNet labels to… ▽ More

    Submitted 12 June, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2024

  20. arXiv:2305.16834  [pdf, other

    cs.CL

    Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Massively multilingual language models have displayed strong performance in zero-shot (ZS-XLT) and few-shot (FS-XLT) cross-lingual transfer setups, where models fine-tuned on task data in a source language are transferred without any or with only a few annotated instances to the target language(s). However, current work typically overestimates model performance as fine-tuned models are frequently… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted To Appear In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics

  21. arXiv:2305.14163  [pdf, other

    cs.CL cs.LG

    Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection

    Authors: David Dukić, Kiril Gashteovski, Goran Glavaš, Jan Šnajder

    Abstract: Event detection is a crucial information extraction task in many domains, such as Wikipedia or news. The task typically relies on trigger detection (TD) -- identifying token spans in the text that evoke specific events. While the notion of triggers should ideally be universal across domains, domain transfer for TD from high- to low-resource domains results in significant performance drops. We addr… ▽ More

    Submitted 1 February, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at EACL 2024 Findings

  22. arXiv:2305.07016  [pdf, other

    cs.CL

    A General-Purpose Multilingual Document Encoder

    Authors: Onur Galoğlu, Robert Litschko, Goran Glavaš

    Abstract: Massively multilingual pretrained transformers (MMTs) have tremendously pushed the state of the art on multilingual NLP and cross-lingual transfer of NLP models in particular. While a large body of work leveraged MMTs to mine parallel data and induce bilingual document embeddings, much less effort has been devoted to training general-purpose (massively) multilingual document encoder that can be us… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  23. arXiv:2304.08823  [pdf, other

    cs.CL

    Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese

    Authors: Vésteinn Snæbjarnarson, Annika Simonsen, Goran Glavaš, Ivan Vulić

    Abstract: Multilingual language models have pushed state-of-the-art in cross-lingual NLP transfer. The majority of zero-shot cross-lingual transfer, however, use one and the same massively multilingual transformer (e.g., mBERT or XLM-R) to transfer to all target languages, irrespective of their typological, etymological, and phylogenetic relations to other languages. In particular, readily available data an… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  24. arXiv:2304.03112  [pdf, other

    cs.IR

    Simplifying Content-Based Neural News Recommendation: On User Modeling and Training Objectives

    Authors: Andreea Iana, Goran Glavaš, Heiko Paulheim

    Abstract: The advent of personalized news recommendation has given rise to increasingly complex recommender architectures. Most neural news recommenders rely on user click behavior and typically introduce dedicated user encoders that aggregate the content of clicked news into user embeddings (early fusion). These models are predominantly trained with standard point-wise classification objectives. The existi… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

    ACM Class: H.3.3

  25. arXiv:2210.07362  [pdf, other

    cs.CL

    Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can consistently improve performance for various NLP tasks with traditional NLP models. In this work, we investigate whether these previous findings still hold with state-of-the-art pretrained Transformer-based language models (PLMs). We use three common specialization methods… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Findings of EACL 2023. arXiv admin note: text overlap with arXiv:2208.01029

  26. arXiv:2210.06440  [pdf, other

    cs.CL

    The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification

    Authors: Mohsen Mesgar, Thy Thy Tran, Goran Glavas, Iryna Gurevych

    Abstract: Few-shot Intent Classification (FSIC) is one of the key challenges in modular task-oriented dialog systems. While advanced FSIC methods are similar in using pretrained language models to encode texts and nearest neighbour-based inference for classification, these methods differ in details. They start from different pretrained text encoders, use different encoding architectures with varying similar… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  27. arXiv:2208.01029  [pdf, other

    cs.CL

    On the Limitations of Sociodemographic Adaptation with Transformers

    Authors: Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for inco… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  28. arXiv:2208.01018  [pdf, other

    cs.CL

    Massively Multilingual Lexical Specialization of Multilingual Transformers

    Authors: Tommaso Green, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: While pretrained language models (PLMs) primarily serve as general-purpose text encoders that can be fine-tuned for a wide variety of downstream tasks, recent work has shown that they can also be rewired to produce high-quality word representations (i.e., static word embeddings) and yield good performance in type-level lexical tasks. While existing work primarily focused on the lexical specializat… ▽ More

    Submitted 29 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted in ACL 2023

  29. arXiv:2205.14981  [pdf, other

    cs.CL

    ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

    Authors: Chia-Chien Hung, Tommaso Green, Robert Litschko, Tornike Tsereteli, Sotaro Takeshita, Marco Bombieri, Goran Glavaš, Simone Paolo Ponzetto

    Abstract: This paper introduces our proposed system for the MIA Shared Task on Cross-lingual Open-retrieval Question Answering (COQA). In this challenging scenario, given an input question the system has to gather evidence documents from a multilingual pool and generate from them an answer in the language of the question. We devised several approaches combining different model variants for three main compon… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  30. arXiv:2205.10400  [pdf, other

    cs.CL

    Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce Multi2WOZ, a new multilingual multi-domain TOD dataset, derived from the well-established… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  31. arXiv:2205.00267  [pdf, other

    cs.CL

    Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders

    Authors: Ivan Vulić, Goran Glavaš, Fangyu Liu, Nigel Collier, Edoardo Maria Ponti, Anna Korhonen

    Abstract: Pretrained multilingual language models (LMs) can be successfully transformed into multilingual sentence encoders (SEs; e.g., LaBSE, xMPNet) via additional fine-tuning or model distillation with parallel data. However, it remains unclear how to best leverage them to represent sub-sentence lexical items (i.e., words and phrases) in cross-lingual lexical tasks. In this work, we probe SEs for the amo… ▽ More

    Submitted 13 October, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

  32. arXiv:2204.02292  [pdf, other

    cs.CL cs.IR

    Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval

    Authors: Robert Litschko, Ivan Vulić, Goran Glavaš

    Abstract: State-of-the-art neural (re)rankers are notoriously data-hungry which -- given the lack of large-scale training data in languages other than English -- makes them rarely used in multilingual and cross-lingual retrieval settings. Current approaches therefore commonly transfer rankers trained on English data to other languages and cross-lingual setups by means of multilingual encoders: they fine-tun… ▽ More

    Submitted 16 September, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: COLING 2022

    ACM Class: H.3.3; I.2.7

  33. arXiv:2203.08565  [pdf, other

    cs.CL

    Geographic Adaptation of Pretrained Language Models

    Authors: Valentin Hofmann, Goran Glavaš, Nikola Ljubešić, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: While pretrained language models (PLMs) have been shown to possess a plethora of linguistic knowledge, the existing body of research has largely neglected extralinguistic knowledge, which is generally difficult to obtain by pretraining on text alone. Here, we contribute to closing this gap by examining geolinguistic knowledge, i.e., knowledge about geographic variation in language. We introduce ge… ▽ More

    Submitted 28 January, 2024; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: TACL 2024 (pre-MIT Press publication version)

  34. arXiv:2112.11031  [pdf, other

    cs.CL cs.IR

    On Cross-Lingual Retrieval with Multilingual Text Encoders

    Authors: Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: In this work we present a systematic empirical study focused on the suitability of the state-of-the-art multilingual encoders for cross-lingual document and sentence retrieval tasks across a number of diverse language pairs. We first treat these models as multilingual text encoders and benchmark their performance in unsupervised ad-hoc sentence- and document-level CLIR. In contrast to supervised l… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: to appear in IRJ ECIR 2021 Special Issue. arXiv admin note: substantial text overlap with arXiv:2101.08370

    ACM Class: H.3.3; I.2.7

  35. arXiv:2110.08395  [pdf, other

    cs.CL

    DS-TOD: Efficient Domain Specialization for Task Oriented Dialog

    Authors: Chia-Chien Hung, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Recent work has shown that self-supervised dialog-specific pretraining on large conversational datasets yields substantial gains over traditional language modeling (LM) pretraining in downstream task-oriented dialog (TOD). These approaches, however, exploit general dialogic corpora (e.g., Reddit) and thus presumably fail to reliably embed domain-specific knowledge useful for concrete downstream TO… ▽ More

    Submitted 20 May, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Findings of ACL 2022

  36. arXiv:2109.07464  [pdf, other

    cs.CL

    AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark

    Authors: Niklas Friedrich, Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Open Information Extraction (OIE) is the task of extracting facts from sentences in the form of relations and their corresponding arguments in schema-free manner. Intrinsic performance of OIE systems is difficult to measure due to the incompleteness of existing OIE benchmarks: the ground truth extractions do not group all acceptable surface realizations of the same fact that can be extracted from… ▽ More

    Submitted 13 April, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

  37. arXiv:2109.06850  [pdf, other

    cs.CL cs.AI

    BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction Evaluation

    Authors: Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Intrinsic evaluations of OIE systems are carried out either manually -- with human evaluators judging the correctness of extractions -- or automatically, on standardized benchmarks. The latter, while much more cost-effective, is less reliable, primarily because of the incompleteness of the existing OIE benchmarks: the ground truth extractions do not include all acceptable variants of the same fact… ▽ More

    Submitted 13 April, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

  38. arXiv:2109.03646  [pdf, other

    cs.CL

    Sustainable Modular Debiasing of Language Models

    Authors: Anne Lauscher, Tobias Lüken, Goran Glavaš

    Abstract: Unfair stereotypical biases (e.g., gender, racial, or religious biases) encoded in modern pretrained language models (PLMs) have negative ethical implications for widespread adoption of state-of-the-art language technology. To remedy for this, a wide range of debiasing techniques have recently been introduced to remove such stereotypical biases from PLMs. Existing debiasing methods, however, direc… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted for EMNLP-Findings 2021

  39. arXiv:2108.06295  [pdf, other

    cs.CL cs.DL

    Diachronic Analysis of German Parliamentary Proceedings: Ideological Shifts through the Lens of Political Biases

    Authors: Tobias Walter, Celina Kirschner, Steffen Eger, Goran Glavaš, Anne Lauscher, Simone Paolo Ponzetto

    Abstract: We analyze bias in historical corpora as encoded in diachronic distributional semantic models by focusing on two specific forms of bias, namely a political (i.e., anti-communism) and racist (i.e., antisemitism) one. For this, we use a new corpus of German parliamentary proceedings, DeuPARL, spanning the period 1867--2020. We complement this analysis of historical biases in diachronic word embeddin… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: Accepted for JCDL2021

  40. arXiv:2107.00281  [pdf, other

    cs.CL

    Scientia Potentia Est -- On the Role of Knowledge in Computational Argumentation

    Authors: Anne Lauscher, Henning Wachsmuth, Iryna Gurevych, Goran Glavaš

    Abstract: Despite extensive research efforts in recent years, computational argumentation (CA) remains one of the most challenging areas of natural language processing. The reason for this is the inherent complexity of the cognitive processes behind human argumentation, which integrate a plethora of different types of knowledge, ranging from topic-specific facts and common sense to rhetorical knowledge. The… ▽ More

    Submitted 8 November, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in TACL

  41. arXiv:2106.03521  [pdf, other

    cs.CL

    RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

    Authors: Soumya Barikeri, Anne Lauscher, Ivan Vulić, Goran Glavaš

    Abstract: Text representation models are prone to exhibit a range of societal biases, reflecting the non-controlled and biased nature of the underlying pretraining data, which consequently leads to severe ethical issues and even bias amplification. Recent work has predominantly focused on measuring and mitigating bias in pretrained language models. Surprisingly, the landscape of bias measurements and mitiga… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted for ACL21

  42. arXiv:2104.08570  [pdf, other

    cs.CL

    Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

    Authors: Evgeniia Razumovskaia, Goran Glavaš, Olga Majewska, Edoardo M. Ponti, Anna Korhonen, Ivan Vulić

    Abstract: In task-oriented dialogue (ToD), a user holds a conversation with an artificial agent to complete a concrete task. Although this technology represents one of the central objectives of AI and has been the focus of ever more intense research and development efforts, it is currently limited to a few narrow domains (e.g., food ordering, ticket booking) and a handful of languages (e.g., English, Chines… ▽ More

    Submitted 25 May, 2022; v1 submitted 17 April, 2021; originally announced April 2021.

  43. arXiv:2103.06598  [pdf, other

    cs.CL

    DebIE: A Platform for Implicit and Explicit Debiasing of Word Embedding Spaces

    Authors: Niklas Friedrich, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Recent research efforts in NLP have demonstrated that distributional word vector spaces often encode stereotypical human biases, such as racism and sexism. With word representations ubiquitously used in NLP models and pipelines, this raises ethical issues and jeopardizes the fairness of language technologies. While there exists a large body of work on bias measures and debiasing methods, to date,… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted as EACL21 Demo

  44. arXiv:2101.08370  [pdf, other

    cs.CL cs.IR

    Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval

    Authors: Robert Litschko, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Pretrained multilingual text encoders based on neural Transformer architectures, such as multilingual BERT (mBERT) and XLM, have achieved strong performance on a myriad of language understanding tasks. Consequently, they have been adopted as a go-to paradigm for multilingual and cross-lingual representation learning and transfer, rendering cross-lingual word embeddings (CLWEs) effectively obsolete… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: accepted at ECIR'21 (preprint)

    ACM Class: H.3.3; I.2.7

  45. arXiv:2012.15421  [pdf, other

    cs.CL

    Verb Knowledge Injection for Multilingual Event Processing

    Authors: Olga Majewska, Ivan Vulić, Goran Glavaš, Edoardo M. Ponti, Anna Korhonen

    Abstract: In parallel to their overwhelming success across NLP tasks, language ability of deep Transformer networks, pretrained via language modeling (LM) objectives has undergone extensive scrutiny. While probing revealed that these models encode a range of syntactic and semantic properties of a language, they are still prone to fall back on superficial cues and simple heuristics to solve downstream tasks,… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: 19 pages, 1 figure, 8 tables

    Journal ref: Proceedings of ACL-IJCNLP 2021 Volume 1 Long Papers 6952-6969

  46. arXiv:2012.11213  [pdf, ps, other

    cs.IR cs.CL

    Self-Supervised Learning for Visual Summary Identification in Scientific Publications

    Authors: Shintaro Yamamoto, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš, Shigeo Morishima

    Abstract: Providing visual summaries of scientific publications can increase information access for readers and thereby help deal with the exponential growth in the number of scientific publications. Nonetheless, efforts in providing visual publication summaries have been few and far apart, primarily focusing on the biomedical domain. This is primarily because of the limited availability of annotated gold s… ▽ More

    Submitted 14 January, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

  47. arXiv:2012.06460  [pdf, other

    cs.CL

    Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer

    Authors: Marko Vidoni, Ivan Vulić, Goran Glavaš

    Abstract: Adapter modules, additional trainable parameters that enable efficient fine-tuning of pretrained transformers, have recently been used for language specialization of multilingual transformers, improving downstream zero-shot cross-lingual transfer. In this work, we propose orthogonal language and task adapters (dubbed orthoadapters) for cross-lingual transfer. They are trained to encode language- a… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  48. arXiv:2011.01575  [pdf, ps, other

    cs.CL

    AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings

    Authors: Anne Lauscher, Rafik Takieddin, Simone Paolo Ponzetto, Goran Glavaš

    Abstract: Recent work has shown that distributional word vector spaces often encode human biases like sexism or racism. In this work, we conduct an extensive analysis of biases in Arabic word embeddings by applying a range of recently introduced bias tests on a variety of embedding spaces induced from corpora in Arabic. We measure the presence of biases across several dimensions, namely: embedding models (S… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: accepted for WANLP 20

  49. arXiv:2010.05731  [pdf, other

    cs.CL

    Probing Pretrained Language Models for Lexical Semantics

    Authors: Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen

    Abstract: The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on morphosyntactic, semantic, and world knowledge, it remains unclear to which extent LMs also derive lexical type-level knowledge from words in context. In this work, w… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020: Long paper

  50. arXiv:2008.06788  [pdf, other

    cs.CL

    Is Supervised Syntactic Parsing Beneficial for Language Understanding? An Empirical Investigation

    Authors: Goran Glavaš, Ivan Vulić

    Abstract: Traditional NLP has long held (supervised) syntactic parsing necessary for successful higher-level semantic language understanding (LU). The recent advent of end-to-end neural models, self-supervised via language modeling (LM), and their success on a wide range of LU tasks, however, questions this belief. In this work, we empirically investigate the usefulness of supervised parsing for semantic LU… ▽ More

    Submitted 28 April, 2021; v1 submitted 15 August, 2020; originally announced August 2020.

    Comments: EACL 2021