Skip to main content

Showing 1–10 of 10 results for author: Mesgar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01771  [pdf, other

    cs.CL

    LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback

    Authors: Wen Lai, Mohsen Mesgar, Alexander Fraser

    Abstract: To democratize large language models (LLMs) to most natural languages, it is imperative to make these models capable of understanding and generating texts in many languages, in particular low-resource ones. While recent multilingual LLMs demonstrate remarkable performance in such capabilities, these LLMs still support a limited number of human languages due to the lack of training data for low-res… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to Findings of ACL 2024. The code, datasets, and models are publicly available at https://github.com/boschresearch/ACL24-MLLM

  2. arXiv:2404.18585  [pdf, other

    cs.CL

    FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering

    Authors: Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich

    Abstract: Table Question Answering (TQA) aims at composing an answer to a question based on tabular data. While prior research has shown that TQA models lack robustness, understanding the underlying cause and nature of this issue remains predominantly unclear, posing a significant obstacle to the development of robust TQA systems. In this paper, we formalize three major desiderata for a fine-grained evaluat… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

  3. arXiv:2212.09885  [pdf, other

    cs.CL

    Python Code Generation by Asking Clarification Questions

    Authors: Haau-Sing Li, Mohsen Mesgar, André F. T. Martins, Iryna Gurevych

    Abstract: Code generation from text requires understanding the user's intent from a natural language description and generating an executable code snippet that satisfies this intent. While recent pretrained language models demonstrate remarkable performance for this task, these models fail when the given natural language description is under-specified. In this work, we introduce a novel and more realistic s… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages (excluding Limitations and Ethics Concerns)

  4. arXiv:2210.06440  [pdf, other

    cs.CL

    The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification

    Authors: Mohsen Mesgar, Thy Thy Tran, Goran Glavas, Iryna Gurevych

    Abstract: Few-shot Intent Classification (FSIC) is one of the key challenges in modular task-oriented dialog systems. While advanced FSIC methods are similar in using pretrained language models to encode texts and nearest neighbour-based inference for classification, these methods differ in details. They start from different pretrained text encoders, use different encoding architectures with varying similar… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  5. arXiv:2202.06690  [pdf, other

    cs.CL

    ArgSciChat: A Dataset for Argumentative Dialogues on Scientific Papers

    Authors: Federico Ruggeri, Mohsen Mesgar, Iryna Gurevych

    Abstract: The applications of conversational agents for scientific disciplines (as expert domains) are understudied due to the lack of dialogue data to train such agents. While most data collection frameworks, such as Amazon Mechanical Turk, foster data collection for generic domains by connecting crowd workers and task designers, these frameworks are not much optimized for data collection in expert domains… ▽ More

    Submitted 12 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  6. arXiv:2005.00036  [pdf, other

    cs.CL

    Improving Factual Consistency Between a Response and Persona Facts

    Authors: Mohsen Mesgar, Edwin Simpson, Iryna Gurevych

    Abstract: Neural models for response generation produce responses that are semantically plausible but not necessarily factually consistent with facts describing the speaker's persona. These models are trained with fully supervised learning where the objective function barely captures factual consistency. We propose to fine-tune these models by reinforcement learning and an efficient reward function that exp… ▽ More

    Submitted 14 February, 2021; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: Accepted in EACL'21 (https://www.aclweb.org/anthology/)

  7. arXiv:1911.10392  [pdf, other

    cs.CL

    When is ACL's Deadline? A Scientific Conversational Agent

    Authors: Mohsen Mesgar, Paul Youssef, Lin Li, Dominik Bierwirth, Yihao Li, Christian M. Meyer, Iryna Gurevych

    Abstract: Our conversational agent UKP-ATHENA assists NLP researchers in finding and exploring scientific literature, identifying relevant authors, planning or post-processing conference visits, and preparing paper submissions using a unified interface based on natural language inputs and responses. UKP-ATHENA enables new access paths to our swiftly evolving research area with its massive amounts of scienti… ▽ More

    Submitted 23 November, 2019; originally announced November 2019.

  8. arXiv:1908.08486  [pdf, ps, other

    cs.CL

    Dialogue Coherence Assessment Without Explicit Dialogue Act Labels

    Authors: Mohsen Mesgar, Sebastian Bücker, Iryna Gurevych

    Abstract: Recent dialogue coherence models use the coherence features designed for monologue texts, e.g. nominal entities, to represent utterances and then explicitly augment them with dialogue-relevant features, e.g., dialogue act labels. It indicates two drawbacks, (a) semantics of utterances is limited to entity mentions, and (b) the performance of coherence models strongly relies on the quality of the i… ▽ More

    Submitted 3 June, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: Accepted at ACL 2020

  9. arXiv:1907.12894  [pdf, other

    cs.CL cs.LG

    Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation

    Authors: Yang Gao, Christian M. Meyer, Mohsen Mesgar, Iryna Gurevych

    Abstract: Document summarisation can be formulated as a sequential decision-making problem, which can be solved by Reinforcement Learning (RL) algorithms. The predominant RL paradigm for summarisation learns a cross-input policy, which requires considerable time, data and parameter tuning due to the huge search spaces and the delayed rewards. Learning input-specific RL policies is a more efficient alternati… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: Accepted to IJCAI 2019

  10. arXiv:1903.11508  [pdf, other

    cs.CL cs.CR cs.CV cs.LG

    Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems

    Authors: Steffen Eger, Gözde Gül Şahin, Andreas Rücklé, Ji-Ung Lee, Claudia Schulz, Mohsen Mesgar, Krishnkant Swarnkar, Edwin Simpson, Iryna Gurevych

    Abstract: Visual modifications to text are often used to obfuscate offensive comments in social media (e.g., "!d10t") or as a writing style ("1337" in "leet speak"), among other scenarios. We consider this as a new type of adversarial attack in NLP, a setting to which humans are very robust, as our experiments with both simple and more difficult visual input perturbations demonstrate. We then investigate th… ▽ More

    Submitted 10 June, 2020; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: Accepted as long paper at NAACL-2019; fixed one ungrammatical sentence