Skip to main content

Showing 1–50 of 118 results for author: Haffari, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17430  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights

    Authors: Hao Yang, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari

    Abstract: Large Multimodal Models (LMMs) have achieved great success recently, demonstrating a strong capability to understand multimodal information and to interact with human users. Despite the progress made, the challenge of detecting high-risk interactions in multimodal settings, and in particular in speech modality, remains largely unexplored. Conventional research on risk for speech modality primarily… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.17300  [pdf, other

    cs.CL

    CausalScore: An Automatic Reference-Free Metric for Assessing Response Relevance in Open-Domain Dialogue Systems

    Authors: Tao Feng, Lizhen Qu, Xiaoxi Kang, Gholamreza Haffari

    Abstract: Automatically evaluating the quality of responses in open-domain dialogue systems is a challenging but crucial task. Current evaluation metrics often fail to align with human judgments, especially when assessing responses that are grammatically correct. To address this issue, we propose a novel metric, called CausalScore, which assesses the relevance of responses by measuring the causal strength b… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.15490  [pdf, other

    cs.CL cs.AI cs.LG

    Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction

    Authors: Yuncheng Hua, Yu** Huang, Shuo Huang, Tao Feng, Lizhen Qu, Chris Bain, Richard Bassed, Gholamreza Haffari

    Abstract: This paper tackles the task of emotion-cause pair extraction in the unsupervised domain adaptation setting. The problem is challenging as the distributions of the events causing emotions in target domains are dramatically different than those in source domains, despite the distributions of emotional expressions between domains are overlapped. Inspired by causal discovery, we propose a novel deep l… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, 4 tables; Under Review in EMNLP 2024

    ACM Class: I.2.4

  4. arXiv:2406.10882  [pdf, other

    cs.CL

    SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking

    Authors: Zhuang Li, Yuncheng Hua, Thuy-Trang Vu, Haolan Zhan, Lizhen Qu, Gholamreza Haffari

    Abstract: Recent studies have shown that maintaining a consistent response style by human experts and enhancing data quality in training sets can significantly improve the performance of fine-tuned Large Language Models (LLMs) while reducing the number of training examples needed. However, the precise definition of style and the relationship between style, data quality, and LLM performance remains unclear.… ▽ More

    Submitted 1 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 21 pages

  5. arXiv:2406.10880  [pdf, other

    cs.CL

    Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR

    Authors: Minghan Wang, Yuxia Wang, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari

    Abstract: Recent advancements in multimodal large language models (MLLMs) have made significant progress in integrating information across various modalities, yet real-world applications in educational and scientific domains remain challenging. This paper introduces the Multimodal Scientific ASR (MS-ASR) task, which focuses on transcribing scientific conference videos by leveraging visual information from s… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  6. arXiv:2406.08811  [pdf, other

    cs.CL

    Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models

    Authors: Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Gholamreza Haffari

    Abstract: Large language models (LLMs) are typically fine-tuned on diverse and extensive datasets sourced from various origins to develop a comprehensive range of skills, such as writing, reasoning, chatting, coding, and more. Each skill has unique characteristics, and these datasets are often heterogeneous and imbalanced, making the fine-tuning process highly challenging. Balancing the development of each… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Work in progress; 15 pages, 7 tables, 4 figures

  7. arXiv:2406.07411  [pdf, other

    cs.SE cs.CL

    VersiCode: Towards Version-controllable Code Generation

    Authors: Tongtong Wu, Weigang Wu, Xingyu Wang, Kang Xu, Suyu Ma, Bo Jiang, ** Yang, Zhenchang Xing, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Significant research has focused on improving the performance of large language model on code-related tasks due to their practical importance. Although performance is typically evaluated using public benchmark datasets, the existing datasets do not account for the concept of \emph{version}, which is crucial in professional software development. In this paper, we introduce VersiCode, the first comp… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  8. arXiv:2406.03749  [pdf, other

    cs.CL

    NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human

    Authors: Shuo Huang, William MacLean, Xiaoxi Kang, Anqi Wu, Lizhen Qu, Qiongkai Xu, Zhuang Li, Xingliang Yuan, Gholamreza Haffari

    Abstract: Increasing concerns about privacy leakage issues in academia and industry arise when employing NLP models from third-party providers to process sensitive texts. To protect privacy before sending sensitive data to those models, we suggest sanitizing sensitive text using two common strategies used by humans: i) deleting sensitive expressions, and ii) obscuring sensitive details by abstracting them.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2406.01045  [pdf, other

    cs.CL cs.AI

    Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs

    Authors: Fatemeh Shiri, Van Nguyen, Farhad Moghimifar, John Yoo, Gholamreza Haffari, Yuan-Fang Li

    Abstract: Large Language Models (LLMs) demonstrate significant capabilities in processing natural language data, promising efficient knowledge extraction from diverse textual sources to enhance situational awareness and support decision-making. However, concerns arise due to their susceptibility to hallucination, resulting in contextually inaccurate content. This work focuses on harnessing LLMs for automate… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2405.14366  [pdf, other

    cs.CL cs.AI cs.LG

    MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

    Authors: Akide Liu, **g Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang

    Abstract: A critical approach for efficiently deploying computationally demanding large language models (LLMs) is Key-Value (KV) caching. The KV cache stores key-value states of previously generated tokens, significantly reducing the need for repetitive computations and thereby lowering latency in autoregressive generation. However, the size of the KV cache grows linearly with sequence length, posing challe… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Tech report

  11. arXiv:2405.11804  [pdf, other

    cs.CL

    (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

    Authors: Minghao Wu, Yulin Yuan, Gholamreza Haffari, Longyue Wang

    Abstract: Recent advancements in machine translation (MT) have significantly enhanced translation quality across various domains. However, the translation of literary texts remains a formidable challenge due to their complex language, figurative expressions, and cultural nuances. In this work, we introduce a novel multi-agent framework based on large language models (LLMs) for literary translation, implemen… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: work in progress

  12. arXiv:2404.13504  [pdf, other

    cs.CL

    IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models

    Authors: Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, Yuncheng Hua, Gholamreza Haffari

    Abstract: Machine learning models have made incredible progress, but they still struggle when applied to examples from unseen domains. This study focuses on a specific problem of domain generalization, where a model is trained on one source domain and tested on multiple target domains that are unseen during training. We propose IMO: Invariant features Masks for Out-of-Distribution text classification, to ac… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  13. arXiv:2404.13289  [pdf, other

    cs.CL cs.MM cs.SD eess.AS

    Double Mixture: Towards Continual Event Detection from Speech

    Authors: **gqi Kang, Tongtong Wu, **ming Zhao, Guitao Wang, Yinwei Wei, Hao Yang, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Speech event detection is crucial for multimedia retrieval, involving the tagging of both semantic and acoustic events. Traditional ASR systems often overlook the interplay between these events, focusing solely on content, even though the interpretation of dialogue can vary with environmental context. This paper tackles two primary challenges in speech event detection: the continual integration of… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  14. arXiv:2402.11712  [pdf, other

    cs.CL

    Modelling Political Coalition Negotiations Using LLM-based Agents

    Authors: Farhad Moghimifar, Yuan-Fang Li, Robert Thomson, Gholamreza Haffari

    Abstract: Coalition negotiations are a cornerstone of parliamentary democracies, characterised by complex interactions and strategic communications among political parties. Despite its significance, the modelling of these negotiations has remained unexplored with the domain of Natural Language Processing (NLP), mostly due to lack of proper data. In this paper, we introduce coalition negotiations as a novel… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  15. arXiv:2402.11199  [pdf, other

    cs.CL

    Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs

    Authors: Minh-Vuong Nguyen, Linhao Luo, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Thuy-Trang Vu, Gholamreza Haffari

    Abstract: Large language models (LLMs) demonstrate strong reasoning abilities when prompted to generate chain-of-thought (CoT) explanations alongside answers. However, previous research on evaluating LLMs has solely focused on answer accuracy, neglecting the correctness of the generated CoT. In this paper, we delve deeper into the CoT reasoning capabilities of LLMs in multi-hop question answering by utilizi… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Minh-Vuong Nguyen and Linhao Luo are co-first authors and contributed equally to the preparation of this manuscript. Accepted to ACL24-Findings

  16. arXiv:2402.11178  [pdf, other

    cs.CL

    RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

    Authors: Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Lay-Ki Soon, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Norm violations occur when individuals fail to conform to culturally accepted behaviors, which may lead to potential conflicts. Remediating norm violations requires social awareness and cultural sensitivity of the nuances at play. To equip interactive AI systems with a remediation ability, we offer ReNoVi - a large-scale corpus of 9,258 multi-turn dialogues annotated with social norms, as well as… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: work in progress. 15 pages, 7 figures

  17. arXiv:2402.10552  [pdf, other

    cs.CL

    Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models

    Authors: Minghan Wang, Thuy-Trang Vu, Yuxia Wang, Ehsan Shareghi, Gholamreza Haffari

    Abstract: Simultaneous machine translation (SimulMT) presents a challenging trade-off between translation quality and latency. Recent studies have shown that LLMs can achieve good performance in SimulMT tasks. However, this often comes at the expense of high inference cost and latency. In this paper, we propose a conversational SimulMT framework to enhance the inference efficiency of LLM-based SimulMT throu… ▽ More

    Submitted 21 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  18. arXiv:2402.04609  [pdf, other

    cs.CL

    Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach

    Authors: Zhuang Li, Levon Haroutunian, Raj Tumuluri, Philip Cohen, Gholamreza Haffari

    Abstract: Post-editing has proven effective in improving the quality of text generated by large language models (LLMs) such as GPT-3.5 or GPT-4, particularly when direct updating of their parameters to enhance text quality is infeasible or expensive. However, relying solely on smaller language models for post-editing can limit the LLMs' ability to generalize across domains. Moreover, the editing strategies… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: EACL 2024 (findings), short paper, 5 pages

  19. arXiv:2402.01737  [pdf, other

    cs.CL cs.AI

    Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues

    Authors: Yuncheng Hua, Lizhen Qu, Gholamreza Haffari

    Abstract: We develop assistive agents based on Large Language Models (LLMs) that aid interlocutors in business negotiations. Specifically, we simulate business negotiations by letting two LLM-based agents engage in role play. A third LLM acts as a remediator agent to rewrite utterances violating norms for improving negotiation outcomes. We introduce a simple tuning-free and label-free In-Context Learning (I… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

    Comments: 25 pages, 3 figures, 13 tables; Under review in EMNLP 2024

    ACM Class: I.2.7

  20. arXiv:2402.01736  [pdf, other

    cs.CL cs.AI

    SADAS: A Dialogue Assistant System Towards Remediating Norm Violations in Bilingual Socio-Cultural Conversations

    Authors: Yuncheng Hua, Zhuang Li, Linhao Luo, Kadek Ananta Satriadi, Tao Feng, Haolan Zhan, Lizhen Qu, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari

    Abstract: In today's globalized world, bridging the cultural divide is more critical than ever for forging meaningful connections. The Socially-Aware Dialogue Assistant System (SADAS) is our answer to this global challenge, and it's designed to ensure that conversations between individuals from diverse cultural backgrounds unfold with respect and understanding. Our system's novel architecture includes: (1)… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

    ACM Class: I.2.7

  21. arXiv:2402.01364  [pdf, other

    cs.CL cs.LG

    Continual Learning for Large Language Models: A Survey

    Authors: Tongtong Wu, Linhao Luo, Yuan-Fang Li, Shirui Pan, Thuy-Trang Vu, Gholamreza Haffari

    Abstract: Large language models (LLMs) are not amenable to frequent re-training, due to high training costs arising from their massive scale. However, updates are necessary to endow LLMs with new skills and keep them up-to-date with rapidly evolving human knowledge. This paper surveys recent works on continual learning for LLMs. Due to the unique nature of LLMs, we catalog continue learning techniques in a… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  22. arXiv:2402.01097  [pdf, other

    cs.CL

    Let's Negotiate! A Survey of Negotiation Dialogue Systems

    Authors: Haolan Zhan, Yufei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Negotiation is a crucial ability in human communication. Recently, there has been a resurgent research interest in negotiation dialogue systems, whose goal is to create intelligent agents that can assist people in resolving conflicts or reaching agreements. Although there have been many explorations into negotiation dialogue systems, a systematic review of this task has not been performed to date.… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by EACL 2024 (findings). arXiv admin note: substantial text overlap with arXiv:2212.09072

  23. arXiv:2401.15385  [pdf, other

    cs.CL cs.MM

    Towards Event Extraction from Speech with Contextual Clues

    Authors: **gqi Kang, Tongtong Wu, **ming Zhao, Guitao Wang, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari

    Abstract: While text-based event extraction has been an active research area and has seen successful application in many domains, extracting semantic events from speech directly is an under-explored problem. In this paper, we introduce the Speech Event Extraction (SpeechEE) task and construct three synthetic training sets and one human-spoken test set. Compared to event extraction from text, SpeechEE poses… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Under Review

  24. arXiv:2401.15360  [pdf, other

    cs.CL

    Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

    Authors: Minghao Wu, Yufei Wang, George Foster, Lizhen Qu, Gholamreza Haffari

    Abstract: Document-level neural machine translation (DocNMT) aims to generate translations that are both coherent and cohesive, in contrast to its sentence-level counterpart. However, due to its longer input length and limited availability of training data, DocNMT often faces the challenge of data sparsity. To overcome this issue, we propose a novel Importance-Aware Data Augmentation (IADA) algorithm for Do… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 13 pages, 4 figures, 7 tables, accepted by EACL2024 main conference

  25. arXiv:2401.06468  [pdf, other

    cs.CL

    Adapting Large Language Models for Document-Level Machine Translation

    Authors: Minghao Wu, Thuy-Trang Vu, Lizhen Qu, George Foster, Gholamreza Haffari

    Abstract: Large language models (LLMs) have significantly advanced various natural language processing (NLP) tasks. Recent research indicates that moderately-sized LLMs often outperform larger ones after task-specific fine-tuning. This study focuses on adapting LLMs for document-level machine translation (DocMT) for specific language pairs. We first investigate the impact of prompt strategies on translation… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: work in progress; 23 pages, 19 tables, 7 figures

  26. arXiv:2401.05632  [pdf, other

    cs.CL

    Natural Language Processing for Dialects of a Language: A Survey

    Authors: Aditya Joshi, Raj Dabre, Diptesh Kanojia, Zhuang Li, Haolan Zhan, Gholamreza Haffari, Doris Dippold

    Abstract: State-of-the-art natural language processing (NLP) models are trained on massive training corpora, and report a superlative performance on evaluation datasets. This survey delves into an important attribute of these datasets: the dialect of a language. Motivated by the performance degradation of NLP models for dialectic datasets and its implications for the equity of language technologies, we surv… ▽ More

    Submitted 28 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: The paper is under review at ACM Computing Surveys. Please reach out to the authors in the case of feedback

  27. arXiv:2310.18359  [pdf, other

    cs.CL cs.AI

    DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding

    Authors: Xiao-Yu Guo, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Social intelligence is essential for understanding and reasoning about human expressions, intents and interactions. One representative benchmark for its study is Social Intelligence Queries (Social-IQ), a dataset of multiple-choice questions on videos of complex social interactions. We define a comprehensive methodology to study the soundness of Social-IQ, as the soundness of such benchmark datase… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 12 pages, 5 figures, EMNLP 2023 Long Paper

  28. arXiv:2310.11638  [pdf, other

    cs.CL

    Systematic Assessment of Factual Knowledge in Large Language Models

    Authors: Linhao Luo, Thuy-Trang Vu, Dinh Phung, Gholamreza Haffari

    Abstract: Previous studies have relied on existing question-answering benchmarks to evaluate the knowledge stored in large language models (LLMs). However, this approach has limitations regarding factual knowledge coverage, as it mostly focuses on generic domains which may overlap with the pretraining data. This paper proposes a framework to systematically assess the factual knowledge of LLMs by leveraging… ▽ More

    Submitted 30 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 Findings

  29. arXiv:2310.01061  [pdf, other

    cs.CL cs.AI

    Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

    Authors: Linhao Luo, Yuan-Fang Li, Gholamreza Haffari, Shirui Pan

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning abilities in complex tasks. However, they lack up-to-date knowledge and experience hallucinations during reasoning, which can lead to incorrect reasoning processes and diminish their performance and trustworthiness. Knowledge graphs (KGs), which capture vast amounts of facts in a structured format, offer a reliable source of knowl… ▽ More

    Submitted 23 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  30. arXiv:2309.12294  [pdf, other

    cs.CL

    Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models

    Authors: Levon Haroutunian, Zhuang Li, Lucian Galescu, Philip Cohen, Raj Tumuluri, Gholamreza Haffari

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language generation. However, their output quality can be inconsistent, posing challenges for generating natural language from logical forms (LFs). This task requires the generated outputs to embody the exact semantics of LFs, without missing any LF semantics or creating any hallucinations. In this work, we tackle th… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: IJCNLP-AACL 2023

  31. arXiv:2309.06706  [pdf, other

    cs.CL

    Simultaneous Machine Translation with Large Language Models

    Authors: Minghan Wang, **ming Zhao, Thuy-Trang Vu, Fatemeh Shiri, Ehsan Shareghi, Gholamreza Haffari

    Abstract: Real-world simultaneous machine translation (SimulMT) systems face more challenges than just the quality-latency trade-off. They also need to address issues related to robustness with noisy input, processing long contexts, and flexibility for knowledge injection. These challenges demand models with strong language understanding and generation capabilities which may not often equipped by dedicated… ▽ More

    Submitted 15 February, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  32. arXiv:2309.01538  [pdf, other

    cs.AI cs.CL

    ChatRule: Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning

    Authors: Linhao Luo, Jiaxin Ju, Bo Xiong, Yuan-Fang Li, Gholamreza Haffari, Shirui Pan

    Abstract: Logical rules are essential for uncovering the logical connections between relations, which could improve reasoning performance and provide interpretable results on knowledge graphs (KGs). Although there have been many efforts to mine meaningful logical rules over KGs, existing methods suffer from computationally intensive searches over the rule space and a lack of scalability for large-scale KGs.… ▽ More

    Submitted 21 January, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures

  33. arXiv:2306.04996  [pdf, other

    cs.CL

    T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification

    Authors: Inigo Jauregi Unanue, Gholamreza Haffari, Massimo Piccardi

    Abstract: Cross-lingual text classification leverages text classifiers trained in a high-resource language to perform text classification in other languages with no or minimal fine-tuning (zero/few-shots cross-lingual transfer). Nowadays, cross-lingual text classifiers are typically built on large-scale, multilingual language models (LMs) pretrained on a variety of languages of interest. However, the perfor… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted by the Transactions of the Association for Computational Linguistics (TACL), pre-MIT Press publication version

  34. arXiv:2305.17733  [pdf, other

    cs.CL cs.SD eess.AS

    Investigating Pre-trained Audio Encoders in the Low-Resource Condition

    Authors: Hao Yang, **ming Zhao, Gholamreza Haffari, Ehsan Shareghi

    Abstract: Pre-trained speech encoders have been central to pushing state-of-the-art results across various speech understanding and generation tasks. Nonetheless, the capabilities of these encoders in low-resource settings are yet to be thoroughly explored. To address this, we conduct a comprehensive set of experiments using a representative set of 3 state-of-the-art encoders (Wav2vec2, WavLM, Whisper) in t… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023

  35. arXiv:2305.17497  [pdf, other

    cs.CL

    FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

    Authors: Zhuang Li, Yuyang Chai, Terry Yue Zhuo, Lizhen Qu, Gholamreza Haffari, Fei Li, Donghong Ji, Quan Hung Tran

    Abstract: Textual scene graph parsing has become increasingly important in various vision-language applications, including image caption evaluation and image retrieval. However, existing scene graph parsers that convert image captions into scene graphs often suffer from two types of errors. First, the generated scene graphs fail to capture the true semantics of the captions or the corresponding images, resu… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (findings)

  36. arXiv:2305.16598  [pdf, other

    cs.CL

    NormMark: A Weakly Supervised Markov Model for Socio-cultural Norm Discovery

    Authors: Farhad Moghimifar, Shilin Qu, Tongtong Wu, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Norms, which are culturally accepted guidelines for behaviours, can be integrated into conversational models to generate utterances that are appropriate for the socio-cultural context. Existing methods for norm recognition tend to focus only on surface-level features of dialogues and do not take into account the interactions within a conversation. To address this issue, we propose NormMark, a prob… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  37. arXiv:2305.12737  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning

    Authors: Zhuang Li, Lizhen Qu, Philip R. Cohen, Raj V. Tumuluri, Gholamreza Haffari

    Abstract: Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by either humans or machines to alleviate such issues. However, human translations are expensive, while machine translations are cheap but prone to error and bias… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  38. Voicify Your UI: Towards Android App Control with Voice Commands

    Authors: Minh Duc Vu, Han Wang, Zhuang Li, Gholamreza Haffari, Zhenchang Xing, Chunyang Chen

    Abstract: Nowadays, voice assistants help users complete tasks on the smartphone with voice commands, replacing traditional touchscreen interactions when such interactions are inhibited. However, the usability of those tools remains moderate due to the problems in understanding rich language variations in human commands, along with efficiency and comprehensibility issues. Therefore, we introduce Voicify, an… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 7, no. 1 (2023): 1-22

  39. arXiv:2305.04082  [pdf, other

    cs.LG cs.CL

    A Minimal Approach for Natural Language Action Space in Text-based Games

    Authors: Dongwon Kelvin Ryu, Meng Fang, Shirui Pan, Gholamreza Haffari, Ehsan Shareghi

    Abstract: Text-based games (TGs) are language-based interactive environments for reinforcement learning. While language models (LMs) and knowledge graphs (KGs) are commonly used for handling large action space in TGs, it is unclear whether these techniques are necessary or overused. In this paper, we revisit the challenge of exploring the action space in TGs and propose $ ε$-admissible exploration, a minima… ▽ More

    Submitted 29 November, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  40. arXiv:2305.03923  [pdf, other

    cs.LG cs.CL

    Active Continual Learning: On Balancing Knowledge Retention and Learnability

    Authors: Thuy-Trang Vu, Shahram Khadivi, Mahsa Ghorbanali, Dinh Phung, Gholamreza Haffari

    Abstract: Acquiring new knowledge without forgetting what has been learned in a sequence of tasks is the central focus of continual learning (CL). While tasks arrive sequentially, the training data are often prepared and annotated independently, leading to the CL of incoming supervised learning tasks. This paper considers the under-explored problem of active continual learning (ACL) for a sequence of active… ▽ More

    Submitted 30 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

  41. arXiv:2305.01323  [pdf, other

    cs.CL

    Turning Flowchart into Dialog: Augmenting Flowchart-grounded Troubleshooting Dialogs via Synthetic Data Generation

    Authors: Haolan Zhan, Sameen Maruf, Lizhen Qu, Yufei Wang, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Flowchart-grounded troubleshooting dialogue (FTD) systems, which follow the instructions of a flowchart to diagnose users' problems in specific domains (e.g., vehicle, laptop), have been gaining research interest in recent years. However, collecting sufficient dialogues that are naturally grounded on flowcharts is costly, thus FTD systems are impeded by scarce training data. To mitigate the data s… ▽ More

    Submitted 29 October, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted by ALTA 2023

  42. arXiv:2304.12026  [pdf, other

    cs.CL

    SocialDial: A Benchmark for Socially-Aware Dialogue Systems

    Authors: Haolan Zhan, Zhuang Li, Yufei Wang, Linhao Luo, Tao Feng, Xiaoxi Kang, Yuncheng Hua, Lizhen Qu, Lay-Ki Soon, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari

    Abstract: Dialogue systems have been widely applied in many scenarios and are now more powerful and ubiquitous than ever before. With large neural models and massive available data, current dialogue systems have access to more knowledge than any people in their life. However, current dialogue systems still do not perform at a human level. One major gap between conversational agents and humans lies in their… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR 2023

  43. Normalizing Flow-based Neural Process for Few-Shot Knowledge Graph Completion

    Authors: Linhao Luo, Yuan-Fang Li, Gholamreza Haffari, Shirui Pan

    Abstract: Knowledge graphs (KGs), as a structured form of knowledge representation, have been widely applied in the real world. Recently, few-shot knowledge graph completion (FKGC), which aims to predict missing facts for unseen relations with few-shot associated facts, has attracted increasing attention from practitioners and researchers. However, existing FKGC methods are based on metric learning or meta-… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR2023

    Journal ref: SIGIR 2023

  44. arXiv:2303.14770  [pdf, other

    cs.CL cs.LG

    Koala: An Index for Quantifying Overlaps with Pre-training Corpora

    Authors: Thuy-Trang Vu, Xuanli He, Gholamreza Haffari, Ehsan Shareghi

    Abstract: In very recent years more attention has been placed on probing the role of pre-training data in Large Language Models (LLMs) downstream behaviour. Despite the importance, there is no public tool that supports such analysis of pre-training corpora at large scale. To help research in this space, we launch Koala, a searchable index over large pre-training corpora using compressed suffix arrays with h… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Available here: https://koala-index.erc.monash.edu/

  45. arXiv:2303.13556  [pdf, other

    cs.CV

    ProtoCon: Pseudo-label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-supervised Learning

    Authors: Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Gholamreza Haffari

    Abstract: Confidence-based pseudo-labeling is among the dominant approaches in semi-supervised learning (SSL). It relies on including high-confidence predictions made on unlabeled data as additional targets to train the model. We propose ProtoCon, a novel SSL method aimed at the less-explored label-scarce SSL where such methods usually underperform. ProtoCon refines the pseudo-labels by leveraging their nea… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR2023 (highlight)

  46. arXiv:2303.01962  [pdf, other

    cs.CL cs.AI

    Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue Response Generation Models by Causal Discovery

    Authors: Tao Feng, Lizhen Qu, Gholamreza Haffari

    Abstract: In this paper, we conduct the first study on spurious correlations for open-domain response generation models based on a corpus CGDIALOG curated in our work. The cur rent models indeed suffer from spurious correlations and have a tendency of generating irrelevant and generic responses. Inspired by causal discovery algorithms, we propose a novel model-agnostic method for training and inference of r… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  47. arXiv:2302.08079  [pdf, other

    cs.CL

    Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation

    Authors: Minghao Wu, George Foster, Lizhen Qu, Gholamreza Haffari

    Abstract: Existing work in document-level neural machine translation commonly concatenates several consecutive sentences as a pseudo-document, and then learns inter-sentential dependencies. This strategy limits the model's ability to leverage information from distant context. We overcome this limitation with a novel Document Flattening (DocFlat) technique that integrates Flat-Batch Attention (FBA) and Neura… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 15 pages, 8 figures, accepted by EACL 2023

  48. arXiv:2301.12920  [pdf, other

    cs.CL

    Active Learning for Multilingual Semantic Parser

    Authors: Zhuang Li, Gholamreza Haffari

    Abstract: Current multilingual semantic parsing (MSP) datasets are almost all collected by translating the utterances in the existing datasets from the resource-rich language to the target language. However, manual translation is costly. To reduce the translation effort, this paper proposes the first active learning procedure for MSP (AL-MSP). AL-MSP selects only a subset from the existing datasets to be tr… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: EACL 2023 (findings)

  49. arXiv:2301.12868  [pdf, other

    cs.CL

    On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex

    Authors: Terry Yue Zhuo, Zhuang Li, Yu** Huang, Fatemeh Shiri, Weiqing Wang, Gholamreza Haffari, Yuan-Fang Li

    Abstract: Semantic parsing is a technique aimed at constructing a structured representation of the meaning of a natural-language question. Recent advancements in few-shot language models trained on code have demonstrated superior performance in generating these representations compared to traditional unimodal language models, which are trained on downstream tasks. Despite these advancements, existing fine-t… ▽ More

    Submitted 9 March, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted at EACL2023 (main)

  50. arXiv:2212.09072  [pdf, other

    cs.CL

    Let's Negotiate! A Survey of Negotiation Dialogue Systems

    Authors: Haolan Zhan, Yufei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Gholamreza Haffari

    Abstract: Negotiation is one of the crucial abilities in human communication, and there has been a resurgent research interest in negotiation dialogue systems recently, which goal is to empower intelligent agents with such ability that can efficiently help humans resolve conflicts or reach beneficial agreements. Although there have been many explorations in negotiation dialogue systems, a systematic review… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: An early version, work in progress