Skip to main content

Showing 1–14 of 14 results for author: Aw, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16020  [pdf, other

    cs.SD cs.CL eess.AS

    AudioBench: A Universal Benchmark for Audio Large Language Models

    Authors: Bin Wang, Xunlong Zou, Geyu Lin, Shuo Sun, Zhuohan Liu, Wenyu Zhang, Zhengyuan Liu, AiTi Aw, Nancy F. Chen

    Abstract: We introduce AudioBench, a new benchmark designed to evaluate audio large language models (AudioLLMs). AudioBench encompasses 8 distinct tasks and 26 carefully selected or newly curated datasets, focusing on speech understanding, voice interpretation, and audio scene understanding. Despite the rapid advancement of large language models, including multimodal versions, a significant gap exists in co… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: 20 pages; v2 - typo update; Code: https://github.com/AudioLLMs/AudioBench

  2. arXiv:2312.05834  [pdf, other

    cs.CL cs.AI

    Evidence-based Interpretable Open-domain Fact-checking with Large Language Models

    Authors: Xin Tan, Bowei Zou, Ai Ti Aw

    Abstract: Universal fact-checking systems for real-world claims face significant challenges in gathering valid and sufficient real-time evidence and making reasoned decisions. In this work, we introduce the Open-domain Explainable Fact-checking (OE-Fact) system for claim-checking in real-world scenarios. The OE-Fact system can leverage the powerful understanding and reasoning capabilities of large language… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  3. arXiv:2309.04766  [pdf, other

    cs.CL cs.AI

    SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning

    Authors: Bin Wang, Zhengyuan Liu, Xin Huang, Fangkai Jiao, Yang Ding, AiTi Aw, Nancy F. Chen

    Abstract: We present SeaEval, a benchmark for multilingual foundation models. In addition to characterizing how these models understand and reason with natural language, we also investigate how well they comprehend cultural practices, nuances, and values. Alongside standard accuracy metrics, we investigate the brittleness of foundation models in the dimensions of semantics and multilinguality. Our analyses… ▽ More

    Submitted 31 March, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: To appear in NAACL 2024. 20 pages. More datasets (2 on Cross-Lingual Consistency and 4 on Cultural Understanding) and more supported languages. Code: https://seaeval.github.io/

    Journal ref: NAACL 2024

  4. arXiv:2305.13085  [pdf, other

    cs.CL

    Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models

    Authors: Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy F. Chen

    Abstract: This study investigates machine translation between related languages i.e., languages within the same family that share linguistic characteristics such as word order and lexical similarity. Machine translation through few-shot prompting leverages a small set of translation pair examples to generate translations for test sentences. This procedure requires the model to learn how to generate translat… ▽ More

    Submitted 22 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (Main, Long paper)

  5. arXiv:2305.03088  [pdf, other

    cs.CL cs.AI

    Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

    Authors: Xuan Long Do, Bowei Zou, Shafiq Joty, Anh Tai Tran, Liangming Pan, Nancy F. Chen, Ai Ti Aw

    Abstract: Conversational Question Generation (CQG) is a critical task for machines to assist humans in fulfilling their information needs through conversations. The task is generally cast into two different settings: answer-aware and answer-unaware. While the former facilitates the models by exposing the expected answer, the latter is more realistic and receiving growing attentions recently. What-to-ask and… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 17 pages, ACL 2023

  6. arXiv:2209.06652  [pdf, other

    cs.CL

    CoHS-CQG: Context and History Selection for Conversational Question Generation

    Authors: Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw

    Abstract: Conversational question generation (CQG) serves as a vital task for machines to assist humans, such as interactive reading comprehension, through conversations. Compared to traditional single-turn question generation (SQG), CQG is more challenging in the sense that the generated question is required not only to be meaningful, but also to align with the occurred conversation history. While previous… ▽ More

    Submitted 10 October, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted by 29th International Conference on Computational Linguistics (COLING 2022)

  7. arXiv:2205.15544  [pdf, other

    cs.CL cs.AI

    Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

    Authors: Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

    Abstract: Numerous recent work on unsupervised machine translation (UMT) implies that competent unsupervised translations of low-resource and unrelated languages, such as Nepali or Sinhala, are only possible if the model is trained in a massive multilingual environment, where these low-resource languages are mixed with high-resource counterparts. Nonetheless, while the high-resource languages greatly help k… ▽ More

    Submitted 1 October, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: Published in NeurIPS 2022

  8. arXiv:2202.13636  [pdf, other

    cs.CL

    Improving Lexical Embeddings for Robust Question Answering

    Authors: Weiwen Xu, Bowei Zou, Wai Lam, Ai Ti Aw

    Abstract: Recent techniques in Question Answering (QA) have gained remarkable performance improvement with some QA models even surpassed human performance. However, the ability of these models in truly understanding the language still remains dubious and the models are revealing limitations when facing adversarial examples. To strengthen the robustness of QA models and their generalization ability, we propo… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: 7 pages, 3 tables

  9. arXiv:2104.09810  [pdf, other

    cs.CL

    Addressing the Vulnerability of NMT in Input Perturbations

    Authors: Weiwen Xu, Ai Ti Aw, Yang Ding, Kui Wu, Shafiq Joty

    Abstract: Neural Machine Translation (NMT) has achieved significant breakthrough in performance but is known to suffer vulnerability to input perturbations. As real input noise is difficult to predict during training, robustness is a big issue for system deployment. In this paper, we improve the robustness of NMT models by reducing the effect of noisy words through a Context-Enhanced Reconstruction (CER) ap… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted by NAACL 2021 Industry Track

  10. arXiv:2010.05523  [pdf, other

    cs.CL cs.AI cs.LG

    FILM: A Fast, Interpretable, and Low-rank Metric Learning Approach for Sentence Matching

    Authors: Xiangru Tang, Alan Aw

    Abstract: Detection of semantic similarity plays a vital role in sentence matching. It requires to learn discriminative representations of natural language. Recently, owing to more and more sophisticated model architecture, impressive progress has been made, along with a time-consuming training process and not-interpretable inference. To alleviate this problem, we explore a metric learning approach, named F… ▽ More

    Submitted 12 October, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

  11. arXiv:2006.02163  [pdf, other

    cs.CL cs.LG

    Cross-model Back-translated Distillation for Unsupervised Machine Translation

    Authors: Xuan-Phi Nguyen, Shafiq Joty, Thanh-Tung Nguyen, Wu Kui, Ai Ti Aw

    Abstract: Recent unsupervised machine translation (UMT) systems usually employ three main principles: initialization, language modeling and iterative back-translation, though they may apply them differently. Crucially, iterative back-translation and denoising auto-encoding for language modeling provide data diversity to train the UMT systems. However, the gains from these diversification processes has seeme… ▽ More

    Submitted 24 May, 2021; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted to a conference paper at ICML 2021

  12. arXiv:1911.01986  [pdf, other

    cs.CL cs.LG

    Data Diversification: A Simple Strategy For Neural Machine Translation

    Authors: Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

    Abstract: We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging them with the original dataset on which the final NMT model is trained. Our method is applicable to all NMT models. It does not require extra monolingual data like… ▽ More

    Submitted 4 October, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: Accepted as a conference paper at NeurIPS 2020

  13. arXiv:1910.01335  [pdf, other

    cs.CL

    Topic-aware Pointer-Generator Networks for Summarizing Spoken Conversations

    Authors: Zhengyuan Liu, Angela Ng, Sheldon Lee, Ai Ti Aw, Nancy F. Chen

    Abstract: Due to the lack of publicly available resources, conversation summarization has received far less attention than text summarization. As the purpose of conversations is to exchange information between at least two interlocutors, key information about a certain topic is often scattered and spanned across multiple utterances and turns from different speakers. This phenomenon is more pronounced during… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: To appear in ASRU2019

  14. The multicovering radius problem for some types of discrete structures

    Authors: Alan J. Aw

    Abstract: The covering radius problem is a question in coding theory concerned with finding the minimum radius $r$ such that, given a code that is a subset of an underlying metric space, balls of radius $r$ over its code words cover the entire metric space. Klapper introduced a code parameter, called the multicovering radius, which is a generalization of the covering radius. In this paper, we introduce an a… ▽ More

    Submitted 26 October, 2012; v1 submitted 8 December, 2011; originally announced December 2011.

    Comments: To appear in Designs, Codes and Cryptography (2012)

    MSC Class: 05D40; 94B99

    Journal ref: Designs, Codes and Cryptography 72 (2) 2014