Skip to main content

Showing 1–47 of 47 results for author: Luu, A T

.
  1. arXiv:2405.19723  [pdf, other

    cs.CV cs.AI

    Encoding and Controlling Global Semantics for Long-form Video Question Answering

    Authors: Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Seeking answers effectively for long videos is essential to build video question answering (videoQA) systems. Previous methods adaptively select frames and regions from long videos to save computations. However, this fails to reason over the whole sequence of video, leading to sub-optimal performance. To address this problem, we introduce a state space layer (SSL) into multi-modal Transformer to e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Work in progress

  2. arXiv:2405.17978  [pdf, other

    cs.CL cs.AI

    FASTopic: A Fast, Adaptive, Stable, and Transferable Topic Modeling Paradigm

    Authors: Xiaobao Wu, Thong Nguyen, Delvin Ce Zhang, William Yang Wang, Anh Tuan Luu

    Abstract: Topic models have been evolving rapidly over the years, from conventional to recent neural models. However, existing topic models generally struggle with either effectiveness, efficiency, or stability, highly impeding their practical applications. In this paper, we propose FASTopic, a fast, adaptive, stable, and transferable topic model. FASTopic follows a new paradigm: Dual Semantic-relation Reco… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.17957  [pdf, other

    cs.CL cs.AI

    Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion

    Authors: Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Nguyen, Anh Tuan Luu

    Abstract: Dynamic topic models track the evolution of topics in sequential documents, which have derived various applications like trend analysis and opinion mining. However, existing models suffer from repetitive topic and unassociated topic issues, failing to reveal the evolution and hindering further applications. To address these issues, we break the tradition of simply chaining topics in existing work… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 Findings

  4. arXiv:2403.17486  [pdf, other

    cs.CL

    KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning

    Authors: Cong-Duy Nguyen, Thong Nguyen, Xiaobao Wu, Anh Tuan Luu

    Abstract: Previous work on multimodal sentence embedding has proposed multimodal contrastive learning and achieved promising results. However, by taking the rest of the batch as negative samples without reviewing when forming contrastive pairs, those studies encountered many suspicious and noisy negative examples, significantly affecting the methods' overall performance. In this work, we propose KDMCSE (Kno… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  5. arXiv:2403.10258  [pdf, other

    cs.CL

    Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Authors: Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated multilingual capabilities; yet, they are mostly English-centric due to the imbalanced training corpora. Existing works leverage this phenomenon to improve their multilingual performances through translation, primarily on natural language processing (NLP) tasks. This work extends the evaluation from NLP tasks to real user queries and from English-centr… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 19 pages

  6. arXiv:2403.02990  [pdf, other

    cs.CL cs.AI

    Data Augmentation using Large Language Models: Data Perspectives, Learning Paradigms and Challenges

    Authors: Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty

    Abstract: In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has emerged as a pivotal technique for enhancing model performance by diversifying training examples without the need for additional data collection. This survey explores the transformative impact of LLMs on DA, particularly addressing the unique challenges and opportunities they present in the context of natural… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2402.18909  [pdf, other

    cs.CL cs.AI

    Updating Language Models with Unstructured Facts: Towards Practical Knowledge Editing

    Authors: Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu

    Abstract: Knowledge editing aims to inject knowledge updates into language models to keep them correct and up-to-date. However, its current evaluation strategies are notably impractical: they solely update with well-curated structured facts (triplets with subjects, relations, and objects), whereas real-world knowledge updates commonly emerge in unstructured texts like news articles. In this paper, we propos… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  8. arXiv:2402.16030  [pdf, other

    cs.CL cs.AI

    Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration

    Authors: Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Anh Tuan Luu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) significantly enhances the generation quality of Large Language Models (LLMs), recent studies have raised concerns regarding the complexity and instability associated with the Proximal Policy Optimization (PPO) algorithm, proposing a series of order-based calibration methods as viable alternatives. This paper delves further into current order… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 19 pages, Under review

  9. arXiv:2402.07844  [pdf, other

    cs.SE cs.CL

    Mercury: A Code Efficiency Benchmark for Code Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

    Abstract: Amidst the recent strides in evaluating Large Language Models for Code (Code LLMs), existing benchmarks have mainly focused on the functional correctness of generated code, neglecting the importance of their computational efficiency. To fill the gap, we present Mercury, the first code efficiency benchmark for Code LLMs. It comprises 1,889 Python tasks, each accompanied by adequate solutions that s… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  10. arXiv:2402.07577  [pdf, other

    cs.CL

    Topic Modeling as Multi-Objective Contrastive Optimization

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning objective that contrasts pairs of input documents. However, document-level contrastive learning might capture low-level mutual information, such as word ratio, which disturbs topic modeling. Moreove… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (poster)

  11. arXiv:2402.03271  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

    Authors: Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi

    Abstract: In the face of uncertainty, the ability to *seek information* is of fundamental importance. In many practical applications, such as medical diagnosis and troubleshooting, the information needed to solve the task is not initially given and has to be actively sought by asking follow-up questions (for example, a doctor asking a patient for more details about their symptoms). In this work, we introduc… ▽ More

    Submitted 30 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Update Results

  12. A Survey on Neural Topic Models: Methods, Applications, and Challenges

    Authors: Xiaobao Wu, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades to discover latent topics and infer topic proportions of documents in an unsupervised fashion. They have been widely used in various applications like text analysis and context recommendation. Recently, the rise of neural networks has facilitated the emergence of a new research field -- Neural Topic Models (NTMs). Different from conventional topic model… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted to Artificial Intelligence Review. See https://doi.org/10.1007/s10462-023-10661-7 and a paper list at https://github.com/BobXWu/Paper-Neural-Topic-Models

  13. arXiv:2401.14113  [pdf, other

    cs.CL

    On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

    Authors: Xiaobao Wu, Fengjun Pan, Thong Nguyen, Yichao Feng, Chaoqun Liu, Cong-Duy Nguyen, Anh Tuan Luu

    Abstract: Hierarchical topic modeling aims to discover latent topics from a corpus and organize them into a hierarchy to understand documents with desirable semantic granularity. However, existing work struggles with producing topic hierarchies of low affinity, rationality, and diversity, which hampers document understanding. To overcome these challenges, we in this paper propose Transport Plan and Context-… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024 conference. Our code is available at https://github.com/bobxwu/TraCo

  14. LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

    Authors: Khoi M. Le, Trinh Pham, Tho Quan, Anh Tuan Luu

    Abstract: Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro… ▽ More

    Submitted 23 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: First two authors contribute equally. Accepted at AAAI 2024

  15. arXiv:2312.11109  [pdf, other

    cs.LG

    Graph Transformers for Large Graphs

    Authors: Vijay Prakash Dwivedi, Yozen Liu, Anh Tuan Luu, Xavier Bresson, Neil Shah, Tong Zhao

    Abstract: Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the computational feasibility of the global attention mechanism is possible. The next goal is to scale up these architectures to handle very large graphs on the sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  16. arXiv:2312.01661  [pdf, other

    cs.CL cs.AI

    ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

    Authors: Phuoc Pham Van Long, Duc Anh Vu, Nhat M. Hoang, Xuan Long Do, Anh Tuan Luu

    Abstract: Mathematical questioning is crucial for assessing students problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate questions that heavily involve multiple steps of logical and arithmetic reasoning. Meanwhile, large language models(LLMs)… ▽ More

    Submitted 27 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted at the 39th ACM/SIGAPP Symposium On Applied Computing (SAC 2024), Main Conference

  17. arXiv:2311.03970  [pdf, other

    cs.CV

    Bias and Diversity in Synthetic-based Face Recognition

    Authors: Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer

    Abstract: Synthetic data is emerging as a substitute for authentic data to solve ethical and legal challenges in handling authentic face data. The current models can create real-looking face images of people who do not exist. However, it is a known and sensitive problem that face recognition systems are susceptible to bias, i.e. performance differences between different demographic and non-demographics attr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted for presentation at WACV2024

  18. arXiv:2310.14248  [pdf, other

    cs.CL

    From Static to Dynamic: A Continual Learning Framework for Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, See-kiong Ng

    Abstract: The vast number of parameters in large language models (LLMs) endows them with remarkable capabilities, allowing them to excel in a variety of natural language processing tasks. However, this complexity also presents challenges, making LLMs difficult to train and inhibiting their ability to continuously assimilate new knowledge, which may lead to inaccuracies in their outputs. To mitigate these is… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  19. arXiv:2309.08949  [pdf, other

    cs.CL

    Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

    Authors: Zhiyuan Hu, Yue Feng, Yang Deng, Zekun Li, See-Kiong Ng, Anh Tuan Luu, Bryan Hooi

    Abstract: Recently, the development of large language models (LLMs) has been significantly enhanced the question answering and dialogue generation, and makes them become increasingly popular in current practical scenarios. While unlike the general dialogue system which emphasizes the semantic performance, the task-oriented dialogue (ToD) systems aim to achieve the dialogue goal efficiently and successfully… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: 7 Pages

  20. arXiv:2309.06908  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Towards the TopMost: A Topic Modeling System Toolkit

    Authors: Xiaobao Wu, Fengjun Pan, Anh Tuan Luu

    Abstract: Topic models have a rich history with various applications and have recently been reinvigorated by neural topic modeling. However, these numerous topic models adopt totally distinct datasets, implementations, and evaluations. This impedes quick utilization and fair comparisons, and thereby hinders their research progress and applications. To tackle this challenge, we in this paper propose a Topic… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to ACL 2024 System Demonstrations Track

  21. arXiv:2309.01219  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

    Authors: Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge… ▽ More

    Submitted 24 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: work in progress; 32 pages

  22. Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

    Authors: Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, Aldo Lipani

    Abstract: Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated b… ▽ More

    Submitted 19 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted by CIKM 2023

  23. arXiv:2306.08456  [pdf, other

    cs.CL

    PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

    Authors: Zhiyuan Hu, Chumin Liu, Yue Feng, Anh Tuan Luu, Bryan Hooi

    Abstract: Controllable text generation is a challenging and meaningful field in natural language generation (NLG). Especially, poetry generation is a typical one with well-defined and strict conditions for text generation which is an ideal playground for the assessment of current methodologies. While prior works succeeded in controlling either semantic or metrical aspects of poetry generation, simultaneousl… ▽ More

    Submitted 19 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI2024

  24. arXiv:2306.04217  [pdf, other

    cs.CL

    Effective Neural Topic Modeling with Embedding Clustering Regularization

    Authors: Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades with various applications. However, existing topic models commonly suffer from the notorious topic collapsing: discovered topics semantically collapse towards each other, leading to highly repetitive topics, insufficient topic discovery, and damaged model interpretability. In this paper, we propose a new neural topic model, Embedding Clustering Regulari… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023 conference

  25. arXiv:2305.15872  [pdf, other

    cs.CL cs.AI

    Jointprop: Joint Semi-supervised Learning for Entity and Relation Extraction with Heterogeneous Graph-based Propagation

    Authors: Yandan Zheng, Anran Hao, Anh Tuan Luu

    Abstract: Semi-supervised learning has been an important approach to address challenges in extracting entities and relations from limited data. However, current semi-supervised works handle the two tasks (i.e., Named Entity Recognition and Relation Extraction) separately and ignore the cross-correlation of entity and relation instances as well as the existence of similar instances across unlabeled data. To… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  26. arXiv:2305.12744  [pdf, other

    cs.CL cs.AI

    Fact-Checking Complex Claims with Program-Guided Reasoning

    Authors: Liangming Pan, Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan, Preslav Nakov

    Abstract: Fact-checking real-world claims often requires collecting multiple pieces of evidence and applying complex multi-step reasoning. In this paper, we present Program-Guided Fact-Checking (ProgramFC), a novel fact-checking model that decomposes complex claims into simpler sub-tasks that can be solved using a shared library of specialized functions. We first leverage the in-context learning ability of… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (main conference, long paper)

  27. arXiv:2305.12678  [pdf, other

    cs.CL

    Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Anh Tuan Luu, Cong-Duy Nguyen, Zhen Hai, Lidong Bing

    Abstract: Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform ineffici… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Published in ACL 2023 (Findings)

  28. arXiv:2305.11442  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Text Classification via Self-Supervised Tuning

    Authors: Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

    Abstract: Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data… ▽ More

    Submitted 25 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  29. arXiv:2304.13409  [pdf, other

    cs.CV

    Efficient Explainable Face Verification based on Similarity Score Argument Backpropagation

    Authors: Marco Huber, Anh Thi Luu, Philipp Terhörst, Naser Damer

    Abstract: Explainable Face Recognition is gaining growing attention as the use of the technology is gaining ground in security-critical applications. Understanding why two faces images are matched or not matched by a given face recognition system is important to operators, users, anddevelopers to increase trust, accountability, develop better systems, and highlight unfair behavior. In this work, we propose… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at WACV 2024

  30. arXiv:2304.03544  [pdf, other

    cs.CL

    InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

    Authors: Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Chaoqun Liu, Liangming Pan, Anh Tuan Luu

    Abstract: Cross-lingual topic models have been prevalent for cross-lingual text analysis by revealing aligned latent topics. However, most existing methods suffer from producing repetitive topics that hinder further analysis and performance decline caused by low-coverage dictionaries. In this paper, we propose the Cross-lingual Topic Modeling with Mutual Information (InfoCTM). Instead of the direct alignmen… ▽ More

    Submitted 27 March, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted to AAAI2023 conference. Code is available at https://github.com/BobXWu/InfoCTM

  31. arXiv:2211.12878  [pdf, other

    cs.CL

    Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning

    Authors: Xiaobao Wu, Anh Tuan Luu, Xinshuai Dong

    Abstract: To overcome the data sparsity issue in short text topic modeling, existing methods commonly rely on data augmentation or the data characteristic of short texts to introduce more word co-occurrence information. However, most of them do not make full use of the augmented data or the data characteristic: they insufficiently learn the relations among samples in data, leading to dissimilar topic distri… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted to EMNLP2022 main conference

  32. arXiv:2211.10065  [pdf, other

    cs.LG

    How to train your draGAN: A task oriented solution to imbalanced classification

    Authors: Leon O. Guertler, Andri Ashfahani, Anh Tuan Luu

    Abstract: The long-standing challenge of building effective classification models for small and imbalanced datasets has seen little improvement since the creation of the Synthetic Minority Over-sampling Technique (SMOTE) over 20 years ago. Though GAN based models seem promising, there has been a lack of purpose built architectures for solving the aforementioned problem, as most previous studies focus on app… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 94 Datasets; under review (Elsevier Neural Networks)

  33. arXiv:2208.07337  [pdf, other

    cs.CV

    SYN-MAD 2022: Competition on Face Morphing Attack Detection Based on Privacy-aware Synthetic Training Data

    Authors: Marco Huber, Fadi Boutros, Anh Thi Luu, Kiran Raja, Raghavendra Ramachandra, Naser Damer, Pedro C. Neto, Tiago Gonçalves, Ana F. Sequeira, Jaime S. Cardoso, João Tremoço, Miguel Lourenço, Sergio Serra, Eduardo Cermeño, Marija Ivanovska, Borut Batagelj, Andrej Kronovšek, Peter Peer, Vitomir Štruc

    Abstract: This paper presents a summary of the Competition on Face Morphing Attack Detection Based on Privacy-aware Synthetic Training Data (SYN-MAD) held at the 2022 International Joint Conference on Biometrics (IJCB 2022). The competition attracted a total of 12 participating teams, both from academia and industry and present in 11 different countries. In the end, seven valid submissions were submitted by… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted at International Joint Conference on Biometrics (IJCB) 2022

  34. arXiv:2207.01772  [pdf, other

    cs.CL

    Vision-and-Language Pretraining

    Authors: Thong Nguyen, Cong-Duy Nguyen, Xiaobao Wu, See-Kiong Ng, Anh Tuan Luu

    Abstract: With the burgeoning amount of data of image-text pairs and diversity of Vision-and-Language (V\&L) tasks, scholars have introduced an abundance of deep learning models in this research domain. Furthermore, in recent years, transfer learning has also shown tremendous success in Computer Vision for tasks such as Image Classification, Object Detection, etc., and in Natural Language Processing for Que… ▽ More

    Submitted 23 June, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 46 pages, 2 figures

  35. arXiv:2206.08164  [pdf, other

    cs.LG

    Long Range Graph Benchmark

    Authors: Vijay Prakash Dwivedi, Ladislav Rampášek, Mikhail Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, Dominique Beaini

    Abstract: Graph Neural Networks (GNNs) that are based on the message passing (MP) paradigm generally exchange information between 1-hop neighbors to build node representations at each layer. In principle, such networks are not able to capture long-range interactions (LRI) that may be desired or necessary for learning a given task on graphs. Recently, there has been an increasing interest in development of T… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Added reference to Tönshoff et al., 2023 in Sec. 4.1; NeurIPS 2022 Track on D&B; Open-sourced at: https://github.com/vijaydwivedi75/lrgb

  36. arXiv:2205.12454  [pdf, other

    cs.LG

    Recipe for a General, Powerful, Scalable Graph Transformer

    Authors: Ladislav Rampášek, Mikhail Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, Dominique Beaini

    Abstract: We propose a recipe on how to build a general, powerful, scalable (GPS) graph Transformer with linear complexity and state-of-the-art results on a diverse set of benchmarks. Graph Transformers (GTs) have gained popularity in the field of graph representation learning with a variety of recent publications but they lack a common foundation about what constitutes a good positional or structural encod… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NeurIPS 2022

  37. arXiv:2205.12331  [pdf, other

    cs.LG cs.CL cs.CR

    Certified Robustness Against Natural Language Attacks by Causal Intervention

    Authors: Haiteng Zhao, Chang Ma, Xinshuai Dong, Anh Tuan Luu, Zhi-Hong Deng, Hanwang Zhang

    Abstract: Deep learning models have achieved great success in many fields, yet they are vulnerable to adversarial examples. This paper follows a causal perspective to look into the adversarial vulnerability and proposes Causal Intervention by Semantic Smoothing (CISS), a novel framework towards robustness against natural language attacks. Instead of merely fitting observational data, CISS learns causal effe… ▽ More

    Submitted 14 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Journal ref: International Conference on Machine International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  38. arXiv:2110.12764  [pdf, other

    cs.CL

    Contrastive Learning for Neural Topic Model

    Authors: Thong Nguyen, Anh Tuan Luu

    Abstract: Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample. However, utilizing that discriminative-generative architecture has two important drawbacks: (1) the architecture does not relate similar documents, which has the same document-word distribution of salient words; (… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 15 pages, 8 tables, 5 figures, published at Advances in Neural Information Processing Systems (NeurIPS), 2021

  39. arXiv:2110.07875  [pdf, other

    cs.LG

    Graph Neural Networks with Learnable Structural and Positional Representations

    Authors: Vijay Prakash Dwivedi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: Graph neural networks (GNNs) have become the standard learning architectures for graphs. GNNs have been applied to numerous domains ranging from quantum chemistry, recommender systems to knowledge graphs and natural language processing. A major issue with arbitrary graphs is the absence of canonical positional information of nodes, which decreases the representation power of GNNs to distinguish e.… ▽ More

    Submitted 10 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Code at https://github.com/vijaydwivedi75/gnn-lspe

    Journal ref: ICLR 2022 (https://openreview.net/pdf?id=wTTjnvGphYj)

  40. arXiv:2109.10616  [pdf, other

    cs.CL

    Enriching and Controlling Global Semantics for Text Summarization

    Authors: Thong Nguyen, Anh Tuan Luu, Truc Lu, Tho Quan

    Abstract: Recently, Transformer-based models have been proven effective in the abstractive summarization task by creating fluent and informative summaries. Nevertheless, these models still suffer from the short-range dependency problem, causing them to produce summaries that miss the key points of document. In this paper, we attempt to address this issue by introducing a neural topic model empowered with no… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted to the main EMNLP 2021 conference

  41. arXiv:2107.13541  [pdf, other

    cs.CL

    Towards Robustness Against Natural Language Word Substitutions

    Authors: Xinshuai Dong, Anh Tuan Luu, Rongrong Ji, Hong Liu

    Abstract: Robustness against word substitutions has a well-defined and widely acceptable form, i.e., using semantically similar words as substitutions, and thus it is considered as a fundamental step**-stone towards broader robustness in natural language processing. Previous defense methods capture word substitutions in vector space by using either $l_2$-ball or hyper-rectangle, which results in perturbat… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Conference paper ICLR 2021

  42. arXiv:2102.08597  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters

    Authors: Aston Zhang, Yi Tay, Shuai Zhang, Alvin Chan, Anh Tuan Luu, Siu Cheung Hui, Jie Fu

    Abstract: Recent works have demonstrated reasonable success of representation learning in hypercomplex space. Specifically, "fully-connected layers with Quaternions" (4D hypercomplex numbers), which replace real-valued matrix multiplications in fully-connected layers with Hamilton products of Quaternions, both enjoy parameter savings with only 1/4 learnable parameters and achieve comparable performance in v… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Published as a conference paper at the 9th International Conference on Learning Representations (ICLR 2021)

  43. arXiv:2003.00982  [pdf, other

    cs.LG stat.ML

    Benchmarking Graph Neural Networks

    Authors: Vijay Prakash Dwivedi, Chaitanya K. Joshi, Anh Tuan Luu, Thomas Laurent, Yoshua Bengio, Xavier Bresson

    Abstract: In the last few years, graph neural networks (GNNs) have become the standard toolkit for analyzing and learning from data on graphs. This emerging field has witnessed an extensive growth of promising techniques that have been applied with success to computer science, mathematics, biology, physics and chemistry. But for any successful field to become mainstream and reliable, benchmarks must be deve… ▽ More

    Submitted 27 December, 2022; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Benchmarking framework on GitHub at https://github.com/graphdeeplearning/benchmarking-gnns

    Journal ref: Journal of Machine Learning Research (JMLR), 2022

  44. arXiv:1811.04595  [pdf, other

    cs.CV

    Holistic Multi-modal Memory Network for Movie Question Answering

    Authors: Anran Wang, Anh Tuan Luu, Chuan-Sheng Foo, Hongyuan Zhu, Yi Tay, Vijay Chandrasekhar

    Abstract: Answering questions according to multi-modal context is a challenging problem as it requires a deep integration of different data sources. Existing approaches only employ partial interactions among data sources in one attention hop. In this paper, we present the Holistic Multi-modal Memory Network (HMMN) framework which fully considers the interactions between different input sources (multi-modal… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  45. arXiv:1805.11535  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    CoupleNet: Paying Attention to Couples with Coupled Attention for Relationship Recommendation

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Dating and romantic relationships not only play a huge role in our personal lives but also collectively influence and shape society. Today, many romantic partnerships originate from the Internet, signifying the importance of technology and the web in modern dating. In this paper, we present a text-based computational approach for estimating the relationship compatibility of two users on social med… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: Accepted at ICWSM 2018

  46. arXiv:1712.05403  [pdf, other

    cs.CL cs.AI cs.IR

    Learning to Attend via Word-Aspect Associative Fusion for Aspect-based Sentiment Analysis

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Aspect-based sentiment analysis (ABSA) tries to predict the polarity of a given document with respect to a given aspect entity. While neural network architectures have been successful in predicting the overall polarity of sentences, aspect-specific sentiment analysis still remains as an open problem. In this paper, we propose a novel method for integrating aspect information into the neural model.… ▽ More

    Submitted 14 December, 2017; originally announced December 2017.

    Comments: Accepted to AAAI2018

  47. Latent Relational Metric Learning via Memory-based Attention for Collaborative Ranking

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: This paper proposes a new neural architecture for collaborative ranking with implicit feedback. Our model, LRML (\textit{Latent Relational Metric Learning}) is a novel metric learning approach for recommendation. More specifically, instead of simple push-pull mechanisms between user and item pairs, we propose to learn latent relations that describe each user item interaction. This helps to allevia… ▽ More

    Submitted 13 February, 2018; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: WWW 2018