Skip to main content

Showing 1–50 of 201 results for author: Phung, D

.
  1. arXiv:2407.02721  [pdf, ps, other

    cs.LG cs.CV

    Model and Feature Diversity for Bayesian Neural Networks in Mutual Learning

    Authors: Cuong Pham, Cuong C. Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Bayesian Neural Networks (BNNs) offer probability distributions for model parameters, enabling uncertainty quantification in predictions. However, they often underperform compared to deterministic neural networks. Utilizing mutual learning can effectively enhance the performance of peer BNNs. In this paper, we propose a novel approach to improve BNNs performance through deep mutual learning. The p… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to NeurIPS 2023

  2. arXiv:2406.09008  [pdf, other

    cs.CL

    LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models

    Authors: Xiaohao Yang, He Zhao, Dinh Phung, Wray Buntine, Lan Du

    Abstract: Topic modeling has been a widely used tool for unsupervised text analysis. However, comprehensive evaluations of a topic model remain challenging. Existing evaluation methods are either less comparable across different models (e.g., perplexity) or focus on only one specific aspect of a model (e.g., topic quality or document representation quality) at a time, which is insufficient to reflect the ov… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.07107  [pdf, other

    cs.LG

    Agnostic Sharpness-Aware Minimization

    Authors: Van-Anh Nguyen, Quyen Tran, Tuan Truong, Thanh-Toan Do, Dinh Phung, Trung Le

    Abstract: Sharpness-aware minimization (SAM) has been instrumental in improving deep neural network training by minimizing both the training loss and the sharpness of the loss landscape, leading the model into flatter minima that are associated with better generalization properties. In another aspect, Model-Agnostic Meta-Learning (MAML) is a framework designed to improve the adaptability of models. MAML opt… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2406.05641  [pdf, other

    cs.CV

    PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction

    Authors: Shangyu Chen, Zizheng Pan, Jianfei Cai, Dinh Phung

    Abstract: Personalizing a large-scale pretrained Text-to-Image (T2I) diffusion model is challenging as it typically struggles to make an appropriate trade-off between its training data distribution and the target distribution, i.e., learning a novel concept with only a few target images to achieve personalization (aligning with the personalized target) while preserving text editability (aligning with divers… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  5. arXiv:2405.12578  [pdf, ps, other

    math.AP

    Trend to equilibrium for degenerate reaction-diffusion systems coming out of chemistry

    Authors: Laurent Desvillettes, Kim Dang Phung, Bao Quoc Tang

    Abstract: The trend to equilibrium for reaction-diffusion systems coming out of chemistry is investigated, in the case when reaction processes might happen only on some open subsets of the domain. A special case has been studied recently in [Desvillettes, L., \\& Phung, K. D. (2022). Journal of Differential Equations, 338, 227-255] using log convexity technique from controllability theory, which in turn req… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.10084  [pdf, other

    eess.AS cs.AI cs.SD

    Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

    Authors: Manh Luong, Khai Nguyen, Nhat Ho, Reza Haf, Dinh Phung, Lizhen Qu

    Abstract: The Learning-to-match (LTM) framework proves to be an effective inverse optimal transport approach for learning the underlying ground metric between two sources of data, facilitating subsequent matching. However, the conventional LTM framework faces scalability challenges, necessitating the use of the entire dataset each time the parameters of the ground metric are updated. In adapting LTM to the… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  7. Radial Basis Function Neural Networks for Formation Control of Unmanned Aerial Vehicles

    Authors: Duy-Nam Bui, Manh Duong Phung

    Abstract: This paper addresses the problem of controlling multiple unmanned aerial vehicles (UAVs) cooperating in a formation to carry out a complex task such as surface inspection. We first use the virtual leader-follower model to determine the topology and trajectory of the formation. A double-loop control system combining backstep** and sliding mode control techniques is then designed for the UAVs to t… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Journal ref: Robotica, 2024

  8. arXiv:2404.07949  [pdf, other

    cs.CV

    Taming Stable Diffusion for Text to 360° Panorama Image Generation

    Authors: Cheng Zhang, Qianyi Wu, Camilo Cruz Gambardella, Xiaoshui Huang, Dinh Phung, Wanli Ouyang, Jianfei Cai

    Abstract: Generative models, e.g., Stable Diffusion, have enabled the creation of photorealistic images from text prompts. Yet, the generation of 360-degree panorama images from text remains a challenge, particularly due to the dearth of paired text-panorama data and the domain gap between panorama and perspective images. In this paper, we introduce a novel dual-branch diffusion model named PanFusion to gen… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project Page: https://chengzhag.github.io/publication/panfusion Code: https://github.com/chengzhag/PanFusion

  9. arXiv:2404.01226  [pdf, other

    cs.CL

    Stable Code Technical Report

    Authors: Nikhil Pinnaparaju, Reshinth Adithyan, Duy Phung, Jonathan Tow, James Baicoianu, Ashish Datta, Maksym Zhuravinskyi, Dakota Mahan, Marco Bellagente, Carlos Riquelme, Nathan Cooper

    Abstract: We introduce Stable Code, the first in our new-generation of code language models series, which serves as a general-purpose base code language model targeting code completion, reasoning, math, and other software engineering-based tasks. Additionally, we introduce an instruction variant named Stable Code Instruct that allows conversing with the model in a natural chat interface for performing quest… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2403.14101  [pdf, other

    cs.CV cs.CL cs.LG

    Text-Enhanced Data-free Approach for Federated Class-Incremental Learning

    Authors: Minh-Tuan Tran, Trung Le, Xuan-May Le, Mehrtash Harandi, Dinh Phung

    Abstract: Federated Class-Incremental Learning (FCIL) is an underexplored yet pivotal issue, involving the dynamic addition of new classes in the context of federated learning. In this field, Data-Free Knowledge Transfer (DFKT) plays a crucial role in addressing catastrophic forgetting and data privacy problems. However, prior approaches lack the crucial synergy between DFKT and the model training phases, c… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  11. arXiv:2403.13204  [pdf, other

    cs.LG cs.CV stat.ML

    Diversity-Aware Agnostic Ensemble of Sharpness Minimizers

    Authors: Anh Bui, Vy Vo, Tung Pham, Dinh Phung, Trung Le

    Abstract: There has long been plenty of theoretical and empirical evidence supporting the success of ensemble learning. Deep ensembles in particular take advantage of training randomness and expressivity of individual neural networks to gain prediction diversity, ultimately leading to better generalization, robustness and uncertainty estimation. In respect of generalization, it is found that pursuing wider… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  12. arXiv:2403.12326  [pdf, other

    cs.LG cs.CV

    Removing Undesirable Concepts in Text-to-Image Generative Models with Learnable Prompts

    Authors: Anh Bui, Khanh Doan, Trung Le, Paul Montague, Tamas Abraham, Dinh Phung

    Abstract: Generative models have demonstrated remarkable potential in generating visually impressive content from textual descriptions. However, training these models on unfiltered internet data poses the risk of learning and subsequently propagating undesirable concepts, such as copyrighted or unethical content. In this paper, we propose a novel method to remove undesirable concepts from text-to-image gene… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  13. arXiv:2403.05894  [pdf, other

    cs.CV

    Frequency Attention for Knowledge Distillation

    Authors: Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Knowledge distillation is an attractive approach for learning compact deep neural networks, which learns a lightweight student model by distilling knowledge from a complex teacher model. Attention-based knowledge distillation is a specific form of intermediate feature-based knowledge distillation that uses attention mechanisms to encourage the student to better mimic the teacher. However, most of… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Appear to WACV 2024

  14. arXiv:2402.17834  [pdf, other

    cs.CL stat.ML

    Stable LM 2 1.6B Technical Report

    Authors: Marco Bellagente, Jonathan Tow, Dakota Mahan, Duy Phung, Maksym Zhuravinskyi, Reshinth Adithyan, James Baicoianu, Ben Brooks, Nathan Cooper, Ashish Datta, Meng Lee, Emad Mostaque, Michael Pieler, Nikhil Pinnaparju, Paulo Rocha, Harry Saini, Hannah Teufel, Niccolo Zanichelli, Carlos Riquelme

    Abstract: We introduce StableLM 2 1.6B, the first in a new generation of our language model series. In this technical report, we present in detail the data and training procedure leading to the base and instruction-tuned versions of StableLM 2 1.6B. The weights for both models are available via Hugging Face for anyone to download and use. The report contains thorough evaluations of these models, including z… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 23 pages, 6 figures

  15. arXiv:2402.15255  [pdf, other

    cs.LG cs.AI

    Optimal Transport for Structure Learning Under Missing Data

    Authors: Vy Vo, He Zhao, Trung Le, Edwin V. Bonilla, Dinh Phung

    Abstract: Causal discovery in the presence of missing data introduces a chicken-and-egg dilemma. While the goal is to recover the true causal structure, robust imputation requires considering the dependencies or, preferably, causal relations among variables. Merely filling in missing values with existing imputation methods and subsequently applying structure learning on the complete data is empirically show… ▽ More

    Submitted 1 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  16. arXiv:2402.11199  [pdf, other

    cs.CL

    Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs

    Authors: Minh-Vuong Nguyen, Linhao Luo, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Thuy-Trang Vu, Gholamreza Haffari

    Abstract: Large language models (LLMs) demonstrate strong reasoning abilities when prompted to generate chain-of-thought (CoT) explanations alongside answers. However, previous research on evaluating LLMs has solely focused on answer accuracy, neglecting the correctness of the generated CoT. In this paper, we delve deeper into the CoT reasoning capabilities of LLMs in multi-hop question answering by utilizi… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Minh-Vuong Nguyen and Linhao Luo are co-first authors and contributed equally to the preparation of this manuscript. Accepted to ACL24-Findings

  17. Object Detection in Thermal Images Using Deep Learning for Unmanned Aerial Vehicles

    Authors: Minh Dang Tu, Kieu Trang Le, Manh Duong Phung

    Abstract: This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles. Our model consists of three parts, the backbone, the neck, and the prediction head. The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end. The neck includes a BI-FPN block combined with the us… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

  18. Ant Colony Optimization for Cooperative Inspection Path Planning Using Multiple Unmanned Aerial Vehicles

    Authors: Duy Nam Bui, Thuy Ngan Duong, Manh Duong Phung

    Abstract: This paper presents a new swarm intelligence-based approach to deal with the cooperative path planning problem of unmanned aerial vehicles (UAVs), which is essential for the automatic inspection of infrastructure. The approach uses a 3D model of the structure to generate viewpoints for the UAVs. The calculation of the viewpoints considers the constraints related to the UAV formation model, camera… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

  19. Self-Reconfigurable V-shape Formation of Multiple UAVs in Narrow Space Environments

    Authors: Duy Nam Bui, Manh Duong Phung, Hung Pham Duy

    Abstract: This paper presents the design and implementation of a self-reconfigurable V-shape formation controller for multiple unmanned aerial vehicles (UAVs) navigating through narrow spaces in a dense obstacle environment. The selection of the V-shape formation is motivated by its maneuverability and visibility advantages. The main objective is to develop an effective formation control strategy that allow… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

  20. arXiv:2401.15952  [pdf, other

    cs.LG cs.AI cs.CV

    A Class-aware Optimal Transport Approach with Higher-Order Moment Matching for Unsupervised Domain Adaptation

    Authors: Tuan Nguyen, Van Nguyen, Trung Le, He Zhao, Quan Hung Tran, Dinh Phung

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain. In this paper, we introduce a novel approach called class-aware optimal transport (OT), which measures the OT distance between a distribution over the source class-conditional distributions and a mixture of source and target data distribution. Our class-aware OT leverages a c… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 18 pages

  21. arXiv:2312.06710  [pdf, other

    cs.LG

    Class-Prototype Conditional Diffusion Model with Gradient Projection for Continual Learning

    Authors: Khanh Doan, Quyen Tran, Tung Lam Tran, Tuan Nguyen, Dinh Phung, Trung Le

    Abstract: Mitigating catastrophic forgetting is a key hurdle in continual learning. Deep Generative Replay (GR) provides techniques focused on generating samples from prior tasks to enhance the model's memory capabilities using generative AI models ranging from Generative Adversarial Networks (GANs) to the more recent Diffusion Models (DMs). A major issue is the deterioration in the quality of generated dat… ▽ More

    Submitted 21 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  22. arXiv:2311.15414  [pdf, other

    cs.LG cs.CV

    KOPPA: Improving Prompt-based Continual Learning with Key-Query Orthogonal Projection and Prototype-based One-Versus-All

    Authors: Quyen Tran, Lam Tran, Khoat Than, Toan Tran, Dinh Phung, Trung Le

    Abstract: Drawing inspiration from prompt tuning techniques applied to Large Language Models, recent methods based on pre-trained ViT networks have achieved remarkable results in the field of Continual Learning. Specifically, these approaches propose to maintain a set of prompts and allocate a subset of them to learn each task using a key-query matching strategy. However, they may encounter limitations when… ▽ More

    Submitted 30 November, 2023; v1 submitted 26 November, 2023; originally announced November 2023.

  23. arXiv:2311.09671  [pdf, ps, other

    cs.LG cs.CV

    Robust Contrastive Learning With Theory Guarantee

    Authors: Ngoc N. Tran, Lam Tran, Hoang Phan, Anh Bui, Tung Pham, Toan Tran, Dinh Phung, Trung Le

    Abstract: Contrastive learning (CL) is a self-supervised training paradigm that allows us to extract meaningful features without any label information. A typical CL framework is divided into two phases, where it first tries to learn the features from unlabelled data, and then uses those features to train a linear classifier with the labeled data. While a fair amount of existing theoretical works have analyz… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 27 pages, 0 figures. arXiv admin note: text overlap with arXiv:2305.10252

  24. arXiv:2311.02945  [pdf, ps, other

    cs.CL

    PhoGPT: Generative Pre-training for Vietnamese

    Authors: Dat Quoc Nguyen, Linh The Nguyen, Chi Tran, Dung Ngoc Nguyen, Dinh Phung, Hung Bui

    Abstract: We open-source a state-of-the-art 4B-parameter generative model series for Vietnamese, which includes the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. The base model, PhoGPT-4B, with exactly 3.7B parameters, is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length, employing a vocabulary of 20480 token types. The chat vari… ▽ More

    Submitted 22 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: PhoGPT-4B Technical Report - 5 pages

  25. arXiv:2310.11638  [pdf, other

    cs.CL

    Systematic Assessment of Factual Knowledge in Large Language Models

    Authors: Linhao Luo, Thuy-Trang Vu, Dinh Phung, Gholamreza Haffari

    Abstract: Previous studies have relied on existing question-answering benchmarks to evaluate the knowledge stored in large language models (LLMs). However, this approach has limitations regarding factual knowledge coverage, as it mostly focuses on generic domains which may overlap with the pretraining data. This paper proposes a framework to systematically assess the factual knowledge of LLMs by leveraging… ▽ More

    Submitted 30 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 Findings

  26. arXiv:2310.01176  [pdf, other

    eess.IV cs.CV

    Cross-adversarial local distribution regularization for semi-supervised medical image segmentation

    Authors: Thanh Nguyen-Duc, Trung Le, Roland Bammer, He Zhao, Jianfei Cai, Dinh Phung

    Abstract: Medical semi-supervised segmentation is a technique where a model is trained to segment objects of interest in medical images with limited annotated data. Existing semi-supervised segmentation methods are usually based on the smoothness assumption. This assumption implies that the model output distributions of two similar data samples are encouraged to be invariant. In other words, the smoothness… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: MICCAI 2023

  27. arXiv:2310.00258  [pdf, other

    cs.CV

    NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation

    Authors: Minh-Tuan Tran, Trung Le, Xuan-May Le, Mehrtash Harandi, Quan Hung Tran, Dinh Phung

    Abstract: Data-Free Knowledge Distillation (DFKD) has made significant recent strides by transferring knowledge from a teacher neural network to a student neural network without accessing the original data. Nonetheless, existing approaches encounter a significant challenge when attempting to generate samples from random noise inputs, which inherently lack meaningful information. Consequently, these models s… ▽ More

    Submitted 21 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: Accepted at CVPR 2024

  28. arXiv:2309.17215  [pdf, other

    cs.LG cs.AI

    RSAM: Learning on manifolds with Riemannian Sharpness-aware Minimization

    Authors: Tuan Truong, Hoang-Phi Nguyen, Tung Pham, Minh-Tuan Tran, Mehrtash Harandi, Dinh Phung, Trung Le

    Abstract: Nowadays, understanding the geometry of the loss landscape shows promise in enhancing a model's generalization ability. In this work, we draw upon prior works that apply geometric principles to optimization and present a novel approach to improve robustness and generalization ability for constrained optimization problems. Indeed, this paper aims to generalize the Sharpness-Aware Minimization (SAM)… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  29. arXiv:2309.15491  [pdf, ps, other

    math.AP

    An optimal spectral inequality for degenerate operators

    Authors: Rémi Buffe, Kim Dang Phung, Amine Slimani

    Abstract: In this paper we establish a Lebeau-Robbiano spectral inequality for a degenerate one dimensional elliptic operator. Carleman techniques and moment method are combined. Application to null controllability on a measurable set in time for the degenerated heat equation is described.

    Submitted 27 September, 2023; originally announced September 2023.

  30. arXiv:2307.12564  [pdf, other

    cs.CL cs.LG

    Towards Generalising Neural Topical Representations

    Authors: Xiaohao Yang, He Zhao, Dinh Phung, Lan Du

    Abstract: Topic models have evolved from conventional Bayesian probabilistic models to recent Neural Topic Models (NTMs). Although NTMs have shown promising performance when trained and tested on a specific corpus, their generalisation ability across corpora has yet to be studied. In practice, we often expect that an NTM trained on a source corpus can still produce quality topical representation (i.e., late… ▽ More

    Submitted 12 June, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

  31. arXiv:2306.06109  [pdf, other

    cs.CR cs.AI cs.LG

    Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level Vulnerabilities

    Authors: Michael Fu, Trung Le, Van Nguyen, Chakkrit Tantithamthavorn, Dinh Phung

    Abstract: Deep learning (DL) models have become increasingly popular in identifying software vulnerabilities. Prior studies found that vulnerabilities across different vulnerable programs may exhibit similar vulnerable scopes, implicitly forming discernible vulnerability patterns that can be learned by DL models through supervised training. However, vulnerable scopes still manifest in various spatial locati… ▽ More

    Submitted 26 May, 2023; originally announced June 2023.

  32. arXiv:2306.04178  [pdf, other

    cs.LG cs.CG

    Optimal Transport Model Distributional Robustness

    Authors: Van-Anh Nguyen, Trung Le, Anh Tuan Bui, Thanh-Toan Do, Dinh Phung

    Abstract: Distributional robustness is a promising framework for training deep learning models that are less vulnerable to adversarial examples and data distribution shifts. Previous works have mainly focused on exploiting distributional robustness in the data space. In this work, we explore an optimal transport-based distributional robustness framework in model spaces. Specifically, we examine a model dist… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPs 2023

    Journal ref: Advances in Neural Information Processing Systems, 2023

  33. arXiv:2305.16615  [pdf, other

    cs.SE cs.CR

    AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities

    Authors: Michael Fu, Chakkrit Tantithamthavorn, Trung Le, Yuki Kume, Van Nguyen, Dinh Phung, John Grundy

    Abstract: Many ML-based approaches have been proposed to automatically detect, localize, and repair software vulnerabilities. While ML-based methods are more effective than program analysis-based vulnerability analysis tools, few have been integrated into modern IDEs, hindering practical adoption. To bridge this critical gap, we propose AIBugHunter, a novel ML-based software vulnerability analysis tool for… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 34 pages, Accepted at Empirical Software Engineering Journal

    Journal ref: Empirical Software Engineering (EMSE), 2023

  34. arXiv:2305.15927  [pdf, other

    cs.LG cs.SI

    Parameter Estimation in DAGs from Incomplete Data via Optimal Transport

    Authors: Vy Vo, Trung Le, Tung-Long Vuong, He Zhao, Edwin Bonilla, Dinh Phung

    Abstract: Estimating the parameters of a probabilistic directed graphical model from incomplete data is a long-standing challenge. This is because, in the presence of latent variables, both the likelihood function and posterior distribution are intractable without assumptions about structural dependencies or model classes. While existing learning methods are fundamentally based on likelihood maximization, h… ▽ More

    Submitted 1 June, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  35. arXiv:2305.10252  [pdf, other

    cs.LG cs.CV

    Sharpness & Shift-Aware Self-Supervised Learning

    Authors: Ngoc N. Tran, Son Duong, Hoang Phan, Tung Pham, Dinh Phung, Trung Le

    Abstract: Self-supervised learning aims to extract meaningful features from unlabeled data for further downstream tasks. In this paper, we consider classification as a downstream task in phase 2 and develop rigorous theories to realize the factors that implicitly influence the general loss of this classification task. Our theories signify that sharpness-aware feature extractors benefit the classification ta… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  36. arXiv:2305.03923  [pdf, other

    cs.LG cs.CL

    Active Continual Learning: On Balancing Knowledge Retention and Learnability

    Authors: Thuy-Trang Vu, Shahram Khadivi, Mahsa Ghorbanali, Dinh Phung, Gholamreza Haffari

    Abstract: Acquiring new knowledge without forgetting what has been learned in a sequence of tasks is the central focus of continual learning (CL). While tasks arrive sequentially, the training data are often prepared and annotated independently, leading to the CL of incoming supervised learning tasks. This paper considers the under-explored problem of active continual learning (ACL) for a sequence of active… ▽ More

    Submitted 30 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

  37. arXiv:2304.13229  [pdf, other

    cs.LG cs.CV

    Generating Adversarial Examples with Task Oriented Multi-Objective Optimization

    Authors: Anh Bui, Trung Le, He Zhao, Quan Tran, Paul Montague, Dinh Phung

    Abstract: Deep learning models, even the-state-of-the-art ones, are highly vulnerable to adversarial examples. Adversarial training is one of the most efficient methods to improve the model's robustness. The key factor for the success of adversarial training is the capability to generate qualified and divergent adversarial examples which satisfy some objectives/goals (e.g., finding adversarial examples that… ▽ More

    Submitted 1 June, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  38. arXiv:2304.10764  [pdf, other

    cs.CV

    Hyperbolic Geometry in Computer Vision: A Survey

    Authors: Pengfei Fang, Mehrtash Harandi, Trung Le, Dinh Phung

    Abstract: Hyperbolic geometry, a Riemannian manifold endowed with constant sectional negative curvature, has been considered an alternative embedding space in many learning scenarios, \eg, natural language processing, graph learning, \etc, as a result of its intriguing property of encoding the data's hierarchical structure (like irregular graph or tree-likeness data). Recent studies prove that such data hie… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: First survey paper for the hyperbolic geometry in CV applications

  39. A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

    Authors: Iva Bojic, Josef Halim, Verena Suharman, Sreeja Tar, Qi Chwen Ong, Duy Phung, Mathieu Ravaut, Shafiq Joty, Josip Car

    Abstract: Low-quality data can cause downstream problems in high-stakes applications. Data-centric approach emphasizes on improving dataset quality to enhance model performance. High-quality datasets are needed for general-purpose Large Language Models (LLMs) training, as well as for domain-specific models, which are usually small in size as it is costly to engage a large number of domain experts for their… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Journal ref: 2023.In The Fourth Workshop on Insights from Negative Results in NLP, pages 19-32, Dubrovnik, Croatia. Association for Computational Linguistics

  40. arXiv:2302.05917  [pdf, other

    cs.LG

    Vector Quantized Wasserstein Auto-Encoder

    Authors: Tung-Long Vuong, Trung Le, He Zhao, Chuanxia Zheng, Mehrtash Harandi, Jianfei Cai, Dinh Phung

    Abstract: Learning deep discrete latent presentations offers a promise of better symbolic and summarized abstractions that are more useful to subsequent downstream tasks. Inspired by the seminal Vector Quantized Variational Auto-Encoder (VQ-VAE), most of work in learning deep discrete representations has mainly focused on improving the original VQ-VAE form and none of them has studied learning deep discrete… ▽ More

    Submitted 17 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

  41. arXiv:2302.02713  [pdf, other

    cs.LG cs.IT

    Flat Seeking Bayesian Neural Networks

    Authors: Van-Anh Nguyen, Tung-Long Vuong, Hoang Phan, Thanh-Toan Do, Dinh Phung, Trung Le

    Abstract: Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with l… ▽ More

    Submitted 6 November, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at NeurIPS 2023

    Journal ref: Advances in Neural Information Processing Systems, 2023

  42. arXiv:2301.07336  [pdf, other

    cs.CV

    Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation

    Authors: Son Duy Dao, Hengcan Shi, Dinh Phung, Jianfei Cai

    Abstract: Recent mask proposal models have significantly improved the performance of zero-shot semantic segmentation. However, the use of a `background' embedding during training in these methods is problematic as the resulting model tends to over-learn and assign all unseen classes as the background class instead of their correct labels. Furthermore, they ignore the semantic relationship of text embeddings… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  43. arXiv:2212.13218  [pdf, other

    cs.RO

    Multisensor Data Fusion for Reliable Obstacle Avoidance

    Authors: Thanh Nguyen Canh, Truong Son Nguyen, Cong Hoang Quach, Xiem HoangVan, Manh Duong Phung

    Abstract: In this work, we propose a new approach that combines data from multiple sensors for reliable obstacle avoidance. The sensors include two depth cameras and a LiDAR arranged so that they can capture the whole 3D area in front of the robot and a 2D slide around it. To fuse the data from these sensors, we first use an external camera as a reference to combine data from two depth cameras. A projection… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: In the 11th International Conference on Control, Automation and Information Sciences (ICCAIS 2022), Hanoi, Vietnam

  44. arXiv:2212.13209  [pdf, other

    cs.RO

    Deployment of UAVs for Optimal Multihop Ad-hoc Networks Using Particle Swarm Optimization and Behavior-based Control

    Authors: Ngan Duong Thi Thuy, Duy Nam Bui, Manh Duong Phung, Hung Pham Duy

    Abstract: This study proposes an approach for establishing an optimal multihop ad-hoc network using multiple unmanned aerial vehicles (UAVs) to provide emergency communication in disaster areas. The approach includes two stages, one uses particle swarm optimization (PSO) to find optimal positions to deploy UAVs, and the other uses a behavior-based controller to navigate the UAVs to their assigned positions… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: In the 11th International Conference on Control, Automation and Information Sciences (ICCAIS 2022), Hanoi, Vietnam

  45. arXiv:2212.03069  [pdf, other

    cs.CV cs.CR cs.LG

    Multiple Perturbation Attack: Attack Pixelwise Under Different $\ell_p$-norms For Better Adversarial Performance

    Authors: Ngoc N. Tran, Anh Tuan Bui, Dinh Phung, Trung Le

    Abstract: Adversarial machine learning has been both a major concern and a hot topic recently, especially with the ubiquitous use of deep neural networks in the current landscape. Adversarial attacks and defenses are usually likened to a cat-and-mouse game in which defenders and attackers evolve over the time. On one hand, the goal is to develop strong and robust deep networks that are resistant to maliciou… ▽ More

    Submitted 7 December, 2022; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 18 pages, 8 figures, 7 tables

  46. arXiv:2211.16780  [pdf, other

    cs.LG cs.CV

    Continual Learning with Optimal Transport based Mixture Model

    Authors: Quyen Tran, Hoang Phan, Khoat Than, Dinh Phung, Trung Le

    Abstract: Online Class Incremental learning (CIL) is a challenging setting in Continual Learning (CL), wherein data of new tasks arrive in incoming streams and online learning models need to handle incoming data streams without revisiting previous ones. Existing works used a single centroid adapted with incoming data streams to characterize a class. This approach possibly exposes limitations when the incomi… ▽ More

    Submitted 5 December, 2022; v1 submitted 30 November, 2022; originally announced November 2022.

  47. arXiv:2211.13723  [pdf, other

    cs.LG cs.AI

    Improving Multi-task Learning via Seeking Task-based Flat Regions

    Authors: Hoang Phan, Lam Tran, Ngoc N. Tran, Nhat Ho, Dinh Phung, Trung Le

    Abstract: Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a… ▽ More

    Submitted 29 September, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 35 pages, 17 figures, 7 tables

  48. Lyapunov-based Nonlinear Model Predictive Control for Attitude Trajectory Tracking of Unmanned Aerial Vehicles

    Authors: Duy Nam Bui, Thi Thanh Van Nguyen, Manh Duong Phung

    Abstract: This paper presents a new Lyapunov-based nonlinear model predictive controller (LNMPC) for the attitude control problem of unmanned aerial vehicles (UAVs), which is essential for their functioning operation. The controller is designed based on a quadratic cost function integrating UAV dynamics and system constraints. An additional contraction constraint is then introduced to ensure closed-loop sys… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Journal ref: International Journal of Aeronautical and Space Sciences, 2022

  49. arXiv:2210.11628  [pdf, other

    cs.CL

    Can Domains Be Transferred Across Languages in Multi-Domain Multilingual Neural Machine Translation?

    Authors: Thuy-Trang Vu, Shahram Khadivi, Xuanli He, Dinh Phung, Gholamreza Haffari

    Abstract: Previous works mostly focus on either multilingual or multi-domain aspects of neural machine translation (NMT). This paper investigates whether the domain information can be transferred across languages on the composition of multi-domain and multilingual NMT, particularly for the incomplete data condition where in-domain bitext is missing for some language pairs. Our results in the curated leave-o… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: WMT2022

  50. arXiv:2210.07646  [pdf, other

    cs.CV cs.LG

    Vision Transformer Visualization: What Neurons Tell and How Neurons Behave?

    Authors: Van-Anh Nguyen, Khanh Pham Dinh, Long Tung Vuong, Thanh-Toan Do, Quan Hung Tran, Dinh Phung, Trung Le

    Abstract: Recently vision transformers (ViT) have been applied successfully for various tasks in computer vision. However, important questions such as why they work or how they behave still remain largely unknown. In this paper, we propose an effective visualization technique, to assist us in exposing the information carried in neurons and feature embeddings across the ViT's layers. Our approach departs fro… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: The first two authors contributed equally to this work. Our code is available at https://github.com/byM1902/ViT_visualization