Skip to main content

Showing 51–100 of 201 results for author: Phung, D

.
  1. Feature-based Learning for Diverse and Privacy-Preserving Counterfactual Explanations

    Authors: Vy Vo, Trung Le, Van Nguyen, He Zhao, Edwin Bonilla, Gholamreza Haffari, Dinh Phung

    Abstract: Interpretable machine learning seeks to understand the reasoning process of complex black-box systems that are long notorious for lack of explainability. One flourishing approach is through counterfactual explanations, which provide suggestions on what a user can do to alter an outcome. Not only must a counterfactual example counter the original prediction from the black-box classifier but it shou… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Journal ref: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 6-10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 18 pages

  2. arXiv:2209.10414  [pdf, other

    cs.CR cs.AI cs.LG

    Statement-Level Vulnerability Detection: Learning Vulnerability Patterns Through Information Theory and Contrastive Learning

    Authors: Van Nguyen, Trung Le, Chakkrit Tantithamthavorn, Michael Fu, John Grundy, Hung Nguyen, Seyit Camtepe, Paul Quirk, Dinh Phung

    Abstract: Software vulnerabilities are a serious and crucial concern. Typically, in a program or function consisting of hundreds or thousands of source code statements, there are only a few statements causing the corresponding vulnerabilities. Most current approaches to vulnerability labelling are done on a function or program level by experts with the assistance of machine learning tools. Extending this ap… ▽ More

    Submitted 11 June, 2024; v1 submitted 19 September, 2022; originally announced September 2022.

  3. arXiv:2209.10406  [pdf, other

    cs.CR cs.AI cs.LG

    Cross Project Software Vulnerability Detection via Domain Adaptation and Max-Margin Principle

    Authors: Van Nguyen, Trung Le, Chakkrit Tantithamthavorn, John Grundy, Hung Nguyen, Dinh Phung

    Abstract: Software vulnerabilities (SVs) have become a common, serious and crucial concern due to the ubiquity of computer software. Many machine learning-based approaches have been proposed to solve the software vulnerability detection (SVD) problem. However, there are still two open and significant issues for SVD in terms of i) learning automatic representations to improve the predictive performance of SV… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  4. arXiv:2209.09002  [pdf, other

    cs.CV

    MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation

    Authors: Chuanxia Zheng, Long Tung Vuong, Jianfei Cai, Dinh Phung

    Abstract: Although two-stage Vector Quantized (VQ) generative models allow for synthesizing high-fidelity and high-resolution images, their quantization operator encodes similar patches within an image into the same index, resulting in a repeated artifact for similar adjacent regions using existing decoder architectures. To address this issue, we propose to incorporate the spatially conditional normalizatio… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  5. Stag hunt game-based approach for cooperative UAVs

    Authors: L. V. Nguyen, I. Torres Herrera, T. H. Le, M. D. Phung, R. P. Aguilera, Q. P. Ha

    Abstract: Unmanned aerial vehicles (UAVs) are being employed in many areas such as photography, emergency, entertainment, defence, agriculture, forestry, mining and construction. Over the last decade, UAV technology has found applications in numerous construction project phases, ranging from site map**, progress monitoring, building inspection, damage assessments, and material delivery. While extensive st… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: in 2022 Proceedings of 39th International Symposium on Automation and Robotics in Construction, Pages 367-374, Bogotá, Colombia, ISBN 978-952-69524-2-0, ISSN 2413-5844

  6. arXiv:2207.13906  [pdf, ps, other

    cond-mat.dis-nn physics.optics

    Quasi-resonant diffusion of wave packets in one-dimensional disordered mosaic lattices

    Authors: Ba Phi Nguyen, Duy Khuong Phung, Kihong Kim

    Abstract: We investigate numerically the time evolution of wave packets incident on one-dimensional semi-infinite lattices with mosaic modulated random on-site potentials, which are characterized by the integer-valued modulation period $κ$ and the disorder strength $W$. For Gaussian wave packets with the central energy $E_0$ and a small spectral width, we perform extensive numerical calculations of the diso… ▽ More

    Submitted 15 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 12 pages, 9 figures

    Journal ref: Physical Review B 106, 134204 (2022)

  7. arXiv:2207.03113  [pdf, other

    cs.LG cs.AI

    An Additive Instance-Wise Approach to Multi-class Model Interpretation

    Authors: Vy Vo, Van Nguyen, Trung Le, Quan Hung Tran, Gholamreza Haffari, Seyit Camtepe, Dinh Phung

    Abstract: Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main categories: attribution and selection. A popular attribution-based approach is to exploit local neighborhoods for learning instance-specific explainers in an addi… ▽ More

    Submitted 9 February, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Journal ref: In The Eleventh International Conference on Learning Representations, 2023

  8. arXiv:2206.07655  [pdf, other

    eess.SP cs.AI cs.LG q-bio.NC

    Classification of EEG Motor Imagery Using Deep Learning for Brain-Computer Interface Systems

    Authors: Alessandro Gallo, Manh Duong Phung

    Abstract: A trained T1 class Convolutional Neural Network (CNN) model will be used to examine its ability to successfully identify motor imagery when fed pre-processed electroencephalography (EEG) data. In theory, and if the model has been trained accurately, it should be able to identify a class and label it accordingly. The CNN model will then be restored and used to try and identify the same class of mot… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  9. arXiv:2206.01934  [pdf, other

    cs.LG cs.AI stat.ML

    Stochastic Multiple Target Sampling Gradient Descent

    Authors: Hoang Phan, Ngoc Tran, Trung Le, Toan Tran, Nhat Ho, Dinh Phung

    Abstract: Sampling from an unnormalized target distribution is an essential problem with many applications in probabilistic inference. Stein Variational Gradient Descent (SVGD) has been shown to be a powerful method that iteratively updates a set of particles to approximate the distribution of interest. Furthermore, when analysing its asymptotic properties, SVGD reduces exactly to a single-objective optimiz… ▽ More

    Submitted 10 February, 2023; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: Accepted to Advances in Neural Information Processing Systems (NeurIPS) 2022. 27 pages, 10 figures, 5 tables

  10. Enhanced Teaching-Learning-based Optimization for 3D Path Planning of Multicopter UAVs

    Authors: Van Truong Hoang, Manh Duong Phung

    Abstract: This paper introduces a new path planning algorithm for unmanned aerial vehicles (UAVs) based on the teaching-learning-based optimization (TLBO) technique. We first define an objective function that incorporates requirements on the path length and constraints on the movement and safe operation of UAVs to convert the path planning into an optimization problem. The optimization algorithm named Multi… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Proceedings of the International Conference on Advanced Mechanical Engineering, Automation, and Sustainable Development 2021 (AMAS2021)

    Journal ref: Lecture Notes in Mechanical Engineering, 2022

  11. arXiv:2204.01931  [pdf

    cs.CV

    High-Quality Pluralistic Image Completion via Code Shared VQGAN

    Authors: Chuanxia Zheng, Guoxian Song, Tat-Jen Cham, Jianfei Cai, Dinh Phung, Linjie Luo

    Abstract: PICNet pioneered the generation of multiple and diverse results for image completion task, but it required a careful balance between $\mathcal{KL}$ loss (diversity) and reconstruction loss (quality), resulting in a limited diversity and quality . Separately, iGPT-based architecture has been employed to infer distributions in a discrete space derived from a pixel-level pre-clustered palette, which… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 12 pages, 15 figures

  12. arXiv:2203.00553  [pdf, other

    cs.LG cs.AI

    Global-Local Regularization Via Distributional Robustness

    Authors: Hoang Phan, Trung Le, Trung Phung, Tuan Anh Bui, Nhat Ho, Dinh Phung

    Abstract: Despite superior performance in many situations, deep neural networks are often vulnerable to adversarial examples and distribution shifts, limiting model generalization ability in real-world applications. To alleviate these problems, recent approaches leverage distributional robustness optimization (DRO) to find the most challenging distribution, and then minimize loss function over this most cha… ▽ More

    Submitted 12 February, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  13. arXiv:2202.13437  [pdf, other

    cs.LG cs.CV

    A Unified Wasserstein Distributional Robustness Framework for Adversarial Training

    Authors: Tuan Anh Bui, Trung Le, Quan Tran, He Zhao, Dinh Phung

    Abstract: It is well-known that deep neural networks (DNNs) are susceptible to adversarial attacks, exposing a severe fragility of deep learning systems. As the result, adversarial training (AT) method, by incorporating adversarial examples during training, represents a natural and effective approach to strengthen the robustness of a DNN-based classifier. However, most AT-based methods, notably PGD-AT and T… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

  14. arXiv:2202.10723  [pdf, other

    cs.LG cs.AI stat.ML

    Sobolev Transport: A Scalable Metric for Probability Measures with Graph Metrics

    Authors: Tam Le, Truyen Nguyen, Dinh Phung, Viet Anh Nguyen

    Abstract: Optimal transport (OT) is a popular measure to compare probability distributions. However, OT suffers a few drawbacks such as (i) a high complexity for computation, (ii) indefiniteness which limits its applicability to kernel machines. In this work, we consider probability measures supported on a graph metric space and propose a novel Sobolev transport metric. We show that the Sobolev transport me… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022

  15. arXiv:2112.09231  [pdf, other

    cs.CL cs.AI cs.LG

    Two-view Graph Neural Networks for Knowledge Graph Completion

    Authors: Vinh Tong, Dai Quoc Nguyen, Dinh Phung, Dat Quoc Nguyen

    Abstract: We present an effective graph neural network (GNN)-based knowledge graph embedding model, which we name WGE, to capture entity- and relation-focused graph structures. Given a knowledge graph, WGE builds a single undirected entity-focused graph that views entities as nodes. WGE also constructs another single undirected graph from relation-focused constraints, which views entities and relations as n… ▽ More

    Submitted 11 March, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: To appear in Proceedings of ESWC 2023; 17 pages; 4 tables; 4 figures

  16. arXiv:2111.13822  [pdf, other

    cs.LG cs.AI stat.ML

    On Learning Domain-Invariant Representations for Transfer Learning with Multiple Sources

    Authors: Trung Phung, Trung Le, Long Vuong, Toan Tran, Anh Tran, Hung Bui, Dinh Phung

    Abstract: Domain adaptation (DA) benefits from the rigorous theoretical works that study its insightful characteristics and various aspects, e.g., learning domain-invariant representations and its trade-off. However, it seems not the case for the multiple source DA and domain generalization (DG) settings which are remarkably more complicated and sophisticated due to the involvement of multiple source domain… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

    Journal ref: Proceedings of Advances in Neural Information Processing Systems (2021) 27720-27733

  17. arXiv:2110.15538  [pdf, other

    cs.LG cs.CV

    On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural Networks

    Authors: Dang Nguyen, Trang Nguyen, Khai Nguyen, Dinh Phung, Hung Bui, Nhat Ho

    Abstract: Layer-wise model fusion via optimal transport, named OTFusion, applies soft neuron association for unifying different pre-trained networks to save computational resources. While enjoying its success, OTFusion requires the input networks to have the same number of layers. To address this issue, we propose a novel model fusion framework, named CLAFusion, to fuse neural networks with a different numb… ▽ More

    Submitted 19 February, 2023; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted to ICASSP 2023, 30 pages, 4 figures, 21 tables

  18. arXiv:2110.15520  [pdf, other

    cs.LG stat.ME stat.ML

    On Label Shift in Domain Adaptation via Wasserstein Distance

    Authors: Trung Le, Dat Do, Tuan Nguyen, Huy Nguyen, Hung Bui, Nhat Ho, Dinh Phung

    Abstract: We study the label shift problem between the source and target domains in general domain adaptation (DA) settings. We consider transformations transporting the target to source domains, which enable us to align the source and target examples. Through those transformations, we define the label shift between two domains via optimal transport and develop theory to investigate the properties of DA und… ▽ More

    Submitted 1 March, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: 35 pages, 7 figures, 6 tables

  19. arXiv:2110.09410  [pdf, other

    cs.LG

    Exploiting Domain-Specific Features to Enhance Domain Generalization

    Authors: Manh-Ha Bui, Toan Tran, Anh Tuan Tran, Dinh Phung

    Abstract: Domain Generalization (DG) aims to train a model, from multiple observed source domains, in order to perform well on unseen target domains. To obtain the generalization capability, prior DG approaches have focused on extracting domain-invariant information across sources to generalize on target domains, while useful domain-specific information which strongly correlates with labels in individual do… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 25 pages, 6 tables, 11 figures, published at Advances in Neural Information Processing Systems (NeurIPS), 2021

  20. arXiv:2110.07317  [pdf, other

    cs.LG cs.CR

    ReGVD: Revisiting Graph Neural Networks for Vulnerability Detection

    Authors: Van-Anh Nguyen, Dai Quoc Nguyen, Van Nguyen, Trung Le, Quan Hung Tran, Dinh Phung

    Abstract: Identifying vulnerabilities in the source code is essential to protect the software systems from cyber security attacks. It, however, is also a challenging step that requires specialized expertise in security and code representation. To this end, we aim to develop a general, practical, and programming language-independent model capable of running on various source codes and libraries without diffi… ▽ More

    Submitted 4 February, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted to ICSE 2022 (Demonstrations). The first two authors contributed equally to this work

  21. arXiv:2109.04292  [pdf, other

    cs.CL

    Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection

    Authors: Thuy-Trang Vu, Xuanli He, Dinh Phung, Gholamreza Haffari

    Abstract: This paper considers the unsupervised domain adaptation problem for neural machine translation (NMT), where we assume the access to only monolingual text in either the source or target language in the new domain. We propose a cross-lingual data selection method to extract in-domain sentences in the missing language side from a large generic monolingual corpus. Our proposed method trains an adaptiv… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP2021

  22. arXiv:2108.13215  [pdf, ps, other

    math.AP

    Exponential decay toward equilibrium via log convexity for a degenerate reaction-diffusion system

    Authors: Laurent Desvillettes, Kim Dang Phung

    Abstract: We consider a system of two reaction-diffusion equations coming out of reversible chemistry. When the reaction happens on the totality of the domain, it is known that exponential convergence to equilibrium holds. We show in this paper that this exponential convergence also holds when the reaction holds only on a given open set of a ball, thanks to an observation estimate deduced by logarithmic con… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  23. arXiv:2108.01224  [pdf, other

    cs.CV cs.LG

    Rapid Elastic Architecture Search under Specialized Classes and Resource Constraints

    Authors: **g Liu, Bohan Zhuang, Mingkui Tan, Xu Liu, Dinh Phung, Yuanqing Li, Jianfei Cai

    Abstract: In many real-world applications, we often need to handle various deployment scenarios, where the resource constraint and the superclass of interest corresponding to a group of classes are dynamically specified. How to efficiently deploy deep models for diverse deployment scenarios is a new challenge. Previous NAS approaches seek to design architectures for all classes simultaneously, which may not… ▽ More

    Submitted 15 March, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Tech report

  24. arXiv:2107.11626  [pdf, other

    cs.CV

    Multi-Label Image Classification with Contrastive Learning

    Authors: Son D. Dao, Ethan Zhao, Dinh Phung, Jianfei Cai

    Abstract: Recently, as an effective way of learning latent representations, contrastive learning has been increasingly popular and successful in various domains. The success of constrastive learning in single-label classifications motivates us to leverage this learning framework to enhance distinctiveness for better performance in multi-label image classification. In this paper, we show that a direct applic… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  25. arXiv:2105.12977  [pdf, ps, other

    math.AP

    Observation estimate for the heat equations with Neumann boundary condition via logarithmic convexity

    Authors: Rémi Buffe, Kim Dang Phung

    Abstract: We prove an inequality of Hölder type traducing the unique continuation property at one time for the heat equation with a potential and Neumann boundary condition. The main feature of the proof is to overcome the propagation of smallness by a global approach using a refined parabolic frequency function method. It relies with a Carleman commutator estimate to obtain the logarithmic convexity proper… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  26. arXiv:2105.02706  [pdf

    cs.RO eess.SY

    Mobile Robot Localization Using Fuzzy Neural Network Based Extended Kalman Filter

    Authors: Thi Thanh Van Nguyen, Manh Duong Phung, Thuan Hoang Tran, Quang Vinh Tran

    Abstract: This paper proposes a novel approach to improve the performance of the extended Kalman filter (EKF) for the problem of mobile robot localization. A fuzzy logic system is employed to continuous-ly adjust the noise covariance matrices of the filter. A neural network is implemented to regulate the membership functions of the antecedent and consequent parts of the fuzzy rules. The aim is to gain the a… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  27. arXiv:2105.02460  [pdf

    cs.CV cs.HC

    Development of a Fast and Robust Gaze Tracking System for Game Applications

    Authors: Manh Duong Phung, Cong Hoang Quach, Quang Vinh Tran

    Abstract: In this study, a novel eye tracking system using a visual camera is developed to extract human's gaze, and it can be used in modern game machines to bring new and innovative interactive experience to players. Central to the components of the system, is a robust iris-center and eye-corner detection algorithm basing on it the gaze is continuously and adaptively extracted. Evaluation tests were appli… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1611.09427

  28. arXiv:2104.13488  [pdf, other

    cs.LG cs.AI cs.CL

    Text Generation with Deep Variational GAN

    Authors: Mahmoud Hossam, Trung Le, Michael Papasimeon, Viet Huynh, Dinh Phung

    Abstract: Generating realistic sequences is a central task in many machine learning applications. There has been considerable recent progress on building deep generative models for sequence generation tasks. However, the issue of mode-collapsing remains a main issue for the current models. In this paper we propose a GAN-based generic framework to address the problem of mode-collapse in a principled approach… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted in the Third Workshop on Bayesian Deep Learning (NIPS / NeurIPS 2018)

    ACM Class: I.2.0; I.2.7; I.5.0

  29. arXiv:2104.13484  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Improved and Efficient Text Adversarial Attacks using Target Information

    Authors: Mahmoud Hossam, Trung Le, He Zhao, Viet Huynh, Dinh Phung

    Abstract: There has been recently a growing interest in studying adversarial examples on natural language models in the black-box setting. These methods attack natural language classifiers by perturbing certain important words until the classifier label is changed. In order to find these important words, these methods rank all words by importance by querying the target model word by word for each input sent… ▽ More

    Submitted 2 May, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted in the International Conference on Learning Representations (ICLR) workshop on Robust and Reliable Machine Learning in the Real World (RobustML)

    MSC Class: I.5.0; I.2.0

  30. Hierarchical Convolutional Neural Network with Feature Preservation and Autotuned Thresholding for Crack Detection

    Authors: Qiuchen Zhu, Tran Hiep Dinh, Manh Duong Phung, Quang Phuc Ha

    Abstract: Drone imagery is increasingly used in automated inspection for infrastructure surface defects, especially in hazardous or unreachable environments. In machine vision, the key to crack detection rests with robust and accurate algorithms for image processing. To this end, this paper proposes a deep learning approach using hierarchical convolutional neural networks with feature preservation (HCNNFP)… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Journal ref: IEEE Access, 2021

  31. arXiv:2104.10033  [pdf, other

    cs.NE cs.AI cs.RO eess.SY

    Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm Optimization

    Authors: Manh Duong Phung, Quang Phuc Ha

    Abstract: This paper presents a new algorithm named spherical vector-based particle swarm optimization (SPSO) to deal with the problem of path planning for unmanned aerial vehicles (UAVs) in complicated environments subjected to multiple threats. A cost function is first formulated to convert the path planning into an optimization problem that incorporates requirements and constraints for the feasible and s… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Journal ref: Applied Soft Computing, Volume 107, August 2021, 107376

  32. arXiv:2104.07396  [pdf, other

    cs.CL cs.LG

    Node Co-occurrence based Graph Neural Networks for Knowledge Graph Link Prediction

    Authors: Dai Quoc Nguyen, Vinh Tong, Dinh Phung, Dat Quoc Nguyen

    Abstract: We introduce a novel embedding model, named NoGE, which aims to integrate co-occurrence among entities and relations into graph neural networks to improve knowledge graph completion (i.e., link prediction). Given a knowledge graph, NoGE constructs a single graph considering entities and relations as individual nodes. NoGE then computes weights for edges among nodes based on the co-occurrence of en… ▽ More

    Submitted 25 December, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: To appear in Proceedings of WSDM 2022. The first two authors contributed equally to this work

  33. arXiv:2104.00845  [pdf, other

    cs.CV

    Bridging Global Context Interactions for High-Fidelity Image Completion

    Authors: Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai, Dinh Phung

    Abstract: Bridging global context interactions correctly is important for high-fidelity image completion with large masks. Previous methods attempting this via deep or large receptive field (RF) convolutions cannot escape from the dominance of nearby interactions, which may be inferior. In this paper, we propose to treat image completion as a directionless sequence-to-sequence prediction task, and deploy a… ▽ More

    Submitted 22 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

  34. arXiv:2103.00498  [pdf, ps, other

    cs.LG cs.CL cs.IR

    Topic Modelling Meets Deep Neural Networks: A Survey

    Authors: He Zhao, Dinh Phung, Viet Huynh, Yuan **, Lan Du, Wray Buntine

    Abstract: Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: A review on Neural Topic Models

  35. arXiv:2102.05912  [pdf, other

    stat.ML cs.LG

    On Transportation of Mini-batches: A Hierarchical Approach

    Authors: Khai Nguyen, Dang Nguyen, Quoc Nguyen, Tung Pham, Hung Bui, Dinh Phung, Trung Le, Nhat Ho

    Abstract: Mini-batch optimal transport (m-OT) has been successfully used in practical applications that involve probability measures with a very high number of supports. The m-OT solves several smaller optimal transport problems and then returns the average of their costs and transportation plans. Despite its scalability advantage, the m-OT does not consider the relationship between mini-batches which leads… ▽ More

    Submitted 6 June, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted to ICML 2022, 34 pages, 16 figures, 9 tables

  36. arXiv:2101.10027  [pdf, other

    cs.LG cs.AI cs.CV

    Understanding and Achieving Efficient Robustness with Adversarial Supervised Contrastive Learning

    Authors: Anh Bui, Trung Le, He Zhao, Paul Montague, Seyit Camtepe, Dinh Phung

    Abstract: Contrastive learning (CL) has recently emerged as an effective approach to learning representation in a range of downstream tasks. Central to this approach is the selection of positive (similar) and negative (dissimilar) sets to provide the model the opportunity to `contrast' between data and class representation in the latent space. In this paper, we investigate CL for improving model robustness… ▽ More

    Submitted 22 October, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

  37. arXiv:2011.08543  [pdf, other

    cs.LG cs.CL cs.CV

    Structural and Functional Decomposition for Personality Image Captioning in a Communication Game

    Authors: Thu Nguyen, Duy Phung, Minh Hoai, Thien Huu Nguyen

    Abstract: Personality image captioning (PIC) aims to describe an image with a natural language caption given a personality trait. In this work, we introduce a novel formulation for PIC based on a communication game between a speaker and a listener. The speaker attempts to generate natural language captions while the listener encourages the generated captions to contain discriminative information about the i… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 10 pages, EMNLP-Findings 2020

    Journal ref: EMNLP-Findings 2020

  38. arXiv:2011.06344  [pdf, other

    math.CV

    Remarks on results by Müger and Tuset on the moments of polynomials

    Authors: Greg Markowsky, Dylan Phung

    Abstract: Let $f(x)$ be a non-zero polynomial with complex coefficients, and $M_p = \int_{0}^1 f(x)^p dx$ for $p$ a positive integer. In a recent paper, Müger and Tuset showed that $\limsup_{p \to \infty} |M_p|^{1/p} > 0$, and conjectured that this limit is equal to the maximum amongst the critical values of $f$ together with the values $|f(0)|$ and $|f(1)|$. We give an example that shows that this conjectu… ▽ More

    Submitted 17 November, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: The first version mischaracterized the conjecture by Müger and Tuset

  39. arXiv:2011.03096  [pdf, other

    cs.CL cs.LG

    Explain by Evidence: An Explainable Memory-based Neural Network for Question Answering

    Authors: Quan Tran, Nhan Dam, Tuan Lai, Franck Dernoncourt, Trung Le, Nham Le, Dinh Phung

    Abstract: Interpretability and explainability of deep neural networks are challenging due to their scale, complexity, and the agreeable notions on which the explaining process rests. Previous work, in particular, has focused on representing internal components of neural networks through human-friendly visuals and concepts. On the other hand, in real life, when making a decision, human tends to rely on simil… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted to COLING 2020

  40. arXiv:2010.06812  [pdf, other

    cs.LG cs.CR

    Explain2Attack: Text Adversarial Attacks via Cross-Domain Interpretability

    Authors: Mahmoud Hossam, Trung Le, He Zhao, Dinh Phung

    Abstract: Training robust deep learning models for down-stream tasks is a critical challenge. Research has shown that down-stream models can be easily fooled with adversarial inputs that look like the training data, but slightly perturbed, in a way imperceptible to humans. Understanding the behavior of natural language models under these attacks is crucial to better defend these models against such attacks.… ▽ More

    Submitted 16 January, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Preprint for accepted paper at 25th International Conference on Pattern Recognition (ICPR 2020)

    ACM Class: I.5.0; I.2.0

  41. arXiv:2010.06131  [pdf, other

    cs.CV cs.CR cs.LG

    Learning to Attack with Fewer Pixels: A Probabilistic Post-hoc Framework for Refining Arbitrary Dense Adversarial Attacks

    Authors: He Zhao, Thanh Nguyen, Trung Le, Paul Montague, Olivier De Vel, Tamas Abraham, Dinh Phung

    Abstract: Deep neural network image classifiers are reported to be susceptible to adversarial evasion attacks, which use carefully crafted images created to mislead a classifier. Many adversarial attacks belong to the category of dense attacks, which generate adversarial examples by perturbing all the pixels of a natural image. To generate sparse perturbations, sparse attacks have been recently developed, w… ▽ More

    Submitted 21 February, 2022; v1 submitted 12 October, 2020; originally announced October 2020.

  42. Motion-Encoded Particle Swarm Optimization for Moving Target Search Using UAVs

    Authors: Manh Duong Phung, Quang Phuc Ha

    Abstract: This paper presents a novel algorithm named the motion-encoded particle swarm optimization (MPSO) for finding a moving target with unmanned aerial vehicles (UAVs). From the Bayesian theory, the search problem can be converted to the optimization of a cost function that represents the probability of detecting the target. Here, the proposed MPSO is developed to solve that problem by encoding the sea… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Applied Soft Computing, 2020

  43. arXiv:2010.01739  [pdf, other

    cs.CL

    Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models

    Authors: Thuy-Trang Vu, Dinh Phung, Gholamreza Haffari

    Abstract: Recent work has shown the importance of adaptation of broad-coverage contextualised embedding models on the domain of the target task of interest. Current self-supervised adaptation methods are simplistic, as the training signal comes from a small percentage of \emph{randomly} masked-out tokens. In this paper, we show that careful masking strategies can bridge the knowledge gap of masked language… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: EMNLP2020

  44. arXiv:2009.12517  [pdf, other

    cs.CL cs.AI cs.LG

    QuatRE: Relation-Aware Quaternions for Knowledge Graph Embeddings

    Authors: Dai Quoc Nguyen, Thanh Vu, Tu Dinh Nguyen, Dinh Phung

    Abstract: We propose a simple yet effective embedding model to learn quaternion embeddings for entities and relations in knowledge graphs. Our model aims to enhance correlations between head and tail entities given a relation within the Quaternion space with Hamilton product. The model achieves this goal by further associating each relation with two relation-aware rotations, which are used to rotate quatern… ▽ More

    Submitted 8 March, 2022; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: Accepted to The ACM Web Conference 2022 (WWW '22) (Poster and Demo Track)

  45. arXiv:2009.09612  [pdf, other

    cs.CV cs.LG

    Improving Ensemble Robustness by Collaboratively Promoting and Demoting Adversarial Robustness

    Authors: Anh Bui, Trung Le, He Zhao, Paul Montague, Olivier deVel, Tamas Abraham, Dinh Phung

    Abstract: Ensemble-based adversarial training is a principled approach to achieve robustness against adversarial attacks. An important technique of this approach is to control the transferability of adversarial examples among ensemble members. We propose in this work a simple yet effective strategy to collaborate among committee models of an ensemble model. This is achieved via the secure and insecure sets… ▽ More

    Submitted 4 February, 2022; v1 submitted 21 September, 2020; originally announced September 2020.

  46. arXiv:2008.13537  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Neural Topic Model via Optimal Transport

    Authors: He Zhao, Dinh Phung, Viet Huynh, Trung Le, Wray Buntine

    Abstract: Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of repar… ▽ More

    Submitted 31 May, 2022; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: Published in ICLR 2021, link: https://openreview.net/forum?id=Oos98K9Lv-k, code: https://github.com/ethanhezhao/NeuralSinkhornTopicModel

  47. arXiv:2008.05089  [pdf, other

    cs.LG stat.ML

    Quaternion Graph Neural Networks

    Authors: Dai Quoc Nguyen, Tu Dinh Nguyen, Dinh Phung

    Abstract: Recently, graph neural networks (GNNs) have become an important and active research direction in deep learning. It is worth noting that most of the existing GNN-based methods learn graph representations within the Euclidean vector space. Beyond the Euclidean space, learning representation and embeddings in hyper-complex space have also shown to be a promising and effective approach. To this end, w… ▽ More

    Submitted 6 October, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

    Comments: Camera-ready for ACML 2021. Additional implementations for Gated QGNNs, Dual QGNNs, Simplifying QGNNs

  48. arXiv:2008.02593  [pdf, other

    cs.CV cs.LG eess.IV

    MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models

    Authors: Thanh Nguyen-Duc, He Zhao, Jianfei Cai, Dinh Phung

    Abstract: Deep learning methods usually require a large amount of training data and lack interpretability. In this paper, we propose a novel knowledge distillation and model interpretation framework for medical image classification that jointly solves the above two issues. Specifically, to address the data-hungry issue, a small student model is learned with less data by distilling knowledge from a cumbersom… ▽ More

    Submitted 12 January, 2022; v1 submitted 6 August, 2020; originally announced August 2020.

    Journal ref: International Symposium on Biomedical Imaging (ISBI, 2022)

  49. arXiv:2007.05123  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Improving Adversarial Robustness by Enforcing Local and Global Compactness

    Authors: Anh Bui, Trung Le, He Zhao, Paul Montague, Olivier deVel, Tamas Abraham, Dinh Phung

    Abstract: The fact that deep neural networks are susceptible to crafted perturbations severely impacts the use of deep learning in certain domains of application. Among many developed defense models against such attacks, adversarial training emerges as the most successful method that consistently resists a wide range of attacks. In this work, based on an observation from a previous study that the representa… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Proceeding of the European Conference on Computer Vision (ECCV) 2020

  50. arXiv:2006.12100  [pdf, other

    cs.LG cs.CL cs.SI stat.ML

    A Self-Attention Network based Node Embedding Model

    Authors: Dai Quoc Nguyen, Tu Dinh Nguyen, Dinh Phung

    Abstract: Despite several signs of progress have been made recently, limited research has been conducted for an inductive setting where embeddings are required for newly unseen nodes -- a setting encountered commonly in practical applications of deep learning for graph networks. This significantly affects the performances of downstream tasks such as node classification, link prediction or community extracti… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted version, ECML-PKDD 2020