Skip to main content

Showing 1–29 of 29 results for author: Duong, C N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.07408  [pdf, other

    cs.CV cs.LG

    Fairness in Visual Clustering: A Novel Transformer Clustering Approach

    Authors: Xuan-Bac Nguyen, Chi Nhan Duong, Marios Savvides, Kaushik Roy, Hugh Churchill, Khoa Luu

    Abstract: Promoting fairness for deep clustering models in unsupervised clustering settings to reduce demographic bias is a challenging goal. This is because of the limitation of large-scale balanced data with well-annotated labels for sensitive or protected attributes. In this paper, we first evaluate demographic bias in deep clustering models from the perspective of cluster purity, which is measured by th… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  2. arXiv:2304.07372  [pdf, other

    cs.CV

    CoMaL: Conditional Maximum Likelihood Approach to Self-supervised Domain Adaptation in Long-tail Semantic Segmentation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Pierce Helton, Ashley Dowling, Xin Li, Khoa Luu

    Abstract: The research in self-supervised domain adaptation in semantic segmentation has recently received considerable attention. Although GAN-based methods have become one of the most popular approaches to domain adaptation, they have suffered from some limitations. They are insufficient to model both global and local structures of a given image, especially in small regions of tail classes. Moreover, they… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  3. arXiv:2304.07199  [pdf, other

    cs.CV

    CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ashley Dowling, Son Lam Phung, Jackson Cothren, Khoa Luu

    Abstract: Understanding semantic scene segmentation of urban scenes captured from the Unmanned Aerial Vehicles (UAV) perspective plays a vital role in building a perception model for UAV. With the limitations of large-scale densely labeled data, semantic scene segmentation for UAV views requires a broad understanding of an object from both its top and side views. Adapting from well-annotated autonomous driv… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  4. arXiv:2304.03195  [pdf, other

    cs.CV

    Micron-BERT: BERT-based Facial Micro-Expression Recognition

    Authors: Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li, Susan Gauch, Han-Seok Seo, Khoa Luu

    Abstract: Micro-expression recognition is one of the most challenging topics in affective computing. It aims to recognize tiny facial movements difficult for humans to perceive in a brief period, i.e., 0.25 to 0.5 seconds. Recent advances in pre-training deep Bidirectional Transformers (BERT) have significantly improved self-supervised learning tasks in computer vision. However, the standard BERT in vision… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR2023

  5. arXiv:2211.09663  [pdf, other

    cs.CV

    Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach

    Authors: Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Son Lam Phung, Ngan Le, Khoa Luu

    Abstract: The development of autonomous vehicles generates a tremendous demand for a low-cost solution with a complete set of camera sensors capturing the environment around the car. It is essential for object detection and tracking to address these new challenges in multi-camera settings. In order to address these challenges, this work introduces novel Single-Stage Global Association Tracking approaches to… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: In review PR journal. arXiv admin note: text overlap with arXiv:2204.09151

  6. arXiv:2209.04920  [pdf, other

    cs.CV

    Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Marios Savvides, Khoa Luu

    Abstract: In this work, we investigate the problem of face reconstruction given a facial feature representation extracted from a blackbox face recognition engine. Indeed, it is a very challenging problem in practice due to the limitations of abstracted information from the engine. We, therefore, introduce a new method named Attention-based Bijective Generative Adversarial Networks in a Distillation framewor… ▽ More

    Submitted 1 September, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2003.06958

  7. arXiv:2207.04551  [pdf, other

    cs.CV

    Depth Perspective-aware Multiple Object Tracking

    Authors: Kha Gia Quach, Huu Le, Pha Nguyen, Chi Nhan Duong, Tien Dai Bui, Khoa Luu

    Abstract: This paper aims to tackle Multiple Object Tracking (MOT), an important problem in computer vision but remains challenging due to many practical issues, especially occlusions. Indeed, we propose a new real-time Depth Perspective-aware Multiple Object Tracking (DP-MOT) approach to tackle the occlusion problem in MOT. A simple yet efficient Subject-Ordered Depth Estimation (SODE) is first proposed to… ▽ More

    Submitted 27 February, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: In review PR journal

  8. arXiv:2204.09151  [pdf, other

    cs.CV

    Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles

    Authors: Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Ngan Le, Xuan-Bac Nguyen, Khoa Luu

    Abstract: The development of autonomous vehicles provides an opportunity to have a complete set of camera sensors capturing the environment around the car. Thus, it is important for object detection and tracking to address new challenges, such as achieving consistent results across views of cameras. To address these challenges, this work presents a new Global Association Graph Model with Link Prediction app… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted at CVPRW 2022

  9. arXiv:2203.10233  [pdf, other

    cs.CV

    DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

    Authors: Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu

    Abstract: Human action recognition has recently become one of the popular research topics in the computer vision community. Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results. However, these methods have suffered some fundamental limitations such as lack of robustness and generalization, e.g., h… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  10. arXiv:2108.03267  [pdf, other

    cs.CV

    BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu

    Abstract: Semantic segmentation aims to predict pixel-level labels. It has become a popular task in various computer vision applications. While fully supervised segmentation methods have achieved high accuracy on large-scale vision datasets, they are unable to generalize on a new test environment or a new domain well. In this work, we first introduce a new Un-aligned Domain Score to measure the efficiency o… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  11. arXiv:2108.03256  [pdf, other

    cs.CV cs.SD eess.AS

    The Right to Talk: An Audio-Visual Transformer Approach

    Authors: Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu

    Abstract: Turn-taking has played an essential role in structuring the regulation of a conversation. The task of identifying the main speaker (who is properly taking his/her turn of speaking) and the interrupters (who are interrupting or reacting to the main speaker's utterances) remains a challenging task. Although some prior methods have partially addressed this task, there still remain some limitations. F… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  12. arXiv:2106.06856  [pdf, other

    cs.CV

    DyGLIP: A Dynamic Graph Model with Link Prediction for Accurate Multi-Camera Multiple Object Tracking

    Authors: Kha Gia Quach, Pha Nguyen, Huu Le, Thanh-Dat Truong, Chi Nhan Duong, Minh-Triet Tran, Khoa Luu

    Abstract: Multi-Camera Multiple Object Tracking (MC-MOT) is a significant computer vision problem due to its emerging applicability in several real-world applications. Despite a large number of existing works, solving the data association problem in any MC-MOT pipeline is arguably one of the most challenging tasks. Develo** a robust MC-MOT system, however, is still highly challenging due to many practical… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: accepted at CVPR 2021

  13. arXiv:2004.05085  [pdf, other

    cs.CV

    LIAAD: Lightweight Attentive Angular Distillation for Large-scale Age-Invariant Face Recognition

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Kha Gia Quach, Ngan Le, Tien D. Bui, Khoa Luu

    Abstract: Disentangled representations have been commonly adopted to Age-invariant Face Recognition (AiFR) tasks. However, these methods have reached some limitations with (1) the requirement of large-scale face recognition (FR) training data with age labels, which is limited in practice; (2) heavy deep network architectures for high performance; and (3) their evaluations are usually taken place on age-rela… ▽ More

    Submitted 11 September, 2022; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1905.10620

  14. arXiv:2003.06958  [pdf, other

    cs.CV

    Vec2Face: Unveil Human Faces from their Blackbox Features in Face Recognition

    Authors: Chi Nhan Duong, Thanh-Dat Truong, Kha Gia Quach, Hung Bui, Kaushik Roy, Khoa Luu

    Abstract: Unveiling face images of a subject given his/her high-level representations extracted from a blackbox Face Recognition engine is extremely challenging. It is because the limitations of accessible information from that engine including its structure and uninterpretable extracted features. This paper presents a novel generative structure with Bijective Metric Learning, namely Bijective Generative Ad… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  15. arXiv:1905.13040  [pdf, other

    cs.CV

    Domain Generalization via Universal Non-volume Preserving Models

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Khoa Luu, Minh-Triet Tran, Ngan Le

    Abstract: Recognition across domains has recently become an active topic in the research community. However, it has been largely overlooked in the problem of recognition in new unseen domains. Under this condition, the delivered deep network models are unable to be updated, adapted, or fine-tuned. Therefore, recent deep learning techniques, such as domain adaptation, feature transferring, and fine-tuning, c… ▽ More

    Submitted 10 April, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted to Computer and Robot Vision 2020. arXiv admin note: substantial text overlap with arXiv:1812.03407

  16. arXiv:1905.12028  [pdf, other

    cs.CV

    Image Alignment in Unseen Domains via Domain Deep Generalization

    Authors: Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

    Abstract: Image alignment across domains has recently become one of the realistic and popular topics in the research community. In this problem, a deep learning-based image alignment method is usually trained on an available largescale database. During the testing steps, this trained model is deployed on unseen images collected under different camera conditions and modalities. The delivered deep network mod… ▽ More

    Submitted 31 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  17. arXiv:1905.10620  [pdf, other

    cs.CV

    ShrinkTeaNet: Million-scale Lightweight Face Recognition via Shrinking Teacher-Student Networks

    Authors: Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Ngan Le

    Abstract: Large-scale face recognition in-the-wild has been recently achieved matured performance in many real work applications. However, such systems are built on GPU platforms and mostly deploy heavy deep network architectures. Given a high-performance heavy network as a teacher, this work presents a simple and elegant teacher-student learning paradigm, namely ShrinkTeaNet, to train a portable student ne… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

  18. arXiv:1905.10170  [pdf, other

    cs.CV

    Fast Flow Reconstruction via Robust Invertible nxn Convolution

    Authors: Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

    Abstract: Flow-based generative models have recently become one of the most efficient approaches to model data generation. Indeed, they are constructed with a sequence of invertible and tractable transformations. Glow first introduced a simple type of generative flow using an invertible $1 \times 1$ convolution. However, the $1 \times 1$ convolution suffers from limited flexibility compared to the standard… ▽ More

    Submitted 6 August, 2022; v1 submitted 24 May, 2019; originally announced May 2019.

  19. arXiv:1812.03407  [pdf, other

    cs.CV

    Beyond Domain Adaptation: Unseen Domain Encapsulation via Universal Non-volume Preserving Models

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Khoa Luu, Minh-Triet Tran, Minh Do

    Abstract: Recognition across domains has recently become an active topic in the research community. However, it has been largely overlooked in the problem of recognition in new unseen domains. Under this condition, the delivered deep network models are unable to be updated, adapted or fine-tuned. Therefore, recent deep learning techniques, such as: domain adaptation, feature transferring, and fine-tuning, c… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

  20. arXiv:1811.11849  [pdf, other

    cs.CV

    Non-Volume Preserving-based Fusion to Group-Level Emotion Recognition on Crowd Videos

    Authors: Kha Gia Quach, Ngan Le, Chi Nhan Duong, Ibsa Jalata, Kaushik Roy, Khoa Luu

    Abstract: Group-level emotion recognition (ER) is a growing research area as the demands for assessing crowds of all sizes are becoming an interest in both the security arena as well as social media. This work extends the earlier ER investigations, which focused on either group-level ER on single images or within a video, by fully investigating group-level expression recognition on crowd videos. In this pap… ▽ More

    Submitted 23 March, 2022; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: In press at Patter Recognition Journal

  21. arXiv:1811.11082  [pdf, other

    cs.CV

    Automatic Face Aging in Videos via Deep Reinforcement Learning

    Authors: Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Nghia Nguyen, Eric Patterson, Tien D. Bui, Ngan Le

    Abstract: This paper presents a novel approach to synthesize automatically age-progressed facial images in video sequences using Deep Reinforcement Learning. The proposed method models facial structures and the longitudinal face-aging process of given subjects coherently across video frames. The approach is optimized using a long-term reward, Reinforcement Learning function with deep feature extraction from… ▽ More

    Submitted 24 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: CVPR2019 Camera Ready, https://face-aging.github.io/RL-VAP/

  22. arXiv:1811.11080  [pdf, other

    cs.CV

    MobiFace: A Lightweight Deep Learning Face Recognition on Mobile Devices

    Authors: Chi Nhan Duong, Kha Gia Quach, Ibsa Jalata, Ngan Le, Khoa Luu

    Abstract: Deep neural networks have been widely used in numerous computer vision applications, particularly in face recognition. However, deploying deep neural network face recognition on mobile devices has recently become a trend but still limited since most high-accuracy deep models are both time and GPU consumption in the inference stage. Therefore, develo** a lightweight deep neural network is one of… ▽ More

    Submitted 17 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

  23. arXiv:1802.08726  [pdf, other

    cs.CV

    Longitudinal Face Aging in the Wild - Recent Deep Learning Approaches

    Authors: Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

    Abstract: Face Aging has raised considerable attentions and interest from the computer vision community in recent years. Numerous approaches ranging from purely image processing techniques to deep learning structures have been proposed in literature. In this paper, we aim to give a review of recent developments of modern deep learning based approaches, i.e. Deep Generative Models, for Face Aging task. Their… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

  24. arXiv:1711.10520  [pdf, other

    cs.CV

    Learning from Longitudinal Face Demonstration - Where Tractable Deep Modeling Meets Inverse Reinforcement Learning

    Authors: Chi Nhan Duong, Kha Gia Quach, Khoa Luu, T. Hoang Ngan Le, Marios Savvides, Tien D. Bui

    Abstract: This paper presents a novel Subject-dependent Deep Aging Path (SDAP), which inherits the merits of both Generative Probabilistic Modeling and Inverse Reinforcement Learning to model the facial structures and the longitudinal face aging process of a given subject. The proposed SDAP is optimized using tractable log-likelihood objective functions with Convolutional Neural Networks (CNNs) based deep f… ▽ More

    Submitted 2 February, 2019; v1 submitted 28 November, 2017; originally announced November 2017.

  25. arXiv:1704.03594  [pdf, other

    cs.CV

    Deep Contextual Recurrent Residual Networks for Scene Labeling

    Authors: T. Hoang Ngan Le, Chi Nhan Duong, Ligong Han, Khoa Luu, Marios Savvides, Dipan Pal

    Abstract: Designed as extremely deep architectures, deep residual networks which provide a rich visual representation and offer robust convergence behaviors have recently achieved exceptional performance in numerous computer vision problems. Being directly applied to a scene labeling problem, however, they were limited to capture long-range contextual dependence, which is a critical aspect. To address this… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

  26. arXiv:1703.08617  [pdf, other

    cs.CV

    Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition

    Authors: Chi Nhan Duong, Kha Gia Quach, Khoa Luu, T. Hoang Ngan le, Marios Savvides

    Abstract: Modeling the long-term facial aging process is extremely challenging due to the presence of large and non-linear variations during the face development stages. In order to efficiently address the problem, this work first decomposes the aging process into multiple short-term stages. Then, a novel generative probabilistic model, named Temporal Non-Volume Preserving (TNVP) transformation, is presente… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

  27. Deep Appearance Models: A Deep Boltzmann Machine Approach for Face Modeling

    Authors: Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

    Abstract: The "interpretation through synthesis" approach to analyze face images, particularly Active Appearance Models (AAMs) method, has become one of the most successful face modeling approaches over the last two decades. AAM models have ability to represent face images through synthesis using a controllable parameterized Principal Component Analysis (PCA) model. However, the accuracy and robustness of t… ▽ More

    Submitted 21 December, 2017; v1 submitted 22 July, 2016; originally announced July 2016.

  28. arXiv:1607.00659  [pdf, other

    cs.CV

    Robust Deep Appearance Models

    Authors: Kha Gia Quach, Chi Nhan Duong, Khoa Luu, Tien D. Bui

    Abstract: This paper presents a novel Robust Deep Appearance Models to learn the non-linear correlation between shape and texture of face images. In this approach, two crucial components of face images, i.e. shape and texture, are represented by Deep Boltzmann Machines and Robust Deep Boltzmann Machines (RDBM), respectively. The RDBM, an alternative form of Robust Boltzmann Machines, can separate corrupted/… ▽ More

    Submitted 3 July, 2016; originally announced July 2016.

    Comments: 6 pages, 8 figures, submitted to ICPR 2016

  29. Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines

    Authors: Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

    Abstract: Modeling the face aging process is a challenging task due to large and non-linear variations present in different stages of face development. This paper presents a deep model approach for face age progression that can efficiently capture the non-linear aging process and automatically synthesize a series of age-progressed faces in various age ranges. In this approach, we first decompose the long-te… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Comments: in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016