Skip to main content

Showing 1–50 of 442 results for author: Tran, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15883  [pdf, other

    cs.CL cs.AI

    SimSMoE: Solving Representational Collapse via Similarity Measure

    Authors: Giang Do, Hung Le, Truyen Tran

    Abstract: Sparse mixture of experts (SMoE) have emerged as an effective approach for scaling large language models while kee** a constant computational cost. Regardless of several notable successes of SMoE, effective training such architecture remains elusive due to the representation collapse problem, which in turn harms model performance and causes parameter redundancy. In this work, we present Similari… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2406.14220  [pdf

    cs.CV cs.LG

    Evaluation of Deep Learning Semantic Segmentation for Land Cover Map** on Multispectral, Hyperspectral and High Spatial Aerial Imagery

    Authors: Ilham Adi Panuntun, Ying-Nong Chen, Ilham Jamaluddin, Thi Linh Chi Tran

    Abstract: In the rise of climate change, land cover map** has become such an urgent need in environmental monitoring. The accuracy of land cover classification has gotten increasingly based on the improvement of remote sensing data. Land cover classification using satellite imageries has been explored and become more prevalent in recent years, but the methodologies remain some drawbacks of subjective and… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: conference, This preprint is based on the following published conference article: Panuntun, I. A., Chen, Y.-N., Jamaluddin, I., & Tran, T. L. C., 2023. Evaluation of Deep Learning Semantic Segmentation for Land Cover Map** on Multispectral, Hyperspectral and High Spatial Aerial Imagery. 44th Asian Conference on Remote Sensing, ACRS 2023. Code 198676

    Journal ref: 44th Asian Conference on Remote Sensing, ACRS 2023. Code 198676

  3. arXiv:2406.13725  [pdf, other

    cs.LG cs.AI stat.ML

    Tree-Sliced Wasserstein Distance on a System of Lines

    Authors: Viet-Hoang Tran, Trang Pham, Tho Tran, Tam Le, Tan M. Nguyen

    Abstract: Sliced Wasserstein (SW) distance in Optimal Transport (OT) is widely used in various applications thanks to its statistical effectiveness and computational efficiency. On the other hand, Tree Wassenstein (TW) and Tree-sliced Wassenstein (TSW) are instances of OT for probability measures where its ground cost is a tree metric. TSW also has a low computational complexity, i.e. linear to the number o… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 33 pages, 6 figures, 2 tables, 4 algorithms

  4. arXiv:2406.11146  [pdf, other

    cs.HC

    Designing Interactions with Autonomous Physical Systems

    Authors: Marius Hoggenmueller, Tram Thi Minh Tran, Luke Hespanhol, Martin Tomitsch

    Abstract: In this position paper, we present a collection of four different prototy** approaches which we have developed and applied to prototype and evaluate interfaces for and interactions around autonomous physical systems. Further, we provide a classification of our approaches aiming to support other researchers and designers in choosing appropriate prototy** platforms and representations.

    Submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.09128  [pdf, other

    cs.CL

    CoastTerm: a Corpus for Multidisciplinary Term Extraction in Coastal Scientific Literature

    Authors: Julien Delaunay, Hanh Thi Hong Tran, Carlos-Emiliano González-Gallardo, Georgeta Bordea, Mathilde Ducos, Nicolas Sidere, Antoine Doucet, Senja Pollak, Olivier De Viron

    Abstract: The growing impact of climate change on coastal areas, particularly active but fragile regions, necessitates collaboration among diverse stakeholders and disciplines to formulate effective environmental protection policies. We introduce a novel specialized corpus comprising 2,491 sentences from 410 scientific abstracts concerning coastal areas, for the Automatic Term Extraction (ATE) and Classific… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. Context-Based Interface Prototy**: Understanding the Effect of Prototype Representation on User Feedback

    Authors: Marius Hoggenmueller, Martin Tomitsch, Luke Hespanhol, Tram Thi Minh Tran, Stewart Worrall, Eduardo Nebot

    Abstract: The rise of autonomous systems in cities, such as automated vehicles (AVs), requires new approaches for prototy** and evaluating how people interact with those systems through context-based user interfaces, such as external human-machine interfaces (eHMIs). In this paper, we present a comparative study of three prototype representations (real-world VR, computer-generated VR, real-world video) of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.01457  [pdf, other

    cs.LG cs.CL cs.CR

    Differentially Private Tabular Data Synthesis using Large Language Models

    Authors: Toan V. Tran, Li Xiong

    Abstract: Synthetic tabular data generation with differential privacy is a crucial problem to enable data sharing with formal privacy. Despite a rich history of methodological research and development, develo** differentially private tabular data generators that can provide realistic synthetic datasets remains challenging. This paper introduces DP-LLMTGen -- a novel framework for differentially private ta… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2406.00837  [pdf, other

    cs.RO

    Arena 3.0: Advancing Social Navigation in Collaborative and Highly Dynamic Environments

    Authors: Linh Kästner, Volodymyir Shcherbyna, Huajian Zeng, Tuan Anh Le, Maximilian Ho-Kyoung Schreff, Halid Osmaev, Nam Truong Tran, Diego Diaz, Jan Golebiowski, Harold Soh, Jens Lambrecht

    Abstract: Building upon our previous contributions, this paper introduces Arena 3.0, an extension of Arena-Bench, Arena 1.0, and Arena 2.0. Arena 3.0 is a comprehensive software stack containing multiple modules and simulation environments focusing on the development, simulation, and benchmarking of social navigation approaches in collaborative environments. We significantly enhance the realism of human beh… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 11 pages, 6 figures

    Journal ref: Robotics Science and Systems 2024, Delft Netherlands

  9. Building a temperature forecasting model for the city with the regression neural network (RNN)

    Authors: Nguyen Phuc Tran, Duy Thanh Tran, Thi Thuy Nga Duong

    Abstract: In recent years, a study by environmental organizations in the world and Vietnam shows that weather change is quite complex. global warming has become a serious problem in the modern world, which is a concern for scientists. last century, it was difficult to forecast the weather due to missing weather monitoring stations and technological limitations. this made it hard to collect data for building… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 6 pages

    Journal ref: The 6th International Conference for Small & Medium Business in 2020 (ICSMB 2020)

  10. arXiv:2405.16204  [pdf, other

    cs.CV cs.AI cs.GR

    VOODOO XP: Expressive One-Shot Head Reenactment for VR Telepresence

    Authors: Phong Tran, Egor Zakharov, Long-Nhat Ho, Liwen Hu, Adilbek Karmanov, Aviral Agarwal, McLean Goldwhite, Ariana Bermudez Venegas, Anh Tuan Tran, Hao Li

    Abstract: We introduce VOODOO XP: a 3D-aware one-shot head reenactment method that can generate highly expressive facial expressions from any input driver video and a single 2D portrait. Our solution is real-time, view-consistent, and can be instantly used without calibration or fine-tuning. We demonstrate our solution on a monocular video setting and an end-to-end VR telepresence system for two-way communi… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  11. arXiv:2405.15779  [pdf

    eess.IV cs.AI cs.CV

    LiteNeXt: A Novel Lightweight ConvMixer-based Model with Self-embedding Representation Parallel for Medical Image Segmentation

    Authors: Ngoc-Du Tran, Thi-Thao Tran, Quang-Huy Nguyen, Manh-Hung Vu, Van-Truong Pham

    Abstract: The emergence of deep learning techniques has advanced the image segmentation task, especially for medical images. Many neural network models have been introduced in the last decade bringing the automated segmentation accuracy close to manual segmentation. However, cutting-edge models like Transformer-based architectures rely on large scale annotated training data, and are generally designed with… ▽ More

    Submitted 3 April, 2024; originally announced May 2024.

    Comments: 35 pages, 9 figures, 10 tables

  12. arXiv:2405.03011  [pdf

    cs.CV cs.AI

    AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation

    Authors: Viet-Thanh Nguyen, Van-Truong Pham, Thi-Thao Tran

    Abstract: Skin lesion segmentation is a critical task in computer-aided diagnosis systems for dermatological diseases. Accurate segmentation of skin lesions from medical images is essential for early detection, diagnosis, and treatment planning. In this paper, we propose a new model for skin lesion segmentation namely AC-MambaSeg, an enhanced model that has the hybrid CNN-Mamba backbone, and integrates adva… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 15 pages, 7 figures, 4 tables

  13. arXiv:2405.01815  [pdf, other

    cs.SD cs.AI eess.AS

    Toward end-to-end interpretable convolutional neural networks for waveform signals

    Authors: Linh Vu, Thu Tran, Wern-Han Lim, Raphael Phan

    Abstract: This paper introduces a novel convolutional neural networks (CNN) framework tailored for end-to-end audio deep learning models, presenting advancements in efficiency and explainability. By benchmarking experiments on three standard speech emotion recognition datasets with five-fold cross-validation, our framework outperforms Mel spectrogram features by up to seven percent. It can potentially repla… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  14. arXiv:2404.11870  [pdf, ps, other

    cs.LG cs.CL

    Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory

    Authors: Hung Le, Dung Nguyen, Kien Do, Svetha Venkatesh, Truyen Tran

    Abstract: We propose Pointer-Augmented Neural Memory (PANM) to help neural networks understand and apply symbol processing to new, longer sequences of data. PANM integrates an external neural memory that uses novel physical addresses and pointer manipulation techniques to mimic human and computer symbol processing abilities. PANM facilitates pointer assignment, dereference, and arithmetic by explicitly usin… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Preprint

  15. arXiv:2404.11049  [pdf, other

    cs.LG cs.AI cs.CL

    Stepwise Alignment for Constrained Language Model Policy Optimization

    Authors: Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto

    Abstract: Safety and trustworthiness are indispensable requirements for real-world applications of AI systems using large language models (LLMs). This paper formulates human value alignment as an optimization problem of the language model policy to maximize reward under a safety constraint, and then proposes an algorithm, Stepwise Alignment for Constrained Policy Optimization (SACPO). One key idea behind SA… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  16. arXiv:2404.03414  [pdf, other

    cs.CL cs.AI

    Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought

    Authors: Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut, Kai-Wei Chang, Chengwei Su

    Abstract: We introduce a novel framework, LM-Guided CoT, that leverages a lightweight (i.e., <1B) language model (LM) for guiding a black-box large (i.e., >10B) LM in reasoning tasks. Specifically, the lightweight LM first generates a rationale for each input instance. The Frozen large LM is then prompted to predict a task output based on the rationale generated by the lightweight LM. Our approach is resour… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: This paper is accepted to LREC-COLING 2024

  17. Exploring Holistic HMI Design for Automated Vehicles: Insights from a Participatory Workshop to Bridge In-Vehicle and External Communication

    Authors: Haoyu Dong, Tram Thi Minh Tran, Rutger Verstegen, Silvia Cazacu, Ruolin Gao, Marius Hoggenmüller, Debargha Dey, Mervyn Franssen, Markus Sasalovici, Pavlo Bazilinskyy, Marieke Martens

    Abstract: Human-Machine Interfaces (HMIs) for automated vehicles (AVs) are typically divided into two categories: internal HMIs for interactions within the vehicle, and external HMIs for communication with other road users. In this work, we examine the prospects of bridging these two seemingly distinct domains. Through a participatory workshop with automotive user interface researchers and practitioners, we… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  18. arXiv:2403.18871  [pdf

    cs.CV cs.AI cs.LG

    Clinical Domain Knowledge-Derived Template Improves Post Hoc AI Explanations in Pneumothorax Classification

    Authors: Han Yuan, Chuan Hong, Pengtao Jiang, Gangming Zhao, Nguyen Tuan Anh Tran, Xinxing Xu, Yet Yen Yan, Nan Liu

    Abstract: Background: Pneumothorax is an acute thoracic disease caused by abnormal air collection between the lungs and chest wall. To address the opaqueness often associated with deep learning (DL) models, explainable artificial intelligence (XAI) methods have been introduced to outline regions related to pneumothorax diagnoses made by DL models. However, these explanations sometimes diverge from actual le… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  19. arXiv:2403.16016  [pdf, other

    cs.CV cs.AI

    Fill in the ____ (a Diffusion-based Image Inpainting Pipeline)

    Authors: Eyoel Gebre, Krishna Saxena, Timothy Tran

    Abstract: Image inpainting is the process of taking an image and generating lost or intentionally occluded portions. Inpainting has countless applications including restoring previously damaged pictures, restoring the quality of images that have been degraded due to compression, and removing unwanted objects/text. Modern inpainting techniques have shown remarkable ability in generating sensible completions… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  20. Holistic HMI Design for Automated Vehicles: Bridging In-Vehicle and External Communication

    Authors: Haoyu Dong, Tram Thi Minh Tran, Pavlo Bazilinskyy, Marius Hoggenmüller, Debargha Dey, Silvia Cazacu, Mervyn Franssen, Ruolin Gao

    Abstract: As the field of automated vehicles (AVs) advances, it has become increasingly critical to develop human-machine interfaces (HMI) for both internal and external communication. Critical dialogue is emerging around the potential necessity for a holistic approach to HMI designs, which promotes the integration of both in-vehicle user and external road user perspectives. This approach aims to create a u… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  21. A Review of Virtual Reality Studies on Autonomous Vehicle--Pedestrian Interaction

    Authors: Tram Thi Minh Tran, Callum Parker, Martin Tomitsch

    Abstract: An increasing number of studies employ virtual reality (VR) to evaluate interactions between autonomous vehicles (AVs) and pedestrians. VR simulators are valued for their cost-effectiveness, flexibility in develo** various traffic scenarios, safe conduct of user studies, and acceptable ecological validity. Reviewing the literature between 2010 and 2020, we found 31 empirical studies using VR as… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  22. Simulating Wearable Urban Augmented Reality Experiences in VR: Lessons Learnt from Designing Two Future Urban Interfaces

    Authors: Tram Thi Minh Tran, Callum Parker, Marius Hoggenmüller, Luke Hespanhol, Martin Tomitsch

    Abstract: Augmented reality (AR) has the potential to fundamentally change how people engage with increasingly interactive urban environments. However, many challenges exist in designing and evaluating these new urban AR experiences, such as technical constraints and safety concerns associated with outdoor AR. We contribute to this domain by assessing the use of virtual reality (VR) for simulating wearable… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  23. arXiv:2403.07027  [pdf, ps, other

    cs.LG

    FWin transformer for dengue prediction under climate and ocean influence

    Authors: Nhat Thanh Tran, Jack Xin, Guofa Zhou

    Abstract: Dengue fever is one of the most deadly mosquito-born tropical infectious diseases. Detailed long range forecast model is vital in controlling the spread of disease and making mitigation efforts. In this study, we examine methods used to forecast dengue cases for long range predictions. The dataset consists of local climate/weather in addition to global climate indicators of Singapore from 2000 to… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  24. Designing Wearable Augmented Reality Concepts to Support Scalability in Autonomous Vehicle-Pedestrian Interaction

    Authors: Tram Thi Minh Tran, Callum Parker, Yiyuan Wang, Martin Tomitsch

    Abstract: Wearable augmented reality (AR) offers new ways for supporting the interaction between autonomous vehicles (AVs) and pedestrians due to its ability to integrate timely and contextually relevant data into the user's field of view. This article presents novel wearable AR concepts that assist crossing pedestrians in multi-vehicle scenarios where several AVs frequent the road from both directions. Thr… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  25. arXiv:2403.05727  [pdf, other

    cs.HC

    Sco** Out the Scalability Issues of Autonomous Vehicle-Pedestrian Interaction

    Authors: Tram Thi Minh Tran, Callum Parker, Martin Tomitsch

    Abstract: Autonomous vehicles (AVs) may use external interfaces, such as LED light bands, to communicate with pedestrians safely and intuitively. While previous research has demonstrated the effectiveness of these interfaces in simple traffic scenarios involving one pedestrian and one vehicle, their performance in more complex scenarios with multiple road users remains unclear. The scalability of AV externa… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  26. Exploring the Impact of Interconnected External Interfaces in Autonomous Vehicleson Pedestrian Safety and Experience

    Authors: Tram Thi Minh Tran, Callum Parker, Marius Hoggenmuller, Yiyuan Wang, Martin Tomitsch

    Abstract: Policymakers advocate for the use of external Human-Machine Interfaces (eHMIs) to allow autonomous vehicles (AVs) to communicate their intentions or status. Nonetheless, scalability concerns in complex traffic scenarios arise, such as potentially increasing pedestrian cognitive load or conveying contradictory signals. Building upon precursory works, our study explores 'interconnected eHMIs,' where… ▽ More

    Submitted 17 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  27. arXiv:2403.03180  [pdf, other

    math.OC cs.LG

    Shuffling Momentum Gradient Algorithm for Convex Optimization

    Authors: Trang H. Tran, Quoc Tran-Dinh, Lam M. Nguyen

    Abstract: The Stochastic Gradient Descent method (SGD) and its stochastic variants have become methods of choice for solving finite-sum optimization problems arising from machine learning and data science thanks to their ability to handle large-scale applications and big datasets. In the last decades, researchers have made substantial effort to study the theoretical performance of SGD and its shuffling vari… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Vietnam Journal of Mathematics (VJOM), Special issue dedicated to Dr. Tamás Terlaky on the occasion of his 70th birthday, 2024

  28. arXiv:2403.00752  [pdf, other

    cs.MM cs.PF

    An Experimental Study of Low-Latency Video Streaming over 5G

    Authors: Imran Khan, Tuyen X. Tran, Matti Hiltunen, Theodore Karagioules, Dimitrios Koutsonikolas

    Abstract: Low-latency video streaming over 5G has become rapidly popular over the last few years due to its increased usage in hosting virtual events, online education, webinars, and all-hands meetings. Our work aims to address the absence of studies that reveal the real-world behavior of low-latency video streaming. To that end, we provide an experimental methodology and measurements, collected in a US met… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 6 Pages

  29. arXiv:2402.14124  [pdf, other

    cs.CR

    Fake Resume Attacks: Data Poisoning on Online Job Platforms

    Authors: Michiharu Yamashita, Thanh Tran, Dongwon Lee

    Abstract: While recent studies have exposed various vulnerabilities incurred from data poisoning attacks in many web services, little is known about the vulnerability on online professional job platforms (e.g., LinkedIn and Indeed). In this work, first time, we demonstrate the critical vulnerabilities found in the common Human Resources (HR) task of matching job seekers and companies on online job platforms… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted at The Web Conference 2024 (WWW'24)

  30. arXiv:2402.10765  [pdf, other

    cs.LG cs.AI

    Policy Learning for Off-Dynamics RL with Deficient Support

    Authors: Linh Le Pham Van, Hung The Tran, Sunil Gupta

    Abstract: Reinforcement Learning (RL) can effectively learn complex policies. However, learning these policies often demands extensive trial-and-error interactions with the environment. In many real-world scenarios, this approach is not practical due to the high costs of data collection and safety concerns. As a result, a common strategy is to transfer a policy trained in a low-cost, rapid source simulator… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted by AAMAS 2024 as a full paper

  31. arXiv:2402.05484  [pdf

    cs.SE cs.AI

    Leveraging AI for Enhanced Software Effort Estimation: A Comprehensive Study and Framework Proposal

    Authors: Nhi Tran, Tan Tran, Nam Nguyen

    Abstract: This paper presents an extensive study on the application of AI techniques for software effort estimation in the past five years from 2017 to 2023. By overcoming the limitations of traditional methods, the study aims to improve accuracy and reliability. Through performance evaluation and comparison with diverse Machine Learning models, including Artificial Neural Network (ANN), Support Vector Mach… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  32. arXiv:2402.04209  [pdf

    cs.LG cs.AI

    Acute kidney injury prediction for non-critical care patients: a retrospective external and internal validation study

    Authors: Esra Adiyeke, Yuanfang Ren, Benjamin Shickel, Matthew M. Ruppert, Ziyuan Guan, Sandra L. Kane-Gill, Raghavan Murugan, Nabihah Amatullah, Britney A. Stottlemyer, Tiffany L. Tran, Dan Ricketts, Christopher M Horvat, Parisa Rashidi, Azra Bihorac, Tezcan Ozrazgat-Baslanti

    Abstract: Background: Acute kidney injury (AKI), the decline of kidney excretory function, occurs in up to 18% of hospitalized admissions. Progression of AKI may lead to irreversible kidney damage. Methods: This retrospective cohort study includes adult patients admitted to a non-intensive care unit at the University of Pittsburgh Medical Center (UPMC) (n = 46,815) and University of Florida Health (UFH) (n… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  33. arXiv:2402.03577  [pdf, other

    cs.LG

    Revisiting the Dataset Bias Problem from a Statistical Perspective

    Authors: Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

    Abstract: In this paper, we study the "dataset bias" problem from a statistical standpoint, and identify the main cause of the problem as the strong correlation between a class attribute u and a non-class attribute b in the input x, represented by p(u|b) differing significantly from p(u). Since p(u|b) appears as part of the sampling distributions in the standard maximum log-likelihood (MLL) objective, a mod… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  34. arXiv:2402.03243  [pdf, other

    cs.LG

    PINN-BO: A Black-box Optimization Algorithm using Physics-Informed Neural Networks

    Authors: Dat Phan-Trong, Hung The Tran, Alistair Shilton, Sunil Gupta

    Abstract: Black-box optimization is a powerful approach for discovering global optima in noisy and expensive black-box functions, a problem widely encountered in real-world scenarios. Recently, there has been a growing interest in leveraging domain knowledge to enhance the efficacy of machine learning methods. Partial Differential Equations (PDEs) often provide an effective means for elucidating the fundame… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  35. arXiv:2402.01198  [pdf, other

    cs.IT eess.SP

    Physical Layer Location Privacy in SIMO Communication Using Fake Paths Injection

    Authors: Trong Duy Tran, Maxime Ferreira Da Costa, Linh Trung Nguyen

    Abstract: Fake path injection is an emerging paradigm for inducing privacy over wireless networks. In this paper, fake paths are injected by the transmitter into a SIMO multipath communication channel to preserve her physical location from an eavesdropper. A novel statistical privacy metric is defined as the ratio between the largest (resp. smallest) eigenvalues of Bob's (resp. Eve's) Cramér-Rao lower bound… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  36. arXiv:2401.17264  [pdf, other

    cs.SD cs.AI cs.CR

    Proactive Detection of Voice Cloning with Localized Watermarking

    Authors: Robin San Roman, Pierre Fernandez, Alexandre Défossez, Teddy Furon, Tuan Tran, Hady Elsahar

    Abstract: In the rapidly evolving field of speech generative models, there is a pressing need to ensure audio authenticity against the risks of voice cloning. We present AudioSeal, the first audio watermarking technique designed specifically for localized detection of AI-generated speech. AudioSeal employs a generator/detector architecture trained jointly with a localization loss to enable localized waterma… ▽ More

    Submitted 6 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Published at ICML 2024. Code at https://github.com/facebookresearch/audioseal - webpage at https://pierrefdz.github.io/publications/audioseal/

  37. arXiv:2401.11062  [pdf

    cs.CV

    Learned Image resizing with efficient training (LRET) facilitates improved performance of large-scale digital histopathology image classification models

    Authors: Md Zahangir Alom, Quynh T. Tran, Brent A. Orr

    Abstract: Histologic examination plays a crucial role in oncology research and diagnostics. The adoption of digital scanning of whole slide images (WSI) has created an opportunity to leverage deep learning-based image classification methods to enhance diagnosis and risk stratification. Technical limitations of current approaches to training deep convolutional neural networks (DCNN) result in suboptimal mode… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 30 pages, 6 figures, 1 table

  38. arXiv:2401.09248  [pdf, other

    cs.CL cs.HC

    Learning from Emotions, Demographic Information and Implicit User Feedback in Task-Oriented Document-Grounded Dialogues

    Authors: Dominic Petrak, Thy Thy Tran, Iryna Gurevych

    Abstract: The success of task-oriented and document-grounded dialogue systems depends on users accepting and enjoying using them. To achieve this, recently published work in the field of Human-Computer Interaction suggests that the combination of considering demographic information, user emotions and learning from the implicit feedback in their utterances, is particularly important. However, these findings… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  39. arXiv:2401.03551  [pdf, other

    cs.CL cs.IR

    CAPTAIN at COLIEE 2023: Efficient Methods for Legal Information Retrieval and Entailment Tasks

    Authors: Chau Nguyen, Phuong Nguyen, Thanh Tran, Dat Nguyen, An Trieu, Tin Pham, Anh Dang, Le-Minh Nguyen

    Abstract: The Competition on Legal Information Extraction/Entailment (COLIEE) is held annually to encourage advancements in the automatic processing of legal texts. Processing legal documents is challenging due to the intricate structure and meaning of legal language. In this paper, we outline our strategies for tackling Task 2, Task 3, and Task 4 in the COLIEE 2023 competition. Our approach involved utiliz… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  40. arXiv:2401.02058  [pdf, other

    cs.LG stat.ML

    Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model

    Authors: Hien Dang, Tho Tran, Tan Nguyen, Nhat Ho

    Abstract: The current paradigm of training deep neural networks for classification tasks includes minimizing the empirical risk that pushes the training loss value towards zero, even after the training error has been vanished. In this terminal phase of training, it has been observed that the last-layer features collapse to their class-means and these class-means converge to the vertices of a simplex Equiang… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 2024 International Conference on Machine Learning

  41. arXiv:2312.12431  [pdf, other

    cs.CV

    On Inference Stability for Diffusion Models

    Authors: Viet Nguyen, Giang Vu, Tung Nguyen Thanh, Khoat Than, Toan Tran

    Abstract: Denoising Probabilistic Models (DPMs) represent an emerging domain of generative models that excel in generating diverse and high-quality images. However, most current training methods for DPMs often neglect the correlation between timesteps, limiting the model's performance in generating images effectively. Notably, we theoretically point out that this issue can be caused by the cumulative estima… ▽ More

    Submitted 31 January, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Oral presentation at AAAI 2024

  42. arXiv:2312.11818  [pdf, other

    cs.AI cs.LG stat.ML

    Root Cause Explanation of Outliers under Noisy Mechanisms

    Authors: Phuoc Nguyen, Truyen Tran, Sunil Gupta, Thin Nguyen, Svetha Venkatesh

    Abstract: Identifying root causes of anomalies in causal processes is vital across disciplines. Once identified, one can isolate the root causes and implement necessary measures to restore the normal operation. Causal processes are often modelled as graphs with entities being nodes and their paths/interconnections as edge. Existing work only consider the contribution of nodes in the generative process, thus… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted AAAI 2024

  43. arXiv:2312.06710  [pdf, other

    cs.LG

    Class-Prototype Conditional Diffusion Model with Gradient Projection for Continual Learning

    Authors: Khanh Doan, Quyen Tran, Tung Lam Tran, Tuan Nguyen, Dinh Phung, Trung Le

    Abstract: Mitigating catastrophic forgetting is a key hurdle in continual learning. Deep Generative Replay (GR) provides techniques focused on generating samples from prior tasks to enhance the model's memory capabilities using generative AI models ranging from Generative Adversarial Networks (GANs) to the more recent Diffusion Models (DMs). A major issue is the deterioration in the quality of generated dat… ▽ More

    Submitted 21 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  44. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  45. arXiv:2312.04651  [pdf, other

    cs.CV

    VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment

    Authors: Phong Tran, Egor Zakharov, Long-Nhat Ho, Anh Tuan Tran, Liwen Hu, Hao Li

    Abstract: We present a 3D-aware one-shot head reenactment method based on a fully volumetric neural disentanglement framework for source appearance and driver expressions. Our method is real-time and produces high-fidelity and view-consistent output, suitable for 3D teleconferencing systems based on holographic displays. Existing cutting-edge 3D-aware reenactment methods often use neural radiance fields or… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  46. arXiv:2312.03785  [pdf, ps, other

    cs.IR cs.AI

    Sports Recommender Systems: Overview and Research Issues

    Authors: Alexander Felfernig, Manfred Wundara, Thi Ngoc Trang Tran, Viet-Man Le, Sebastian Lubos, Seda Polat-Erdeniz

    Abstract: Sports recommender systems receive an increasing attention due to their potential of fostering healthy living, improving personal well-being, and increasing performances in sport. These systems support people in sports, for example, by the recommendation of healthy and performance boosting food items, the recommendation of training practices, talent and team recommendation, and the recommendation… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Article under review in the Journal of Intelligent Information Systems (Springer JIIS)

    ACM Class: I.2; J.3

  47. arXiv:2312.03419  [pdf, other

    cs.CR

    Synthesizing Physical Backdoor Datasets: An Automated Framework Leveraging Deep Generative Models

    Authors: Sze Jue Yang, Chinh D. La, Quang H. Nguyen, Kok-Seng Wong, Anh Tuan Tran, Chee Seng Chan, Khoa D. Doan

    Abstract: Backdoor attacks, representing an emerging threat to the integrity of deep neural networks, have garnered significant attention due to their ability to compromise deep learning systems clandestinely. While numerous backdoor attacks occur within the digital realm, their practical implementation in real-world prediction systems remains limited and vulnerable to disturbances in the physical world. Co… ▽ More

    Submitted 15 March, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  48. arXiv:2312.01970  [pdf, other

    cs.NI eess.SY

    CaRL: Cascade Reinforcement Learning with State Space Splitting for O-RAN based Traffic Steering

    Authors: Chuanneng Sun, Yu Zhou, Gueyoung Jung, Tuyen Xuan Tran, Dario Pompili

    Abstract: The Open Radio Access Network (O-RAN) architecture empowers intelligent and automated optimization of the RAN through applications deployed on the RAN Intelligent Controller (RIC) platform, enabling capabilities beyond what is achievable with traditional RAN solutions. Within this paradigm, Traffic Steering (TS) emerges as a pivotal RIC application that focuses on optimizing cell-level mobility se… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 14 pages, 10 figures

    ACM Class: C.2.3; I.2.8

  49. arXiv:2312.00656  [pdf, other

    cs.LG cs.AI stat.ML

    Simple Transferability Estimation for Regression Tasks

    Authors: Cuong N. Nguyen, Phong Tran, Lam Si Tung Ho, Vu Dinh, Anh T. Tran, Tal Hassner, Cuong V. Nguyen

    Abstract: We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel… ▽ More

    Submitted 3 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Paper published at The 39th Conference on Uncertainty in Artificial Intelligence (UAI) 2023

  50. arXiv:2312.00640  [pdf, ps, other

    math.OC cs.LG stat.ML

    One to beat them all: "RYU'' -- a unifying framework for the construction of safe balls

    Authors: Thu-Le Tran, Clément Elvira, Hong-Phuong Dang, Cédric Herzet

    Abstract: In this paper, we put forth a novel framework (named ``RYU'') for the construction of ``safe'' balls, i.e. regions that provably contain the dual solution of a target optimization problem. We concentrate on the standard setup where the cost function is the sum of two terms: a closed, proper, convex Lipschitz-smooth function and a closed, proper, convex function. The RYU framework is shown to gener… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 19 pages, 1 table