Skip to main content

Showing 1–50 of 113 results for author: NGuyen, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03144  [pdf, other

    cs.CV

    Venomancer: Towards Imperceptible and Target-on-Demand Backdoor Attacks in Federated Learning

    Authors: Son Nguyen, Thinh Nguyen, Khoa Doan, Kok-Seng Wong

    Abstract: Federated Learning (FL) is a distributed machine learning approach that maintains data privacy by training on decentralized data sources. Similar to centralized machine learning, FL is also susceptible to backdoor attacks. Most backdoor attacks in FL assume a predefined target class and require control over a large number of clients or knowledge of benign clients' information. Furthermore, they ar… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.00521  [pdf, other

    cs.LG cs.CV

    A Medical Low-Back Pain Physical Rehabilitation Dataset for Human Body Movement Analysis

    Authors: Sao Mai Nguyen, Maxime Devanne, Olivier Remy-Neris, Mathieu Lempereur, André Thepaut

    Abstract: While automatic monitoring and coaching of exercises are showing encouraging results in non-medical applications, they still have limitations such as errors and limited use contexts. To allow the development and assessment of physical rehabilitation by an intelligent tutoring system, we identify in this article four challenges to address and propose a medical dataset of clinical patients carrying… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    ACM Class: I.5.4; I.4.8

    Journal ref: IJCNN 2024

  3. arXiv:2406.09958  [pdf, other

    cs.LG

    H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent

    Authors: Son Nguyen, Lizhang Chen, Bo Liu, Qiang Liu

    Abstract: In this study, we introduce a novel adaptive optimizer, H-Fac, which incorporates a factorized approach to momentum and scaling parameters. Our algorithm demonstrates competitive performances on both ResNets and Vision Transformers, while achieving sublinear memory costs through the use of rank-1 parameterizations for moment estimators. We develop our algorithms based on principles derived from Ha… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 21 pages, 4 figures

  4. arXiv:2405.19206  [pdf, other

    stat.ML cs.LG

    Matrix Manifold Neural Networks++

    Authors: Xuan Son Nguyen, Shuo Yang, Aymeric Histace

    Abstract: Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature language processing tasks. One of the key factors that contribute to the success of these networks is that spherical and hyperbolic manifolds have the rich alge… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, **-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  6. arXiv:2404.12076  [pdf, other

    cs.AI cs.NE

    Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams

    Authors: Pivithuru Thejan Amarasinghe, Diem Pham, Binh Tran, Su Nguyen, Yuan Sun, Damminda Alahakoon

    Abstract: This paper introduces a novel approach, evolutionary multi-objective optimisation for fairness-aware self-adjusting memory classifiers, designed to enhance fairness in machine learning algorithms applied to data stream classification. With the growing concern over discrimination in algorithmic decision-making, particularly in dynamic data stream environments, there is a need for methods that ensur… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by GECCO 2024

  7. arXiv:2404.00204  [pdf

    cs.RO cs.LG eess.SY

    A PPO-based DRL Auto-Tuning Nonlinear PID Drone Controller for Robust Autonomous Flights

    Authors: Junyang Zhang, Cristian Emanuel Ocampo Rivera, Kyle Tyni, Steven Nguyen

    Abstract: This project aims to revolutionize drone flight control by implementing a nonlinear Deep Reinforcement Learning (DRL) agent as a replacement for traditional linear Proportional Integral Derivative (PID) controllers. The primary objective is to seamlessly transition drones between manual and autonomous modes, enhancing responsiveness and stability. We utilize the Proximal Policy Optimization (PPO)… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 9 pages, 12 figures

  8. arXiv:2403.02745  [pdf, other

    cs.AI cs.CL

    CURATRON: Complete Robust Preference Data for Robust Alignment of Large Language Models

    Authors: Son The Nguyen, Niranjan Uma Naresh, Theja Tulabandhula

    Abstract: This paper addresses the challenges of aligning large language models (LLMs) with human values via preference learning (PL), with a focus on the issues of incomplete and corrupted data in preference datasets. We propose a novel method for robustly and completely recalibrating values within these datasets to enhance LLMs resilience against the issues. In particular, we devise a guaranteed polynomia… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  9. arXiv:2402.05115  [pdf, other

    cs.RO cs.AI cs.LG

    Unsupervised Motion Retargeting for Human-Robot Imitation

    Authors: Louis Annabi, Ziqi Ma, Sao Mai Nguyen

    Abstract: This early-stage research work aims to improve online human-robot imitation by translating sequences of joint positions from the domain of human motions to a domain of motions achievable by a given robot, thus constrained by its embodiment. Leveraging the generalization capabilities of deep learning methods, we address this problem by proposing an encoder-decoder neural network model performing do… ▽ More

    Submitted 18 January, 2024; originally announced February 2024.

    Comments: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interactio, Mar 2024, Boulder (CO), United States

  10. arXiv:2402.03805  [pdf, other

    cs.SE

    Automated Description Generation for Software Patches

    Authors: Thanh Trong Vu, Tuan-Dung Bui, Thanh-Dat Do, Thu-Trang Nguyen, Hieu Dinh Vo, Son Nguyen

    Abstract: Software patches are pivotal in refining and evolving codebases, addressing bugs, vulnerabilities, and optimizations. Patch descriptions provide detailed accounts of changes, aiding comprehension and collaboration among developers. However, manual description creation poses challenges in terms of time consumption and variations in quality and detail. In this paper, we propose PATCHEXPLAINER, an ap… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Pre-print version of PATCHEXPLAINER

  11. arXiv:2402.02675  [pdf, other

    cs.LG cs.AI cs.CR

    Verifiable evaluations of machine learning models using zkSNARKs

    Authors: Tobin South, Alexander Camuto, Shrey Jain, Shayla Nguyen, Robert Mahari, Christian Paquin, Jason Morton, Alex 'Sandy' Pentland

    Abstract: In a world of increasing closed-source commercial machine learning models, model evaluations from developers must be taken at face value. These benchmark results-whether over task accuracy, bias evaluations, or safety checks-are traditionally impossible to verify by a model end-user without the costly or impossible process of re-performing the benchmark on black-box model outputs. This work presen… ▽ More

    Submitted 22 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    MSC Class: 68T01

  12. Prerequisite Structure Discovery in Intelligent Tutoring Systems

    Authors: Louis Annabi, Sao Mai Nguyen

    Abstract: This paper addresses the importance of Knowledge Structure (KS) and Knowledge Tracing (KT) in improving the recommendation of educational content in intelligent tutoring systems. The KS represents the relations between different Knowledge Components (KCs), while KT predicts a learner's success based on her past history. The contribution of this research includes proposing a KT model that incorpora… ▽ More

    Submitted 18 January, 2024; originally announced February 2024.

    Journal ref: 2023 IEEE International Conference on Development and Learning (ICDL), Nov 2023, Macau, China. pp.176-181

  13. arXiv:2402.00459  [pdf, other

    cs.NE cs.AI

    Genetic-based Constraint Programming for Resource Constrained Job Scheduling

    Authors: Su Nguyen, Dhananjay Thiruvady, Yuan Sun, Mengjie Zhang

    Abstract: Resource constrained job scheduling is a hard combinatorial optimisation problem that originates in the mining industry. Off-the-shelf solvers cannot solve this problem satisfactorily in reasonable timeframes, while other solution methods such as many evolutionary computation methods and matheuristics cannot guarantee optimality and require low-level customisation and specialised heuristics to be… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  14. arXiv:2401.17184  [pdf, other

    cs.MS math.NA

    Rigorous Error Analysis for Logarithmic Number Systems

    Authors: Thanh Son Nguyen, Alexey Solovyev, Ganesh Gopalakrishnan

    Abstract: Logarithmic Number Systems (LNS) hold considerable promise in hel** reduce the number of bits needed to represent a high dynamic range of real-numbers with finite precision, and also efficiently support multiplication and division. However, under LNS, addition and subtraction turn into non-linear functions that must be approximated - typically using precomputed table-based functions. Additionall… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 42 pages, 14 figures, 6 tables

    MSC Class: 65G50 ACM Class: G.1

  15. arXiv:2401.15232  [pdf, other

    cs.HC

    How Beginning Programmers and Code LLMs (Mis)read Each Other

    Authors: Sydney Nguyen, Hannah McLean Babe, Yangtian Zi, Arjun Guha, Carolyn Jane Anderson, Molly Q Feldman

    Abstract: Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluat… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Conditionally Accepted to CHI 2024

  16. arXiv:2401.09870  [pdf, other

    cs.LG cs.AI

    Reconciling Spatial and Temporal Abstractions for Goal Representation

    Authors: Mehdi Zadem, Sergio Mover, Sao Mai Nguyen

    Abstract: Goal representation affects the performance of Hierarchical Reinforcement Learning (HRL) algorithms by decomposing the complex learning problem into easier subtasks. Recent studies show that representations that preserve temporally abstract environment dynamics are successful in solving difficult problems and provide theoretical guarantees for optimality. These methods however cannot scale to task… ▽ More

    Submitted 30 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Journal ref: ICLR 2024

  17. arXiv:2401.05290  [pdf, other

    cs.RO cs.HC

    Analysis and Perspectives on the ANA Avatar XPRIZE Competition

    Authors: Kris Hauser, Eleanor Watson, Joonbum Bae, Josh Bankston, Sven Behnke, Bill Borgia, Manuel G. Catalano, Stefano Dafarra, Jan B. F. van Erp, Thomas Ferris, Jeremy Fishel, Guy Hoffman, Serena Ivaldi, Fumio Kanehiro, Abderrahmane Kheddar, Gaelle Lannuzel, Jacqueline Ford Morie, Patrick Naughton, Steve NGuyen, Paul Oh, Taskin Padir, Jim Pippine, Jaeheung Park, Daniele Pucci, Jean Vaz , et al. (3 additional authors not shown)

    Abstract: The ANA Avatar XPRIZE was a four-year competition to develop a robotic "avatar" system to allow a human operator to sense, communicate, and act in a remote environment as though physically present. The competition featured a unique requirement that judges would operate the avatars after less than one hour of training on the human-machine interfaces, and avatar systems were judged on both objective… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 26 pages, preprint of article appearing in International Journal of Social Robotics

  18. arXiv:2312.12441  [pdf, other

    cs.CV cs.LG

    DiffSpectralNet : Unveiling the Potential of Diffusion Models for Hyperspectral Image Classification

    Authors: Neetu Sigger, Tuan Thanh Nguyen, Gianluca Tozzi, Quoc-Tuan Vien, Sinh Van Nguyen

    Abstract: Hyperspectral images (HSI) have become popular for analysing remotely sensed images in multiple domain like agriculture, medical. However, existing models struggle with complex relationships and characteristics of spectral-spatial data due to the multi-band nature and data redundancy of hyperspectral data. To address this limitation, we propose a new network called DiffSpectralNet, which combines… ▽ More

    Submitted 29 October, 2023; originally announced December 2023.

    Comments: 18 pages

  19. arXiv:2312.06826  [pdf, other

    cs.AI cs.HC

    User Friendly and Adaptable Discriminative AI: Using the Lessons from the Success of LLMs and Image Generation Models

    Authors: Son The Nguyen, Theja Tulabandhula, Mary Beth Watson-Manheim

    Abstract: While there is significant interest in using generative AI tools as general-purpose models for specific ML applications, discriminative models are much more widely deployed currently. One of the key shortcomings of these discriminative AI tools that have been already deployed is that they are not adaptable and user-friendly compared to generative AI tools (e.g., GPT4, Stable Diffusion, Bard, etc.)… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  20. arXiv:2311.02872  [pdf, other

    cs.CV

    FocusTune: Tuning Visual Localization through Focus-Guided Sampling

    Authors: Son Tung Nguyen, Alejandro Fontan, Michael Milford, Tobias Fischer

    Abstract: We propose FocusTune, a focus-guided sampling technique to improve the performance of visual localization algorithms. FocusTune directs a scene coordinate regression model towards regions critical for 3D point triangulation by exploiting key geometric constraints. Specifically, rather than uniformly sampling points across the image for training the scene coordinate regression model, we instead re-… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  21. arXiv:2310.17949  [pdf, other

    cs.CV

    Instance Segmentation under Occlusions via Location-aware Copy-Paste Data Augmentation

    Authors: Son Nguyen, Mikel Lainsa, Hung Dao, Daeyoung Kim, Giang Nguyen

    Abstract: Occlusion is a long-standing problem in computer vision, particularly in instance segmentation. ACM MMSports 2023 DeepSportRadar has introduced a dataset that focuses on segmenting human subjects within a basketball context and a specialized evaluation metric for occlusion scenarios. Given the modest size of the dataset and the highly deformable nature of the objects to be segmented, this challeng… ▽ More

    Submitted 21 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  22. arXiv:2310.10875  [pdf, other

    cs.CV cs.CG

    Filling the Holes on 3D Heritage Object Surface based on Automatic Segmentation Algorithm

    Authors: Sinh Van Nguyen, Son Thanh Le, Minh Khai Tran, Le Thanh Sach

    Abstract: Reconstructing and processing the 3D objects are popular activities in the research field of computer graphics, image processing and computer vision. The 3D objects are processed based on the methods like geometric modeling, a branch of applied mathematics and computational geometry, or the machine learning algorithms based on image processing. The computation of geometrical objects includes proce… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 20 pages, 11 figures, 37 references

  23. arXiv:2310.03146  [pdf, ps, other

    cs.LG

    Fairness-enhancing mixed effects deep learning improves fairness on in- and out-of-distribution clustered (non-iid) data

    Authors: Adam Wang, Son Nguyen, Albert Montillo

    Abstract: Traditional deep learning (DL) suffers from two core problems. Firstly, it assumes training samples are independent and identically distributed. However, numerous real-world datasets group samples by shared measurements (e.g., study participants or cells), violating this assumption. In these scenarios, DL can show compromised performance, limited generalization, and interpretability issues, couple… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  24. arXiv:2309.13218  [pdf, other

    cs.AI

    AI-Copilot for Business Optimisation: A Framework and A Case Study in Production Scheduling

    Authors: Pivithuru Thejan Amarasinghe, Su Nguyen, Yuan Sun, Damminda Alahakoon

    Abstract: Business optimisation refers to the process of finding and implementing efficient and cost-effective means of operation to bring a competitive advantage for businesses. Synthesizing problem formulations is an integral part of business optimisation, which relies on human expertise to construct problem formulations using optimisation languages. Interestingly, with advancements in Large Language Mode… ▽ More

    Submitted 18 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  25. arXiv:2309.11069  [pdf, other

    cs.CV cs.AI

    Dynamic Tiling: A Model-Agnostic, Adaptive, Scalable, and Inference-Data-Centric Approach for Efficient and Accurate Small Object Detection

    Authors: Son The Nguyen, Theja Tulabandhula, Duy Nguyen

    Abstract: We introduce Dynamic Tiling, a model-agnostic, adaptive, and scalable approach for small object detection, anchored in our inference-data-centric philosophy. Dynamic Tiling starts with non-overlap** tiles for initial detections and utilizes dynamic overlap** rates along with a tile minimizer. This dual approach effectively resolves fragmented objects, improves detection accuracy, and minimizes… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  26. arXiv:2309.08225  [pdf, other

    cs.SE

    Silent Vulnerability-fixing Commit Identification Based on Graph Neural Networks

    Authors: Hieu Dinh Vo, Thanh Trong Vu, Son Nguyen

    Abstract: The growing dependence of software projects on external libraries has generated apprehensions regarding the security of these libraries because of concealed vulnerabilities. Handling these vulnerabilities presents difficulties due to the temporal delay between remediation and public exposure. Furthermore, a substantial fraction of open-source projects covertly address vulnerabilities without any f… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2304.08396, arXiv:2309.01971

  27. Goal Space Abstraction in Hierarchical Reinforcement Learning via Set-Based Reachability Analysis

    Authors: Mehdi Zadem, Sergio Mover, Sao Mai Nguyen

    Abstract: Open-ended learning benefits immensely from the use of symbolic methods for goal representation as they offer ways to structure knowledge for efficient and transferable learning. However, the existing Hierarchical Reinforcement Learning (HRL) approaches relying on symbolic reasoning are often limited as they require a manual goal representation. The challenge in autonomously discovering a symbolic… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    ACM Class: K.3.2

  28. arXiv:2309.07168  [pdf, other

    cs.LG cs.AI cs.FL cs.RO

    Goal Space Abstraction in Hierarchical Reinforcement Learning via Reachability Analysis

    Authors: Mehdi Zadem, Sergio Mover, Sao Mai Nguyen

    Abstract: Open-ended learning benefits immensely from the use of symbolic methods for goal representation as they offer ways to structure knowledge for efficient and transferable learning. However, the existing Hierarchical Reinforcement Learning (HRL) approaches relying on symbolic reasoning are often limited as they require a manual goal representation. The challenge in autonomously discovering a symbolic… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Journal ref: Intrinsically Motivated Open-ended Learning IMOL 2023, Sep 2023, Paris, France

  29. arXiv:2309.03219  [pdf, other

    cs.AI cs.CL cs.LG

    Companion Animal Disease Diagnostics based on Literal-aware Medical Knowledge Graph Representation Learning

    Authors: Van Thuy Hoang, Sang Thanh Nguyen, Sangmyeong Lee, Jooho Lee, Luong Vuong Nguyen, O-Joun Lee

    Abstract: Knowledge graph (KG) embedding has been used to benefit the diagnosis of animal diseases by analyzing electronic medical records (EMRs), such as notes and veterinary records. However, learning representations to capture entities and relations with literal information in KGs is challenging as the KGs show heterogeneous properties and various types of literal information. Meanwhile, the existing met… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: 16 pages

  30. arXiv:2309.01971  [pdf, other

    cs.SE

    VFFINDER: A Graph-based Approach for Automated Silent Vulnerability-Fix Identification

    Authors: Son Nguyen, Thanh Trong Vu, Hieu Dinh Vo

    Abstract: The increasing reliance of software projects on third-party libraries has raised concerns about the security of these libraries due to hidden vulnerabilities. Managing these vulnerabilities is challenging due to the time gap between fixes and public disclosures. Moreover, a significant portion of open-source projects silently fix vulnerabilities without disclosure, impacting vulnerability manageme… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE KSE 2023

  31. arXiv:2308.14182  [pdf, other

    cs.CL

    Generative AI for Business Strategy: Using Foundation Models to Create Business Strategy Tools

    Authors: Son The Nguyen, Theja Tulabandhula

    Abstract: Generative models (foundation models) such as LLMs (large language models) are having a large impact on multiple fields. In this work, we propose the use of such models for business decision making. In particular, we combine unstructured textual data sources (e.g., news data) with multiple foundation models (namely, GPT4, transformer-based Named Entity Recognition (NER) models and Entailment-based… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  32. arXiv:2308.11621  [pdf, other

    cs.NI cs.AI

    Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH

    Authors: Nghia T. Nguyen, Long Luu, Phuong L. Vo, Thi Thanh Sang Nguyen, Cuong T. Do, Ngoc-thanh Nguyen

    Abstract: Dynamic adaptive streaming over HTTP (DASH) has been widely used in video streaming recently. In DASH, the client downloads video chunks in order from a server. The rate adaptation function at the video client enhances the user's quality-of-experience (QoE) by choosing a suitable quality level for each video chunk to download based on the network condition. Today networks such as content delivery… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.

    Comments: 19 pages

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: C.2.4; I.2.11

  33. arXiv:2306.14726  [pdf, other

    cs.SE

    Can An Old Fashioned Feature Extraction and A Light-weight Model Improve Vulnerability Type Identification Performance?

    Authors: Hieu Dinh Vo, Son Nguyen

    Abstract: Recent advances in automated vulnerability detection have achieved potential results in hel** developers determine vulnerable components. However, after detecting vulnerabilities, investigating to fix vulnerable code is a non-trivial task. In fact, the types of vulnerability, such as buffer overflow or memory corruption, could help developers quickly understand the nature of the weaknesses and l… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  34. arXiv:2306.06620  [pdf, other

    cs.SE cs.AI

    ARIST: An Effective API Argument Recommendation Approach

    Authors: Son Nguyen, Cuong Tran Manh, Kien T. Tran, Tan M. Nguyen, Thu-Trang Nguyen, Kien-Tuan Ngo, Hieu Dinh Vo

    Abstract: Learning and remembering to use APIs are difficult. Several techniques have been proposed to assist developers in using APIs. Most existing techniques focus on recommending the right API methods to call, but very few techniques focus on recommending API arguments. In this paper, we propose ARIST, a novel automated argument recommendation approach which suggests arguments by predicting developers'… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  35. arXiv:2306.04556  [pdf, other

    cs.LG cs.HC cs.SE

    StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code

    Authors: Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson

    Abstract: Code LLMs are being rapidly deployed and there is evidence that they can make professional programmers more productive. Current benchmarks for code generation measure whether models generate correct programs given an expert prompt. In this paper, we present a new benchmark containing multiple prompts per problem, written by a specific population of non-expert prompters: beginning programmers. Stud… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  36. arXiv:2305.04560  [pdf, other

    stat.ML cs.LG

    Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach

    Authors: Xuan Son Nguyen, Shuo Yang

    Abstract: Matrix manifolds, such as manifolds of Symmetric Positive Definite (SPD) matrices and Grassmann manifolds, appear in many applications. Recently, by applying the theory of gyrogroups and gyrovector spaces that is a powerful framework for studying hyperbolic geometry, some works have attempted to build principled generalizations of Euclidean neural networks on matrix manifolds. However, due to the… ▽ More

    Submitted 5 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

  37. arXiv:2304.13195  [pdf, other

    cs.LG cs.SI

    Connector 0.5: A unified framework for graph representation learning

    Authors: Thanh Sang Nguyen, Jooho Lee, Van Thuy Hoang, O-Joun Lee

    Abstract: Graph representation learning models aim to represent the graph structure and its features into low-dimensional vectors in a latent space, which can benefit various downstream tasks, such as node classification and link prediction. Due to its powerful graph data modelling capabilities, various graph embedding models and libraries have been proposed to learn embeddings and help researchers ease con… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: An unified framework for graph representation learning

  38. arXiv:2304.08396  [pdf, other

    cs.SE

    Code-centric Learning-based Just-In-Time Vulnerability Detection

    Authors: Son Nguyen, Thu-Trang Nguyen, Thanh Trong Vu, Thanh-Dat Do, Kien-Tuan Ngo, Hieu Dinh Vo

    Abstract: Attacks against computer systems exploiting software vulnerabilities can cause substantial damage to the cyber-infrastructure of our modern society and economy. To minimize the consequences, it is vital to detect and fix vulnerabilities as soon as possible. Just-in-time vulnerability detection (JIT-VD) discovers vulnerability-prone ("dangerous") commits to prevent them from being merged into sourc… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  39. arXiv:2212.13218  [pdf, other

    cs.RO

    Multisensor Data Fusion for Reliable Obstacle Avoidance

    Authors: Thanh Nguyen Canh, Truong Son Nguyen, Cong Hoang Quach, Xiem HoangVan, Manh Duong Phung

    Abstract: In this work, we propose a new approach that combines data from multiple sensors for reliable obstacle avoidance. The sensors include two depth cameras and a LiDAR arranged so that they can capture the whole 3D area in front of the robot and a 2D slide around it. To fuse the data from these sensors, we first use an external camera as a reference to combine data from two depth cameras. A projection… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: In the 11th International Conference on Control, Automation and Information Sciences (ICCAIS 2022), Hanoi, Vietnam

  40. arXiv:2211.14492  [pdf, other

    cs.AI

    Enhancing Constraint Programming via Supervised Learning for Job Shop Scheduling

    Authors: Yuan Sun, Su Nguyen, Dhananjay Thiruvady, Xiaodong Li, Andreas T. Ernst, Uwe Aickelin

    Abstract: Constraint programming (CP) is a powerful technique for solving constraint satisfaction and optimization problems. In CP solvers, the variable ordering strategy used to select which variable to explore first in the solving process has a significant impact on solver effectiveness. To address this issue, we propose a novel variable ordering strategy based on supervised learning, which we evaluate in… ▽ More

    Submitted 12 April, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

  41. Speeding Up Recommender Systems Using Association Rules

    Authors: Eyad Kannout, Hung Son Nguyen, Marek Grzegorowski

    Abstract: Recommender systems are considered one of the most rapidly growing branches of Artificial Intelligence. The demand for finding more efficient techniques to generate recommendations becomes urgent. However, many recommendations become useless if there is a delay in generating and showing them to the user. Therefore, we focus on improving the speed of recommendation systems without impacting the acc… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 13 pages, 3 figures, 1 table, 14th Asian Conference on Intelligent Information and Database Systems (ACIIDS)

  42. Adaptive Population-based Simulated Annealing for Uncertain Resource Constrained Job Scheduling

    Authors: Dhananjay Thiruvady, Su Nguyen, Yuan Sun, Fatemeh Shiri, Nayyar Zaidi, Xiaodong Li

    Abstract: Transporting ore from mines to ports is of significant interest in mining supply chains. These operations are commonly associated with growing costs and a lack of resources. Large mining companies are interested in optimally allocating their resources to reduce operational costs. This problem has been previously investigated in the literature as resource constrained job scheduling (RCJS). While a… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Journal ref: International Journal of Production Research, 1 - 24 (2024)

  43. Pedestrian Emergency Braking in Ten Weeks

    Authors: Steven Nguyen, Zillur Rahman, Brendan Tan Morris

    Abstract: In the last decade, research in the field of autonomous vehicles has grown immensely, and there is a wealth of information available for researchers to rapidly establish an autonomous vehicle platform for basic maneuvers. In this paper, we design, implement, and test, in ten weeks, a PD approach to longitudinal control for pedestrian emergency braking. We also propose a lateral controller with a s… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted for publication, 6 pages

    Journal ref: 2022 IEEE International Conference on Vehicular Electronics and Safety (ICVES)

  44. arXiv:2210.11260  [pdf, other

    cs.NE math.OC

    An Efficient Merge Search Matheuristic for Maximising the Net Present Value of Project Schedules

    Authors: Dhananjay R. Thiruvady, Su Nguyen, Christian Blum, Andreas T. Ernst

    Abstract: Resource constrained project scheduling is an important combinatorial optimisation problem with many practical applications. With complex requirements such as precedence constraints, limited resources, and finance-based objectives, finding optimal solutions for large problem instances is very challenging even with well-customised meta-heuristics and matheuristics. To address this challenge, we pro… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  45. arXiv:2210.05187  [pdf, other

    cs.AI cs.LG cs.RO

    Broad-persistent Advice for Interactive Reinforcement Learning Scenarios

    Authors: Francisco Cruz, Adam Bignold, Hung Son Nguyen, Richard Dazeley, Peter Vamplew

    Abstract: The use of interactive advice in reinforcement learning scenarios allows for speeding up the learning process for autonomous agents. Current interactive reinforcement learning research has been limited to real-time interactions that offer relevant user advice to the current state only. Moreover, the information provided by each interaction is not retained and instead discarded by the agent after a… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Extended abstract accepted at the 2nd RL-CONFORM Workshop at IEEE/RSJ IROS'22 Conference. 5 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:2102.02441, arXiv:2110.08003

  46. arXiv:2208.08227  [pdf, other

    cs.LG cs.PL

    MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

    Authors: Federico Cassano, John Gouwar, Daniel Nguyen, Sydney Nguyen, Luna Phipps-Costin, Donald Pinckney, Ming-Ho Yee, Yangtian Zi, Carolyn Jane Anderson, Molly Q Feldman, Arjun Guha, Michael Greenberg, Abhinav Jangda

    Abstract: Large language models have demonstrated the ability to generate both natural language and programming language text. Such models open up the possibility of multi-language code generation: could code generation models generalize knowledge from one language to another? Although contemporary code generation models can generate semantically correct Python code, little is known about their abilities wi… ▽ More

    Submitted 19 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  47. arXiv:2207.05422  [pdf, other

    cs.CV

    Improving Domain Generalization by Learning without Forgetting: Application in Retail Checkout

    Authors: Thuy C. Nguyen, Nam LH. Phan, Son T. Nguyen

    Abstract: Designing an automatic checkout system for retail stores at the human level accuracy is challenging due to similar appearance products and their various poses. This paper addresses the problem by proposing a method with a two-stage pipeline. The first stage detects class-agnostic items, and the second one is dedicated to classify product categories. We also track the objects across video frames to… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  48. arXiv:2206.08513  [pdf

    cs.LG cs.DC cs.IR

    TLETA: Deep Transfer Learning and Integrated Cellular Knowledge for Estimated Time of Arrival Prediction

    Authors: Hieu Tran, Son Nguyen, I-Ling Yen, Farokh Bastani

    Abstract: Vehicle arrival time prediction has been studied widely. With the emergence of IoT devices and deep learning techniques, estimated time of arrival (ETA) has become a critical component in intelligent transportation systems. Though many tools exist for ETA, ETA for special vehicles, such as ambulances, fire engines, etc., is still challenging due to the limited amount of traffic data for special ve… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 8 pages, 3 figures, 3 tables. The 25th IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC 2022)

  49. arXiv:2204.12000  [pdf, other

    cs.CL cs.AI

    Estimating the Personality of White-Box Language Models

    Authors: Saketh Reddy Karra, Son The Nguyen, Theja Tulabandhula

    Abstract: Technology for open-ended language generation, a key application of artificial intelligence, has advanced to a great extent in recent years. Large-scale language models, which are trained on large corpora of text, are being used in a wide range of applications everywhere, from virtual assistants to conversational bots. While these language models output fluent text, existing research shows that th… ▽ More

    Submitted 10 May, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

  50. arXiv:2203.10791  [pdf

    cs.NI cs.DC cs.IR

    IoT Data Discovery: Routing Table and Summarization Techniques

    Authors: Hieu Tran, Son Nguyen, I-Ling Yen, Farokh Bastani

    Abstract: In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the… ▽ More

    Submitted 6 May, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: 18 pages, 23 figures, 1 table, 3 algorithms. arXiv admin note: substantial text overlap with arXiv:2107.09558