-
Gameplay Filters: Safe Robot Walking through Adversarial Imagination
Authors:
Duy P. Nguyen,
Kai-Chieh Hsu,
Wenhao Yu,
Jie Tan,
Jaime F. Fisac
Abstract:
Ensuring the safe operation of legged robots in uncertain, novel environments is crucial to their widespread adoption. Despite recent advances in safety filters that can keep arbitrary task-driven policies from incurring safety failures, existing solutions for legged robot locomotion still rely on simplified dynamics and may fail when the robot is perturbed away from predefined stable gaits. This…
▽ More
Ensuring the safe operation of legged robots in uncertain, novel environments is crucial to their widespread adoption. Despite recent advances in safety filters that can keep arbitrary task-driven policies from incurring safety failures, existing solutions for legged robot locomotion still rely on simplified dynamics and may fail when the robot is perturbed away from predefined stable gaits. This paper presents a general approach that leverages offline game-theoretic reinforcement learning to synthesize a highly robust safety filter for high-order nonlinear dynamics. This gameplay filter then maintains runtime safety by continually simulating adversarial futures and precluding task-driven actions that would cause it to lose future games (and thereby violate safety). Validated on a 36-dimensional quadruped robot locomotion task, the gameplay safety filter exhibits inherent robustness to the sim-to-real gap without manual tuning or heuristic designs. Physical experiments demonstrate the effectiveness of the gameplay safety filter under perturbations, such as tugging and unmodeled irregular terrains, while simulation studies shed light on how to trade off computation and conservativeness without compromising safety.
△ Less
Submitted 31 May, 2024; v1 submitted 1 May, 2024;
originally announced May 2024.
-
FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning
Authors:
Duy Phuong Nguyen,
J. Pablo Munoz,
Ali Jannesari
Abstract:
In the rapidly evolving field of artificial intelligence, multimodal models, e.g., integrating vision and language into visual-language models (VLMs), have become pivotal for many applications, ranging from image captioning to multimodal search engines. Among these models, the Contrastive Language-Image Pre-training (CLIP) model has demonstrated remarkable performance in understanding and generati…
▽ More
In the rapidly evolving field of artificial intelligence, multimodal models, e.g., integrating vision and language into visual-language models (VLMs), have become pivotal for many applications, ranging from image captioning to multimodal search engines. Among these models, the Contrastive Language-Image Pre-training (CLIP) model has demonstrated remarkable performance in understanding and generating nuanced relationships between text and images. However, the conventional training of such models often requires centralized aggregation of vast datasets, posing significant privacy and data governance challenges. To address these concerns, this paper proposes a novel approach that leverages Federated Learning and parameter-efficient adapters, i.e., Low-Rank Adaptation (LoRA), to train VLMs. This methodology preserves data privacy by training models across decentralized data sources and ensures model adaptability and efficiency through LoRA's parameter-efficient fine-tuning. Our approach accelerates training time by up to 34.72 times and requires 2.47 times less memory usage than full fine-tuning.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars
Authors:
Quang Huy Che,
Dinh Phuc Nguyen,
Minh Quan Pham,
Duc Khai Lam
Abstract:
Semantic segmentation is a common task in autonomous driving to understand the surrounding environment. Driveable Area Segmentation and Lane Detection are particularly important for safe and efficient navigation on the road. However, original semantic segmentation models are computationally expensive and require high-end hardware, which is not feasible for embedded systems in autonomous vehicles.…
▽ More
Semantic segmentation is a common task in autonomous driving to understand the surrounding environment. Driveable Area Segmentation and Lane Detection are particularly important for safe and efficient navigation on the road. However, original semantic segmentation models are computationally expensive and require high-end hardware, which is not feasible for embedded systems in autonomous vehicles. This paper proposes a lightweight model for the driveable area and lane line segmentation. TwinLiteNet is designed cheaply but achieves accurate and efficient segmentation results. We evaluate TwinLiteNet on the BDD100K dataset and compare it with modern models. Experimental results show that our TwinLiteNet performs similarly to existing approaches, requiring significantly fewer computational resources. Specifically, TwinLiteNet achieves a mIoU score of 91.3% for the Drivable Area task and 31.08% IoU for the Lane Detection task with only 0.4 million parameters and achieves 415 FPS on GPU RTX A5000. Furthermore, TwinLiteNet can run in real-time on embedded devices with limited computing power, especially since it achieves 60FPS on Jetson Xavier NX, making it an ideal solution for self-driving vehicles. Code is available: url{https://github.com/chequanghuy/TwinLiteNet}.
△ Less
Submitted 13 December, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Retrieval of material properties of monolayer transition-metal dichalcogenides from magnetoexciton energy spectra
Authors:
Duy-Nhat Ly,
Dai-Nam Le,
Duy-Anh P. Nguyen,
Ngoc-Tram D. Hoang,
Ngoc-Hung Phan,
Hoang-Minh L. Nguyen,
Van-Hoang Le
Abstract:
Reduced exciton mass, polarizability, and dielectric constant of the surrounding medium are essential properties for semiconducting materials, and they have been extracted recently from the magnetoexciton energies. However, the acceptable accuracy of the suggested method requires very high magnetic intensity. Therefore, in the present paper, we propose an alternative method of extracting these mat…
▽ More
Reduced exciton mass, polarizability, and dielectric constant of the surrounding medium are essential properties for semiconducting materials, and they have been extracted recently from the magnetoexciton energies. However, the acceptable accuracy of the suggested method requires very high magnetic intensity. Therefore, in the present paper, we propose an alternative method of extracting these material properties from recently available experimental magnetoexciton s-state energies in monolayer transition-metal dichalcogenides (TMDCs). The method is based on the high sensitivity of exciton energies to the material parameters in the Rytova-Keldysh model. It allows us to vary the considered material parameters to get the best fit of the theoretical calculation to the experimental exciton energies for the $1s$, $2s$, and $3s$ states. This procedure gives values of the exciton reduced mass and $2D$ polarizability. Then, the experimental magnetoexciton spectra compared to the theoretical calculation also determine the average dielectric constant. Concrete applications are presented only for monolayers WSe$_2$ and WS$_2$ from the recently available experimental data; however, the presented approach is universal and can be applied to other monolayer TMDCs. The mentioned fitting procedure requires a fast and effective method of solving the Schrödinger equation of an exciton in monolayer TMDCs with a magnetic field. Therefore, we also develop such a method in this paper for highly accurate magnetoexciton energies.
△ Less
Submitted 24 April, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
Authors:
Kai-Chieh Hsu,
Duy Phuong Nguyen,
Jaime Fernández Fisac
Abstract:
The deployment of robots in uncontrolled environments requires them to operate robustly under previously unseen scenarios, like irregular terrain and wind conditions. Unfortunately, while rigorous safety frameworks from robust optimal control theory scale poorly to high-dimensional nonlinear dynamics, control policies computed by more tractable "deep" methods lack guarantees and tend to exhibit li…
▽ More
The deployment of robots in uncontrolled environments requires them to operate robustly under previously unseen scenarios, like irregular terrain and wind conditions. Unfortunately, while rigorous safety frameworks from robust optimal control theory scale poorly to high-dimensional nonlinear dynamics, control policies computed by more tractable "deep" methods lack guarantees and tend to exhibit little robustness to uncertain operating conditions. This work introduces a novel approach enabling scalable synthesis of robust safety-preserving controllers for robotic systems with general nonlinear dynamics subject to bounded modeling error by combining game-theoretic safety analysis with adversarial reinforcement learning in simulation. Following a soft actor-critic scheme, a safety-seeking fallback policy is co-trained with an adversarial "disturbance" agent that aims to invoke the worst-case realization of model error and training-to-deployment discrepancy allowed by the designer's uncertainty. While the learned control policy does not intrinsically guarantee safety, it is used to construct a real-time safety filter (or shield) with robust safety guarantees based on forward reachability rollouts. This shield can be used in conjunction with a safety-agnostic control policy, precluding any task-driven actions that could result in loss of safety. We evaluate our learning-based safety approach in a 5D race car simulator, compare the learned safety policy to the numerically obtained optimal solution, and empirically validate the robust safety guarantee of our proposed safety shield against worst-case model discrepancy.
△ Less
Submitted 7 June, 2024; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Enhancing Heterogeneous Federated Learning with Knowledge Extraction and Multi-Model Fusion
Authors:
Duy Phuong Nguyen,
Sixing Yu,
J. Pablo Muñoz,
Ali Jannesari
Abstract:
Concerned with user data privacy, this paper presents a new federated learning (FL) method that trains machine learning models on edge devices without accessing sensitive data. Traditional FL methods, although privacy-protective, fail to manage model heterogeneity and incur high communication costs due to their reliance on aggregation methods. To address this limitation, we propose a resource-awar…
▽ More
Concerned with user data privacy, this paper presents a new federated learning (FL) method that trains machine learning models on edge devices without accessing sensitive data. Traditional FL methods, although privacy-protective, fail to manage model heterogeneity and incur high communication costs due to their reliance on aggregation methods. To address this limitation, we propose a resource-aware FL method that aggregates local knowledge from edge models and distills it into robust global knowledge through knowledge distillation. This method allows efficient multi-model knowledge fusion and the deployment of resource-aware models while preserving model heterogeneity. Our method improves communication cost and performance in heterogeneous data and models compared to existing FL algorithms. Notably, it reduces the communication cost of ResNet-32 by up to 50\% and VGG-11 by up to 10$\times$ while delivering superior performance.
△ Less
Submitted 30 September, 2023; v1 submitted 16 August, 2022;
originally announced August 2022.
-
Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees
Authors:
Kai-Chieh Hsu,
Allen Z. Ren,
Duy Phuong Nguyen,
Anirudha Majumdar,
Jaime F. Fisac
Abstract:
Safety is a critical component of autonomous systems and remains a challenge for learning-based policies to be utilized in the real world. In particular, policies learned using reinforcement learning often fail to generalize to novel environments due to unsafe behavior. In this paper, we propose Sim-to-Lab-to-Real to bridge the reality gap with a probabilistically guaranteed safety-aware policy di…
▽ More
Safety is a critical component of autonomous systems and remains a challenge for learning-based policies to be utilized in the real world. In particular, policies learned using reinforcement learning often fail to generalize to novel environments due to unsafe behavior. In this paper, we propose Sim-to-Lab-to-Real to bridge the reality gap with a probabilistically guaranteed safety-aware policy distribution. To improve safety, we apply a dual policy setup where a performance policy is trained using the cumulative task reward and a backup (safety) policy is trained by solving the Safety Bellman Equation based on Hamilton-Jacobi (HJ) reachability analysis. In Sim-to-Lab transfer, we apply a supervisory control scheme to shield unsafe actions during exploration; in Lab-to-Real transfer, we leverage the Probably Approximately Correct (PAC)-Bayes framework to provide lower bounds on the expected performance and safety of policies in unseen environments. Additionally, inheriting from the HJ reachability analysis, the bound accounts for the expectation over the worst-case safety in each environment. We empirically study the proposed framework for ego-vision navigation in two types of indoor environments with varying degrees of photorealism. We also demonstrate strong generalization performance through hardware experiments in real indoor spaces with a quadrupedal robot. See https://sites.google.com/princeton.edu/sim-to-lab-to-real for supplementary material.
△ Less
Submitted 1 April, 2023; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Parisian ruin with random deficit-dependent delays for spectrally negative Lévy processes
Authors:
Duy Phat Nguyen,
Konstantin Borovkov
Abstract:
We consider an interesting natural extension to the Parisian ruin problem under the assumption that the risk reserve dynamics are given by a spectrally negative Lévy process. The distinctive feature of this extension is that the distribution of the random implementation delay windows' lengths can depend on the deficit at the epochs when the risk reserve process turns negative, starting a new negat…
▽ More
We consider an interesting natural extension to the Parisian ruin problem under the assumption that the risk reserve dynamics are given by a spectrally negative Lévy process. The distinctive feature of this extension is that the distribution of the random implementation delay windows' lengths can depend on the deficit at the epochs when the risk reserve process turns negative, starting a new negative excursion. This includes the possibility of an immediate ruin when the deficit hits a certain subset. In this general setting, we derive a closed-from expression for the Parisian ruin probability and the joint Laplace transform of the Parisian ruin time and the deficit at ruin.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Back to the Future: Efficient, Time-Consistent Solutions in Reach-Avoid Games
Authors:
Dennis R. Anthony,
Duy P. Nguyen,
David Fridovich-Keil,
Jaime F. Fisac
Abstract:
We study the class of reach-avoid dynamic games in which multiple agents interact noncooperatively, and each wishes to satisfy a distinct target criterion while avoiding a failure criterion. Reach-avoid games are commonly used to express safety-critical optimal control problems found in mobile robot motion planning. Here, we focus on finding time-consistent solutions, in which future motion plans…
▽ More
We study the class of reach-avoid dynamic games in which multiple agents interact noncooperatively, and each wishes to satisfy a distinct target criterion while avoiding a failure criterion. Reach-avoid games are commonly used to express safety-critical optimal control problems found in mobile robot motion planning. Here, we focus on finding time-consistent solutions, in which future motion plans remain optimal even when a robot diverges from the plan early on due to, e.g., intrinsic dynamic uncertainty or extrinsic environment disturbances. Our main contribution is a computationally-efficient algorithm for multi-agent reach-avoid games which renders time-consistent solutions for all players. We demonstrate our approach in two- and three-player simulated driving scenarios, in which our method provides safe control strategies for all agents.
△ Less
Submitted 2 March, 2022; v1 submitted 15 September, 2021;
originally announced September 2021.
-
PCA Reduced Gaussian Mixture Models with Applications in Superresolution
Authors:
Johannes Hertrich,
Dang Phoung Lan Nguyen,
Jean-Fancois Aujol,
Dominique Bernard,
Yannick Berthoumieu,
Abdellatif Saadaldin,
Gabriele Steidl
Abstract:
Despite the rapid development of computational hardware, the treatment of large and high dimensional data sets is still a challenging problem. This paper provides a twofold contribution to the topic. First, we propose a Gaussian Mixture Model in conjunction with a reduction of the dimensionality of the data in each component of the model by principal component analysis, called PCA-GMM. To learn th…
▽ More
Despite the rapid development of computational hardware, the treatment of large and high dimensional data sets is still a challenging problem. This paper provides a twofold contribution to the topic. First, we propose a Gaussian Mixture Model in conjunction with a reduction of the dimensionality of the data in each component of the model by principal component analysis, called PCA-GMM. To learn the (low dimensional) parameters of the mixture model we propose an EM algorithm whose M-step requires the solution of constrained optimization problems. Fortunately, these constrained problems do not depend on the usually large number of samples and can be solved efficiently by an (inertial) proximal alternating linearized minimization algorithm. Second, we apply our PCA-GMM for the superresolution of 2D and 3D material images based on the approach of Sandeep and Jacob. Numerical results confirm the moderate influence of the dimensionality reduction on the overall superresolution result.
△ Less
Submitted 6 May, 2021; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Machine Learning and Control Theory
Authors:
Alain Bensoussan,
Yiqun Li,
Dinh Phan Cao Nguyen,
Minh-Binh Tran,
Sheung Chi Phillip Yam,
Xiang Zhou
Abstract:
We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the…
▽ More
We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the second part, we review the concept of supervised learning and the relation with static optimization. Deep learning which extends supervised learning, can be viewed as a control problem. In the third part, we present the links between stochastic gradient descent and mean-field theory. Conversely, in the fourth and fifth parts, we review machine learning approaches to stochastic control problems, and focus on the deterministic case, to explain, more easily, the numerical algorithms.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
SCALPEL3: a scalable open-source library for healthcare claims databases
Authors:
Emmanuel Bacry,
Stéphane Gaïffas,
Fanny Leroy,
Maryan Morel,
Dinh Phong Nguyen,
Youcef Sebiat,
Dian Sun
Abstract:
This article introduces SCALPEL3, a scalable open-source framework for studies involving Large Observational Databases (LODs). Its design eases medical observational studies thanks to abstractions allowing concept extraction, high-level cohort manipulation, and production of data formats compatible with machine learning libraries. SCALPEL3 has successfully been used on the SNDS database (see Tuppi…
▽ More
This article introduces SCALPEL3, a scalable open-source framework for studies involving Large Observational Databases (LODs). Its design eases medical observational studies thanks to abstractions allowing concept extraction, high-level cohort manipulation, and production of data formats compatible with machine learning libraries. SCALPEL3 has successfully been used on the SNDS database (see Tuppin et al. (2017)), a huge healthcare claims database that handles the reimbursement of almost all French citizens.
SCALPEL3 focuses on scalability, easy interactive analysis and helpers for data flow analysis to accelerate studies performed on LODs. It consists of three open-source libraries based on Apache Spark. SCALPEL-Flattening allows denormalization of the LOD (only SNDS for now) by joining tables sequentially in a big table. SCALPEL-Extraction provides fast concept extraction from a big table such as the one produced by SCALPEL-Flattening. Finally, SCALPEL-Analysis allows interactive cohort manipulations, monitoring statistics of cohort flows and building datasets to be used with machine learning libraries. The first two provide a Scala API while the last one provides a Python API that can be used in an interactive environment. Our code is available on GitHub.
SCALPEL3 allowed to extract successfully complex concepts for studies such as Morel et al (2017) or studies with 14.5 million patients observed over three years (corresponding to more than 15 billion healthcare events and roughly 15 TeraBytes of data) in less than 49 minutes on a small 15 nodes HDFS cluster. SCALPEL3 provides a sharp interactive control of data processing through legible code, which helps to build studies with full reproducibility, leading to improved maintainability and audit of studies performed on LODs.
△ Less
Submitted 26 August, 2020; v1 submitted 15 October, 2019;
originally announced October 2019.
-
Electronic continuum states and far infrared absorption of InAs/GaAs quantum dots
Authors:
Duc Phuong Nguyen,
Nicolas Regnault,
Robson Ferreira,
Gerald Bastard
Abstract:
The electronic continuum states of InAs/GaAs semiconductor quantum dots embedded in a GaAs/AlAs superlattice are theoretically investigated and the far infrared absorption spectra are calculated for a variety of structures and polarizations. The effect of a strong magnetic field applied parallel to the growth direction is also investigated. We predict that the flatness of the InAs/GaAs dots lead…
▽ More
The electronic continuum states of InAs/GaAs semiconductor quantum dots embedded in a GaAs/AlAs superlattice are theoretically investigated and the far infrared absorption spectra are calculated for a variety of structures and polarizations. The effect of a strong magnetic field applied parallel to the growth direction is also investigated. We predict that the flatness of the InAs/GaAs dots leads to a far infrared absorption which is almost insensitive to the magnetic field, in spite of the reorganization of the continuum into series of quasi-Landau states. We also predict that it is possible to design InAs/GaAs photoconductors which display very strong in-plane absorption.
△ Less
Submitted 11 July, 2005; v1 submitted 21 February, 2005;
originally announced February 2005.
-
Bound-to-bound and bound-to-continuum optical transitions in combined quantum dot - superlattice systems
Authors:
F. F. Schrey,
L. Rebohle,
T. Mueller,
G. Strasser,
K. Unterrainer,
D. P. Nguyen,
N. Regnault,
R. Ferreira,
G. Bastard
Abstract:
By combining band gap engineering with the self-organized growth of quantum dots, we present a scheme of adjusting the mid-infrared absorption properties to desired energy transitions in quantum dot based photodetectors. Embedding the self organized InAs quantum dots into an AlAs/GaAs superlattice enables us to tune the optical transition energy by changing the superlattice period as well as by…
▽ More
By combining band gap engineering with the self-organized growth of quantum dots, we present a scheme of adjusting the mid-infrared absorption properties to desired energy transitions in quantum dot based photodetectors. Embedding the self organized InAs quantum dots into an AlAs/GaAs superlattice enables us to tune the optical transition energy by changing the superlattice period as well as by changing the growth conditions of the dots. Using a one band envelope function framework we are able, in a fully three dimensional calculation, to predict the photocurrent spectra of these devices as well as their polarization properties. The calculations further predict a strong impact of the dots on the superlattices minibands. The impact of vertical dot alignment or misalignment on the absorption properties of this dot/superlattice structure is investigated. The observed photocurrent spectra of vertically coupled quantum dot stacks show very good agreement with the calculations.In these experiments, vertically coupled quantum dot stacks show the best performance in the desired photodetector application.
△ Less
Submitted 19 July, 2004;
originally announced July 2004.
-
Alloy effects in GaInN/GaN heterostructures
Authors:
Duc Phuong Nguyen,
Nicolas Regnault,
Robson Ferreira,
Gerald Bastard
Abstract:
We show that the large band offsets between GaN and InN and the heavy carrier effective masses preclude the use of the Virtual Crystal Approximation to describe the electronic structure of Ga_(1-x)In_(x)N/GaN heterostructures while this approximation works very well for the Ga_(1-x)In_(x)As/GaAs heterostructures.
We show that the large band offsets between GaN and InN and the heavy carrier effective masses preclude the use of the Virtual Crystal Approximation to describe the electronic structure of Ga_(1-x)In_(x)N/GaN heterostructures while this approximation works very well for the Ga_(1-x)In_(x)As/GaAs heterostructures.
△ Less
Submitted 6 November, 2003;
originally announced November 2003.