Search | arXiv e-print repository

arXiv:2405.19206 [pdf, other]

Matrix Manifold Neural Networks++

Authors: Xuan Son Nguyen, Shuo Yang, Aymeric Histace

Abstract: Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature language processing tasks. One of the key factors that contribute to the success of these networks is that spherical and hyperbolic manifolds have the rich alge… ▽ More Deep neural networks (DNNs) on Riemannian manifolds have garnered increasing interest in various applied areas. For instance, DNNs on spherical and hyperbolic manifolds have been designed to solve a wide range of computer vision and nature language processing tasks. One of the key factors that contribute to the success of these networks is that spherical and hyperbolic manifolds have the rich algebraic structures of gyrogroups and gyrovector spaces. This enables principled and effective generalizations of the most successful DNNs to these manifolds. Recently, some works have shown that many concepts in the theory of gyrogroups and gyrovector spaces can also be generalized to matrix manifolds such as Symmetric Positive Definite (SPD) and Grassmann manifolds. As a result, some building blocks for SPD and Grassmann neural networks, e.g., isometric models and multinomial logistic regression (MLR) can be derived in a way that is fully analogous to their spherical and hyperbolic counterparts. Building upon these works, we design fully-connected (FC) and convolutional layers for SPD neural networks. We also develop MLR on Symmetric Positive Semi-definite (SPSD) manifolds, and propose a method for performing backpropagation with the Grassmann logarithmic map in the projector perspective. We demonstrate the effectiveness of the proposed approach in the human action recognition and node classification tasks. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2305.04560 [pdf, other]

Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach

Authors: Xuan Son Nguyen, Shuo Yang

Abstract: Matrix manifolds, such as manifolds of Symmetric Positive Definite (SPD) matrices and Grassmann manifolds, appear in many applications. Recently, by applying the theory of gyrogroups and gyrovector spaces that is a powerful framework for studying hyperbolic geometry, some works have attempted to build principled generalizations of Euclidean neural networks on matrix manifolds. However, due to the… ▽ More Matrix manifolds, such as manifolds of Symmetric Positive Definite (SPD) matrices and Grassmann manifolds, appear in many applications. Recently, by applying the theory of gyrogroups and gyrovector spaces that is a powerful framework for studying hyperbolic geometry, some works have attempted to build principled generalizations of Euclidean neural networks on matrix manifolds. However, due to the lack of many concepts in gyrovector spaces for the considered manifolds, e.g., the inner product and gyroangles, techniques and mathematical tools provided by these works are still limited compared to those developed for studying hyperbolic geometry. In this paper, we generalize some notions in gyrovector spaces for SPD and Grassmann manifolds, and propose new models and layers for building neural networks on these manifolds. We show the effectiveness of our approach in two applications, i.e., human action recognition and knowledge graph completion. △ Less

Submitted 5 June, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

arXiv:2111.13089 [pdf, other]

GeomNet: A Neural Network Based on Riemannian Geometries of SPD Matrix Space and Cholesky Space for 3D Skeleton-Based Interaction Recognition

Authors: Xuan Son Nguyen

Abstract: In this paper, we propose a novel method for representation and classification of two-person interactions from 3D skeleton sequences. The key idea of our approach is to use Gaussian distributions to capture statistics on R n and those on the space of symmetric positive definite (SPD) matrices. The main challenge is how to parametrize those distributions. Towards this end, we develop methods for em… ▽ More In this paper, we propose a novel method for representation and classification of two-person interactions from 3D skeleton sequences. The key idea of our approach is to use Gaussian distributions to capture statistics on R n and those on the space of symmetric positive definite (SPD) matrices. The main challenge is how to parametrize those distributions. Towards this end, we develop methods for embedding Gaussian distributions in matrix groups based on the theory of Lie groups and Riemannian symmetric spaces. Our method relies on the Riemannian geometry of the underlying manifolds and has the advantage of encoding high-order statistics from 3D joint positions. We show that the proposed method achieves competitive results in two-person interaction recognition on three benchmarks for 3D human activity understanding. △ Less

Submitted 25 November, 2021; originally announced November 2021.

Comments: Accepted in ICCV 2021

arXiv:2102.09854 [pdf, other]

doi 10.3390/app11030975

Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy

Authors: Nicolas Duminy, Sao Mai Nguyen, Junshuai Zhu, Dominique Duhaut, Jerome Kerdreux

Abstract: In open-ended continuous environments, robots need to learn multiple parameterised control tasks in hierarchical reinforcement learning. We hypothesise that the most complex tasks can be learned more easily by transferring knowledge from simpler tasks, and faster by adapting the complexity of the actions to the task. We propose a task-oriented representation of complex actions, called procedures,… ▽ More In open-ended continuous environments, robots need to learn multiple parameterised control tasks in hierarchical reinforcement learning. We hypothesise that the most complex tasks can be learned more easily by transferring knowledge from simpler tasks, and faster by adapting the complexity of the actions to the task. We propose a task-oriented representation of complex actions, called procedures, to learn online task relationships and unbounded sequences of action primitives to control the different observables of the environment. Combining both goal-babbling with imitation learning, and active learning with transfer of knowledge based on intrinsic motivation, our algorithm self-organises its learning process. It chooses at any given time a task to focus on; and what, how, when and from whom to transfer knowledge. We show with a simulation and a real industrial robot arm, in cross-task and cross-learner transfer settings, that task composition is key to tackle highly complex tasks. Task decomposition is also efficiently transferred across different embodied learners and by active imitation, where the robot requests just a small amount of demonstrations and the adequate type of information. The robot learns and exploits task dependencies so as to learn tasks of every complexity. △ Less

Submitted 19 February, 2021; originally announced February 2021.

Journal ref: Applied Sciences, MDPI, 2021, 11 (3), pp.975

arXiv:2102.07927 [pdf, other]

Structured Dropout Variational Inference for Bayesian Neural Networks

Authors: Son Nguyen, Duong Nguyen, Khai Nguyen, Khoat Than, Hung Bui, Nhat Ho

Abstract: Approximate inference in Bayesian deep networks exhibits a dilemma of how to yield high fidelity posterior approximations while maintaining computational efficiency and scalability. We tackle this challenge by introducing a novel variational structured approximation inspired by the Bayesian interpretation of Dropout regularization. Concretely, we focus on the inflexibility of the factorized struct… ▽ More Approximate inference in Bayesian deep networks exhibits a dilemma of how to yield high fidelity posterior approximations while maintaining computational efficiency and scalability. We tackle this challenge by introducing a novel variational structured approximation inspired by the Bayesian interpretation of Dropout regularization. Concretely, we focus on the inflexibility of the factorized structure in Dropout posterior and then propose an improved method called Variational Structured Dropout (VSD). VSD employs an orthogonal transformation to learn a structured representation on the variational Gaussian noise with plausible complexity, and consequently induces statistical dependencies in the approximate posterior. Theoretically, VSD successfully addresses the pathologies of previous Variational Dropout methods and thus offers a standard Bayesian justification. We further show that VSD induces an adaptive regularization term with several desirable properties which contribute to better generalization. Finally, we conduct extensive experiments on standard benchmarks to demonstrate the effectiveness of VSD over state-of-the-art variational methods on predictive accuracy, uncertainty estimation, and out-of-distribution detection. △ Less

Submitted 28 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

Comments: 45 pages, 9 figures

arXiv:2010.01787 [pdf, other]

Improving Relational Regularized Autoencoders with Spherical Sliced Fused Gromov Wasserstein

Authors: Khai Nguyen, Son Nguyen, Nhat Ho, Tung Pham, Hung Bui

Abstract: Relational regularized autoencoder (RAE) is a framework to learn the distribution of data by minimizing a reconstruction loss together with a relational regularization on the latent space. A recent attempt to reduce the inner discrepancy between the prior and aggregated posterior distributions is to incorporate sliced fused Gromov-Wasserstein (SFG) between these distributions. That approach has a… ▽ More Relational regularized autoencoder (RAE) is a framework to learn the distribution of data by minimizing a reconstruction loss together with a relational regularization on the latent space. A recent attempt to reduce the inner discrepancy between the prior and aggregated posterior distributions is to incorporate sliced fused Gromov-Wasserstein (SFG) between these distributions. That approach has a weakness since it treats every slicing direction similarly, meanwhile several directions are not useful for the discriminative task. To improve the discrepancy and consequently the relational regularization, we propose a new relational discrepancy, named spherical sliced fused Gromov Wasserstein (SSFG), that can find an important area of projections characterized by a von Mises-Fisher distribution. Then, we introduce two variants of SSFG to improve its performance. The first variant, named mixture spherical sliced fused Gromov Wasserstein (MSSFG), replaces the vMF distribution by a mixture of von Mises-Fisher distributions to capture multiple important areas of directions that are far from each other. The second variant, named power spherical sliced fused Gromov Wasserstein (PSSFG), replaces the vMF distribution by a power spherical distribution to improve the sampling time in high dimension settings. We then apply the new discrepancies to the RAE framework to achieve its new variants. Finally, we conduct extensive experiments to show that the new proposed autoencoders have favorable performance in learning latent manifold structure, image generation, and reconstruction. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: 39 pages, 19 figures

arXiv:2007.05394 [pdf]

How An Automated Gesture Imitation Game Can Improve Social Interactions With Teenagers With ASD

Authors: Linda Nanan Vallée, Sao Mai Nguyen, Christophe Lohr, Ioannis Kanellos, Olivier Asseu

Abstract: With the outlook of improving communication and social abilities of people with ASD, we propose to extend the paradigm of robot-based imitation games to ASD teenagers. In this paper, we present an interaction scenario adapted to ASD teenagers, propose a computational architecture using the latest machine learning algorithm Openpose for human pose detection, and present the results of our basic tes… ▽ More With the outlook of improving communication and social abilities of people with ASD, we propose to extend the paradigm of robot-based imitation games to ASD teenagers. In this paper, we present an interaction scenario adapted to ASD teenagers, propose a computational architecture using the latest machine learning algorithm Openpose for human pose detection, and present the results of our basic testing of the scenario with human caregivers. These results are preliminary due to the number of session (1) and participants (4). They include a technical assessment of the performance of Openpose, as well as a preliminary user study to confirm our game scenario could elicit the expected response from subjects. △ Less

Submitted 10 July, 2020; originally announced July 2020.

Journal ref: IEEE ICRA Workshop on Social Robotics for Neurodevelopmental Disorders, Jun 2020, Paris, France

arXiv:2007.04604 [pdf]

doi 10.17654/EC023010001

Building an Automated Gesture Imitation Game for Teenagers with ASD

Authors: Linda Nanan Vallée, Christophe Lohr, Sao Mai Nguyen, Ioannis Kanellos, O. Asseu

Abstract: Autism spectrum disorder is a neurodevelopmental condition that includes issues with communication and social interactions. People with ASD also often have restricted interests and repetitive behaviors. In this paper we build preliminary bricks of an automated gesture imitation game that will aim at improving social interactions with teenagers with ASD. The structure of the game is presented, as w… ▽ More Autism spectrum disorder is a neurodevelopmental condition that includes issues with communication and social interactions. People with ASD also often have restricted interests and repetitive behaviors. In this paper we build preliminary bricks of an automated gesture imitation game that will aim at improving social interactions with teenagers with ASD. The structure of the game is presented, as well as support tools and methods for skeleton detection and imitation learning. The game shall later be implemented using an interactive robot. △ Less

Submitted 9 July, 2020; originally announced July 2020.

Journal ref: Far East Journal of Electronics and Communications, 2019, 22, pp.19 - 28

arXiv:1804.06819 [pdf, other]

doi 10.2478/s13230-013-0110-z

Active choice of teachers, learning strategies and goals for a socially guided intrinsic motivation learner

Authors: Sao Mai Nguyen, Pierre-Yves Oudeyer

Abstract: We present an active learning architecture that allows a robot to actively learn which data collection strategy is most efficient for acquiring motor skills to achieve multiple outcomes, and generalise over its experience to achieve new outcomes. The robot explores its environment both via interactive learning and goal-babbling. It learns at the same time when, who and what to actively imitate fro… ▽ More We present an active learning architecture that allows a robot to actively learn which data collection strategy is most efficient for acquiring motor skills to achieve multiple outcomes, and generalise over its experience to achieve new outcomes. The robot explores its environment both via interactive learning and goal-babbling. It learns at the same time when, who and what to actively imitate from several available teachers, and learns when not to use social guidance but use active goal-oriented self-exploration. This is formalised in the framework of life-long strategic learning. The proposed architecture, called Socially Guided Intrinsic Motivation with Active Choice of Teacher and Strategy (SGIM-ACTS), relies on hierarchical active decisions of what and how to learn driven by empirical evaluation of learning progress for each learning strategy. We illustrate with an experiment where a simulated robot learns to control its arm for realising two kinds of different outcomes. It has to choose actively and hierarchically at each learning episode: 1) what to learn: which outcome is most interesting to select as a goal to focus on for goal-directed exploration; 2) how to learn: which data collection strategy to use among self-exploration, mimicry and emulation; 3) once he has decided when and what to imitate by choosing mimicry or emulation, then he has to choose who to imitate, from a set of different teachers. We show that SGIM-ACTS learns significantly more efficiently than using single learning strategies, and coherently selects the best strategy with respect to the chosen outcome, taking advantage of the available teachers (with different levels of skills). △ Less

Submitted 18 April, 2018; originally announced April 2018.

Journal ref: Paladyn, Springer Verlag, 2012, 3 (3), pp.136-146

arXiv:1002.2749 [pdf]

Risk Quantification Associated with Wind Energy Intermittency in California

Authors: Sam O. George, H. Bola George, Scott V. Nguyen

Abstract: As compared to load demand, frequent wind energy intermittencies produce large short-term (sub 1-hr to 3-hr) deficits (and surpluses) in the energy supply. These intermittent deficits pose systemic and structural risks that will likely lead to energy deficits that have significant reliability implications for energy system operators and consumers. This work provides a toolset to help policy make… ▽ More As compared to load demand, frequent wind energy intermittencies produce large short-term (sub 1-hr to 3-hr) deficits (and surpluses) in the energy supply. These intermittent deficits pose systemic and structural risks that will likely lead to energy deficits that have significant reliability implications for energy system operators and consumers. This work provides a toolset to help policy makers quantify these first-order risks. The thinking methodology / framework shows that increasing wind energy penetration significantly increases the risk of loss in California. In addition, the work presents holistic risk tables as a general innovation to help decision makers quickly grasp the full impact of risk. △ Less

Submitted 13 February, 2010; originally announced February 2010.

Comments: 8 pages, 6 figures, 3 tables

arXiv:1002.2243 [pdf]

Effect of Wind Intermittency on the Electric Grid: Mitigating the Risk of Energy Deficits

Authors: Sam O. George, H. Bola George, Scott V. Nguyen

Abstract: Successful implementation of California's Renewable Portfolio Standard (RPS) mandating 33 percent renewable energy generation by 2020 requires inclusion of a robust strategy to mitigate increased risk of energy deficits (blackouts) due to short time-scale (sub 1 hour) intermittencies in renewable energy sources. Of these RPS sources, wind energy has the fastest growth rate--over 25% year-over-ye… ▽ More Successful implementation of California's Renewable Portfolio Standard (RPS) mandating 33 percent renewable energy generation by 2020 requires inclusion of a robust strategy to mitigate increased risk of energy deficits (blackouts) due to short time-scale (sub 1 hour) intermittencies in renewable energy sources. Of these RPS sources, wind energy has the fastest growth rate--over 25% year-over-year. If these growth trends continue, wind energy could make up 15 percent of California's energy portfolio by 2016 (wRPS15). However, the hour-to-hour variations in wind energy (speed) will create large hourly energy deficits that require installation of other, more predictable, compensation generation capacity and infrastructure. Compensating for the energy deficits of wRPS15 could potentially cost tens of billions in additional dollar-expenditure for fossil and / or nuclear generation capacity. There is a real possibility that carbon dioxide and other greenhouse gas (GHG) emission reductions will miss the California Assembly Bill 32 (CA AB 32) target by a wide margin once the wRPS15 compensation system is in place. This work presents a set of analytics tools that show the impact of short-term intermittencies to help policy makers understand and plan for wRPS15 integration. What are the right policy choices for RPS that include wind energy? △ Less

Submitted 10 February, 2010; originally announced February 2010.

Comments: 8 pages, 12 figures

Showing 1–11 of 11 results for author: Nguyen, S