-
Kolmogorov-Smirnov GAN
Authors:
Maciej Falkiewicz,
Naoya Takeishi,
Alexandros Kalousis
Abstract:
We propose a novel deep generative model, the Kolmogorov-Smirnov Generative Adversarial Network (KSGAN). Unlike existing approaches, KSGAN formulates the learning process as a minimization of the Kolmogorov-Smirnov (KS) distance, generalized to handle multivariate distributions. This distance is calculated using the quantile function, which acts as the critic in the adversarial training process. W…
▽ More
We propose a novel deep generative model, the Kolmogorov-Smirnov Generative Adversarial Network (KSGAN). Unlike existing approaches, KSGAN formulates the learning process as a minimization of the Kolmogorov-Smirnov (KS) distance, generalized to handle multivariate distributions. This distance is calculated using the quantile function, which acts as the critic in the adversarial training process. We formally demonstrate that minimizing the KS distance leads to the trained approximate distribution aligning with the target distribution. We propose an efficient implementation and evaluate its effectiveness through experiments. The results show that KSGAN performs on par with existing adversarial methods, exhibiting stability during training, resistance to mode drop** and collapse, and tolerance to variations in hyperparameter settings. Additionally, we review the literature on the Generalized KS test and discuss the connections between KSGAN and existing adversarial generative models.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Numerical-experimental estimation of the deformability of human red blood cells from rheometrical data
Authors:
Naoki Takeishi,
Tomohiro Nishiyama,
Kodai Nagaishi,
Takeshi Nashima,
Masako Sugihara-Seki
Abstract:
he deformability of human red blood cells (RBCs), which comprise almost 99% of the cells in whole blood, is largely related not only to pathophysiological blood flow but also to the levels of intracellular compounds. Therefore, statistical estimates of the deformability of individual RBCs are of paramount importance in the clinical diagnosis of blood diseases. Although the micro-scale hydrodynamic…
▽ More
he deformability of human red blood cells (RBCs), which comprise almost 99% of the cells in whole blood, is largely related not only to pathophysiological blood flow but also to the levels of intracellular compounds. Therefore, statistical estimates of the deformability of individual RBCs are of paramount importance in the clinical diagnosis of blood diseases. Although the micro-scale hydrodynamic interactions of individual RBCs lead to non-Newtonian blood rheology, there is no established method to estimate individual RBC deformability from the rheological data of RBC suspensions, and the possibility of this estimation has not been proven. To address this issue, we conducted an integrated analysis of a model of the rheology of RBC suspensions, coupled with macro-rheological data of human RBCs suspended in plasma. Assuming a non-linear curve of the relative viscosity of the suspensions as a function of the cell volume fraction, the statistical average of the membrane shear elasticity was estimated for individual intact RBCs or hardened RBCs. Both estimated values reproduced well the experimentally observed shear-thinning non-Newtonian behavior in these suspensions. We hereby conclude that our complementary approach makes it possible to estimate the statistical average of individual RBC deformability from macro-rheological data obtained with usual rheometric tests.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Calibrating Neural Simulation-Based Inference with Differentiable Coverage Probability
Authors:
Maciej Falkiewicz,
Naoya Takeishi,
Imahn Shekhzadeh,
Antoine Wehenkel,
Arnaud Delaunoy,
Gilles Louppe,
Alexandros Kalousis
Abstract:
Bayesian inference allows expressing the uncertainty of posterior belief under a probabilistic model given prior information and the likelihood of the evidence. Predominantly, the likelihood function is only implicitly established by a simulator posing the need for simulation-based inference (SBI). However, the existing algorithms can yield overconfident posteriors (Hermans *et al.*, 2022) defeati…
▽ More
Bayesian inference allows expressing the uncertainty of posterior belief under a probabilistic model given prior information and the likelihood of the evidence. Predominantly, the likelihood function is only implicitly established by a simulator posing the need for simulation-based inference (SBI). However, the existing algorithms can yield overconfident posteriors (Hermans *et al.*, 2022) defeating the whole purpose of credibility if the uncertainty quantification is inaccurate. We propose to include a calibration term directly into the training objective of the neural model in selected amortized SBI techniques. By introducing a relaxation of the classical formulation of calibration error we enable end-to-end backpropagation. The proposed method is not tied to any particular neural model and brings moderate computational overhead compared to the profits it introduces. It is directly applicable to existing computational pipelines allowing reliable black-box posterior inference. We empirically show on six benchmark problems that the proposed method achieves competitive or better results in terms of coverage and expected posterior density than the previously existing approaches.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Mimicking Better by Matching the Approximate Action Distribution
Authors:
João A. Cândido Ramos,
Lionel Blondé,
Naoya Takeishi,
Alexandros Kalousis
Abstract:
In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations. MAAD utilizes a surrogate reward signal, which can be derived from various sources such as adversarial games, trajectory matching objectives, or optimal transport criteria. To compensate for the non-availability of expert actions, we rely on an inverse dynamics model that infer…
▽ More
In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations. MAAD utilizes a surrogate reward signal, which can be derived from various sources such as adversarial games, trajectory matching objectives, or optimal transport criteria. To compensate for the non-availability of expert actions, we rely on an inverse dynamics model that infers plausible actions distribution given the expert's state-state transitions; we regularize the imitator's policy by aligning it to the inferred action distribution. MAAD leads to significantly improved sample efficiency and stability. We demonstrate its effectiveness in a number of MuJoCo environments, both int the OpenAI Gym and the DeepMind Control Suite. We show that it requires considerable fewer interactions to achieve expert performance, outperforming current state-of-the-art on-policy methods. Remarkably, MAAD often stands out as the sole method capable of attaining expert performance levels, underscoring its simplicity and efficacy.
△ Less
Submitted 9 February, 2024; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations
Authors:
Keisuke Fujii,
Kazushi Tsutsui,
Atom Scott,
Hiroshi Nakahara,
Naoya Takeishi,
Yoshinobu Kawahara
Abstract:
Modeling of real-world biological multi-agents is a fundamental problem in various scientific and engineering fields. Reinforcement learning (RL) is a powerful framework to generate flexible and diverse behaviors in cyberspace; however, when modeling real-world biological multi-agents, there is a domain gap between behaviors in the source (i.e., real-world data) and the target (i.e., cyberspace fo…
▽ More
Modeling of real-world biological multi-agents is a fundamental problem in various scientific and engineering fields. Reinforcement learning (RL) is a powerful framework to generate flexible and diverse behaviors in cyberspace; however, when modeling real-world biological multi-agents, there is a domain gap between behaviors in the source (i.e., real-world data) and the target (i.e., cyberspace for RL), and the source environment parameters are usually unknown. In this paper, we propose a method for adaptive action supervision in RL from real-world demonstrations in multi-agent scenarios. We adopt an approach that combines RL and supervised learning by selecting actions of demonstrations in RL based on the minimum distance of dynamic time war** for utilizing the information of the unknown source dynamics. This approach can be easily applied to many existing neural network architectures and provide us with an RL model balanced between reproducibility as imitation and generalization ability to obtain rewards in cyberspace. In the experiments, using chase-and-escape and football tasks with the different dynamics between the unknown source and target environments, we show that our approach achieved a balance between the reproducibility and the generalization ability compared with the baselines. In particular, we used the tracking data of professional football players as expert demonstrations in football and show successful performances despite the larger gap between behaviors in the source and target environments than the chase-and-escape task.
△ Less
Submitted 19 December, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Enhanced axial migration of a deformable capsule in pulsatile channel flows
Authors:
Naoki Takeishi,
Marco Edoardo Rosti
Abstract:
We present numerical analysis of the lateral movement of a deformable spherical capsule in a pulsatile channel flow, with a Newtonian fluid in almost inertialess condition and at a small confinement ratio $a_0/R = 0.4$, where $R$ and $a$ are the channel and capsule radius. We find that the speed of the axial migration of the capsule can be accelerated by the flow pulsation at a specific frequency.…
▽ More
We present numerical analysis of the lateral movement of a deformable spherical capsule in a pulsatile channel flow, with a Newtonian fluid in almost inertialess condition and at a small confinement ratio $a_0/R = 0.4$, where $R$ and $a$ are the channel and capsule radius. We find that the speed of the axial migration of the capsule can be accelerated by the flow pulsation at a specific frequency. The migration speed increases with the oscillatory amplitude, while the most effective frequency remains basically unchanged and independent of the amplitude. Our numerical results form a fundamental basis for further studies on cellular flow mechanics, since pulsatile flows are physiologically relevant in human circulation, potentially affecting the dynamics of deformable particles and red blood cells (RBCs), and can also be potentially exploited in cell focusing techniques.
△ Less
Submitted 5 May, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Deep Grey-Box Modeling With Adaptive Data-Driven Models Toward Trustworthy Estimation of Theory-Driven Models
Authors:
Naoya Takeishi,
Alexandros Kalousis
Abstract:
The combination of deep neural nets and theory-driven models, which we call deep grey-box modeling, can be inherently interpretable to some extent thanks to the theory backbone. Deep grey-box models are usually learned with a regularized risk minimization to prevent a theory-driven part from being overwritten and ignored by a deep neural net. However, an estimation of the theory-driven part obtain…
▽ More
The combination of deep neural nets and theory-driven models, which we call deep grey-box modeling, can be inherently interpretable to some extent thanks to the theory backbone. Deep grey-box models are usually learned with a regularized risk minimization to prevent a theory-driven part from being overwritten and ignored by a deep neural net. However, an estimation of the theory-driven part obtained by uncritically optimizing a regularizer can hardly be trustworthy when we are not sure what regularizer is suitable for the given data, which may harm the interpretability. Toward a trustworthy estimation of the theory-driven part, we should analyze regularizers' behavior to compare different candidates and to justify a specific choice. In this paper, we present a framework that enables us to analyze a regularizer's behavior empirically with a slight change in the neural net's architecture and the training objective.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Numerical analysis of viscoelasticity of two-dimensional fluid membranes under oscillatory loadings
Authors:
Naoki Takeishi,
Masaya Santo,
Naoto Yokoyama,
Shigeo Wada
Abstract:
Biomembranes consisting of two opposing phospholipid monolayers, which comprise the so-called lipid bilayer, are largely responsible for the dual solid-fluid behavior of individual cells and viruses. Quantifying the mechanical characteristics of biomembrane, including the dynamics of their in-plane fluidity, can provide insight not only into active or passive cell behaviors but also into vesicle d…
▽ More
Biomembranes consisting of two opposing phospholipid monolayers, which comprise the so-called lipid bilayer, are largely responsible for the dual solid-fluid behavior of individual cells and viruses. Quantifying the mechanical characteristics of biomembrane, including the dynamics of their in-plane fluidity, can provide insight not only into active or passive cell behaviors but also into vesicle design for drug delivery systems. Despite numerous studies on the mechanics of biomembranes, their dynamical viscoelastic properties have not yet been fully described. We thus quantify their viscoelasticity based on a two-dimensional (2D) fluid membrane model, and investigate this viscoelasticity under small amplitude oscillatory loadings in micron-scale membrane area. We use hydrodynamic equations of bilayer membranes, obtained by Onsager's variational principle, wherein the fluid membrane is assumed to be an almost planar bilayer membrane. Simulations are performed for a wide range of oscillatory frequencies $f$ and membrane tensions. Our numerical results show that as frequencies increase, membrane characteristics shift from an elastic-dominant to viscous-dominant state. Furthermore, such state transitions obtained with a 1-$μ$m-wide loading profile appear with frequencies between $O(f) = 10^1-10^2$ Hz, and almost independently of surface tensions. We discuss the formation mechanism of the viscous- or elastic-dominant transition based on relaxation rates that correspond to the eigenvalues of the dynamical matrix in the governing equations.
△ Less
Submitted 29 March, 2024; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Inertial migration of red blood cells under a Newtonian fluid in a circular channel
Authors:
Naoki Takeishi,
Hiroshi Yamashita,
Toshihiro Omori,
Naoto Yokoyama,
Shigeo Wada,
Masako Sugihara-Seki
Abstract:
We present a numerical analysis of the lateral movement and equilibrium radial positions of red blood cells (RBCs) with major diameter of 8 $μ$m under a Newtonian fluid in a circular channel with 50-$μ$m diameter. Each RBC, modelled as a biconcave capsule whose membrane satisfies strain-hardening characteristics, is simulated for different Reynolds numbers $Re$ and capillary numbers $Ca$, the latt…
▽ More
We present a numerical analysis of the lateral movement and equilibrium radial positions of red blood cells (RBCs) with major diameter of 8 $μ$m under a Newtonian fluid in a circular channel with 50-$μ$m diameter. Each RBC, modelled as a biconcave capsule whose membrane satisfies strain-hardening characteristics, is simulated for different Reynolds numbers $Re$ and capillary numbers $Ca$, the latter of which indicate the ratio of the fluid viscous force to the membrane elastic force. The effects of initial orientation angles and positions on the equilibrium radial position of an RBC centroid are also investigated. The numerical results show that depending on their initial orientations, RBCs have bistable flow modes, so-called rolling and tumbling motions. Most RBCs have a rolling motion. These stable modes are accompanied by different equilibrium radial positions, where tumbling RBCs are further away from the channel axis than rolling ones. The inertial migration of RBCs is achieved by alternating orientation angles, which are primarily affected by the initial orientation angles. Then the RBCs assume the aforementioned bistable modes during the migration, followed by further migration to the equilibrium radial position at much longer time periods. The power (or energy dissipation) associated with membrane deformations is introduced to quantify the state of membrane loads. The energy expenditures rely on stable flow modes, the equilibrium radial position of RBC centroids, and the viscosity ratio between the internal and external fluids.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Viscoelasticity of suspension of red blood cells under oscillatory shear flow
Authors:
Naoki Takeishi,
Marco Edoardo Rosti,
Naoto Yokoyama,
Luca Brandt
Abstract:
We present a numerical analysis of the rheology of a suspension of red blood cells (RBCs) for different volume fractions in a wall-bounded, effectively inertialess, small amplitude oscillatory shear (SAOS) flow for a wide range of applied frequencies. The RBCs are modeled as biconcave capsules, whose membrane is an isotropic and hyperelastic material following the Skalak constitutive law. The freq…
▽ More
We present a numerical analysis of the rheology of a suspension of red blood cells (RBCs) for different volume fractions in a wall-bounded, effectively inertialess, small amplitude oscillatory shear (SAOS) flow for a wide range of applied frequencies. The RBCs are modeled as biconcave capsules, whose membrane is an isotropic and hyperelastic material following the Skalak constitutive law. The frequency-dependent viscoelasticity in the bulk suspension is quantified by the complex viscosity, defined by the amplitude of the particle shear stress and the phase difference between the stress and shear. SAOS flow basically impedes the deformation of individual RBCs as well as the magnitude of fluid-membrane interactions, resulting in a lower specific viscosity and first and second normal stress differences than in steady shear flow. Although it is known that the RBC deformation alone is sufficient to give rise to shear-thinning, our results show that the complex viscosity weakly depends on the frequency-modulated deformations or orientations of individual RBCs, but rather depends on combinations of the frequency-dependent amplitude and phase difference. The effect of the viscosity ratio between the cytoplasm and plasma and of the capillary number are also assessed.
△ Less
Submitted 17 March, 2024; v1 submitted 19 July, 2022;
originally announced July 2022.
-
Estimating counterfactual treatment outcomes over time in complex multiagent scenarios
Authors:
Keisuke Fujii,
Koh Takeuchi,
Atsushi Kuribayashi,
Naoya Takeishi,
Yoshinobu Kawahara,
Kazuya Takeda
Abstract:
Evaluation of intervention in a multiagent system, e.g., when humans should intervene in autonomous driving systems and when a player should pass to teammates for a good shot, is challenging in various engineering and scientific fields. Estimating the individual treatment effect (ITE) using counterfactual long-term prediction is practical to evaluate such interventions. However, most of the conven…
▽ More
Evaluation of intervention in a multiagent system, e.g., when humans should intervene in autonomous driving systems and when a player should pass to teammates for a good shot, is challenging in various engineering and scientific fields. Estimating the individual treatment effect (ITE) using counterfactual long-term prediction is practical to evaluate such interventions. However, most of the conventional frameworks did not consider the time-varying complex structure of multiagent relationships and covariate counterfactual prediction. This may lead to erroneous assessments of ITE and difficulty in interpretation. Here we propose an interpretable, counterfactual recurrent network in multiagent systems to estimate the effect of the intervention. Our model leverages graph variational recurrent neural networks and theory-based computation with domain knowledge for the ITE estimation framework based on long-term prediction of multiagent covariates and outcomes, which can confirm the circumstances under which the intervention is effective. On simulated models of an automated vehicle and biological agents with time-varying confounders, we show that our methods achieved lower estimation errors in counterfactual covariates and the most effective treatment timing than the baselines. Furthermore, using real basketball data, our methods performed realistic counterfactual predictions and evaluated the counterfactual passes in shot scenarios.
△ Less
Submitted 17 February, 2024; v1 submitted 4 June, 2022;
originally announced June 2022.
-
Asteroid Flyby Cycler Trajectory Design Using Deep Neural Networks
Authors:
Naoya Ozaki,
Kanta Yanagida,
Takuya Chikazawa,
Nishanth Pushparaj,
Naoya Takeishi,
Ryuki Hyodo
Abstract:
Asteroid exploration has been attracting more attention in recent years. Nevertheless, we have just visited tens of asteroids while we have discovered more than one million bodies. As our current observation and knowledge should be biased, it is essential to explore multiple asteroids directly to better understand the remains of planetary building materials. One of the mission design solutions is…
▽ More
Asteroid exploration has been attracting more attention in recent years. Nevertheless, we have just visited tens of asteroids while we have discovered more than one million bodies. As our current observation and knowledge should be biased, it is essential to explore multiple asteroids directly to better understand the remains of planetary building materials. One of the mission design solutions is utilizing asteroid flyby cycler trajectories with multiple Earth gravity assists. An asteroid flyby cycler trajectory design problem is a subclass of global trajectory optimization problems with multiple flybys, involving a trajectory optimization problem for a given flyby sequence and a combinatorial optimization problem to decide the sequence of the flybys. As the number of flyby bodies grows, the computation time of this optimization problem expands maliciously. This paper presents a new method to design asteroid flyby cycler trajectories utilizing a surrogate model constructed by deep neural networks approximating trajectory optimization results. Since one of the bottlenecks of machine learning approaches is the computation time to generate massive trajectory databases, we propose an efficient database generation strategy by introducing pseudo-asteroids satisfying the Karush-Kuhn-Tucker conditions. The numerical result applied to JAXA's DESTINY+ mission shows that the proposed method is practically applicable to space mission design and can significantly reduce the computational time for searching asteroid flyby sequences.
△ Less
Submitted 11 July, 2022; v1 submitted 23 November, 2021;
originally announced November 2021.
-
Learning interaction rules from multi-animal trajectories via augmented behavioral models
Authors:
Keisuke Fujii,
Naoya Takeishi,
Kazushi Tsutsui,
Emyo Fujioka,
Nozomi Nishiumi,
Ryoya Tanaka,
Mika Fukushiro,
Kaoru Ide,
Hiroyoshi Kohno,
Ken Yoda,
Susumu Takahashi,
Shizuko Hiryu,
Yoshinobu Kawahara
Abstract:
Extracting the interaction rules of biological agents from movement sequences pose challenges in various domains. Granger causality is a practical framework for analyzing the interactions from observed time-series data; however, this framework ignores the structures and assumptions of the generative process in animal behaviors, which may lead to interpretational problems and sometimes erroneous as…
▽ More
Extracting the interaction rules of biological agents from movement sequences pose challenges in various domains. Granger causality is a practical framework for analyzing the interactions from observed time-series data; however, this framework ignores the structures and assumptions of the generative process in animal behaviors, which may lead to interpretational problems and sometimes erroneous assessments of causality. In this paper, we propose a new framework for learning Granger causality from multi-animal trajectories via augmented theory-based behavioral models with interpretable data-driven models. We adopt an approach for augmenting incomplete multi-agent behavioral models described by time-varying dynamical systems with neural networks. For efficient and interpretable learning, our model leverages theory-based architectures separating navigation and motion processes, and the theory-guided regularization for reliable behavioral modeling. This can provide interpretable signs of Granger-causal effects over time, i.e., when specific others cause the approach or separation. In experiments using synthetic datasets, our method achieved better performance than various baselines. We then analyzed multi-animal datasets of mice, flies, birds, and bats, which verified our method and obtained novel biological insights.
△ Less
Submitted 25 October, 2021; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Physics-Integrated Variational Autoencoders for Robust and Interpretable Generative Modeling
Authors:
Naoya Takeishi,
Alexandros Kalousis
Abstract:
Integrating physics models within machine learning models holds considerable promise toward learning robust models with improved interpretability and abilities to extrapolate. In this work, we focus on the integration of incomplete physics models into deep generative models. In particular, we introduce an architecture of variational autoencoders (VAEs) in which a part of the latent space is ground…
▽ More
Integrating physics models within machine learning models holds considerable promise toward learning robust models with improved interpretability and abilities to extrapolate. In this work, we focus on the integration of incomplete physics models into deep generative models. In particular, we introduce an architecture of variational autoencoders (VAEs) in which a part of the latent space is grounded by physics. A key technical challenge is to strike a balance between the incomplete physics and trainable components such as neural networks for ensuring that the physics part is used in a meaningful manner. To this end, we propose a regularized learning method that controls the effect of the trainable components and preserves the semantics of the physics-based latent variables as intended. We not only demonstrate generative performance improvements over a set of synthetic and real-world datasets, but we also show that we learn robust models that can consistently extrapolate beyond the training distribution in a meaningful manner. Moreover, we show that we can control the generative process in an interpretable manner.
△ Less
Submitted 26 October, 2021; v1 submitted 25 February, 2021;
originally announced February 2021.
-
Discriminant Dynamic Mode Decomposition for Labeled Spatio-Temporal Data Collections
Authors:
Naoya Takeishi,
Keisuke Fujii,
Koh Takeuchi,
Yoshinobu Kawahara
Abstract:
Extracting coherent patterns is one of the standard approaches towards understanding spatio-temporal data. Dynamic mode decomposition (DMD) is a powerful tool for extracting coherent patterns, but the original DMD and most of its variants do not consider label information, which is often available as side information of spatio-temporal data. In this work, we propose a new method for extracting dis…
▽ More
Extracting coherent patterns is one of the standard approaches towards understanding spatio-temporal data. Dynamic mode decomposition (DMD) is a powerful tool for extracting coherent patterns, but the original DMD and most of its variants do not consider label information, which is often available as side information of spatio-temporal data. In this work, we propose a new method for extracting distinctive coherent patterns from labeled spatio-temporal data collections, such that they contribute to major differences in a labeled set of dynamics. We achieve such pattern extraction by incorporating discriminant analysis into DMD. To this end, we define a kernel function on subspaces spanned by sets of dynamic modes and develop an objective to take both reconstruction goodness as DMD and class-separation goodness as discriminant analysis into account. We illustrate our method using a synthetic dataset and several real-world datasets. The proposed method can be a useful tool for exploratory data analysis for understanding spatio-temporal data.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Decentralized policy learning with partial observation and mechanical constraints for multiperson modeling
Authors:
Keisuke Fujii,
Naoya Takeishi,
Yoshinobu Kawahara,
Kazuya Takeda
Abstract:
Extracting the rules of real-world multi-agent behaviors is a current challenge in various scientific and engineering fields. Biological agents independently have limited observation and mechanical constraints; however, most of the conventional data-driven models ignore such assumptions, resulting in lack of biological plausibility and model interpretability for behavioral analyses. Here we propos…
▽ More
Extracting the rules of real-world multi-agent behaviors is a current challenge in various scientific and engineering fields. Biological agents independently have limited observation and mechanical constraints; however, most of the conventional data-driven models ignore such assumptions, resulting in lack of biological plausibility and model interpretability for behavioral analyses. Here we propose sequential generative models with partial observation and mechanical constraints in a decentralized manner, which can model agents' cognition and body dynamics, and predict biologically plausible behaviors. We formulate this as a decentralized multi-agent imitation-learning problem, leveraging binary partial observation and decentralized policy models based on hierarchical variational recurrent neural networks with physical and biomechanical penalties. Using real-world basketball and soccer datasets, we show the effectiveness of our method in terms of the constraint violations, long-term trajectory prediction, and partial observation. Our approach can be used as a multi-agent simulator to generate realistic trajectories using real-world data.
△ Less
Submitted 1 December, 2023; v1 submitted 6 July, 2020;
originally announced July 2020.
-
Learning Dynamics Models with Stable Invariant Sets
Authors:
Naoya Takeishi,
Yoshinobu Kawahara
Abstract:
Invariance and stability are essential notions in dynamical systems study, and thus it is of great interest to learn a dynamics model with a stable invariant set. However, existing methods can only handle the stability of an equilibrium. In this paper, we propose a method to ensure that a dynamics model has a stable invariant set of general classes such as limit cycles and line attractors. We star…
▽ More
Invariance and stability are essential notions in dynamical systems study, and thus it is of great interest to learn a dynamics model with a stable invariant set. However, existing methods can only handle the stability of an equilibrium. In this paper, we propose a method to ensure that a dynamics model has a stable invariant set of general classes such as limit cycles and line attractors. We start with the approach by Manek and Kolter (2019), where they use a learnable Lyapunov function to make a model stable with regard to an equilibrium. We generalize it for general sets by introducing projection onto them. To resolve the difficulty of specifying a to-be stable invariant set analytically, we propose defining such a set as a primitive shape (e.g., sphere) in a latent space and learning the transformation between the original and latent spaces. It enables us to compute the projection easily, and at the same time, we can maintain the model's flexibility using various invertible neural networks for the transformation. We present experimental results that show the validity of the proposed method and the usefulness for long-term prediction.
△ Less
Submitted 29 October, 2020; v1 submitted 16 June, 2020;
originally announced June 2020.
-
An MCMC Method for Uncertainty Set Generation via Operator-Theoretic Metrics
Authors:
Anand Srinivasan,
Naoya Takeishi
Abstract:
Model uncertainty sets are required in many robust optimization problems, such as robust control and prediction with uncertainty, but there is no definite methodology to generate uncertainty sets for nonlinear dynamical systems. In this paper, we propose a method for model uncertainty set generation via Markov chain Monte Carlo. The proposed method samples from distributions over dynamical systems…
▽ More
Model uncertainty sets are required in many robust optimization problems, such as robust control and prediction with uncertainty, but there is no definite methodology to generate uncertainty sets for nonlinear dynamical systems. In this paper, we propose a method for model uncertainty set generation via Markov chain Monte Carlo. The proposed method samples from distributions over dynamical systems via metrics over transfer operators and is applicable to general nonlinear systems. We adapt Hamiltonian Monte Carlo for sampling high-dimensional transfer operators in a computationally efficient manner. We present numerical examples to validate the proposed method for uncertainty set generation.
△ Less
Submitted 5 September, 2020; v1 submitted 6 June, 2020;
originally announced June 2020.
-
A Characteristic Function for Shapley-Value-Based Attribution of Anomaly Scores
Authors:
Naoya Takeishi,
Yoshinobu Kawahara
Abstract:
In anomaly detection, the degree of irregularity is often summarized as a real-valued anomaly score. We address the problem of attributing such anomaly scores to input features for interpreting the results of anomaly detection. We particularly investigate the use of the Shapley value for attributing anomaly scores of semi-supervised detection methods. We propose a characteristic function specifica…
▽ More
In anomaly detection, the degree of irregularity is often summarized as a real-valued anomaly score. We address the problem of attributing such anomaly scores to input features for interpreting the results of anomaly detection. We particularly investigate the use of the Shapley value for attributing anomaly scores of semi-supervised detection methods. We propose a characteristic function specifically designed for attributing anomaly scores. The idea is to approximate the absence of some features by locally minimizing the anomaly score with regard to the to-be-absent features. We examine the applicability of the proposed characteristic function and other general approaches for interpreting anomaly scores on multiple datasets and multiple anomaly detection methods. The results indicate the potential utility of the attribution methods including the proposed one.
△ Less
Submitted 16 February, 2023; v1 submitted 9 April, 2020;
originally announced April 2020.
-
Shapley Values of Reconstruction Errors of PCA for Explaining Anomaly Detection
Authors:
Naoya Takeishi
Abstract:
We present a method to compute the Shapley values of reconstruction errors of principal component analysis (PCA), which is particularly useful in explaining the results of anomaly detection based on PCA. Because features are usually correlated when PCA-based anomaly detection is applied, care must be taken in computing a value function for the Shapley values. We utilize the probabilistic view of P…
▽ More
We present a method to compute the Shapley values of reconstruction errors of principal component analysis (PCA), which is particularly useful in explaining the results of anomaly detection based on PCA. Because features are usually correlated when PCA-based anomaly detection is applied, care must be taken in computing a value function for the Shapley values. We utilize the probabilistic view of PCA, particularly its conditional distribution, to exactly compute a value function for the Shapely values. We also present numerical examples, which imply that the Shapley values are advantageous for explaining detected anomalies than raw reconstruction errors of each feature.
△ Less
Submitted 22 February, 2020; v1 submitted 8 September, 2019;
originally announced September 2019.
-
Physically-interpretable classification of biological network dynamics for complex collective motions
Authors:
Keisuke Fujii,
Naoya Takeishi,
Motokazu Hojo,
Yuki Inaba,
Yoshinobu Kawahara
Abstract:
Understanding biological network dynamics is a fundamental issue in various scientific and engineering fields. Network theory is capable of revealing the relationship between elements and their propagation; however, for complex collective motions, the network properties often transiently and complexly change. A fundamental question addressed here pertains to the classification of collective motion…
▽ More
Understanding biological network dynamics is a fundamental issue in various scientific and engineering fields. Network theory is capable of revealing the relationship between elements and their propagation; however, for complex collective motions, the network properties often transiently and complexly change. A fundamental question addressed here pertains to the classification of collective motion network based on physically-interpretable dynamical properties. Here we apply a data-driven spectral analysis called graph dynamic mode decomposition, which obtains the dynamical properties for collective motion classification. Using a ballgame as an example, we classified the strategic collective motions in different global behaviours and discovered that, in addition to the physical properties, the contextual node information was critical for classification. Furthermore, we discovered the label-specific stronger spectra in the relationship among the nearest agents, providing physical and semantic interpretations. Our approach contributes to the understanding of principles of biological complex network dynamics from the perspective of nonlinear dynamical systems.
△ Less
Submitted 13 June, 2020; v1 submitted 13 May, 2019;
originally announced May 2019.
-
Knowledge-Based Regularization in Generative Modeling
Authors:
Naoya Takeishi,
Yoshinobu Kawahara
Abstract:
Prior domain knowledge can greatly help to learn generative models. However, it is often too costly to hard-code prior knowledge as a specific model architecture, so we often have to use general-purpose models. In this paper, we propose a method to incorporate prior knowledge of feature relations into the learning of general-purpose generative models. To this end, we formulate a regularizer that m…
▽ More
Prior domain knowledge can greatly help to learn generative models. However, it is often too costly to hard-code prior knowledge as a specific model architecture, so we often have to use general-purpose models. In this paper, we propose a method to incorporate prior knowledge of feature relations into the learning of general-purpose generative models. To this end, we formulate a regularizer that makes the marginals of a generative model to follow prescribed relative dependence of features. It can be incorporated into off-the-shelf learning methods of many generative models, including variational autoencoders and generative adversarial networks, as its gradients can be computed using standard backpropagation techniques. We show the effectiveness of the proposed method with experiments on multiple types of datasets and generative models.
△ Less
Submitted 10 December, 2020; v1 submitted 6 February, 2019;
originally announced February 2019.
-
Hemorheology in dilute, semi-dilute and dense suspensions of red blood cells
Authors:
Naoki Takeishi,
Marco E. Rosti,
Yohsuke Imai,
Shigeo Wada,
Luca Brandt
Abstract:
We present a numerical analysis of the rheology of a suspension of red blood cells (RBCs) in a wall-bounded shear flow. The flow is assumed as almost inertialess. The suspension of RBCs, modeled as biconcave capsules whose membrane follows the Skalak constitutive law, is simulated for a wide range of viscosity ratios between the cytoplasm and plasma: $λ$ = 0.1-10, for volume fractions up to $φ$ =…
▽ More
We present a numerical analysis of the rheology of a suspension of red blood cells (RBCs) in a wall-bounded shear flow. The flow is assumed as almost inertialess. The suspension of RBCs, modeled as biconcave capsules whose membrane follows the Skalak constitutive law, is simulated for a wide range of viscosity ratios between the cytoplasm and plasma: $λ$ = 0.1-10, for volume fractions up to $φ$ = 0.41 and for different capillary numbers ($Ca$). Our numerical results show that an RBC at low $Ca$ tends to orient to the shear plane and exhibits the so-called rolling motion, a stable mode with higher intrinsic viscosity than the so-called tumbling motion. As $Ca$ increases, the mode shifts from the rolling to the swinging motion. Hydrodynamic interactions (higher volume fraction) also allows RBCs to exhibit both tumbling or swinging motions resulting in a drop of the intrinsic viscosity for dilute and semi-dilute suspensions. Because of this mode change, conventional ways of modeling the relative viscosity as a polynomial function of $φ$ cannot be simply applied in suspensions of RBCs at low volume fractions. The relative viscosity for high volume fractions, however, can be well described as a function of an effective volume fraction, defined by the volume of spheres of radius equal to the semi-middle axis of the deformed RBC. We find that the relative viscosity successfully collapses on a single non-linear curve independently of $λ$ except for the case with $Ca \geq$ 0.4, where the fit works only in the case of low/moderate volume fraction, and fails in the case of a fully dense suspension.
△ Less
Submitted 1 May, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
Knowledge-Based Distant Regularization in Learning Probabilistic Models
Authors:
Naoya Takeishi,
Kosuke Akimoto
Abstract:
Exploiting the appropriate inductive bias based on the knowledge of data is essential for achieving good performance in statistical machine learning. In practice, however, the domain knowledge of interest often provides information on the relationship of data attributes only distantly, which hinders direct utilization of such domain knowledge in popular regularization methods. In this paper, we pr…
▽ More
Exploiting the appropriate inductive bias based on the knowledge of data is essential for achieving good performance in statistical machine learning. In practice, however, the domain knowledge of interest often provides information on the relationship of data attributes only distantly, which hinders direct utilization of such domain knowledge in popular regularization methods. In this paper, we propose the knowledge-based distant regularization framework, in which we utilize the distant information encoded in a knowledge graph for regularization of probabilistic model estimation. In particular, we propose to impose prior distributions on model parameters specified by knowledge graph embeddings. As an instance of the proposed framework, we present the factor analysis model with the knowledge-based distant regularization. We show the results of preliminary experiments on the improvement of the generalization capability of such model.
△ Less
Submitted 29 June, 2018;
originally announced June 2018.
-
Dynamic and Static Topic Model for Analyzing Time-Series Document Collections
Authors:
Rem Hida,
Naoya Takeishi,
Takehisa Yairi,
Koichi Hori
Abstract:
For extracting meaningful topics from texts, their structures should be considered properly. In this paper, we aim to analyze structured time-series documents such as a collection of news articles and a series of scientific papers, wherein topics evolve along time depending on multiple topics in the past and are also related to each other at each time. To this end, we propose a dynamic and static…
▽ More
For extracting meaningful topics from texts, their structures should be considered properly. In this paper, we aim to analyze structured time-series documents such as a collection of news articles and a series of scientific papers, wherein topics evolve along time depending on multiple topics in the past and are also related to each other at each time. To this end, we propose a dynamic and static topic model, which simultaneously considers the dynamic structures of the temporal topic evolution and the static structures of the topic hierarchy at each time. We show the results of experiments on collections of scientific papers, in which the proposed method outperformed conventional models. Moreover, we show an example of extracted topic structures, which we found helpful for analyzing research activities.
△ Less
Submitted 6 May, 2018;
originally announced May 2018.
-
Recent Developments in Aerial Robotics: A Survey and Prototypes Overview
Authors:
Chun Fui Liew,
Danielle DeLatte,
Naoya Takeishi,
Takehisa Yairi
Abstract:
In recent years, research and development in aerial robotics (i.e., unmanned aerial vehicles, UAVs) has been growing at an unprecedented speed, and there is a need to summarize the background, latest developments, and trends of UAV research. Along with a general overview on the definition, types, categories, and topics of UAV, this work describes a systematic way to identify 1,318 high-quality UAV…
▽ More
In recent years, research and development in aerial robotics (i.e., unmanned aerial vehicles, UAVs) has been growing at an unprecedented speed, and there is a need to summarize the background, latest developments, and trends of UAV research. Along with a general overview on the definition, types, categories, and topics of UAV, this work describes a systematic way to identify 1,318 high-quality UAV papers from more than thirty thousand that have been appeared in the top journals and conferences. On top of that, we provide a bird's-eye view of UAV research since 2001 by summarizing various statistical information, such as the year, type, and topic distribution of the UAV papers. We make our survey list public and believe that the list can not only help researchers identify, study, and compare their work, but is also useful for understanding research trends in the field. From our survey results, we find there are many types of UAV, and to the best of our knowledge, no literature has attempted to summarize all types in one place. With our survey list, we explain the types within our survey and outline the recent progress of each. We believe this summary can enhance readers' understanding on the UAVs and inspire researchers to propose new methods and new applications.
△ Less
Submitted 29 November, 2017; v1 submitted 27 November, 2017;
originally announced November 2017.
-
Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition
Authors:
Naoya Takeishi,
Yoshinobu Kawahara,
Takehisa Yairi
Abstract:
Spectral decomposition of the Koopman operator is attracting attention as a tool for the analysis of nonlinear dynamical systems. Dynamic mode decomposition is a popular numerical algorithm for Koopman spectral analysis; however, we often need to prepare nonlinear observables manually according to the underlying dynamics, which is not always possible since we may not have any a priori knowledge ab…
▽ More
Spectral decomposition of the Koopman operator is attracting attention as a tool for the analysis of nonlinear dynamical systems. Dynamic mode decomposition is a popular numerical algorithm for Koopman spectral analysis; however, we often need to prepare nonlinear observables manually according to the underlying dynamics, which is not always possible since we may not have any a priori knowledge about them. In this paper, we propose a fully data-driven method for Koopman spectral analysis based on the principle of learning Koopman invariant subspaces from observed data. To this end, we propose minimization of the residual sum of squares of linear least-squares regression to estimate a set of functions that transforms data into a form in which the linear regression fits well. We introduce an implementation with neural networks and evaluate performance empirically using nonlinear dynamical systems and applications.
△ Less
Submitted 30 January, 2018; v1 submitted 11 October, 2017;
originally announced October 2017.
-
Subspace dynamic mode decomposition for stochastic Koopman analysis
Authors:
Naoya Takeishi,
Yoshinobu Kawahara,
Takehisa Yairi
Abstract:
The analysis of nonlinear dynamical systems based on the Koopman operator is attracting attention in various applications. Dynamic mode decomposition (DMD) is a data-driven algorithm for Koopman spectral analysis, and several variants with a wide range of applications have been proposed. However, popular implementations of DMD suffer from observation noise on random dynamical systems and generate…
▽ More
The analysis of nonlinear dynamical systems based on the Koopman operator is attracting attention in various applications. Dynamic mode decomposition (DMD) is a data-driven algorithm for Koopman spectral analysis, and several variants with a wide range of applications have been proposed. However, popular implementations of DMD suffer from observation noise on random dynamical systems and generate an inaccurate estimation of the spectra of the stochastic Koopman operator. In this paper, we propose subspace DMD as an algorithm for the Koopman analysis of random dynamical systems with observation noise. Subspace DMD first computes the orthogonal projection of future snapshots to the space of past snapshots and then estimates the spectra of a linear model, and its output converges to the spectra of the stochastic Koopman operator under standard assumptions. We investigate the empirical performance of subspace DMD with several dynamical systems and show its utility for the Koopman analysis of random dynamical systems.
△ Less
Submitted 30 October, 2017; v1 submitted 13 May, 2017;
originally announced May 2017.