-
Building Hybrid B-Spline And Neural Network Operators
Authors:
Raffaele Romagnoli,
Jasmine Ratchford,
Mark H. Klein
Abstract:
Control systems are indispensable for ensuring the safety of cyber-physical systems (CPS), spanning various domains such as automobiles, airplanes, and missiles. Safeguarding CPS necessitates runtime methodologies that continuously monitor safety-critical conditions and respond in a verifiably safe manner. A fundamental aspect of many safety approaches involves predicting the future behavior of sy…
▽ More
Control systems are indispensable for ensuring the safety of cyber-physical systems (CPS), spanning various domains such as automobiles, airplanes, and missiles. Safeguarding CPS necessitates runtime methodologies that continuously monitor safety-critical conditions and respond in a verifiably safe manner. A fundamental aspect of many safety approaches involves predicting the future behavior of systems. However, achieving this requires accurate models that can operate in real time. Motivated by DeepONets, we propose a novel strategy that combines the inductive bias of B-splines with data-driven neural networks to facilitate real-time predictions of CPS behavior. We introduce our hybrid B-spline neural operator, establishing its capability as a universal approximator and providing rigorous bounds on the approximation error. These findings are applicable to a broad class of nonlinear autonomous systems and are validated through experimentation on a controlled 6-degree-of-freedom (DOF) quadrotor with a 12 dimensional state space. Furthermore, we conduct a comparative analysis of different network architectures, specifically fully connected networks (FCNN) and recurrent neural networks (RNN), to elucidate the practical utility and trade-offs associated with each architecture in real-world scenarios.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Emo**ize: Enriching Any Text with Emoji Translations
Authors:
Lars Henning Klein,
Roland Aydin,
Robert West
Abstract:
Emoji have become ubiquitous in written communication, on the Web and beyond. They can emphasize or clarify emotions, add details to conversations, or simply serve decorative purposes. This casual use, however, barely scratches the surface of the expressive power of emoji. To further unleash this power, we present Emo**ize, a method for translating arbitrary text phrases into sequences of one or…
▽ More
Emoji have become ubiquitous in written communication, on the Web and beyond. They can emphasize or clarify emotions, add details to conversations, or simply serve decorative purposes. This casual use, however, barely scratches the surface of the expressive power of emoji. To further unleash this power, we present Emo**ize, a method for translating arbitrary text phrases into sequences of one or more emoji without requiring human input. By leveraging the power of large language models, Emo**ize can choose appropriate emoji by disambiguating based on context (eg, cricket-bat vs bat) and can express complex concepts compositionally by combining multiple emoji (eq, "Emo**ize" is translated to input-latin-letters right-arrow grinning-face). In a cloze test--based user study, we show that Emo**ize's emoji translations increase the human guessability of masked words by 55%, whereas human-picked emoji translations do so by only 29%. These results suggest that emoji provide a sufficiently rich vocabulary to accurately translate a wide variety of words. Moreover, annotating words and phrases with Emo**ize's emoji translations opens the door to numerous downstream applications, including children learning how to read, adults learning foreign languages, and text understanding for people with learning disabilities.
△ Less
Submitted 7 March, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
On the Design of Region-Avoiding Metrics for Collision-Safe Motion Generation on Riemannian Manifolds
Authors:
Holger Klein,
Noémie Jaquier,
Andre Meixner,
Tamim Asfour
Abstract:
The generation of energy-efficient and dynamic-aware robot motions that satisfy constraints such as joint limits, self-collisions, and collisions with the environment remains a challenge. In this context, Riemannian geometry offers promising solutions by identifying robot motions with geodesics on the so-called configuration space manifold. While this manifold naturally considers the intrinsic rob…
▽ More
The generation of energy-efficient and dynamic-aware robot motions that satisfy constraints such as joint limits, self-collisions, and collisions with the environment remains a challenge. In this context, Riemannian geometry offers promising solutions by identifying robot motions with geodesics on the so-called configuration space manifold. While this manifold naturally considers the intrinsic robot dynamics, constraints such as joint limits, self-collisions, and collisions with the environment remain overlooked. In this paper, we propose a modification of the Riemannian metric of the configuration space manifold allowing for the generation of robot motions as geodesics that efficiently avoid given regions. We introduce a class of Riemannian metrics based on barrier functions that guarantee strict region avoidance by systematically generating accelerations away from no-go regions in joint and task space. We evaluate the proposed Riemannian metric to generate energy-efficient, dynamic-aware, and collision-free motions of a humanoid robot as geodesics and sequences thereof.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators
Authors:
Ziyang Jiang,
Yiling Liu,
Michael H. Klein,
Ahmed Aloui,
Yiman Ren,
Keyu Li,
Vahid Tarokh,
David Carlson
Abstract:
Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For examp…
▽ More
Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For example, we may want to identify how changes in brain activity or structure mediate an antidepressant's effect on behavior, but we may only have access to electrophysiological or imaging brain measurements. To date, most CMA methods assume that the mediator is one-dimensional and observable, which oversimplifies such real-world scenarios. To overcome this limitation, we introduce a CMA framework that can handle complex and indirectly observed mediators based on the identifiable variational autoencoder (iVAE) architecture. We prove that the true joint distribution over observed and latent variables is identifiable with the proposed method. Additionally, our framework captures a disentangled representation of the indirectly observed mediator and yields accurate estimation of the direct and mediated effects in synthetic and semi-synthetic experiments, providing evidence of its potential utility in real-world applications.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Domain Adaptation via Rebalanced Sub-domain Alignment
Authors:
Yiling Liu,
Juncheng Dong,
Ziyang Jiang,
Ahmed Aloui,
Keyu Li,
Hunter Klein,
Vahid Tarokh,
David Carlson
Abstract:
Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitati…
▽ More
Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitation, we propose a novel generalization bound that reweights source classification error by aligning source and target sub-domains. We prove that our proposed generalization bound is at least as strong as existing bounds under realistic assumptions, and we empirically show that it is much stronger on real-world data. We then propose an algorithm to minimize this novel generalization bound. We demonstrate by numerical experiments that this approach improves performance in shifted class distribution scenarios compared to state-of-the-art methods.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
A Riemannian Take on Human Motion Analysis and Retargeting
Authors:
Holger Klein,
Noémie Jaquier,
Andre Meixner,
Tamim Asfour
Abstract:
Dynamic motions of humans and robots are widely driven by posture-dependent nonlinear interactions between their degrees of freedom. However, these dynamical effects remain mostly overlooked when studying the mechanisms of human movement generation. Inspired by recent works, we hypothesize that human motions are planned as sequences of geodesic synergies, and thus correspond to coordinated joint m…
▽ More
Dynamic motions of humans and robots are widely driven by posture-dependent nonlinear interactions between their degrees of freedom. However, these dynamical effects remain mostly overlooked when studying the mechanisms of human movement generation. Inspired by recent works, we hypothesize that human motions are planned as sequences of geodesic synergies, and thus correspond to coordinated joint movements achieved with piecewise minimum energy. The underlying computational model is built on Riemannian geometry to account for the inertial characteristics of the body. Through the analysis of various human arm motions, we find that our model segments motions into geodesic synergies, and successfully predicts observed arm postures, hand trajectories, as well as their respective velocity profiles. Moreover, we show that our analysis can further be exploited to transfer arm motions to robots by reproducing individual human synergies as geodesic paths in the robot configuration space.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Multiple Domain Causal Networks
Authors:
Tianhui Zhou,
William E. Carson IV,
Michael Hunter Klein,
David Carlson
Abstract:
Observational studies are regarded as economic alternatives to randomized trials, often used in their stead to investigate and determine treatment efficacy. Due to lack of sample size, observational studies commonly combine data from multiple sources or different sites/centers. Despite the benefits of an increased sample size, a naive combination of multicenter data may result in incongruities ste…
▽ More
Observational studies are regarded as economic alternatives to randomized trials, often used in their stead to investigate and determine treatment efficacy. Due to lack of sample size, observational studies commonly combine data from multiple sources or different sites/centers. Despite the benefits of an increased sample size, a naive combination of multicenter data may result in incongruities stemming from center-specific protocols for generating cohorts or reactions towards treatments distinct to a given center, among other things. These issues arise in a variety of other contexts, including capturing a treatment effect related to an individual's unique biological characteristics. Existing methods for estimating heterogeneous treatment effects have not adequately addressed the multicenter context, but rather treat it simply as a means to obtain sufficient sample size. Additionally, previous approaches to estimating treatment effects do not straightforwardly generalize to the multicenter design, especially when required to provide treatment insights for patients from a new, unobserved center. To address these shortcomings, we propose Multiple Domain Causal Networks (MDCN), an approach that simultaneously strengthens the information sharing between similar centers while addressing the selection bias in treatment assignment through learning of a new feature embedding. In empirical evaluations, MDCN is consistently more accurate when estimating the heterogeneous treatment effect in new centers compared to benchmarks that adjust solely based on treatment imbalance or general center differences. Finally, we justify our approach by providing theoretical analyses that demonstrate that MDCN improves on the generalization bound of the new, unobserved target center.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
Estimating volcanic ash emissions using retrieved satellite ash columns and inverse ash transport modelling
Authors:
André R. Brodtkorb,
Anna Benedictow,
Heiko Klein,
Arve Kylling,
Agnes Nyiri,
Alvaro Valdebenito,
Espen Sollum
Abstract:
This paper describes the inversion procedure being used operationally at the Norwegian Meteorological Institute for estimating ash emission rates from retrieved satellite ash column amounts and a priori knowledge.
The overall procedure consists of five stages:
(1) generate a priori emission estimates;
(2) run forward simulations with unit emissions;
(3) collocate/match observations with em…
▽ More
This paper describes the inversion procedure being used operationally at the Norwegian Meteorological Institute for estimating ash emission rates from retrieved satellite ash column amounts and a priori knowledge.
The overall procedure consists of five stages:
(1) generate a priori emission estimates;
(2) run forward simulations with unit emissions;
(3) collocate/match observations with emission simulations;
(4) build system of linear equations; and
(5) solve overdetermined system.
We go through the mathematical foundations for the inversion procedure, performance for synthetic cases, and performance for real-world cases. The novelties of this paper includes pruning of the linear system of equations used in the inversion and inclusion of observations of ash cloud top altitude.
The source code used in this work is freely available under an open source license, and is possible to use for other similar applications.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
Neural Network-Based Collaborative Filtering for Question Sequencing
Authors:
Lior Sidi,
Hadar Klein
Abstract:
E-Learning systems (ELS) and Intelligent Tutoring Systems (ITS) play a significant part in today's education programs. Sequencing questions is the art of generating a personalized quiz for a target learner. A personalized test will enrich the learner's experience and will contribute to a more effective and efficient learning process. In this paper, we used the Neural Collaborative Filtering (NCF)…
▽ More
E-Learning systems (ELS) and Intelligent Tutoring Systems (ITS) play a significant part in today's education programs. Sequencing questions is the art of generating a personalized quiz for a target learner. A personalized test will enrich the learner's experience and will contribute to a more effective and efficient learning process. In this paper, we used the Neural Collaborative Filtering (NCF) model to generate question sequencing and compare it to a pair-wise memory-based question sequencing algorithm - EduRank. The NCF model showed significantly better ranking results than the EduRank model with an Average precision correlation score of 0.85 compared to 0.8.
△ Less
Submitted 25 April, 2020;
originally announced April 2020.
-
Boundedness-Preserving Implicit Correction of Mesh-Induced Errors for VoF Based Heat and Mass Transfer
Authors:
Simon Hill,
Daniel Deising,
Thomas Acher,
Harald Klein,
Dieter Bothe,
Holger Marschall
Abstract:
Spatial discretisation of geometrically complex computational domains often entails unstructured meshes of general topology for Computational Fluid Dynamics (CFD). Mesh skewness is then typically encountered causing severe deterioration of the formal order of accuracy of the discretisation, or boundedness of the solution, or both. Particularly methods inherently relying on the accurate and bounded…
▽ More
Spatial discretisation of geometrically complex computational domains often entails unstructured meshes of general topology for Computational Fluid Dynamics (CFD). Mesh skewness is then typically encountered causing severe deterioration of the formal order of accuracy of the discretisation, or boundedness of the solution, or both. Particularly methods inherently relying on the accurate and bounded transport of sharp fields suffer from all types of mesh-induced skewness errors, namely both non-orthogonality and non-conjunctionality errors. This work is devoted to a boundedness-preserving strategy to correct for skewness errors arising from discretisation of advection and diffusion terms within the context of interfacial heat and mass transfer based on the Volume-of-Fluid methodology. The implementation has been accomplished using a second-order finite volume method with support for unstructured meshes of general topology. We examine and advance suitable corrections for the finite volume discretisation of a consistent single-field model, where both accurate and bounded transport due to diffusion and advection is crucial. In order to ensure consistency of both the volume fraction and the species concentration transport, i.e. to avoid artificial heat or species transfer, corrections are studied for both cases. The cross interfacial jump and adjacent sharp gradients of species concentration render the correction for skewness-induced diffusion and advection errors additionally demanding and has not so far been addressed in the literature.
△ Less
Submitted 21 June, 2017; v1 submitted 31 May, 2017;
originally announced May 2017.
-
Cancer gene prioritization by integrative analysis of mRNA expression and DNA copy number data: a comparative review
Authors:
Leo Lahti,
Martin Schäfer,
Hans-Ulrich Klein,
Silvio Bicciato,
Martin Dugas
Abstract:
A variety of genome-wide profiling techniques are available to probe complementary aspects of genome structure and function. Integrative analysis of heterogeneous data sources can reveal higher-level interactions that cannot be detected based on individual observations. A standard integration task in cancer studies is to identify altered genomic regions that induce changes in the expression of the…
▽ More
A variety of genome-wide profiling techniques are available to probe complementary aspects of genome structure and function. Integrative analysis of heterogeneous data sources can reveal higher-level interactions that cannot be detected based on individual observations. A standard integration task in cancer studies is to identify altered genomic regions that induce changes in the expression of the associated genes based on joint analysis of genome-wide gene expression and copy number profiling measurements. In this review, we provide a comparison among various modeling procedures for integrating genome-wide profiling data of gene copy number and transcriptional alterations and highlight common approaches to genomic data integration. A transparent benchmarking procedure is introduced to quantitatively compare the cancer gene prioritization performance of the alternative methods. The benchmarking algorithms and data sets are available at http://intcomp.r-forge.r-project.org
△ Less
Submitted 20 November, 2011;
originally announced November 2011.