-
DADEE: Well-calibrated uncertainty quantification in neural networks for barriers-based robot safety
Authors:
Masoud Ataei,
Vikas Dhiman
Abstract:
Uncertainty-aware controllers that guarantee safety are critical for safety critical applications. Among such controllers, Control Barrier Functions (CBFs) based approaches are popular because they are fast, yet safe. However, most such works depend on Gaussian Processes (GPs) or MC-Dropout for learning and uncertainty estimation, and both approaches come with drawbacks: GPs are non-parametric met…
▽ More
Uncertainty-aware controllers that guarantee safety are critical for safety critical applications. Among such controllers, Control Barrier Functions (CBFs) based approaches are popular because they are fast, yet safe. However, most such works depend on Gaussian Processes (GPs) or MC-Dropout for learning and uncertainty estimation, and both approaches come with drawbacks: GPs are non-parametric methods that are slow, while MC-Dropout does not capture aleatoric uncertainty. On the other hand, modern Bayesian learning algorithms have shown promise in uncertainty quantification. The application of modern Bayesian learning methods to CBF-based controllers has not yet been studied. We aim to fill this gap by surveying uncertainty quantification algorithms and evaluating them on CBF-based safe controllers. We find that model variance-based algorithms (for example, Deep ensembles, MC-dropout, etc.) and direct estimation-based algorithms (such as DEUP) have complementary strengths. Algorithms in the former category can only estimate uncertainty accurately out-of-domain, while those in the latter category can only do so in-domain. We combine the two approaches to obtain more accurate uncertainty estimates both in- and out-of-domain. As measured by the failure rate of a simulated robot, this results in a safer CBF-based robot controller.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Elicitron: An LLM Agent-Based Simulation Framework for Design Requirements Elicitation
Authors:
Mohammadmehdi Ataei,
Hyunmin Cheong,
Daniele Grandi,
Ye Wang,
Nigel Morris,
Alexander Tessier
Abstract:
Requirements elicitation, a critical, yet time-consuming and challenging step in product development, often fails to capture the full spectrum of user needs. This may lead to products that fall short of expectations. This paper introduces a novel framework that leverages Large Language Models (LLMs) to automate and enhance the requirements elicitation process. LLMs are used to generate a vast arra…
▽ More
Requirements elicitation, a critical, yet time-consuming and challenging step in product development, often fails to capture the full spectrum of user needs. This may lead to products that fall short of expectations. This paper introduces a novel framework that leverages Large Language Models (LLMs) to automate and enhance the requirements elicitation process. LLMs are used to generate a vast array of simulated users (LLM agents), enabling the exploration of a much broader range of user needs and unforeseen use cases. These agents engage in product experience scenarios, through explaining their actions, observations, and challenges. Subsequent agent interviews and analysis uncover valuable user needs, including latent ones. We validate our framework with three experiments. First, we explore different methodologies for diverse agent generation, discussing their advantages and shortcomings. We measure the diversity of identified user needs and demonstrate that context-aware agent generation leads to greater diversity. Second, we show how our framework effectively mimics empathic lead user interviews, identifying a greater number of latent needs than conventional human interviews. Third, we showcase that LLMs can be used to analyze interviews, capture needs, and classify them as latent or not. Our work highlights the potential of using LLM agents to accelerate early-stage product development, reduce costs, and increase innovation.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Reduced-order modeling of unsteady fluid flow using neural network ensembles
Authors:
Rakesh Halder,
Mohammadmehdi Ataei,
Hesam Salehipour,
Krzysztof Fidkowski,
Kevin Maki
Abstract:
The use of deep learning has become increasingly popular in reduced-order models (ROMs) to obtain low-dimensional representations of full-order models. Convolutional autoencoders (CAEs) are often used to this end as they are adept at handling data that are spatially distributed, including solutions to partial differential equations. When applied to unsteady physics problems, ROMs also require a mo…
▽ More
The use of deep learning has become increasingly popular in reduced-order models (ROMs) to obtain low-dimensional representations of full-order models. Convolutional autoencoders (CAEs) are often used to this end as they are adept at handling data that are spatially distributed, including solutions to partial differential equations. When applied to unsteady physics problems, ROMs also require a model for time-series prediction of the low-dimensional latent variables. Long short-term memory (LSTM) networks, a type of recurrent neural network useful for modeling sequential data, are frequently employed in data-driven ROMs for autoregressive time-series prediction. When making predictions at unseen design points over long time horizons, error propagation is a frequently encountered issue, where errors made early on can compound over time and lead to large inaccuracies. In this work, we propose using bagging, a commonly used ensemble learning technique, to develop a fully data-driven ROM framework referred to as the CAE-eLSTM ROM that uses CAEs for spatial reconstruction of the full-order model and LSTM ensembles for time-series prediction. When applied to two unsteady fluid dynamics problems, our results show that the presented framework effectively reduces error propagation and leads to more accurate time-series prediction of latent variables at unseen points.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
XLB: A differentiable massively parallel lattice Boltzmann library in Python
Authors:
Mohammadmehdi Ataei,
Hesam Salehipour
Abstract:
The lattice Boltzmann method (LBM) has emerged as a prominent technique for solving fluid dynamics problems due to its algorithmic potential for computational scalability. We introduce XLB library, a Python-based differentiable LBM library based on the JAX platform. The architecture of XLB is predicated upon ensuring accessibility, extensibility, and computational performance, enabling scaling eff…
▽ More
The lattice Boltzmann method (LBM) has emerged as a prominent technique for solving fluid dynamics problems due to its algorithmic potential for computational scalability. We introduce XLB library, a Python-based differentiable LBM library based on the JAX platform. The architecture of XLB is predicated upon ensuring accessibility, extensibility, and computational performance, enabling scaling effectively across CPU, TPU, multi-GPU, and distributed multi-GPU or TPU systems. The library can be readily augmented with novel boundary conditions, collision models, or multi-physics simulation capabilities. XLB's differentiability and data structure is compatible with the extensive JAX-based machine learning ecosystem, enabling it to address physics-based machine learning, optimization, and inverse problems. XLB has been successfully scaled to handle simulations with billions of cells, achieving giga-scale lattice updates per second. XLB is released under the permissive Apache-2.0 license and is available on GitHub at https://github.com/Autodesk/XLB.
△ Less
Submitted 2 April, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
A Deep Learning Algorithm for Piecewise Linear Interface Construction (PLIC)
Authors:
Mohammadmehdi Ataei,
Erfan Pirmorad,
Franco Costa,
Se** Han,
Chul B Park,
Markus Bussmann
Abstract:
Piecewise Linear Interface Construction (PLIC) is frequently used to geometrically reconstruct fluid interfaces in Computational Fluid Dynamics (CFD) modeling of two-phase flows. PLIC reconstructs interfaces from a scalar field that represents the volume fraction of each phase in each computational cell. Given the volume fraction and interface normal, the location of a linear interface is uniquely…
▽ More
Piecewise Linear Interface Construction (PLIC) is frequently used to geometrically reconstruct fluid interfaces in Computational Fluid Dynamics (CFD) modeling of two-phase flows. PLIC reconstructs interfaces from a scalar field that represents the volume fraction of each phase in each computational cell. Given the volume fraction and interface normal, the location of a linear interface is uniquely defined. For a cubic computational cell (3D), the position of the planar interface is determined by intersecting the cube with a plane, such that the volume of the resulting truncated polyhedron cell is equal to the volume fraction. Yet it is geometrically complex to find the exact position of the plane, and it involves calculations that can be a computational bottleneck of many CFD models. However, while the forward problem of 3D PLIC is challenging, the inverse problem, of finding the volume of the truncated polyhedron cell given a defined plane, is simple. In this work, we propose a deep learning model for the solution to the forward problem of PLIC by only making use of its inverse problem. The proposed model is up to several orders of magnitude faster than traditional schemes, which significantly reduces the computational bottleneck of PLIC in CFD simulations.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
Theory and Applications of Financial Chaos Index
Authors:
Masoud Ataei,
Shengyuan Chen,
Zijiang Yang,
M. Reza Peyghami
Abstract:
We develop a new stock market index that captures the chaos existing in the market by measuring the mutual changes of asset prices. This new index relies on a tensor-based embedding of the stock market information, which in turn frees it from the restrictive value- or capitalization-weighting assumptions that commonly underlie other various popular indexes. We show that our index is a robust estim…
▽ More
We develop a new stock market index that captures the chaos existing in the market by measuring the mutual changes of asset prices. This new index relies on a tensor-based embedding of the stock market information, which in turn frees it from the restrictive value- or capitalization-weighting assumptions that commonly underlie other various popular indexes. We show that our index is a robust estimator of the market volatility which enables us to characterize the market by performing the task of segmentation with a high degree of reliability. In addition, we analyze the dynamics and kinematics of the realized market volatility as compared to the implied volatility by introducing a time-dependent dynamical system model. Our computational results which pertain to the time period from January 1990 to December 2019 imply that there exist a bidirectional causal relation between the processes underlying the realized and implied volatility of the stock market within the given time period, where it is shown that the later has a stronger causal effect on the former as compared to the opposite. This result connotes that the implied volatility of the market plays a key role in characterization of the market's realized volatility.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
Designing Sequence with Minimum PSL Using Chebyshev Distance and its Application for Chaotic MIMO Radar Waveform Design
Authors:
Hamid Esmaeili Najafabadi,
Mohammad Ataei,
Mohamad F. Sabahi
Abstract:
Controlling peak side-lobe level (PSL) is of great importance in high-resolution applications of multiple-input multiple-output (MIMO) radars. In this paper, designing sequences with good autocorrelation properties are studied. The PSL of the autocorrelation is regarded as the main merit and is optimized through newly introduced cyclic algorithms, namely; PSL Minimization Quadratic Approach (PMQA)…
▽ More
Controlling peak side-lobe level (PSL) is of great importance in high-resolution applications of multiple-input multiple-output (MIMO) radars. In this paper, designing sequences with good autocorrelation properties are studied. The PSL of the autocorrelation is regarded as the main merit and is optimized through newly introduced cyclic algorithms, namely; PSL Minimization Quadratic Approach (PMQA), PSL Minimization Algorithm, the smallest Rectangular (PMAR), and PSL Optimization Cyclic Algorithm (POCA). It is revealed that minimizing PSL results in better sequences in terms of autocorrelation side-lobes when compared with traditional integrated side-lobe level (ISL) minimization. In order to improve the performance of these algorithms, fast-randomized Singular Value Decomposition (SVD) is utilized. To achieve waveform design for MIMO radars, this algorithm is applied to the waveform generated from a modified Bernoulli chaotic system. The numerical experiments confirm the superiority of the newly developed algorithms compared to high-performance algorithms in mono-static and MIMO radars.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
NPLIC: A Machine Learning Approach to Piecewise Linear Interface Construction
Authors:
Mohammadmehdi Ataei,
Markus Bussmann,
Vahid Shaayegan,
Franco Costa,
Se** Han,
Chul B. Park
Abstract:
Volume of fluid (VOF) methods are extensively used to track fluid interfaces in numerical simulations, and many VOF algorithms require that the interface be reconstructed geometrically. For this purpose, the Piecewise Linear Interface Construction (PLIC) technique is most frequently used, which for reasons of geometric complexity can be slow and difficult to implement. Here, we propose an alternat…
▽ More
Volume of fluid (VOF) methods are extensively used to track fluid interfaces in numerical simulations, and many VOF algorithms require that the interface be reconstructed geometrically. For this purpose, the Piecewise Linear Interface Construction (PLIC) technique is most frequently used, which for reasons of geometric complexity can be slow and difficult to implement. Here, we propose an alternative neural network based method called NPLIC to perform PLIC calculations. The model is trained on a large synthetic dataset of PLIC solutions for square, cubic, triangular, and tetrahedral meshes. We show that this data-driven approach results in accurate calculations at a fraction of the usual computational cost, and a single neural network system can be used for interface reconstruction of different mesh types.
△ Less
Submitted 24 January, 2021; v1 submitted 26 June, 2020;
originally announced July 2020.
-
A Geometrical Branch-and-Price (GEOM-BP) Algorithm for Big Bin Packing Problems
Authors:
Masoud Ataei,
Shengyuan Chen
Abstract:
Bin packing problem examines the minimum number of identical bins needed to pack a set of items of various weights. This problem arises in various areas of the artificial intelligence demanding derivation of the exact solutions in the shortest amount of time. Employing branch-and-bound and column generation techniques to derive the exact solutions to this problem, usually requires designation of p…
▽ More
Bin packing problem examines the minimum number of identical bins needed to pack a set of items of various weights. This problem arises in various areas of the artificial intelligence demanding derivation of the exact solutions in the shortest amount of time. Employing branch-and-bound and column generation techniques to derive the exact solutions to this problem, usually requires designation of problem-specific branching rules compatible with the nature of the polluted pricing sub-problem of column generation. In this work, we present a new approach to deal with the forbidden bins which handles two-dimensional knapsack problems. Furthermore, a set of diving criteria are introduced which emphasize the importance of the geometrical features of the bins. It is further shown that efficiency of the column generation technique could significantly get improved using an implicit sectional pricing scheme. The proposed algorithm outperforms the current state-of-the-art algorithms in number of the benchmark instances solved in less than one minute.
△ Less
Submitted 1 September, 2019;
originally announced September 2019.
-
LBfoam: An open-source software package for the simulation of foaming using the Lattice Boltzmann Method
Authors:
Mohammadmehdi Ataei,
Vahid Shaayegan,
Franco Costa,
Se** Han,
Chul B. Park,
Markus Bussmann
Abstract:
This paper presents a 2D/3D Free Surface Lattice Boltzmann Method simulation package called LBfoam for the simulation of foaming processes. The model incorporates the essential physics of foaming phenomena: gas diffusion into nucleated bubbles, bubble dynamics and coalescence, surface tension, the stabilizing disjoining pressure between bubbles, and Newtonian and non-Newtonian rheological models.…
▽ More
This paper presents a 2D/3D Free Surface Lattice Boltzmann Method simulation package called LBfoam for the simulation of foaming processes. The model incorporates the essential physics of foaming phenomena: gas diffusion into nucleated bubbles, bubble dynamics and coalescence, surface tension, the stabilizing disjoining pressure between bubbles, and Newtonian and non-Newtonian rheological models. The software can simulate the growth and interaction of bubbles, and predict final foam structures. The implementation is based on the Palabos library (in C++), which enables large-scale parallel simulations. The software is freely available under the GNU Affero General Public License version 3 at: https://github.com/mehdiataei/LBfoam
△ Less
Submitted 19 May, 2020; v1 submitted 8 August, 2019;
originally announced August 2019.
-
Lehmer Transform and its Theoretical Properties
Authors:
Masoud Ataei,
Shengyuan Chen,
Xiaogang Wang
Abstract:
We propose a new class of transforms that we call {\it Lehmer Transform} which is motivated by the {\it Lehmer mean function}. The proposed {\it Lehmer transform} decomposes a function of a sample into their constituting statistical moments. Theoretical properties of the proposed transform are presented. This transform could be very useful to provide an alternative method in analyzing non-stationa…
▽ More
We propose a new class of transforms that we call {\it Lehmer Transform} which is motivated by the {\it Lehmer mean function}. The proposed {\it Lehmer transform} decomposes a function of a sample into their constituting statistical moments. Theoretical properties of the proposed transform are presented. This transform could be very useful to provide an alternative method in analyzing non-stationary signals such as brain wave EEG.
△ Less
Submitted 13 May, 2018;
originally announced May 2018.
-
Description of Galois unipotent extensions
Authors:
Masoud Ataei,
Jan Minac,
Nguyen Duy Tan
Abstract:
Given an arbitrary field $F$, we describe all Galois extensions $L/F$ whose Galois groups are isomorphic to the group of upper triangular unipotent 4-by-4 matrices with entries in the field of two elements.
Given an arbitrary field $F$, we describe all Galois extensions $L/F$ whose Galois groups are isomorphic to the group of upper triangular unipotent 4-by-4 matrices with entries in the field of two elements.
△ Less
Submitted 16 September, 2016; v1 submitted 22 August, 2015;
originally announced August 2015.
-
Energy-Efficient Broadcasting for Cross Wireless Ad-Hoc Networks
Authors:
Mohammad R. Ataei,
Amir H. Banihashemi,
Thomas Kunz
Abstract:
In this paper, we propose solutions for the energy-efficient broadcasting over cross networks, where N nodes are located on two perpendicular lines. Our solutions consist of an algorithm which finds the optimal range assignment in polynomial time (O(N^12)), a near-optimal algorithm with linear complexity (O(N)), and a distributed algorithm with complexity O(1). To the best of our knowledge, this i…
▽ More
In this paper, we propose solutions for the energy-efficient broadcasting over cross networks, where N nodes are located on two perpendicular lines. Our solutions consist of an algorithm which finds the optimal range assignment in polynomial time (O(N^12)), a near-optimal algorithm with linear complexity (O(N)), and a distributed algorithm with complexity O(1). To the best of our knowledge, this is the first study presenting an optimal solution for the minimum-energy broadcasting problem for a 2-D network (with cross configuration). We compare our algorithms with the broadcast incremental power (BIP) algorithm, one of the most commonly used methods for solving this problem with complexity O(N^2). We demonstrate that our near-optimal algorithm outperforms BIP, and that the distributed algorithm performs close to it. Moreover, the proposed distributed algorithm can be used for more general two-dimensional networks, where the nodes are located on a grid consisting of perpendicular line-segments. The performance of the proposed near-optimal and distributed algorithms tend to be closer to the optimal solution for larger networks.
△ Less
Submitted 20 April, 2015;
originally announced April 2015.
-
Low-Complexity Energy-Efficient Broadcasting in One-Dimensional Wireless Networks
Authors:
Mohammad R. Ataei,
Amir H. Banihashemi,
Thomas Kunz
Abstract:
In this paper, we investigate the transmission range assignment for N wireless nodes located on a line (a linear wireless network) for broadcasting data from one specific node to all the nodes in the network with minimum energy. Our goal is to find a solution that has low complexity and yet performs close to optimal. We propose an algorithm for finding the optimal assignment (which results in the…
▽ More
In this paper, we investigate the transmission range assignment for N wireless nodes located on a line (a linear wireless network) for broadcasting data from one specific node to all the nodes in the network with minimum energy. Our goal is to find a solution that has low complexity and yet performs close to optimal. We propose an algorithm for finding the optimal assignment (which results in the minimum energy consumption) with complexity O(N^2). An approximation algorithm with complexity O(N) is also proposed. It is shown that, for networks with uniformly distributed nodes, the linear-time approximate solution obtained by this algorithm on average performs practically identical to the optimal assignment. Both the optimal and the suboptimal algorithms require the full knowledge of the network topology and are thus centralized. We also propose a distributed algorithm of negligible complexity, i.e., with complexity O(1), which only requires the knowledge of the adjacent neighbors at each wireless node. Our simulations demonstrate that the distributed solution on average performs almost as good as the optimal one for networks with uniformly distributed nodes.
△ Less
Submitted 17 April, 2012;
originally announced April 2012.
-
Minimal blocking sets in PG(n,2) and covering groups by subgroups
Authors:
Alireza Abdollahi,
M. J. Ataei,
A. Mohammadi Hassanabadi
Abstract:
In this paper we prove that a set of points $B$ of PG(n,2) is a minimal blocking set if and only if $<B>=PG(d,2)$ with $d$ odd and $B$ is a set of $d+2$ points of $PG(d,2)$ no $d+1$ of them in the same hyperplane. As a corollary to the latter result we show that if $G$ is a finite 2-group and $n$ is a positive integer, then $G$ admits a $\mathfrak{C}_{n+1}$-cover if and only if $n$ is even and…
▽ More
In this paper we prove that a set of points $B$ of PG(n,2) is a minimal blocking set if and only if $<B>=PG(d,2)$ with $d$ odd and $B$ is a set of $d+2$ points of $PG(d,2)$ no $d+1$ of them in the same hyperplane. As a corollary to the latter result we show that if $G$ is a finite 2-group and $n$ is a positive integer, then $G$ admits a $\mathfrak{C}_{n+1}$-cover if and only if $n$ is even and $G\cong (C_2)^{n}$, where by a $\mathfrak{C}_m$-cover for a group $H$ we mean a set $\mathcal{C}$ of size $m$ of maximal subgroups of $H$ whose set-theoretic union is the whole $H$ and no proper subset of $\mathcal{C}$ has the latter property and the intersection of the maximal subgroups is core-free. Also for all $n<10$ we find all pairs $(m,p)$ ($m>0$ an integer and $p$ a prime number) for which there is a blocking set $B$ of size $n$ in $PG(m,p)$ such that $<B>=PG(m,p)$.
△ Less
Submitted 16 August, 2007;
originally announced August 2007.