-
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
Authors:
Zhentao Xu,
Mark Jerome Cruz,
Matthew Guevara,
Tie Wang,
Manasi Deshpande,
Xiaofeng Wang,
Zheng Li
Abstract:
In customer service technical support, swiftly and accurately retrieving relevant past issues is critical for efficiently resolving customer inquiries. The conventional retrieval methods in retrieval-augmented generation (RAG) for large language models (LLMs) treat a large corpus of past issue tracking tickets as plain text, ignoring the crucial intra-issue structure and inter-issue relations, whi…
▽ More
In customer service technical support, swiftly and accurately retrieving relevant past issues is critical for efficiently resolving customer inquiries. The conventional retrieval methods in retrieval-augmented generation (RAG) for large language models (LLMs) treat a large corpus of past issue tracking tickets as plain text, ignoring the crucial intra-issue structure and inter-issue relations, which limits performance. We introduce a novel customer service question-answering method that amalgamates RAG with a knowledge graph (KG). Our method constructs a KG from historical issues for use in retrieval, retaining the intra-issue structure and inter-issue relations. During the question-answering phase, our method parses consumer queries and retrieves related sub-graphs from the KG to generate answers. This integration of a KG not only improves retrieval accuracy by preserving customer service structure information but also enhances answering quality by mitigating the effects of text segmentation. Empirical assessments on our benchmark datasets, utilizing key retrieval (MRR, Recall@K, NDCG@K) and text generation (BLEU, ROUGE, METEOR) metrics, reveal that our method outperforms the baseline by 77.6% in MRR and by 0.32 in BLEU. Our method has been deployed within LinkedIn's customer service team for approximately six months and has reduced the median per-issue resolution time by 28.6%.
△ Less
Submitted 6 May, 2024; v1 submitted 26 April, 2024;
originally announced April 2024.
-
PyCPL: The ESO Common Pipeline Library in Python v1.0
Authors:
Mrunmayi S. Deshpande,
Nuria P. F. Lorente,
Anthony Horton,
Brent Miszalski,
Ralf Palsa,
Lars Lundin,
Anthony Heng,
Aidan Farrell
Abstract:
PyCPL provides full access to ESO's Common Pipeline Library ( CPL) for astronomical data reduction within a Python environment. Not only does it offer a Python interface to the robust CPL library, but it also lets users and developers fully utilise the rest of the scientific Python ecosystem. We have written a C++ layer to CPL and with pybind11 (a third-party library) created a Pythonic API to CPL…
▽ More
PyCPL provides full access to ESO's Common Pipeline Library ( CPL) for astronomical data reduction within a Python environment. Not only does it offer a Python interface to the robust CPL library, but it also lets users and developers fully utilise the rest of the scientific Python ecosystem. We have written a C++ layer to CPL and with pybind11 (a third-party library) created a Pythonic API to CPL. Since CPL has been around for so long, it has been thoroughly tested and understood. In 2003 it was developed in C due to its efficiency and speed of execution. With the community however moving away from C/C++ programming and embracing Python for data processing tasks, there is a need to provide access to the CPL utilities within a Python environment. With the latest version being released users can now install PyCPL to run existing CPL recipes (written in C) and access the results from Python. It also provides the ability to create new recipes in Python using the functionality provided by CPL.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
"With Great Power Comes Great Responsibility!": Student and Instructor Perspectives on the influence of LLMs on Undergraduate Engineering Education
Authors:
Ishika Joshi,
Ritvik Budhiraja,
Pranav Deepak Tanna,
Lovenya Jain,
Mihika Deshpande,
Arjun Srivastava,
Srinivas Rallapalli,
Harshal D Akolekar,
Jagat Sesh Challa,
Dhruv Kumar
Abstract:
The rise in popularity of Large Language Models (LLMs) has prompted discussions in academic circles, with students exploring LLM-based tools for coursework inquiries and instructors exploring them for teaching and research. Even though a lot of work is underway to create LLM-based tools tailored for students and instructors, there is a lack of comprehensive user studies that capture the perspectiv…
▽ More
The rise in popularity of Large Language Models (LLMs) has prompted discussions in academic circles, with students exploring LLM-based tools for coursework inquiries and instructors exploring them for teaching and research. Even though a lot of work is underway to create LLM-based tools tailored for students and instructors, there is a lack of comprehensive user studies that capture the perspectives of students and instructors regarding LLMs. This paper addresses this gap by conducting surveys and interviews within undergraduate engineering universities in India. Using 1306 survey responses among students, 112 student interviews, and 27 instructor interviews around the academic usage of ChatGPT (a popular LLM), this paper offers insights into the current usage patterns, perceived benefits, threats, and challenges, as well as recommendations for enhancing the adoption of LLMs among students and instructors. These insights are further utilized to discuss the practical implications of LLMs in undergraduate engineering education and beyond.
△ Less
Submitted 30 September, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Revisiting Cosmological Constraints on Supersymmetric SuperWIMPs
Authors:
Meera Deshpande,
Jan Hamann,
Dipan Sengupta,
Martin White,
Anthony G. Williams,
Yvonne Y. Y. Wong
Abstract:
SuperWIMPs are extremely weakly interacting massive particles that inherit their relic abundance from late decays of frozen-out parent particles. Within supersymmetric models, gravitinos and axinos represent two of the most well-motivated superWIMPs. In this paper we revisit constraints on these scenarios from a variety of cosmological observations that probe their production mechanisms as well as…
▽ More
SuperWIMPs are extremely weakly interacting massive particles that inherit their relic abundance from late decays of frozen-out parent particles. Within supersymmetric models, gravitinos and axinos represent two of the most well-motivated superWIMPs. In this paper we revisit constraints on these scenarios from a variety of cosmological observations that probe their production mechanisms as well as the superWIMP kinematic properties in the early Universe. We consider in particular observables of Big Bang Nucleosynthesis and the Cosmic Microwave Background (spectral distortion and anisotropies), which limit the fractional energy injection from the late decays, as well as warm and mixed dark matter constraints derived from the Lyman-$α$ forest and other small-scale structure observables. We discuss complementary constraints from collider experiments, and argue that cosmological considerations rule out a significant part of the gravitino and the axino superWIMP parameter space.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
PDBImages: A Command Line Tool for Automated Macromolecular Structure Visualization
Authors:
Adam Midlik,
Sreenath Nair,
Stephen Anyango,
Mandar Deshpande,
David Sehnal,
Mihaly Varadi,
Sameer Velankar
Abstract:
Summary: PDBImages is an innovative, open-source Node.js package that harnesses the power of the popular macromolecule structure visualization software Mol*. Designed for use by the scientific community, PDBImages provides a means to generate high-quality images for PDB and AlphaFold DB models. Its unique ability to render and save images directly to files in a browserless mode sets it apart, offe…
▽ More
Summary: PDBImages is an innovative, open-source Node.js package that harnesses the power of the popular macromolecule structure visualization software Mol*. Designed for use by the scientific community, PDBImages provides a means to generate high-quality images for PDB and AlphaFold DB models. Its unique ability to render and save images directly to files in a browserless mode sets it apart, offering users a streamlined, automated process for macromolecular structure visualization. Here, we detail the implementation of PDBImages, enumerating its diverse image types and elaborating on its user-friendly setup. This powerful tool opens a new gateway for researchers to visualize, analyse, and share their work, fostering a deeper understanding of bioinformatics. Availability and Implementation: PDBImages is available as an npm package from https://www.npmjs.com/package/pdb-images. The source code is available from https://github.com/PDBeurope/pdb-images. Contact: [email protected], [email protected]
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
Lighthouses and Global Graph Stabilization: Active SLAM for Low-compute, Narrow-FoV Robots
Authors:
Mohit Deshpande,
Richard Kim,
Dhruva Kumar,
Jong ** Park,
Jim Zamiska
Abstract:
Autonomous exploration to build a map of an unknown environment is a fundamental robotics problem. However, the quality of the map directly influences the quality of subsequent robot operation. Instability in a simultaneous localization and map** (SLAM) system can lead to poorquality maps and subsequent navigation failures during or after exploration. This becomes particularly noticeable in cons…
▽ More
Autonomous exploration to build a map of an unknown environment is a fundamental robotics problem. However, the quality of the map directly influences the quality of subsequent robot operation. Instability in a simultaneous localization and map** (SLAM) system can lead to poorquality maps and subsequent navigation failures during or after exploration. This becomes particularly noticeable in consumer robotics, where compute budget and limited field-of-view are very common. In this work, we propose (i) the concept of lighthouses: panoramic views with high visual information content that can be used to maintain the stability of the map locally in their neighborhoods and (ii) the final stabilization strategy for global pose graph stabilization. We call our novel exploration strategy SLAM-aware exploration (SAE) and evaluate its performance on real-world home environments.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
DeepCPG Policies for Robot Locomotion
Authors:
Aditya M. Deshpande,
Eric Hurd,
Ali A. Minai,
Manish Kumar
Abstract:
Central Pattern Generators (CPGs) form the neural basis of the observed rhythmic behaviors for locomotion in legged animals. The CPG dynamics organized into networks allow the emergence of complex locomotor behaviors. In this work, we take this inspiration for develo** walking behaviors in multi-legged robots. We present novel DeepCPG policies that embed CPGs as a layer in a larger neural networ…
▽ More
Central Pattern Generators (CPGs) form the neural basis of the observed rhythmic behaviors for locomotion in legged animals. The CPG dynamics organized into networks allow the emergence of complex locomotor behaviors. In this work, we take this inspiration for develo** walking behaviors in multi-legged robots. We present novel DeepCPG policies that embed CPGs as a layer in a larger neural network and facilitate end-to-end learning of locomotion behaviors in deep reinforcement learning (DRL) setup. We demonstrate the effectiveness of this approach on physics engine-based insectoid robots. We show that, compared to traditional approaches, DeepCPG policies allow sample-efficient end-to-end learning of effective locomotion strategies even in the case of high-dimensional sensor spaces (vision). We scale the DeepCPG policies using a modular robot configuration and multi-agent DRL. Our results suggest that gradual complexification with embedded priors of these policies in a modular fashion could achieve non-trivial sensor and motor integration on a robot platform. These results also indicate the efficacy of bootstrap** more complex intelligent systems from simpler ones based on biological principles. Finally, we present the experimental results for a proof-of-concept insectoid robot system for which DeepCPG learned policies initially using the simulation engine and these were afterwards transferred to real-world robots without any additional fine-tuning.
△ Less
Submitted 25 February, 2023;
originally announced February 2023.
-
Learning to View: Decision Transformers for Active Object Detection
Authors:
Wenhao Ding,
Nathalie Majcherczyk,
Mohit Deshpande,
Xuewei Qi,
Ding Zhao,
Rajasimman Madhivanan,
Arnie Sen
Abstract:
Active perception describes a broad class of techniques that couple planning and perception systems to move the robot in a way to give the robot more information about the environment. In most robotic systems, perception is typically independent of motion planning. For example, traditional object detection is passive: it operates only on the images it receives. However, we have a chance to improve…
▽ More
Active perception describes a broad class of techniques that couple planning and perception systems to move the robot in a way to give the robot more information about the environment. In most robotic systems, perception is typically independent of motion planning. For example, traditional object detection is passive: it operates only on the images it receives. However, we have a chance to improve the results if we allow planning to consume detection signals and move the robot to collect views that maximize the quality of the results. In this paper, we use reinforcement learning (RL) methods to control the robot in order to obtain images that maximize the detection quality. Specifically, we propose using a Decision Transformer with online fine-tuning, which first optimizes the policy with a pre-collected expert dataset and then improves the learned policy by exploring better solutions in the environment. We evaluate the performance of proposed method on an interactive dataset collected from an indoor scenario simulator. Experimental results demonstrate that our method outperforms all baselines, including expert policy and pure offline RL methods. We also provide exhaustive analyses of the reward distribution and observation space.
△ Less
Submitted 23 January, 2023;
originally announced January 2023.
-
Investigations on convergence behaviour of Physics Informed Neural Networks across spectral ranges and derivative orders
Authors:
Mayank Deshpande,
Siddharth Agarwal,
Vukka Snigdha,
Arya Kumar Bhattacharya
Abstract:
An important inference from Neural Tangent Kernel (NTK) theory is the existence of spectral bias (SB), that is, low frequency components of the target function of a fully connected Artificial Neural Network (ANN) being learnt significantly faster than the higher frequencies during training. This is established for Mean Square Error (MSE) loss functions with very low learning rate parameters. Physi…
▽ More
An important inference from Neural Tangent Kernel (NTK) theory is the existence of spectral bias (SB), that is, low frequency components of the target function of a fully connected Artificial Neural Network (ANN) being learnt significantly faster than the higher frequencies during training. This is established for Mean Square Error (MSE) loss functions with very low learning rate parameters. Physics Informed Neural Networks (PINNs) are designed to learn the solutions of differential equations (DE) of arbitrary orders; in PINNs the loss functions are obtained as the residues of the conservative form of the DEs and represent the degree of dissatisfaction of the equations. So there has been an open question whether (a) PINNs also exhibit SB and (b) if so, how does this bias vary across the orders of the DEs. In this work, a series of numerical experiments are conducted on simple sinusoidal functions of varying frequencies, compositions and equation orders to investigate these issues. It is firmly established that under normalized conditions, PINNs do exhibit strong spectral bias, and this increases with the order of the differential equation.
△ Less
Submitted 7 January, 2023;
originally announced January 2023.
-
Hollow Rectangular Waveguide-fed Holographic Beamforming Antenna Additively Manufactured (3D Printed) with Conductive Polymer
Authors:
Insang Yoo,
Jonah Gollub,
Shengrong Ye,
Allen Gray,
Okan Yurduseven,
Manohar D. Deshpande,
David R. Smith
Abstract:
We present the design and fabrication of 3D printed holographic beamforming antennas. The antennas utilize additively manufactured hollow rectangular waveguides that feed radiating rectilinear slots inserted into the upper conducting wall. The lengths of the individual slots are altered to implement a holographic beamforming solution designed using a coupled dipole formalism. For rapid verificatio…
▽ More
We present the design and fabrication of 3D printed holographic beamforming antennas. The antennas utilize additively manufactured hollow rectangular waveguides that feed radiating rectilinear slots inserted into the upper conducting wall. The lengths of the individual slots are altered to implement a holographic beamforming solution designed using a coupled dipole formalism. For rapid verification, the designed antennas are fabricated using a desktop dual-extrusion fused filament 3D printer. The body of each antenna and its inner conducting surface are respectively printed using polylactic acid and biodegradable conductive polyester composite material (i.e., Electrifi), which is later deposited with a layer of copper on its surface to improve surface conductivity and reduce surface roughness. The beamforming performance of the fabricated antennas is confirmed via experiments. The 3D printed metasurface antennas using the proposed fabrication technique illustrate emerging capabilities in the rapid prototy** of complex electromagnetic structures.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Transformation of Node to Knowledge Graph Embeddings for Faster Link Prediction in Social Networks
Authors:
Archit Parnami,
Mayuri Deshpande,
Anant Kumar Mishra,
Minwoo Lee
Abstract:
Recent advances in neural networks have solved common graph problems such as link prediction, node classification, node clustering, node recommendation by develo** embeddings of entities and relations into vector spaces. Graph embeddings encode the structural information present in a graph. The encoded embeddings then can be used to predict the missing links in a graph. However, obtaining the op…
▽ More
Recent advances in neural networks have solved common graph problems such as link prediction, node classification, node clustering, node recommendation by develo** embeddings of entities and relations into vector spaces. Graph embeddings encode the structural information present in a graph. The encoded embeddings then can be used to predict the missing links in a graph. However, obtaining the optimal embeddings for a graph can be a computationally challenging task specially in an embedded system. Two techniques which we focus on in this work are 1) node embeddings from random walk based methods and 2) knowledge graph embeddings. Random walk based embeddings are computationally inexpensive to obtain but are sub-optimal whereas knowledge graph embeddings perform better but are computationally expensive. In this work, we investigate a transformation model which converts node embeddings obtained from random walk based methods to embeddings obtained from knowledge graph methods directly without an increase in the computational cost. Extensive experimentation shows that the proposed transformation model can be used for solving link prediction in real-time.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Robust Deep Reinforcement Learning for Quadcopter Control
Authors:
Aditya M. Deshpande,
Ali A. Minai,
Manish Kumar
Abstract:
Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from…
▽ More
Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from Robust Control and RL. It opts for pessimistic optimization to handle potential gaps between policy transfer from one environment to another. The trained control policy is tested on the task of quadcopter positional control. RL agents were trained in a MuJoCo simulator. During testing, different environment parameters (unseen during the training) were used to validate the robustness of the trained policy for transfer from one environment to another. The robust policy outperformed the standard agents in these environments, suggesting that the added robustness increases generality and can adapt to non-stationary environments.
Codes: https://github.com/adipandas/gym_multirotor
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
On the performance of GPU accelerated q-LSKUM based meshfree solvers in Fortran, C++, Python, and Julia
Authors:
Nischay Ram Mamidi,
Kumar Prasun,
Dhruv Saxena,
Anil Nemili,
Bharatkumar Sharma,
S. M. Deshpande
Abstract:
This report presents a comprehensive analysis of the performance of GPU accelerated meshfree CFD solvers for two-dimensional compressible flows in Fortran, C++, Python, and Julia. The programming model CUDA is used to develop the GPU codes. The meshfree solver is based on the least squares kinetic upwind method with entropy variables (q-LSKUM). To assess the computational efficiency of the GPU sol…
▽ More
This report presents a comprehensive analysis of the performance of GPU accelerated meshfree CFD solvers for two-dimensional compressible flows in Fortran, C++, Python, and Julia. The programming model CUDA is used to develop the GPU codes. The meshfree solver is based on the least squares kinetic upwind method with entropy variables (q-LSKUM). To assess the computational efficiency of the GPU solvers and to compare their relative performance, benchmark calculations are performed on seven levels of point distribution. To analyse the difference in their run-times, the computationally intensive kernel is profiled. Various performance metrics are investigated from the profiled data to determine the cause of observed variation in run-times. To address some of the performance related issues, various optimisation strategies are employed. The optimised GPU codes are compared with the naive codes, and conclusions are drawn from their performance.
△ Less
Submitted 16 August, 2021;
originally announced August 2021.
-
Sensor Placement with Optimal Precision for Temperature Estimation of Battery Systems
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya,
Kamesh Subbarao
Abstract:
The temperature distribution in the battery significantly impacts the short-term and long-term performance of battery systems. Therefore, efficient, safe, and reliable battery system operation requires an accurate estimation of the temperature field. The current industry standard for sensors to battery cell ratio is quite frugal. Thus, the problem of sensor placement for accurate temperature estim…
▽ More
The temperature distribution in the battery significantly impacts the short-term and long-term performance of battery systems. Therefore, efficient, safe, and reliable battery system operation requires an accurate estimation of the temperature field. The current industry standard for sensors to battery cell ratio is quite frugal. Thus, the problem of sensor placement for accurate temperature estimation becomes non-trivial, especially for large-scale systems. In this paper, we explore a greedy approach for sensor placement suitable for large-scale battery systems. An observer to estimate the thermal field is designed in an $\mathcal{H}_{\infty}$ framework while simultaneously minimizing the sensor precisions, thus lowering the overall thermal management system's economic cost.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Sensor Selection and Optimal Precision in $\mathcal{H}_2/\mathcal{H}_{\infty}$ Estimation Framework: Theory and Algorithms
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
We consider the problem of sensor selection for designing observer and filter for continuous linear time invariant systems such that the sensor precisions are minimized, and the estimation errors are bounded by the prescribed $\mathcal{H}_2/\mathcal{H}_{\infty}$ performance criteria. The proposed integrated framework formulates the precision minimization as a convex optimization problem subject to…
▽ More
We consider the problem of sensor selection for designing observer and filter for continuous linear time invariant systems such that the sensor precisions are minimized, and the estimation errors are bounded by the prescribed $\mathcal{H}_2/\mathcal{H}_{\infty}$ performance criteria. The proposed integrated framework formulates the precision minimization as a convex optimization problem subject to linear matrix inequalities, and it is solved using an algorithm based on the alternating direction method of multipliers (ADMM). We also present a greedy approach for sensor selection and demonstrate the performance of the proposed algorithms using numerical simulations.
△ Less
Submitted 6 April, 2021; v1 submitted 28 February, 2021;
originally announced March 2021.
-
Sparse Sensing Architectures with Optimal Precision for Tracking Multi-agent Systems in Sensing-denied Environments
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
In this paper the tracking problem of multi-agent systems, in a particular scenario where a segment of agents entering a sensing-denied environment or behaving as non-cooperative targets, is considered. The focus is on determining the optimal sensor precisions while simultaneously promoting sparseness in the sensor measurements to guarantee a specified estimation performance. The problem is formul…
▽ More
In this paper the tracking problem of multi-agent systems, in a particular scenario where a segment of agents entering a sensing-denied environment or behaving as non-cooperative targets, is considered. The focus is on determining the optimal sensor precisions while simultaneously promoting sparseness in the sensor measurements to guarantee a specified estimation performance. The problem is formulated in the discrete-time centralized Kalman filtering framework. A semi-definite program subject to linear matrix inequalities is solved to minimize the trace of precision matrix which is defined to be the inverse of sensor noise covariance matrix. Simulation results expose a trade-off between sensor precisions and sensing frequency.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Sparse Sensing and Optimal Precision: Robust $\mathcal{H}_{\infty}$ Optimal Observer Design with Model Uncertainty
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
We present a framework which incorporates three aspects of the estimation problem, namely, sparse sensor configuration, optimal precision, and robustness in the presence of model uncertainty. The problem is formulated in the $\mathcal{H}_{\infty}$ optimal observer design framework. We consider two types of uncertainties in the system, i.e. structured affine and unstructured uncertainties. The obje…
▽ More
We present a framework which incorporates three aspects of the estimation problem, namely, sparse sensor configuration, optimal precision, and robustness in the presence of model uncertainty. The problem is formulated in the $\mathcal{H}_{\infty}$ optimal observer design framework. We consider two types of uncertainties in the system, i.e. structured affine and unstructured uncertainties. The objective is to design an observer with a given $\mathcal{H}_{\infty}$ performance index with minimal number of sensors and minimal precision values, while guaranteeing the performance for all admissible uncertainties. The problem is posed as a convex optimization problem subject to linear matrix inequalities. Numerical simulations demonstrate the application of the theoretical results presented in this work.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors
Authors:
Aditya M. Deshpande,
Rumit Kumar,
Ali A. Minai,
Manish Kumar
Abstract:
In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadco…
▽ More
In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to learning the control policy from scratch for this new UAV design created by modifications in a conventional quadcopter, i.e., the addition of more degrees of freedom (4-actuators in conventional quadcopter to 8-actuators in tilt-rotor quadcopter). We demonstrate the robustness of our learned policy by showing the recovery of the tilt-rotor platform in the simulation from various non-static initial conditions in order to reach a desired state. The developmental policy for the tilt-rotor UAV also showed superior fault tolerance when compared with the policy learned from the scratch. The results show the ability of the presented approach to bootstrap the learned behavior from a simpler system (lower-dimensional action-space) to a more complex robot (comparatively higher-dimensional action-space) and reach better performance faster.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
Quaternion Feedback Based Autonomous Control of a Quadcopter UAV with Thrust Vectoring Rotors
Authors:
Rumit Kumar,
Mahathi Bhargavapuri,
Aditya M. Deshpande,
Siddharth Sridhar,
Kelly Cohen,
Manish Kumar
Abstract:
In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicl…
▽ More
In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicle requires singularity-free attitude control. Hence, quaternion state feedback is utilized to compute the control commands for the UAV motors while avoiding the gimbal lock condition experienced by Euler angle based controllers. The quaternion implementation also reduces the overall complexity of state estimation due to absence of trigonometric parameters. The quadcopter dynamic model and state space is utilized to design the attitude controller and control allocation for the UAV. The control allocation, in particular, is derived by linearizing the system about hover condition. This mathematical method renders the control allocation more accurate than existing approaches. Lyapunov stability analysis of the attitude controller is shown to prove global stability. The quaternion feedback attitude controller is commanded by an outer position controller loop which generates rotor-tilt and desired quaternions commands for the system. The performance of the UAV is evaluated by numerical simulations for tracking attitude step commands and for following a way-point navigation mission.
△ Less
Submitted 28 June, 2020;
originally announced June 2020.
-
Computer Vision Toolkit for Non-invasive Monitoring of Factory Floor Artifacts
Authors:
Aditya M. Deshpande,
Anil Kumar Telikicherla,
Vinay Jakkali,
David A. Wickelhaus,
Manish Kumar,
Sam Anand
Abstract:
Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line…
▽ More
Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line with Industry 4.0 requirements for factory data collection. Currently, technical challenges persist towards digitization of legacy systems due to the limitation for changes in their design and sensors. This novel toolkit is developed to facilitate easy integration of legacy production machinery and factory floor artifacts with the digital and smart manufacturing environment with no requirement of any physical changes in the machines. The system developed is modular, and allows real-time monitoring of production machinery. Modularity aspect allows the incorporation of new software applications in the current framework of CV Toolkit. To allow connectivity of this toolkit with manufacturing floors in a simple, deployable and cost-effective manner, the toolkit is integrated with a known manufacturing data standard, MTConnect, to "translate" the digital inputs into data streams that can be read by commercial status tracking and reporting software solutions. The proposed toolkit is demonstrated using a mock-panel environment developed in house at the University of Cincinnati to highlight its usability.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
One-Shot Recognition of Manufacturing Defects in Steel Surfaces
Authors:
Aditya M. Deshpande,
Ali A. Minai,
Manish Kumar
Abstract:
Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human acc…
▽ More
Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human accuracy. However, many of these approaches are data intensive. Training and deployment of such a system on manufacturing floors may become expensive and time-consuming. The need for large amounts of training data is one of the limitations of the applicability of these approaches in real-world manufacturing systems. In this work, we propose the application of a Siamese convolutional neural network to do one-shot recognition for such a task. Our results demonstrate how one-shot learning can be used in quality control of steel by identification of defects on the steel surface. This method can significantly reduce the requirements of training data and can also be run in real-time.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
Flight Control of Sliding Arm Quadcopter with Dynamic Structural Parameters
Authors:
Rumit Kumar,
Aditya M. Deshpande,
James Z. Wells,
Manish Kumar
Abstract:
The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital rol…
▽ More
The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital role in the stability and control of the system. The length of quadcopter arms is a variable parameter, and it is actuated using attitude feedback-based control law. The MoI parameters are computed in real-time and incorporated in the equations of motion of the system. The UAV utilizes the angular motion of propellers and variable quadcopter arm lengths for position and navigation control. The movement space of the CoG is a design parameter and it is bounded by actuator limitations and stability requirements of the system. A detailed information on equations of motion, flight controller design and possible applications of this system are provided. Further, the proposed shape-changing UAV system is evaluated by comparative numerical simulations for way point navigation mission and complex trajectory tracking.
△ Less
Submitted 27 April, 2020;
originally announced April 2020.
-
Prediction of separation and transition on a low-pressure turbine blade using a RANS grid
Authors:
Rajesh Ranjan,
S. M. Deshpande,
Roddam Narasimha
Abstract:
Flow past a high-lift low-pressure turbine (LPT) blade in a cascade could be quite complex as phenomena like separation and transition are often involved. For a highly loadedT106A blade at a high incidence and relatively low Reynolds number(25, 000 < Re < 1, 00, 000), separation-induced transition is observed on the suction side of the blade, making it a challenging problem for model-based simulat…
▽ More
Flow past a high-lift low-pressure turbine (LPT) blade in a cascade could be quite complex as phenomena like separation and transition are often involved. For a highly loadedT106A blade at a high incidence and relatively low Reynolds number(25, 000 < Re < 1, 00, 000), separation-induced transition is observed on the suction side of the blade, making it a challenging problem for model-based simulations. In this work, computations for this flow are carried out using RANS and hybrid LES/RANS approaches. The RANS simulations are performed with six popular low- Re turbulence models. While turbulence models by themselves fail to predict any separation on the T106A blade, the four-equation Langtry-Menter transition model predicts a short separation bubble. The characteristic of this bubble, however, is very different from what is observed in experiments and DNS, and therefore transition is not accurately predicted. An embedded hybrid LES/RANS approach, Limited numerical scales(LNS), with an automatic switch to LES in sufficiently resolved grids, is then used for predictions on the sameRANS grid. With the statistical turbulence on fine grids, LES-like behavior of LNS results in an unphysical drop in Reynolds stresses as the turbulent fluctuations are not appropriately represented on the resolved scale. Therefore, the LNS results are very similar to those obtained with turbulence models. However, when synthetic turbulence with correct statistical characteristics is used to stimulate the large eddies in the embedded LES zone, LNS is able to predict separation and recovers a solution very close to DNS and experimental results.
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Data-driven Solution of Stochastic Differential Equations Using Maximum Entropy Basis Functions
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
In this paper we present a data-driven approach for uncertainty propagation. In particular, we consider stochastic differential equations with parametric uncertainty. Solution of the differential equation is approximated using maximum entropy (maxent) basis functions similar to polynomial chaos expansions. Maxent basis functions are derived from available data by maximization of information-theore…
▽ More
In this paper we present a data-driven approach for uncertainty propagation. In particular, we consider stochastic differential equations with parametric uncertainty. Solution of the differential equation is approximated using maximum entropy (maxent) basis functions similar to polynomial chaos expansions. Maxent basis functions are derived from available data by maximization of information-theoretic entropy, therefore, there is no need to specify basis functions beforehand. We compare the proposed maxent based approach with existing methods.
△ Less
Submitted 3 April, 2020;
originally announced April 2020.
-
Kalman Filtering with Probabilistic Uncertainty in System Parameters
Authors:
Sunsoo Kim,
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
In this paper, we propose a robust Kalman filtering framework for systems with probabilistic uncertainty in system parameters. We consider two cases, namely discrete time systems, and continuous time systems with discrete measurements. The uncertainty, characterized by mean and variance of the states, is propagated using conditional expectations and polynomial chaos expansion framework. The result…
▽ More
In this paper, we propose a robust Kalman filtering framework for systems with probabilistic uncertainty in system parameters. We consider two cases, namely discrete time systems, and continuous time systems with discrete measurements. The uncertainty, characterized by mean and variance of the states, is propagated using conditional expectations and polynomial chaos expansion framework. The results obtained using the proposed filter are compared with existing robust filters in the literature. The proposed filter demonstrates better performance in terms of root mean squared error and rate of convergence.
△ Less
Submitted 8 July, 2020; v1 submitted 24 March, 2020;
originally announced March 2020.
-
Sparse Sensing and Optimal Precision: An Integrated Framework for $\mathcal{H}_2/\mathcal{H}_{\infty}$ Optimal Observer Design
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
In this paper, we simultaneously determine the optimal sensor precision and the observer gain, which achieves the specified accuracy in the state estimates. Along with the unknown observer gain, the formulation parameterizes the scaling of the exogenous inputs that correspond to the sensor noise. Reciprocal of this scaling is defined as the sensor precision, and sparseness is achieved by minimizin…
▽ More
In this paper, we simultaneously determine the optimal sensor precision and the observer gain, which achieves the specified accuracy in the state estimates. Along with the unknown observer gain, the formulation parameterizes the scaling of the exogenous inputs that correspond to the sensor noise. Reciprocal of this scaling is defined as the sensor precision, and sparseness is achieved by minimizing the $l_1$ norm of the precision vector. The optimization is performed with constraints guaranteeing specified accuracy in state estimates, which are defined in terms of $\mathcal{H}_2$ or $\mathcal{H}_{\infty}$ norms of the error dynamics. The results presented in this paper are applied to the linearized longitudinal model of an F-16 aircraft.
△ Less
Submitted 20 June, 2020; v1 submitted 24 March, 2020;
originally announced March 2020.
-
A unified framework to generate optimized compact finite difference schemes
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya,
Diego A. Donzis
Abstract:
A unified framework to derive optimized compact schemes for a uniform grid is presented. The optimal scheme coefficients are determined analytically by solving an optimization problem to minimize the spectral error subject to equality constraints that ensure specified order of accuracy. A rigorous stability analysis for the optimized schemes is also presented. We analytically prove the relation be…
▽ More
A unified framework to derive optimized compact schemes for a uniform grid is presented. The optimal scheme coefficients are determined analytically by solving an optimization problem to minimize the spectral error subject to equality constraints that ensure specified order of accuracy. A rigorous stability analysis for the optimized schemes is also presented. We analytically prove the relation between order of a derivative and symmetry or skew-symmetry of the optimal coefficients approximating it. We also show that other types of schemes e.g., spatially explicit, and biased finite differences, can be generated as special cases of the framework.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Surrogate Modeling of Dynamics From Sparse Data Using Maximum Entropy Basis Functions
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
In this paper we present a data driven approach for approximating dynamical systems. A dynamics is approximated using basis functions, which are derived from maximization of the information-theoretic entropy, and can be generated directly from the data provided. This approach has advantages over other methods, where a dictionary of basis functions have to be provided by the user, which is non triv…
▽ More
In this paper we present a data driven approach for approximating dynamical systems. A dynamics is approximated using basis functions, which are derived from maximization of the information-theoretic entropy, and can be generated directly from the data provided. This approach has advantages over other methods, where a dictionary of basis functions have to be provided by the user, which is non trivial in some applications. We compare the accuracy of the proposed data-driven modeling approach to existing methods in the literature, and demonstrate that for some applications the maximum entropy basis functions provide significantly more accurate models.
△ Less
Submitted 7 November, 2019;
originally announced November 2019.
-
On Improved Statistical Accuracy of Low-Order Polynomial Chaos Approximations
Authors:
Vedang M. Deshpande,
Raktim Bhattacharya
Abstract:
Polynomial chaos expansion is a popular way to develop surrogate models for stochastic systems with arbitrary random variables. Standard techniques such as Galerkin projection, stochastic collocation, and least squares approximation, are applied to determine polynomial chaos coefficients, which define the surrogate model. Since the surrogate models are developed from a function approximation persp…
▽ More
Polynomial chaos expansion is a popular way to develop surrogate models for stochastic systems with arbitrary random variables. Standard techniques such as Galerkin projection, stochastic collocation, and least squares approximation, are applied to determine polynomial chaos coefficients, which define the surrogate model. Since the surrogate models are developed from a function approximation perspective, there is no reason to expect accuracy of statistics from these models. The statistical moments estimated from the surrogate model may significantly differ from the true moments, especially for lower order approximations. Often arbitrary high orders are required to recover, for example, the second moment. In this paper, we present modifications of standard techniques and determine polynomial chaos coefficients by solving a constrained optimization problem. We present this new approach for algebraic functions and differential equations with random parameters, and demonstrate that the surrogate models from the new approach are able to recover the first two moments exactly.
△ Less
Submitted 8 September, 2019;
originally announced September 2019.
-
A high-resolution DNS study of compressible flow past an LPT blade in a cascade
Authors:
Rajesh Ranjan,
S M Deshpande,
Roddam Narasimha
Abstract:
Flow past a low pressure turbine blade in a cascade at $Re \approx 52000$ and angle of incidence $α= 45.5^{0}$ is solved using a code developed in-house for solving 3D compressible Navier-Stokes equations. This code, named ANUROOP, has been developed in the finite volume framework using kinetic energy preserving second order central differencing scheme for calculating fluxes, and is compatible wit…
▽ More
Flow past a low pressure turbine blade in a cascade at $Re \approx 52000$ and angle of incidence $α= 45.5^{0}$ is solved using a code developed in-house for solving 3D compressible Navier-Stokes equations. This code, named ANUROOP, has been developed in the finite volume framework using kinetic energy preserving second order central differencing scheme for calculating fluxes, and is compatible with hybrid grids. ANUROOP was verified and validated against several test cases with Mach numbers ranging from 0.1 (Taylor-Green vortex) to 1.5 (compressible turbulent channel flow). The code was found to be robust and stable, and the kinetic energy decay obeys the compressible Navier-Stokes equations.
A hybrid grid, with a high resolution hexahedral orthogonal mesh in the boundary layer and unstructured (also hexahedral) elements in the rest of the domain, is used for the turbine blade simulation. Total grid size (160 million) is approximately an order of magnitude higher than in previous simulations for the same flow conditions and using similar numerical methods. The discrepancy in the pressure distribution in earlier studies compared to experimental data has been removed in this simulation. The trailing edge separation bubble has been characterized and a detailed discussion on the effect of surface curvature is presented.
△ Less
Submitted 29 November, 2016;
originally announced November 2016.
-
The Fe-Line Feature In The X-Ray Spectrum of Solar Flares: First Results From The SOXS Mission
Authors:
Rajmal Jain,
Anil K. Pradhan,
Vishal Joshi,
K. J. Shah,
Jayshree J. Trivedi,
S. L. Kayasth,
Vishal M. Shah,
M. R. Deshpande
Abstract:
We present the first results from the "Low Energy Detector" payload of the "Solar X-ray Spectrometer (SOXS)" mission, which was launched onboard the GSAT-2 Indian spacecraft on 08 May 2003 by the GSLV-D2 rocket to study solar flares. The SOXS Low Energy Detector (SLD) payload was designed, developed, and fabricated by the Physical Research Laboratory (PRL) in collaboration with the Space Applica…
▽ More
We present the first results from the "Low Energy Detector" payload of the "Solar X-ray Spectrometer (SOXS)" mission, which was launched onboard the GSAT-2 Indian spacecraft on 08 May 2003 by the GSLV-D2 rocket to study solar flares. The SOXS Low Energy Detector (SLD) payload was designed, developed, and fabricated by the Physical Research Laboratory (PRL) in collaboration with the Space Application Centre (SAC), Ahmedabad and the ISRO Satellite Centre (ISAC), Bangalore of Indian Space Research Organization (ISRO). The energy ranges of the Si PIN and CZT detectors are 4 - 25 keV and 4 - 56 keV respectively. The Si PIN provides sub-keV energy resolution while the CZT reveals ~1.7 keV energy resolution throughout the energy range. The high sensitivity and sub-keV energy resolution of the Si PIN detector allows measuring the intensity, peak energy, and the equivalent width of the Fe-line complex at approximately 6.7 keV as a function of time in all ten M-class flares studied in this investigation. The peak energy (Ep) of the Fe-line feature varies between 6.4 and 6.7 keV with increasing in temperature from 9 to 58 MK. We found that the equivalent width (w) of the Fe-line feature increases exponentially with temperature up to 30 MK and then increases very slowly up to 40 Mk. It remains between 3.5 and 4 keV in the temperature range of 30 - 45 MK. We compare our measurements of w with calculations made earlier by various investigators and propose that these measurements may improve theoretical models. We interpret the variation of both Ep and w with temperature as due to the changes in the ionization and recombination conditions in the plasma during the flare interval and as a consequence the contribution from different ionic emission lines also varies.
△ Less
Submitted 13 December, 2006;
originally announced December 2006.
-
p-modes in and away from a sunspot
Authors:
Brajesh Kumar,
R. Jain,
S. C. Tripathy,
Hari Om Vats,
M. R. Deshpande
Abstract:
A time series of GONG Dopplergrams for the period 10-14 May 1997 from Udaipur and Big Bear sites has been used to measure the velocity fluctuations in the sunspot (NOAA active region 8038) and quiet photosphere simultaneously. We observe that the power of pre-dominant p-mode is reduced in the sunspot as compared to quiet photosphere by 39-52% depending on the location of the sunspot region on th…
▽ More
A time series of GONG Dopplergrams for the period 10-14 May 1997 from Udaipur and Big Bear sites has been used to measure the velocity fluctuations in the sunspot (NOAA active region 8038) and quiet photosphere simultaneously. We observe that the power of pre-dominant p-mode is reduced in the sunspot as compared to quiet photosphere by 39-52% depending on the location of the sunspot region on the solar disk. We also observe a relative peak frequency deviation of p-modes in the sunspot, of the order of 80-310 $μ$Hz, which shows a linear dependence on the magnetic field gradient in the active region. The maximum frequency deviation of 310 $μ$Hz on 12 May appears to be an influence of a long duration solar flare that occurred in this active region. We interpret this relative peak frequency deviation as either due to power re-distribution of p-modes in the sunspot or a consequence of frequency modulation of these modes along the magnetic flux tubes due to rapidly varying magnetic field structure.
△ Less
Submitted 30 December, 1999;
originally announced December 1999.
-
Midinfrared Conductivity in Orientationally Disordered Doped Fullerides
Authors:
M. S. Deshpande,
E. J. Mele,
M. J. Rice,
H-Y Choi
Abstract:
The coupling between the intramolecular vibrational modes and the doped conduction electrons in $M_3C_{60}$ is studied by a calculation of the electronic contributions to the phonon self energies. The calculations are carried out for an orientationally ordered reference solid with symmetry $Fm \bar{3} m$ and for a model with quenched orientational disorder on the fullerene sites. In both cases,…
▽ More
The coupling between the intramolecular vibrational modes and the doped conduction electrons in $M_3C_{60}$ is studied by a calculation of the electronic contributions to the phonon self energies. The calculations are carried out for an orientationally ordered reference solid with symmetry $Fm \bar{3} m$ and for a model with quenched orientational disorder on the fullerene sites. In both cases, the dispersion and symmetry of the renormalized modes is governed by the electronic contributions. The current current correlation functions and frequency dependent conductivity through the midinfrared are calculated for both models. In the disordered structures, the renormalized modes derived from even parity intramolecular phonons are resonant with the dipole excited single particle spectrum, and modulate the predicted midinfrared conductivity. The spectra for this coupled system are calculated for several recently proposed microscopic models for the electron phonon coupling, and a comparison is made with recent experimental data which demonstrate this effect.
△ Less
Submitted 3 February, 1994;
originally announced February 1994.
-
Effective-Medium Theory for the Normal State in Orientationally Disordered Fullerides
Authors:
M. S. Deshpande,
S. C. Erwin,
S. Hong,
E. J. Mele
Abstract:
An effective-medium theory for studying the electronic structure of the orientationally disordered A3C60 fullerides is developed and applied to study various normal-state properties. The theory is based on a cluster-Bethe-lattice method in which the disordered medium is modelled by a three-band Bethe lattice, into which we embed a molecular cluster whose scattering properties are treated exactly…
▽ More
An effective-medium theory for studying the electronic structure of the orientationally disordered A3C60 fullerides is developed and applied to study various normal-state properties. The theory is based on a cluster-Bethe-lattice method in which the disordered medium is modelled by a three-band Bethe lattice, into which we embed a molecular cluster whose scattering properties are treated exactly. Various single-particle properties and the frequency-dependent conductivity are calculated in this model, and comparison is made with numerical calculations for disordered lattices, and with experiment.
△ Less
Submitted 20 September, 1993;
originally announced September 1993.