Search | arXiv e-print repository

arXiv:2406.01968 [pdf, other]

Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space Alignment

Authors: Tianyu Wang, Dwait Bhatt, Xiaolong Wang, Nikolay Atanasov

Abstract: This paper focuses on transferring control policies between robot manipulators with different morphology. While reinforcement learning (RL) methods have shown successful results in robot manipulation tasks, transferring a trained policy from simulation to a real robot or deploying it on a robot with different states, actions, or kinematics is challenging. To achieve cross-embodiment policy transfe… ▽ More This paper focuses on transferring control policies between robot manipulators with different morphology. While reinforcement learning (RL) methods have shown successful results in robot manipulation tasks, transferring a trained policy from simulation to a real robot or deploying it on a robot with different states, actions, or kinematics is challenging. To achieve cross-embodiment policy transfer, our key insight is to project the state and action spaces of the source and target robots to a common latent space representation. We first introduce encoders and decoders to associate the states and actions of the source robot with a latent space. The encoders, decoders, and a latent space control policy are trained simultaneously using loss functions measuring task performance, latent dynamics consistency, and encoder-decoder ability to reconstruct the original states and actions. To transfer the learned control policy, we only need to train target encoders and decoders that align a new target domain to the latent space. We use generative adversarial training with cycle consistency and latent dynamics losses without access to the task reward or reward tuning in the target domain. We demonstrate sim-to-sim and sim-to-real manipulation policy transfer with source and target robots of different states, actions, and embodiments. The source code is available at \url{https://github.com/ExistentialRobotics/cross_embodiment_transfer}. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 8 pages, 9 figures

arXiv:2312.15994 [pdf, other]

Practical Bias Mitigation through Proxy Sensitive Attribute Label Generation

Authors: Bhushan Chaudhary, Anubha Pandey, Deepak Bhatt, Darshika Tiwari

Abstract: Addressing bias in the trained machine learning system often requires access to sensitive attributes. In practice, these attributes are not available either due to legal and policy regulations or data unavailability for a given demographic. Existing bias mitigation algorithms are limited in their applicability to real-world scenarios as they require access to sensitive attributes to achieve fairne… ▽ More Addressing bias in the trained machine learning system often requires access to sensitive attributes. In practice, these attributes are not available either due to legal and policy regulations or data unavailability for a given demographic. Existing bias mitigation algorithms are limited in their applicability to real-world scenarios as they require access to sensitive attributes to achieve fairness. In this research work, we aim to address this bottleneck through our proposed unsupervised proxy-sensitive attribute label generation technique. Towards this end, we propose a two-stage approach of unsupervised embedding generation followed by clustering to obtain proxy-sensitive labels. The efficacy of our work relies on the assumption that bias propagates through non-sensitive attributes that are correlated to the sensitive attributes and, when mapped to the high dimensional latent space, produces clusters of different demographic groups that exist in the data. Experimental results demonstrate that bias mitigation using existing algorithms such as Fair Mixup and Adversarial Debiasing yields comparable results on derived proxy labels when compared against using true sensitive attributes. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: Modelling Uncertainty in the Financial World (MUFin) Workshop in AAAI2023

arXiv:2312.11969 [pdf, other]

doi 10.1007/978-3-031-33374-3_41

GroupMixNorm Layer for Learning Fair Models

Authors: Anubha Pandey, Aditi Rai, Maneet Singh, Deepak Bhatt, Tanmoy Bhowmik

Abstract: Recent research has identified discriminatory behavior of automated prediction algorithms towards groups identified on specific protected attributes (e.g., gender, ethnicity, age group, etc.). When deployed in real-world scenarios, such techniques may demonstrate biased predictions resulting in unfair outcomes. Recent literature has witnessed algorithms for mitigating such biased behavior mostly b… ▽ More Recent research has identified discriminatory behavior of automated prediction algorithms towards groups identified on specific protected attributes (e.g., gender, ethnicity, age group, etc.). When deployed in real-world scenarios, such techniques may demonstrate biased predictions resulting in unfair outcomes. Recent literature has witnessed algorithms for mitigating such biased behavior mostly by adding convex surrogates of fairness metrics such as demographic parity or equalized odds in the loss function, which are often not easy to estimate. This research proposes a novel in-processing based GroupMixNorm layer for mitigating bias from deep learning models. The GroupMixNorm layer probabilistically mixes group-level feature statistics of samples across different groups based on the protected attribute. The proposed method improves upon several fairness metrics with minimal impact on overall accuracy. Analysis on benchmark tabular and image datasets demonstrates the efficacy of the proposed method in achieving state-of-the-art performance. Further, the experimental analysis also suggests the robustness of the GroupMixNorm layer against new protected attributes during inference and its utility in eliminating bias from a pre-trained network. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: 12 pages, 6 figures, Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2023

arXiv:2210.14862 [pdf, other]

Visual Semantic Parsing: From Images to Abstract Meaning Representation

Authors: Mohamed Ashraf Abdelsalam, Zhan Shi, Federico Fancellu, Kalliopi Basioti, Dhaivat J. Bhatt, Vladimir Pavlovic, Afsaneh Fazly

Abstract: The success of scene graphs for visual scene understanding has brought attention to the benefits of abstracting a visual input (e.g., image) into a structured representation, where entities (people and objects) are nodes connected by edges specifying their relations. Building these representations, however, requires expensive manual annotation in the form of images paired with their scene graphs o… ▽ More The success of scene graphs for visual scene understanding has brought attention to the benefits of abstracting a visual input (e.g., image) into a structured representation, where entities (people and objects) are nodes connected by edges specifying their relations. Building these representations, however, requires expensive manual annotation in the form of images paired with their scene graphs or frames. These formalisms remain limited in the nature of entities and relations they can capture. In this paper, we propose to leverage a widely-used meaning representation in the field of natural language processing, the Abstract Meaning Representation (AMR), to address these shortcomings. Compared to scene graphs, which largely emphasize spatial relationships, our visual AMR graphs are more linguistically informed, with a focus on higher-level semantic concepts extrapolated from visual input. Moreover, they allow us to generate meta-AMR graphs to unify information contained in multiple image descriptions under one representation. Through extensive experimentation and analysis, we demonstrate that we can re-purpose an existing text-to-AMR parser to parse images into AMRs. Our findings point to important future research directions for improved scene understanding. △ Less

Submitted 27 October, 2022; v1 submitted 26 October, 2022; originally announced October 2022.

Comments: published in CoNLL 2022

arXiv:2210.04996 [pdf, other]

Graph2Vid: Flow graph to Video Grounding for Weakly-supervised Multi-Step Localization

Authors: Nikita Dvornik, Isma Hadji, Hai Pham, Dhaivat Bhatt, Brais Martinez, Afsaneh Fazly, Allan D. Jepson

Abstract: In this work, we consider the problem of weakly-supervised multi-step localization in instructional videos. An established approach to this problem is to rely on a given list of steps. However, in reality, there is often more than one way to execute a procedure successfully, by following the set of steps in slightly varying orders. Thus, for successful localization in a given video, recent works r… ▽ More In this work, we consider the problem of weakly-supervised multi-step localization in instructional videos. An established approach to this problem is to rely on a given list of steps. However, in reality, there is often more than one way to execute a procedure successfully, by following the set of steps in slightly varying orders. Thus, for successful localization in a given video, recent works require the actual order of procedure steps in the video, to be provided by human annotators at both training and test times. Instead, here, we only rely on generic procedural text that is not tied to a specific video. We represent the various ways to complete the procedure by transforming the list of instructions into a procedure flow graph which captures the partial order of steps. Using the flow graphs reduces both training and test time annotation requirements. To this end, we introduce the new problem of flow graph to video grounding. In this setup, we seek the optimal step ordering consistent with the procedure flow graph and a given video. To solve this problem, we propose a new algorithm - Graph2Vid - that infers the actual ordering of steps in the video and simultaneously localizes them. To show the advantage of our proposed formulation, we extend the CrossTask dataset with procedure flow graph information. Our experiments show that Graph2Vid is both more efficient than the baselines and yields strong step localization results, without the need for step order annotation. △ Less

Submitted 31 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: ECCV'22, oral

Journal ref: ECCV 2022

arXiv:2205.15747 [pdf, other]

Adversarial synthesis based data-augmentation for code-switched spoken language identification

Authors: Parth Shastri, Chirag Patil, Poorval Wanere, Dr. Shrinivas Mahajan, Dr. Abhishek Bhatt, Dr. Hardik Sailor

Abstract: Spoken Language Identification (LID) is an important sub-task of Automatic Speech Recognition(ASR) that is used to classify the language(s) in an audio segment. Automatic LID plays an useful role in multilingual countries. In various countries, identifying a language becomes hard, due to the multilingual scenario where two or more than two languages are mixed together during conversation. Such phe… ▽ More Spoken Language Identification (LID) is an important sub-task of Automatic Speech Recognition(ASR) that is used to classify the language(s) in an audio segment. Automatic LID plays an useful role in multilingual countries. In various countries, identifying a language becomes hard, due to the multilingual scenario where two or more than two languages are mixed together during conversation. Such phenomenon of speech is called as code-mixing or code-switching. This nature is followed not only in India but also in many Asian countries. Such code-mixed data is hard to find, which further reduces the capabilities of the spoken LID. Hence, this work primarily addresses this problem using data augmentation as a solution on the on the data scarcity of the code-switched class. This study focuses on Indic language code-mixed with English. Spoken LID is performed on Hindi, code-mixed with English. This research proposes Generative Adversarial Network (GAN) based data augmentation technique performed using Mel spectrograms for audio data. GANs have already been proven to be accurate in representing the real data distribution in the image domain. Proposed research exploits these capabilities of GANs in speech domains such as speech classification, automatic speech recognition, etc. GANs are trained to generate Mel spectrograms of the minority code-mixed class which are then used to augment data for the classifier. Utilizing GANs give an overall improvement on Unweighted Average Recall by an amount of 3.5% as compared to a Convolutional Recurrent Neural Network (CRNN) classifier used as the baseline reference. △ Less

Submitted 1 June, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

Comments: 9 pages, 8 figures, updated

arXiv:2203.15178 [pdf, other]

DesCert: Design for Certification

Authors: Natarajan Shankar, Devesh Bhatt, Michael Ernst, Minyoung Kim, Srivatsan Varadarajan, Suzanne Millstein, Jorge Navas, Jason Biatek, Huascar Sanchez, Anitha Murugesan, Hao Ren

Abstract: The goal of the DARPA Automated Rapid Certification Of Software (ARCOS) program is to "automate the evaluation of software assurance evidence to enable certifiers to determine rapidly that system risk is acceptable." As part of this program, the DesCert project focuses on the assurance-driven development of new software. The DesCert team consists of SRI International, Honeywell Research, and the U… ▽ More The goal of the DARPA Automated Rapid Certification Of Software (ARCOS) program is to "automate the evaluation of software assurance evidence to enable certifiers to determine rapidly that system risk is acceptable." As part of this program, the DesCert project focuses on the assurance-driven development of new software. The DesCert team consists of SRI International, Honeywell Research, and the University of Washington. We have adopted a formal, tool-based approach to the construction of software artifacts that are supported by rigorous evidence. The DesCert workflow integrates evidence generation into a design process that goes from requirements capture and analysis to the decomposition of the high-level software requirements into architecture properties and software components with assertional contracts, and on to software that can be analyzed both dynamically and statically. The generated evidence is organized by means of an assurance ontology and integrated into the RACK knowledge base. △ Less

Submitted 28 March, 2022; originally announced March 2022.

Comments: 142 pages, 63 figures

Report number: SRI-CSL-2022-1 MSC Class: 68N30 ACM Class: D.2.1; D.2.2; D.2.3; D.2.4; D.2.10; D.2.11

arXiv:2112.09866 [pdf, other]

Cascading Adaptors to Leverage English Data to Improve Performance of Question Answering for Low-Resource Languages

Authors: Hariom A. Pandya, Bhavik Ardeshna, Dr. Brijesh S. Bhatt

Abstract: Transformer based architectures have shown notable results on many down streaming tasks including question answering. The availability of data, on the other hand, impedes obtaining legitimate performance for low-resource languages. In this paper, we investigate the applicability of pre-trained multilingual models to improve the performance of question answering in low-resource languages. We tested… ▽ More Transformer based architectures have shown notable results on many down streaming tasks including question answering. The availability of data, on the other hand, impedes obtaining legitimate performance for low-resource languages. In this paper, we investigate the applicability of pre-trained multilingual models to improve the performance of question answering in low-resource languages. We tested four combinations of language and task adapters using multilingual transformer architectures on seven languages similar to MLQA dataset. Additionally, we have also proposed zero-shot transfer learning of low-resource question answering using language and task adapters. We observed that stacking the language and the task adapters improves the multilingual transformer models' performance significantly for low-resource languages. △ Less

Submitted 18 December, 2021; originally announced December 2021.

arXiv:2112.09860 [pdf]

Morpheme Boundary Detection & Grammatical Feature Prediction for Gujarati : Dataset & Model

Authors: Jatayu Baxi, Dr. Brijesh Bhatt

Abstract: Develo** Natural Language Processing resources for a low resource language is a challenging but essential task. In this paper, we present a Morphological Analyzer for Gujarati. We have used a Bi-Directional LSTM based approach to perform morpheme boundary detection and grammatical feature tagging. We have created a data set of Gujarati words with lemma and grammatical features. The Bi-LSTM based… ▽ More Develo** Natural Language Processing resources for a low resource language is a challenging but essential task. In this paper, we present a Morphological Analyzer for Gujarati. We have used a Bi-Directional LSTM based approach to perform morpheme boundary detection and grammatical feature tagging. We have created a data set of Gujarati words with lemma and grammatical features. The Bi-LSTM based model of Morph Analyzer discussed in the paper handles the language morphology effectively without the knowledge of any hand-crafted suffix rules. To the best of our knowledge, this is the first dataset and morph analyzer model for the Gujarati language which performs both grammatical feature tagging and morpheme boundary detection tasks. △ Less

Submitted 18 December, 2021; originally announced December 2021.

arXiv:2109.13913 [pdf, other]

$f$-Cal: Calibrated aleatoric uncertainty estimation from neural networks for robot perception

Authors: Dhaivat Bhatt, Kaustubh Mani, Dishank Bansal, Krishna Murthy, Hanju Lee, Liam Paull

Abstract: While modern deep neural networks are performant perception modules, performance (accuracy) alone is insufficient, particularly for safety-critical robotic applications such as self-driving vehicles. Robot autonomy stacks also require these otherwise blackbox models to produce reliable and calibrated measures of confidence on their predictions. Existing approaches estimate uncertainty from these n… ▽ More While modern deep neural networks are performant perception modules, performance (accuracy) alone is insufficient, particularly for safety-critical robotic applications such as self-driving vehicles. Robot autonomy stacks also require these otherwise blackbox models to produce reliable and calibrated measures of confidence on their predictions. Existing approaches estimate uncertainty from these neural network perception stacks by modifying network architectures, inference procedure, or loss functions. However, in general, these methods lack calibration, meaning that the predictive uncertainties do not faithfully represent the true underlying uncertainties (process noise). Our key insight is that calibration is only achieved by imposing constraints across multiple examples, such as those in a mini-batch; as opposed to existing approaches which only impose constraints per-sample, often leading to overconfident (thus miscalibrated) uncertainty estimates. By enforcing the distribution of outputs of a neural network to resemble a target distribution by minimizing an $f$-divergence, we obtain significantly better-calibrated models compared to prior approaches. Our approach, $f$-Cal, outperforms existing uncertainty calibration approaches on robot perception tasks such as object detection and monocular depth estimation over multiple real-world benchmarks. △ Less

Submitted 28 September, 2021; originally announced September 2021.

Comments: For more details about $f$-Cal, visit https://f-cal.github.io

arXiv:2109.04634 [pdf, other]

doi 10.4204/EPTCS.344.6

Knowledge-Assisted Reasoning of Model-Augmented System Requirements with Event Calculus and Goal-Directed Answer Set Programming

Authors: Brendan Hall, Sarat Chandra Varanasi, Jan Fiedor, Joaquín Arias, Kinjal Basu, Fang Li, Devesh Bhatt, Kevin Driscoll, Elmer Salazar, Gopal Gupta

Abstract: We consider requirements for cyber-physical systems represented in constrained natural language. We present novel automated techniques for aiding in the development of these requirements so that they are consistent and can withstand perceived failures. We show how cyber-physical systems' requirements can be modeled using the event calculus (EC), a formalism used in AI for representing actions and… ▽ More We consider requirements for cyber-physical systems represented in constrained natural language. We present novel automated techniques for aiding in the development of these requirements so that they are consistent and can withstand perceived failures. We show how cyber-physical systems' requirements can be modeled using the event calculus (EC), a formalism used in AI for representing actions and change. We also show how answer set programming (ASP) and its query-driven implementation s(CASP) can be used to directly realize the event calculus model of the requirements. This event calculus model can be used to automatically validate the requirements. Since ASP is an expressive knowledge representation language, it can also be used to represent contextual knowledge about cyber-physical systems, which, in turn, can be used to find gaps in their requirements specifications. We illustrate our approach through an altitude alerting system from the avionics domain. △ Less

Submitted 9 September, 2021; originally announced September 2021.

Comments: In Proceedings HCVS 2021, arXiv:2109.03988

Journal ref: EPTCS 344, 2021, pp. 79-90

arXiv:2105.08106 [pdf, other]

Multi-Modal Image Captioning for the Visually Impaired

Authors: Hiba Ahsan, Nikita Bhalla, Daivat Bhatt, Kaivankumar Shah

Abstract: One of the ways blind people understand their surroundings is by clicking images and relying on descriptions generated by image captioning systems. Current work on captioning images for the visually impaired do not use the textual data present in the image when generating captions. This problem is critical as many visual scenes contain text. Moreover, up to 21% of the questions asked by blind peop… ▽ More One of the ways blind people understand their surroundings is by clicking images and relying on descriptions generated by image captioning systems. Current work on captioning images for the visually impaired do not use the textual data present in the image when generating captions. This problem is critical as many visual scenes contain text. Moreover, up to 21% of the questions asked by blind people about the images they click pertain to the text present in them. In this work, we propose altering AoANet, a state-of-the-art image captioning model, to leverage the text detected in the image as an input feature. In addition, we use a pointer-generator mechanism to copy the detected text to the caption when tokens need to be reproduced accurately. Our model outperforms AoANet on the benchmark dataset VizWiz, giving a 35% and 16.2% performance improvement on CIDEr and SPICE scores, respectively. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: 8 pages, 2 figures, 2 tables, accepted to NAACL-HLT SRW 2021

arXiv:2105.04144 [pdf, other]

Transitioning from Real to Synthetic data: Quantifying the bias in model

Authors: Aman Gupta, Deepak Bhatt, Anubha Pandey

Abstract: With the advent of generative modeling techniques, synthetic data and its use has penetrated across various domains from unstructured data such as image, text to structured dataset modeling healthcare outcome, risk decisioning in financial domain, and many more. It overcomes various challenges such as limited training data, class imbalance, restricted access to dataset owing to privacy issues. To… ▽ More With the advent of generative modeling techniques, synthetic data and its use has penetrated across various domains from unstructured data such as image, text to structured dataset modeling healthcare outcome, risk decisioning in financial domain, and many more. It overcomes various challenges such as limited training data, class imbalance, restricted access to dataset owing to privacy issues. To ensure the trained model used for automated decisioning purposes makes a fair decision there exist prior work to quantify and mitigate those issues. This study aims to establish a trade-off between bias and fairness in the models trained using synthetic data. Variants of synthetic data generation techniques were studied to understand bias amplification including differentially private generation schemes. Through experiments on a tabular dataset, we demonstrate there exist a varying levels of bias impact on models trained using synthetic data. Techniques generating less correlated feature performs well as evident through fairness metrics with 94\%, 82\%, and 88\% relative drop in DPD (demographic parity difference), EoD (equality of odds) and EoP (equality of opportunity) respectively, and 24\% relative improvement in DRP (demographic parity ratio) with respect to the real dataset. We believe the outcome of our research study will help data science practitioners understand the bias in the use of synthetic data. △ Less

Submitted 10 May, 2021; originally announced May 2021.

Comments: Accepted at Synthetic Data Generation Workshop at ICLR 2021 https://sdg-quality-privacy-bias.github.io/papers/

arXiv:2101.00495 [pdf]

Volterra model-based control for nonlinear systems via Carleman linearization

Authors: Dhruvi Bhatt, Shambhu Nath Sharma

Abstract: This paper presents detailed insights of embedding Carleman linearization into nonlinear systems for designing Volterra model-based control technique. Volterra series method is a competent mathematical tool, which extends the convolution integral for linear systems to nonlinear systems. First, we utilize the Carleman linearization technique to arrive at the bilinear approximation of the nonlinear… ▽ More This paper presents detailed insights of embedding Carleman linearization into nonlinear systems for designing Volterra model-based control technique. Volterra series method is a competent mathematical tool, which extends the convolution integral for linear systems to nonlinear systems. First, we utilize the Carleman linearization technique to arrive at the bilinear approximation of the nonlinear system. Secondly, the third-order Volterra model representation is computed from the Carleman bilinearized model. Then, Volterra model-based control strategy is developed. The proposed method is effectuated on the benchmark van de Vusse reactor exhibiting non-minimum phase response. The open and closed-loop simulation results are presented, demonstrating the superiority and practical utility of the proposed method. △ Less

Submitted 2 January, 2021; originally announced January 2021.

Comments: 14 figures, 23 pages

MSC Class: 93B18; 45D05; 93B50; 34A34; 34H05; 93B52

arXiv:2009.05728 [pdf, other]

Abstractive Information Extraction from Scanned Invoices (AIESI) using End-to-end Sequential Approach

Authors: Shreeshiv Patel, Dvijesh Bhatt

Abstract: Recent proliferation in the field of Machine Learning and Deep Learning allows us to generate OCR models with higher accuracy. Optical Character Recognition(OCR) is the process of extracting text from documents and scanned images. For document data streamlining, we are interested in data like, Payee name, total amount, address, and etc. Extracted information helps to get complete insight of data,… ▽ More Recent proliferation in the field of Machine Learning and Deep Learning allows us to generate OCR models with higher accuracy. Optical Character Recognition(OCR) is the process of extracting text from documents and scanned images. For document data streamlining, we are interested in data like, Payee name, total amount, address, and etc. Extracted information helps to get complete insight of data, which can be helpful for fast document searching, efficient indexing in databases, data analytics, and etc. Using AIESI we can eliminate human effort for key parameters extraction from scanned documents. Abstract Information Extraction from Scanned Invoices (AIESI) is a process of extracting information like, date, total amount, payee name, and etc from scanned receipts. In this paper we proposed an improved method to ensemble all visual and textual features from invoices to extract key invoice parameters using Word wise BiLSTM. △ Less

Submitted 12 September, 2020; originally announced September 2020.

Comments: 6 pages, 7 images, to be published in upcoming relevant conference

arXiv:1910.05566 [pdf]

Filtering theory for a weakly coloured noise process

Authors: Shaival H. Nagarsheth, Dhruvi S. Bhatt, Shambhu N. Sharma

Abstract: The problem of analyzing the Ito stochastic differential system and its filtering has received attention. The classical approach to accomplish filtering for the Ito SDE is the Kushner equation. In contrast to the classical filtering approach, this paper presents filtering for the stochastic differential system affected by weakly coloured noise. As a special case, the process can be regarded as the… ▽ More The problem of analyzing the Ito stochastic differential system and its filtering has received attention. The classical approach to accomplish filtering for the Ito SDE is the Kushner equation. In contrast to the classical filtering approach, this paper presents filtering for the stochastic differential system affected by weakly coloured noise. As a special case, the process can be regarded as the Ornstein-Uhlenbeck (OU) process. The theory of this paper is based on a pioneering contribution of Stratonovich involving the perturbation-theoretic approach to noisy dynamical systems in combination with the notion of the filtering density evolution. Making the use of the filtering density evolution equation, the stochastic evolution of condition moment is derived. A scalar Duffing system driven by the OU process is employed to test the effectiveness of the filtering theory of the paper. Numerical simulations involving four different sets of initial conditions and system parameters are utilized to examine the efficacy of the filtering algorithm of this paper. △ Less

Submitted 12 October, 2019; originally announced October 2019.

Comments: 12 pages, 4 figures

MSC Class: 60G99- Stochastic Processes; 60J65 - Brownian motion; 93E11 - Filtering

arXiv:1909.13042 [pdf]

Estimation of the van de Vusse reactor via Carleman embedding

Authors: Dhruvi S. Bhatt, Shambhu N. Sharma

Abstract: The van de Vusse reactor is an appealing benchmark problem in industrial control, since it has a non-minimum phase response. The van de Vusse stochasticity is attributed to the fluctuating input flow rate. The novelties of the paper are two. First, we utilize the surprising power of Ito stochastic calculus for applications to account for the van de Vusse stochasticity. Secondly, the Carleman embed… ▽ More The van de Vusse reactor is an appealing benchmark problem in industrial control, since it has a non-minimum phase response. The van de Vusse stochasticity is attributed to the fluctuating input flow rate. The novelties of the paper are two. First, we utilize the surprising power of Ito stochastic calculus for applications to account for the van de Vusse stochasticity. Secondly, the Carleman embedding is unified with the Fokker-Planck equation for finding the estimation of the van de Vusse reactor. The revelation of the paper is that the Carleman linearized estimate of the van de Vusse reactor is more refined in contrast to the EKF predicted estimate. This paper will be useful to practitioners aspiring for formal methods for stochastically perturbed nonlinear reactors as well as system theorists aspiring for applications of their theoretical results to practical problems. △ Less

Submitted 28 September, 2019; originally announced September 2019.

Comments: 23 pages, 7 figures, 4 tables

MSC Class: 82C31: Stochastic methods; 93E03: Stochastic systems; general

arXiv:1903.01669 [pdf, other]

Deep Active Localization

Authors: Sai Krishna, Keehong Seo, Dhaivat Bhatt, Vincent Mai, Krishna Murthy, Liam Paull

Abstract: Active localization is the problem of generating robot actions that allow it to maximally disambiguate its pose within a reference map. Traditional approaches to this use an information-theoretic criterion for action selection and hand-crafted perceptual models. In this work we propose an end-to-end differentiable method for learning to take informative actions that is trainable entirely in simula… ▽ More Active localization is the problem of generating robot actions that allow it to maximally disambiguate its pose within a reference map. Traditional approaches to this use an information-theoretic criterion for action selection and hand-crafted perceptual models. In this work we propose an end-to-end differentiable method for learning to take informative actions that is trainable entirely in simulation and then transferable to real robot hardware with zero refinement. The system is composed of two modules: a convolutional neural network for perception, and a deep reinforcement learned planning module. We introduce a multi-scale approach to the learned perceptual model since the accuracy needed to perform action selection with reinforcement learning is much less than the accuracy needed for robot control. We demonstrate that the resulting system outperforms using the traditional approach for either perception or planning. We also demonstrate our approaches robustness to different map configurations and other nuisance parameters through the use of domain randomization in training. The code is also compatible with the OpenAI gym framework, as well as the Gazebo simulator. △ Less

Submitted 5 March, 2019; originally announced March 2019.

Comments: 10 pages

arXiv:1902.01578 [pdf, other]

doi 10.1364/OE.27.031900

Detection of self-generated nanowaves on the interface of an evaporating sessile water droplet

Authors: Dhanush Bhatt, Rahul Vaippully, Bhavesh Kharbanda, Anand Dev Ranjan, Sulochana R., Viraj Dharod, Basudev Roy

Abstract: Evaporating sessile droplets have been known to exhibit oscillations on the air-liquid interface. These are generally over millimeter scales. Using a novel approach, we are able to measure surface height changes of 500 nm amplitude using optical trap** of a set of microscopic particles at the interface, particularly when the vertical thickness of the droplet reduces to less than 50 $μ$m. We find… ▽ More Evaporating sessile droplets have been known to exhibit oscillations on the air-liquid interface. These are generally over millimeter scales. Using a novel approach, we are able to measure surface height changes of 500 nm amplitude using optical trap** of a set of microscopic particles at the interface, particularly when the vertical thickness of the droplet reduces to less than 50 $μ$m. We find that at the later stages of the droplet evaporation, particularly when the convection currents become large, the top air-water interface starts to spontaneously oscillate vertically as a function of time in consistency with predictions. We also detect travelling wave trains moving in the azimuthal direction of the drop surface which are consistent with hydrothermal waves at a different combination of Reynolds, Prandtl and Evaporation than previously observed. This is the first time that wave-trains have been observed in water, being extremely challenging to detect both interferometrically and with infra-red cameras. We also find that such waves apply a force parallel to the interface along the propagation direction. △ Less

Submitted 5 February, 2019; originally announced February 2019.

Comments: 5 pages

arXiv:1811.04259 [pdf, other]

Study of adhesivity of surfaces using rotational optical tweezers

Authors: Rahul Vaipully, Dhanush Bhatt, Anand Dev Ranjan, Basudev Roy

Abstract: Optical tweezers are powerful tools for high resolution study of surface properties. Such experiments are traditionally performed by studying the active or the brownian fluctuation of trapped particles in the X, Y, Z direction. Here we find that employing the fourth dimension, rotation, allows for sensitive and fast probing of the surface. Optical tweezers are capable of rotating trapped birefring… ▽ More Optical tweezers are powerful tools for high resolution study of surface properties. Such experiments are traditionally performed by studying the active or the brownian fluctuation of trapped particles in the X, Y, Z direction. Here we find that employing the fourth dimension, rotation, allows for sensitive and fast probing of the surface. Optical tweezers are capable of rotating trapped birefringent microparticles when applied with circularly polarized light, thus called the Rotational Optical Tweezers. When the trapped birefringent microparticle is far enough away from the surface, the rotation rate is dependent only on the laser power. However, we find that if one traps close to a surface, the rotation rate goes to zero even at finite tweezers laser powers for some specific type of substrates. We suspect this to be due to interaction between the substrate and the birefringent particle, kee** in mind that the hydrodynamic drag for this mode of rotation cannot increase beyond 1.2 times the drag away from the surface. We use this to probe some surfaces and find that there is no binding for hydrophobic ones but hydrophilic ones particularly tend to show a power threshold beyond which the birefringent particle starts rotating. We calculate that the threshold energy of the tweezers is consistent with the Van der Waals potential energy, when the mode of interaction with the surface is purely physical. We also find that for chitosan, the mode of interaction is possibly different from Van der Waals. We place the particle on the threshold and observe stick-slip kind of rotational behaviour. △ Less

Submitted 10 November, 2018; originally announced November 2018.

Comments: 4 pages

arXiv:1806.09929 [pdf, other]

Chance Constraints Integrated MPC Navigation in Uncertainty amongst Dynamic Obstacles: An overlap of Gaussians approach

Authors: Dhaivat Bhatt, Akash Garg, Bharath Gopalakrishnan, K. Madhava Krishna

Abstract: In this paper, we formulate a novel trajectory optimization scheme that takes into consideration the state uncertainty of the robot and obstacle into its collision avoidance routine. The collision avoidance under uncertainty is modeled here as an overlap between two distributions that represent the state of the robot and obstacle respectively. We adopt the minmax procedure to characterize the area… ▽ More In this paper, we formulate a novel trajectory optimization scheme that takes into consideration the state uncertainty of the robot and obstacle into its collision avoidance routine. The collision avoidance under uncertainty is modeled here as an overlap between two distributions that represent the state of the robot and obstacle respectively. We adopt the minmax procedure to characterize the area of overlap between two Gaussian distributions, and compare it with the method of Bhattacharyya distance. We provide closed form expressions that can characterize the overlap as a function of control. Our proposed algorithm can avoid overlap** uncertainty distributions in two possible ways. Firstly when a prescribed overlap** area that needs to be avoided is posed as a confidence contour lower bound, control commands are accordingly realized through a MPC framework such that these bounds are respected. Secondly in tight spaces control commands are computed such that the overlap** distribution respects a prescribed range of overlap characterized by lower and upper bounds of the confidence contours. We test our proposal with extensive set of simulations carried out under various constrained environmental configurations. We show usefulness of proposal under tight spaces where finding control maneuvers with minimal risk behavior becomes an inevitable task. △ Less

Submitted 26 June, 2018; originally announced June 2018.

Comments: 8 pages, 8 figures

arXiv:1405.6130 [pdf]

doi 10.14445/22312803/IJCTT-V7P143

A Study of Local Binary Pattern Method for Facial Expression Detection

Authors: Ms. Drashti H. Bhatt, Mr. Kirit R. Rathod, Mr. Shardul J. Agravat

Abstract: Face detection is a basic task for expression recognition. The reliability of face detection & face recognition approach has a major role on the performance and usability of the entire system. There are several ways to undergo face detection & recognition. We can use Image Processing Operations, various classifiers, filters or virtual machines for the former. Various strategies are being available… ▽ More Face detection is a basic task for expression recognition. The reliability of face detection & face recognition approach has a major role on the performance and usability of the entire system. There are several ways to undergo face detection & recognition. We can use Image Processing Operations, various classifiers, filters or virtual machines for the former. Various strategies are being available for Facial Expression Detection. The field of facial expression detection can have various applications along with its importance & can be interacted between human being & computer. Many few options are available to identify a face in an image in accurate & efficient manner. Local Binary Pattern (LBP) based texture algorithms have gained popularity in these years. LBP is an effective approach to have facial expression recognition & is a feature-based approach. △ Less

Submitted 4 February, 2014; originally announced May 2014.

Comments: 3 pages, 2 images, International Journal of Computer Trends and Technology (IJCTT)

Journal ref: Ms.Drashti H. Bhatt , Mr.Kirit R. Rathod , Mr.Shardul J. Agravat. Article: A Study of Local Binary Pattern Method for Facial Expression Detection. IJCTT 7(3):151-153, January 2014. Published by Seventh Sense Research Group

arXiv:1109.0744 [pdf, ps, other]

Biomolecular transitions: efficient computation of pathways, free energies, and rates

Authors: Divesh Bhatt, Ivet Bahar

Abstract: We present an efficient method to compute transition rates between states for a two-state system. The method utilizes the equivalence between steady-state flux and mean first passage rate for such systems. More specifically, the procedure divides the configurational space into smaller regions and equilibrates trajectories within each region efficiently. The equilibrated conditional probabilities b… ▽ More We present an efficient method to compute transition rates between states for a two-state system. The method utilizes the equivalence between steady-state flux and mean first passage rate for such systems. More specifically, the procedure divides the configurational space into smaller regions and equilibrates trajectories within each region efficiently. The equilibrated conditional probabilities between each pair of regions lead to transition rates between the two states. We apply the procedure to a non-trivial coarse-grained model of a 70 residue section of the calcium binding protein, calmodulin. The procedure yields a significant increase in efficiency compared to brute-force simulations, and this efficiency increases dramatically with a decrease in temperature. △ Less

Submitted 4 September, 2011; originally announced September 2011.

arXiv:1109.0743 [pdf, ps, other]

Stochastic modeling of p53-regulated apoptosis upon radiation damage

Authors: Divesh Bhatt, Zoltan Oltvai, Ivet Bahar

Abstract: We develop and study the evolution of a model of radiation induced apoptosis in cells using stochastic simulations, and identified key protein targets for effective mitigation of radiation damage. We identified several key proteins associated with cellular apoptosis using an extensive literature survey. In particular, we focus on the p53 transcription dependent and p53 transcription independent pa… ▽ More We develop and study the evolution of a model of radiation induced apoptosis in cells using stochastic simulations, and identified key protein targets for effective mitigation of radiation damage. We identified several key proteins associated with cellular apoptosis using an extensive literature survey. In particular, we focus on the p53 transcription dependent and p53 transcription independent pathways for mitochondrial apoptosis. Our model reproduces known p53 oscillations following radiation damage. The key, experimentally testable hypotheses that we generate are - inhibition of PUMA is an effective strategy for mitigation of radiation damage if the treatment is administered immediately, at later stages following radiation damage, inhibition of tBid is more effective. △ Less

Submitted 4 September, 2011; originally announced September 2011.

arXiv:1002.3802 [pdf, ps, other]

Automated sampling assessment for molecular simulations using the effective sample size

Authors: Xin Zhang, Divesh Bhatt, Daniel M. Zuckerman

Abstract: To quantify the progress in development of algorithms and forcefields used in molecular simulations, a method for the assessment of the sampling quality is needed. We propose a general method to assess the sampling quality through the estimation of the number of independent samples obtained from molecular simulations. This method is applicable to both dynamic and nondynamic methods and utilizes… ▽ More To quantify the progress in development of algorithms and forcefields used in molecular simulations, a method for the assessment of the sampling quality is needed. We propose a general method to assess the sampling quality through the estimation of the number of independent samples obtained from molecular simulations. This method is applicable to both dynamic and nondynamic methods and utilizes the variance in the populations of physical states to determine the ESS. We test the correctness and robustness of our procedure in a variety of systems--two-state toy model, all-atom butane, coarse-grained calmodulin, all-atom dileucine and Met-enkaphalin. We also introduce an automated procedure to obtain approximate physical states from dynamic trajectories: this procedure allows for sample--size estimation for systems for which physical states are not known in advance. △ Less

Submitted 19 February, 2010; originally announced February 2010.

arXiv:1002.2402 [pdf, ps, other]

Symmetry of forward and reverse path populations

Authors: Divesh Bhatt, Daniel M. Zuckerman

Abstract: In this note, we address formally the issue of symmetry for probabilities of different dynamical pathways in the forward and reverse directions of a conformational transition. Our discussion is based on a decomposition of equilibrium into opposing steady states, and makes clear the conditions necessary for symmetry to apply. From a practical point of view, we also discuss when approximate symmet… ▽ More In this note, we address formally the issue of symmetry for probabilities of different dynamical pathways in the forward and reverse directions of a conformational transition. Our discussion is based on a decomposition of equilibrium into opposing steady states, and makes clear the conditions necessary for symmetry to apply. From a practical point of view, we also discuss when approximate symmetry is to be expected. △ Less

Submitted 11 February, 2010; v1 submitted 11 February, 2010; originally announced February 2010.

arXiv:0910.5255 [pdf, ps, other]

doi 10.1063/1.3456985

Steady-state simulations using weighted ensemble path sampling

Authors: Divesh Bhatt, Bin W. Zhang, Daniel M. Zuckerman

Abstract: We extend the weighted ensemble (WE) path sampling method to perform rigorous statistical sampling for systems at steady state. The straightforward steady-state implementation of WE is directly practical for simple landscapes, but not when significant metastable intermediates states are present. We therefore develop an enhanced WE scheme, building on existing ideas, which accelerates attainment… ▽ More We extend the weighted ensemble (WE) path sampling method to perform rigorous statistical sampling for systems at steady state. The straightforward steady-state implementation of WE is directly practical for simple landscapes, but not when significant metastable intermediates states are present. We therefore develop an enhanced WE scheme, building on existing ideas, which accelerates attainment of steady state in complex systems. We apply both WE approaches to several model systems confirming their correctness and efficiency by comparison with brute-force results. The enhanced version is significantly faster than the brute force and straightforward WE for systems with WE bins that accurately reflect the reaction coordinate(s). The new WE methods can also be applied to equilibrium sampling, since equilibrium is a steady state. △ Less

Submitted 28 February, 2010; v1 submitted 27 October, 2009; originally announced October 2009.

arXiv:0910.5136 [pdf]

doi 10.1016/j.bpj.2008.12.3752

Thermal Motions of the E. Coli Glucose-Galactose Binding Protein Studied Using Well-Sampled Semi-Atomistic Simulations

Authors: Derek J. Cashman, Artem B. Mamonov, Divesh Bhatt, Daniel M. Zuckerman

Abstract: The E. coli glucose-galactose chemosensory receptor is a 309 residue, 32 kDa protein consisting of two distinct structural domains. In this computational study, we studied the protein's thermal fluctuations, including both the large scale interdomain movements that contribute to the receptor's mechanism of action, as well as smaller scale motions, using two different computational methods. We em… ▽ More The E. coli glucose-galactose chemosensory receptor is a 309 residue, 32 kDa protein consisting of two distinct structural domains. In this computational study, we studied the protein's thermal fluctuations, including both the large scale interdomain movements that contribute to the receptor's mechanism of action, as well as smaller scale motions, using two different computational methods. We employ extremely fast, "semi-atomistic" Library-Based Monte Carlo (LBMC) simulations, which include all backbone atoms but "implicit" side chains. Our results were compared with previous experiments and an all-atom Langevin dynamics simulation. Both LBMC and Langevin dynamics simulations were performed using both the apo and glucose-bound form of the protein, with LBMC exhibiting significantly larger fluctuations. The LBMC simulations are also in general agreement with the disulfide trap** experiments of Careaga & Falke (JMB, 1992; Biophys. J., 1992), which indicate that distant residues in the crystal structure (i.e. beta carbons separated by 10 to 20 angstroms) form spontaneous transient contacts in solution. Our simulations illustrate several possible "mechanisms" (configurational pathways) for these fluctuations. We also observe several discrepancies between our calculations and experiment. Nevertheless, we believe that our semi-atomistic approach could be used to study the fluctuations in other proteins, perhaps for ensemble docking, or other analyses of protein flexibility in virtual screening studies. △ Less

Submitted 27 October, 2009; originally announced October 2009.

Comments: 23 pages, 4 figures, 2 tables

arXiv:0910.1582 [pdf, ps, other]

Heterogeneous path ensembles for conformational transitions in semi-atomistic models of adenylate kinase

Authors: Divesh Bhatt, Daniel M. Zuckerman

Abstract: We performed "weighted ensemble" path-sampling simulations of adenylate kinase, using several semi-atomistic protein models. Our study investigated both the biophysics of conformational transitions as well as the possibility of increasing model accuracy without sacrificing good sampling. Biophysically, the path ensembles show significant heterogeneity and the explicit possibility of two principl… ▽ More We performed "weighted ensemble" path-sampling simulations of adenylate kinase, using several semi-atomistic protein models. Our study investigated both the biophysics of conformational transitions as well as the possibility of increasing model accuracy without sacrificing good sampling. Biophysically, the path ensembles show significant heterogeneity and the explicit possibility of two principle pathways in the Open-Closed transition. We recently showed, under certain conditions, a "symmetry of hetereogeneity" is expected between the forward and the reverse transitions: the fraction of transitions taking a specific pathway/channel will be the same in both the directions. Our path ensembles are analyzed in the light of the symmetry relation and its conditions. In the realm of modeling, we employed an all-atom backbone with various levels of residue interactions. Because reasonable path sampling required only a few weeks of single-processor computing time with these models, the addition of further chemical detail should be feasible. △ Less

Submitted 25 February, 2010; v1 submitted 8 October, 2009; originally announced October 2009.

arXiv:0809.3809 [pdf]

A library-based Monte Carlo technique enables rapid equilibrium sampling of a protein model with atomistic components

Authors: Artem B. Mamonov, Divesh Bhatt, Derek J. Cashman, Daniel M. Zuckerman

Abstract: There is significant interest in rapid protein simulations because of the time-scale limitations of all-atom methods. Exploiting the low cost and great availability of computer memory, we report a Monte Carlo technique for incorporating fully flexible atomistic protein components (e.g., peptide planes) into protein models without compromising sampling speed or statistical rigor. Building on exis… ▽ More There is significant interest in rapid protein simulations because of the time-scale limitations of all-atom methods. Exploiting the low cost and great availability of computer memory, we report a Monte Carlo technique for incorporating fully flexible atomistic protein components (e.g., peptide planes) into protein models without compromising sampling speed or statistical rigor. Building on existing approximate methods (e.g., Rosetta), the technique uses pre-generated statistical libraries of all-atom components which are swapped with the corresponding protein components during a simulation. The simple model we study consists of the three all-atom backbone residues -- Ala, Gly, and Pro -- with structure-based (Go-like) interactions. For the five different proteins considered in this study, LBMC can generate at least 30 statistically independent configurations in about a month of single CPU time. Minimal additional cost is required to add residue-specific interactions. △ Less

Submitted 4 December, 2008; v1 submitted 22 September, 2008; originally announced September 2008.

Showing 1–30 of 30 results for author: Bhatt, D