Search | arXiv e-print repository

arXiv:2010.07768 [pdf]

doi 10.1002/jbio.202000473

High-resolution single-shot phase-shifting interference microscopy using deep neural network for quantitative phase imaging of biological samples

Authors: Sunil Bhatt, Ankit Butola, Sheetal Raosaheb Kanade, Anand Kumar, Dalip Singh Mehta

Abstract: White light phase-shifting interference microscopy (WL-PSIM) is a prominent technique for high-resolution quantitative phase imaging (QPI) of industrial and biological specimens. However, multiple interferograms with accurate phase-shifts are essentially required in WL-PSIM for measuring the accurate phase of the object. Here, we present single-shot phase-shifting interferometric techniques for ac… ▽ More White light phase-shifting interference microscopy (WL-PSIM) is a prominent technique for high-resolution quantitative phase imaging (QPI) of industrial and biological specimens. However, multiple interferograms with accurate phase-shifts are essentially required in WL-PSIM for measuring the accurate phase of the object. Here, we present single-shot phase-shifting interferometric techniques for accurate phase measurement using filtered white light phase-shifting interference microscopy (F-WL-PSIM) and deep neural network (DNN). The methods are incorporated by training the DNN to generate 1) four phase-shifted frames and 2) direct phase from a single interferogram. The training of network is performed on two different samples i.e., optical waveguide and MG63 osteosarcoma cells. Further, performance of F-WL-PSIM+DNN framework is validated by comparing the phase map extracted from network generated and experimentally recorded interferograms. The current approach can further strengthen QPI techniques for high-resolution phase recovery using a single frame for different biomedical applications. △ Less

Submitted 3 May, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

arXiv:2009.09818 [pdf, other]

DeepActsNet: Spatial and Motion features from Face, Hands, and Body Combined with Convolutional and Graph Networks for Improved Action Recognition

Authors: Umar Asif, Deval Mehta, Stefan von Cavallar, Jianbin Tang, Stefan Harrer

Abstract: Existing action recognition methods mainly focus on joint and bone information in human body skeleton data due to its robustness to complex backgrounds and dynamic characteristics of the environments. In this paper, we combine body skeleton data with spatial and motion features from face and two hands, and present "Deep Action Stamps (DeepActs)", a novel data representation to encode actions from… ▽ More Existing action recognition methods mainly focus on joint and bone information in human body skeleton data due to its robustness to complex backgrounds and dynamic characteristics of the environments. In this paper, we combine body skeleton data with spatial and motion features from face and two hands, and present "Deep Action Stamps (DeepActs)", a novel data representation to encode actions from video sequences. We also present "DeepActsNet", a deep learning based ensemble model which learns convolutional and structural features from Deep Action Stamps for highly accurate action recognition. Experiments on three challenging action recognition datasets (NTU60, NTU120, and SYSU) show that the proposed model trained using Deep Action Stamps produce considerable improvements in the action recognition accuracy with less computational cost compared to the state-of-the-art methods. △ Less

Submitted 4 June, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

arXiv:2008.04853 [pdf]

doi 10.1109/VTCFall.2018.8690650

Study on State-of-the-art Cloud Services Integration Capabilities with Autonomous Ground Vehicles

Authors: Praveen Damacharla, Dhwani Mehta, Ahmad Y Javaid, Vijay K. Devabhaktuni

Abstract: Computing and intelligence are substantial requirements for the accurate performance of autonomous ground vehicles (AGVs). In this context, the use of cloud services in addition to onboard computers enhances computing and intelligence capabilities of AGVs. In addition, the vast amount of data processed in a cloud system contributes to overall performance and capabilities of the onboard system. Thi… ▽ More Computing and intelligence are substantial requirements for the accurate performance of autonomous ground vehicles (AGVs). In this context, the use of cloud services in addition to onboard computers enhances computing and intelligence capabilities of AGVs. In addition, the vast amount of data processed in a cloud system contributes to overall performance and capabilities of the onboard system. This research study entails a qualitative analysis to gather insights on the applicability of the leading cloud service providers in AGV operations. These services include Google Cloud, Microsoft Azure, Amazon AWS, and IBM Cloud. The study begins with a brief review of AGV technical requirements that are necessary to determine the rationale for identifying the most suitable cloud service. The qualitative analysis studies and addresses the applicability of the cloud service over the proposed generalized AGV's architecture integration, performance, and manageability. Our findings conclude that a generalized AGV architecture can be supported by state-of-the-art cloud service, but there should be a clear line of separation between the primary and secondary computing needs. Moreover, our results show significant lags while using cloud services and preventing their use in real-time AGV operation. △ Less

Submitted 11 August, 2020; originally announced August 2020.

Journal ref: 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall), Chicago, IL, USA, 2018, pp. 1-5

arXiv:2007.12263 [pdf]

Multi-modal on-chip nanoscopy and quantitative phase image reveals the morphology of liver sinusoidal enodthelial cells

Authors: David A. Coucheron, Ankit Butola, Karolina Szafranska, Azeem Ahmad, Jean-Claude Tinguely, Peter McCourt, Paramasivam Senthilkumaran, Dalip Singh Mehta, Balpreet Singh Ahluwalia

Abstract: Visualization of three-dimensional morphological changes in the subcellular structures of a biological specimen is one of the greatest challenges in life science. Despite conspicuous refinements in optical nanoscopy, determination of quantitative changes in subcellular structure, i.e., size and thickness, remains elusive. We present an integrated chip-based optical nanoscopy set-up that provides a… ▽ More Visualization of three-dimensional morphological changes in the subcellular structures of a biological specimen is one of the greatest challenges in life science. Despite conspicuous refinements in optical nanoscopy, determination of quantitative changes in subcellular structure, i.e., size and thickness, remains elusive. We present an integrated chip-based optical nanoscopy set-up that provides a lateral optical resolution of 61 nm combined with a highly sensitive quantitative phase microscopy (QPM) system with a spatial phase sensitivity of $\pm$20 mrad. We use the system to obtain the 3D morphology of liver sinusoidal endothelial cells (LSECs) combined with super-resolved spatial information. LSECs have a unique morphology with nanopores that are present in the plasma membrane, called fenestration. The fenestrations are grouped in clusters called sieve plates, which are around 100 nm thick. Thus, imaging and quantification of fenestration and sieve plate thickness requires resolution and sensitivity of sub-100 nm along both lateral and axial directions. In the chip-based nanoscope, the optical waveguides are used both for hosting and illuminating the sample. A strong evanescent field is generated on top of the waveguide surface for single molecule fluorescence excitation. The fluorescence signal is captured by an upright microscope, which is converted into a Linnik-type interferometer to sequentially acquire both super-resolved images and quantitative phase information of the sample. The multi-modal microscope provided an estimate of the fenestration diameter of 124$\pm$41 nm and revealed the average estimated thickness of the sieve plates in the range of 91.2$\pm$43.5 nm for two different cells. The combination of these techniques offers visualization of both the lateral size (using nanoscopy) and the thickness map of sieve plates, i.e. discrete clusters fenestrations in QPM mode. △ Less

Submitted 22 July, 2020; originally announced July 2020.

arXiv:2007.02397 [pdf]

doi 10.1364/OE.402666

High space-bandwidth in quantitative phase imaging using partially spatially coherent optical coherence microscopy and deep neural network

Authors: Ankit Butola, Sheetal Raosaheb Kanade, Sunil Bhatt, Vishesh Kumar Dubey, Anand Kumar, Azeem Ahmad, Dilip K Prasad, Paramasivam Senthilkumaran, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: Quantitative phase microscopy (QPM) is a label-free technique that enables to monitor morphological changes at subcellular level. The performance of the QPM system in terms of spatial sensitivity and resolution depends on the coherence properties of the light source and the numerical aperture (NA) of objective lenses. Here, we propose high space-bandwidth QPM using partially spatially coherent opt… ▽ More Quantitative phase microscopy (QPM) is a label-free technique that enables to monitor morphological changes at subcellular level. The performance of the QPM system in terms of spatial sensitivity and resolution depends on the coherence properties of the light source and the numerical aperture (NA) of objective lenses. Here, we propose high space-bandwidth QPM using partially spatially coherent optical coherence microscopy (PSC-OCM) assisted with deep neural network. The PSC source synthesized to improve the spatial sensitivity of the reconstructed phase map from the interferometric images. Further, compatible generative adversarial network (GAN) is used and trained with paired low-resolution (LR) and high-resolution (HR) datasets acquired from PSC-OCM system. The training of the network is performed on two different types of samples i.e. mostly homogenous human red blood cells (RBC) and on highly heterogenous macrophages. The performance is evaluated by predicting the HR images from the datasets captured with low NA lens and compared with the actual HR phase images. An improvement of 9 times in space-bandwidth product is demonstrated for both RBC and macrophages datasets. We believe that the PSC-OCM+GAN approach would be applicable in single-shot label free tissue imaging, disease classification and other high-resolution tomography applications by utilizing the longitudinal spatial coherence properties of the light source. △ Less

Submitted 5 July, 2020; originally announced July 2020.

arXiv:2006.14078 [pdf, other]

Machine learning the real discriminant locus

Authors: Edgar A. Bernal, Jonathan D. Hauenstein, Dhagash Mehta, Margaret H. Regan, Tingting Tang

Abstract: Parameterized systems of polynomial equations arise in many applications in science and engineering with the real solutions describing, for example, equilibria of a dynamical system, linkages satisfying design constraints, and scene reconstruction in computer vision. Since different parameter values can have a different number of real solutions, the parameter space is decomposed into regions whose… ▽ More Parameterized systems of polynomial equations arise in many applications in science and engineering with the real solutions describing, for example, equilibria of a dynamical system, linkages satisfying design constraints, and scene reconstruction in computer vision. Since different parameter values can have a different number of real solutions, the parameter space is decomposed into regions whose boundary forms the real discriminant locus. This article views locating the real discriminant locus as a supervised classification problem in machine learning where the goal is to determine classification boundaries over the parameter space, with the classes being the number of real solutions. For multidimensional parameter spaces, this article presents a novel sampling method which carefully samples the parameter space. At each sample point, homotopy continuation is used to obtain the number of real solutions to the corresponding polynomial system. Machine learning techniques including nearest neighbor and deep learning are used to efficiently approximate the real discriminant locus. One application of having learned the real discriminant locus is to develop a real homotopy method that only tracks the real solution paths unlike traditional methods which track all~complex~solution~paths. Examples show that the proposed approach can efficiently approximate complicated solution boundaries such as those arising from the equilibria of the Kuramoto model. △ Less

Submitted 8 August, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

Comments: 22 pages, 14 figures

arXiv:2006.00123 [pdf, other]

Machine Learning Fund Categorizations

Authors: Dhagash Mehta, Dhruv Desai, Jithin Pradeep

Abstract: Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many appli… ▽ More Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many applications ranging from sales and marketing to portfolio replication, portfolio diversification and tax loss harvesting. The current best method is data-vendor provided categorization which usually relies on curation by human experts with the help of available data. In this work, we establish that an industry wide well-regarded categorization system is learnable using machine learning and largely reproducible, and in turn constructing a truly data-driven categorization. We discuss the intellectual challenges in learning this man-made system, our results and their implications. △ Less

Submitted 29 May, 2020; originally announced June 2020.

Comments: 8 pages, 2-column format, 5 figures

arXiv:2005.08224 [pdf]

#Coronavirus or #Chinesevirus?!: Understanding the negative sentiment reflected in Tweets with racist hashtags across the development of COVID-19

Authors: Xin Pei, Deval Mehta

Abstract: Situated in the global outbreak of COVID-19, our study enriches the discussion concerning the emergent racism and xenophobia on social media. With big data extracted from Twitter, we focus on the analysis of negative sentiment reflected in tweets marked with racist hashtags, as racism and xenophobia are more likely to be delivered via the negative sentiment. Especially, we propose a stage-based ap… ▽ More Situated in the global outbreak of COVID-19, our study enriches the discussion concerning the emergent racism and xenophobia on social media. With big data extracted from Twitter, we focus on the analysis of negative sentiment reflected in tweets marked with racist hashtags, as racism and xenophobia are more likely to be delivered via the negative sentiment. Especially, we propose a stage-based approach to capture how the negative sentiment changes along with the three development stages of COVID-19, under which it transformed from a domestic epidemic into an international public health emergency and later, into the global pandemic. At each stage, sentiment analysis enables us to recognize the negative sentiment from tweets with racist hashtags, and keyword extraction allows for the discovery of themes in the expression of negative sentiment by these tweets. Under this public health crisis of human beings, this stage-based approach enables us to provide policy suggestions for the enactment of stage-specific intervention strategies to combat racism and xenophobia on social media in a more effective way. △ Less

Submitted 17 May, 2020; originally announced May 2020.

arXiv:2005.00116 [pdf, other]

Sequence Information Channel Concatenation for Improving Camera Trap Image Burst Classification

Authors: Bhuvan Malladihalli Shashidhara, Darshan Mehta, Yash Kale, Dan Morris, Megan Hazen

Abstract: Camera Traps are extensively used to observe wildlife in their natural habitat without disturbing the ecosystem. This could help in the early detection of natural or human threats to animals, and help towards ecological conservation. Currently, a massive number of such camera traps have been deployed at various ecological conservation areas around the world, collecting data for decades, thereby re… ▽ More Camera Traps are extensively used to observe wildlife in their natural habitat without disturbing the ecosystem. This could help in the early detection of natural or human threats to animals, and help towards ecological conservation. Currently, a massive number of such camera traps have been deployed at various ecological conservation areas around the world, collecting data for decades, thereby requiring automation to detect images containing animals. Existing systems perform classification to detect if images contain animals by considering a single image. However, due to challenging scenes with animals camouflaged in their natural habitat, it sometimes becomes difficult to identify the presence of animals from merely a single image. We hypothesize that a short burst of images instead of a single image, assuming that the animal moves, makes it much easier for a human as well as a machine to detect the presence of animals. In this work, we explore a variety of approaches, and measure the impact of using short image sequences (burst of 3 images) on improving the camera trap image classification. We show that concatenating masks containing sequence information and the images from the 3-image-burst across channels, improves the ROC AUC by 20% on a test-set from unseen camera-sites, as compared to an equivalent model that learns from a single image. △ Less

Submitted 5 June, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

Comments: 8 pages, 4 figures, 2 tables. Git repository can be found at: https://github.com/bhuvi3/camera_trap_animal_classification

ACM Class: I.4.9; I.4.10; I.2.10

arXiv:2004.12908 [pdf, other]

A Simple Lifelong Learning Approach

Authors: Joshua T. Vogelstein, Jayanta Dey, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Weiwei Yang, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain performance on old tasks given new tasks. But striving to avoid forgetting sets the goal unnecessarily low. The goal of lifelong learning should be to use data to improve performance on both future tasks (forward transfer) and past tasks (backward transfer). In this paper, we show that a simple approach -- representation ensembling -- demonstrates both forward and backward transfer in a variety of simulated and benchmark data scenarios, including tabular, vision (CIFAR-100, 5-dataset, Split Mini-Imagenet, and Food1k), and speech (spoken digit), in contrast to various reference algorithms, which typically failed to transfer either forward or backward, or both. Moreover, our proposed approach can flexibly operate with or without a computational budget. △ Less

Submitted 11 June, 2024; v1 submitted 27 April, 2020; originally announced April 2020.

arXiv:2004.12256 [pdf, other]

doi 10.1038/s41467-020-19292-w

A Self-powered Analog Sensor-data-logging Device based on Fowler-Nordheim Dynamical Systems

Authors: Darshit Mehta, Kenji Aono, Shantanu Chakrabartty

Abstract: Continuous, battery-free operation of sensor nodes requires ultra-low-power sensing and data-logging techniques. Here we report that by directly coupling a sensor/transducer signal into globally asymptotically stable monotonic dynamical systems based on Fowler-Nordheim quantum tunneling, one can achieve self-powered sensing at an energy budget that is currently unachievable using conventional ener… ▽ More Continuous, battery-free operation of sensor nodes requires ultra-low-power sensing and data-logging techniques. Here we report that by directly coupling a sensor/transducer signal into globally asymptotically stable monotonic dynamical systems based on Fowler-Nordheim quantum tunneling, one can achieve self-powered sensing at an energy budget that is currently unachievable using conventional energy harvesting methods. The proposed device uses a differential architecture to compensate for environmental variations and the device can retain sensed information for durations ranging from hours to days. With a theoretical operating energy budget less than 10 attojoules, we demonstrate that when integrated with a miniature piezoelectric transducer the proposed sensor-data-logger can measure cumulative "action" due to ambient mechanical acceleration without any additional external power. △ Less

Submitted 3 October, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

Comments: 24 pages (including 11 supplementary pages) and 16 figures (including 11 supplementary figures)

arXiv:2004.10270 [pdf, other]

Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi

Authors: Devansh Mehta, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Anurag Shukla, Vishnu Prasad, Venkanna U, Amit Sharma, Kalika Bali

Abstract: The primary obstacle to develo** technologies for low-resource languages is the lack of usable data. In this paper, we report the adoption and deployment of 4 technology-driven methods of data collection for Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. In the process of data collection, we also help in its revival by expanding a… ▽ More The primary obstacle to develo** technologies for low-resource languages is the lack of usable data. In this paper, we report the adoption and deployment of 4 technology-driven methods of data collection for Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. In the process of data collection, we also help in its revival by expanding access to information in Gondi through the creation of linguistic resources that can be used by the community, such as a dictionary, children's stories, an app with Gondi content from multiple sources and an Interactive Voice Response (IVR) based mass awareness platform. At the end of these interventions, we collected a little less than 12,000 translated words and/or sentences and identified more than 650 community members whose help can be solicited for future translation efforts. The larger goal of the project is collecting enough data in Gondi to build and deploy viable language technologies like machine translation and speech to text systems that can help take the language onto the internet. △ Less

Submitted 26 January, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

Comments: Accepted at LREC 2020 (7 pages). D.M. and S.S. contributed equally

arXiv:2002.07377 [pdf]

High spatially sensitive quantitative phase imaging assisted with deep neural network for classification of human spermatozoa under stressed condition

Authors: Ankit Butola, Daria Popova, Dilip K Prasad, Azeem Ahmad, Anowarul Habib, Jean Claude Tinguely, Purusotam Basnet, Ganesh Acharya, Paramasivam Senthilkumaran, Dalip Singh Mehta, Balpreet Singh Ahluwalia

Abstract: Sperm cell motility and morphology observed under the bright field microscopy are the only criteria for selecting particular sperm cell during Intracytoplasmic Sperm Injection (ICSI) procedure of Assisted Reproductive Technology (ART). Several factors such as, oxidative stress, cryopreservation, heat, smoking and alcohol consumption, are negatively associated with the quality of sperm cell and fer… ▽ More Sperm cell motility and morphology observed under the bright field microscopy are the only criteria for selecting particular sperm cell during Intracytoplasmic Sperm Injection (ICSI) procedure of Assisted Reproductive Technology (ART). Several factors such as, oxidative stress, cryopreservation, heat, smoking and alcohol consumption, are negatively associated with the quality of sperm cell and fertilization potential due to the changing of sub-cellular structures and functions which are overlooked. A bright field imaging contrast is insufficient to distinguish tiniest morphological cell features that might influence the fertilizing ability of sperm cell. We developed a partially spatially coherent digital holographic microscope (PSC-DHM) for quantitative phase imaging (QPI) in order to distinguish normal sperm cells from sperm cells under different stress conditions such as cryopreservation, exposure to hydrogen peroxide and ethanol without any labeling. Phase maps of 10,163 sperm cells (2,400 control cells, 2,750 spermatozoa after cryopreservation, 2,515 and 2,498 cells under hydrogen peroxide and ethanol respectively) are reconstructed using the data acquired from PSC-DHM system. Total of seven feedforward deep neural networks (DNN) were employed for the classification of the phase maps for normal and stress affected sperm cells. When validated against the test dataset, the DNN provided an average sensitivity, specificity and accuracy of 84.88%, 95.03% and 85%, respectively. The current approach DNN and QPI techniques of quantitative information can be applied for further improving ICSI procedure and the diagnostic efficiency for the classification of semen quality in regards to their fertilization potential and other biomedical applications in general. △ Less

Submitted 18 February, 2020; originally announced February 2020.

arXiv:1907.00837 [pdf, other]

doi 10.1145/3386569.3392410

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

Authors: Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

Abstract: We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates successfully in generic scenes which may contain occlusions by objects and by other people. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible jo… ▽ More We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates successfully in generic scenes which may contain occlusions by objects and by other people. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals.We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy. In the second stage, a fully connected neural network turns the possibly partial (on account of occlusion) 2Dpose and 3Dpose features for each subject into a complete 3Dpose estimate per individual. The third stage applies space-time skeletal model fitting to the predicted 2D and 3D pose per subject to further reconcile the 2D and 3D pose, and enforce temporal coherence. Our method returns the full skeletal pose in joint angles for each subject. This is a further key distinction from previous work that do not produce joint angle results of a coherent skeleton in real time for multi-person scenes. The proposed system runs on consumer hardware at a previously unseen speed of more than 30 fps given 512x320 images as input while achieving state-of-the-art accuracy, which we will demonstrate on a range of challenging real-world scenes. △ Less

Submitted 30 April, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

Comments: To appear in ACM Transactions on Graphics (SIGGRAPH) 2020

arXiv:1907.00199 [pdf, other]

Incidents Are Meant for Learning, Not Repeating: Sharing Knowledge About Security Incidents in Cyber-Physical Systems

Authors: Faeq Alrimawi, Liliana Pasquale, Deepak Mehta, Nobukazu Yoshioka, Bashar Nuseibeh

Abstract: Cyber-physical systems (CPSs) are part of most critical infrastructures such as industrial automation and transportation systems. Thus, security incidents targeting CPSs can have disruptive consequences to assets and people. As prior incidents tend to re-occur, sharing knowledge about these incidents can help organizations be more prepared to prevent, mitigate or investigate future incidents. This… ▽ More Cyber-physical systems (CPSs) are part of most critical infrastructures such as industrial automation and transportation systems. Thus, security incidents targeting CPSs can have disruptive consequences to assets and people. As prior incidents tend to re-occur, sharing knowledge about these incidents can help organizations be more prepared to prevent, mitigate or investigate future incidents. This paper proposes a novel approach to enable representation and sharing of knowledge about CPS incidents across different organizations. To support sharing, we represent incident knowledge (incident patterns) capturing incident characteristics that can manifest again, such as incident activities or vulnerabilities exploited by offenders. Incident patterns are a more abstract representation of specific incident instances and, thus, are general enough to be applicable to various systems - different than the one in which the incident occurred. They can also avoid disclosing potentially sensitive information about an organization's assets and resources. We provide an automated technique to extract an incident pattern from a specific incident instance. To understand how an incident pattern can manifest again in other cyber-physical systems, we also provide an automated technique to instantiate incident patterns to specific systems. We demonstrate the feasibility of our approach in the application domain of smart buildings. We evaluate correctness, scalability, and performance using two substantive scenarios inspired by real-world systems and incidents. △ Less

Submitted 29 June, 2019; originally announced July 2019.

arXiv:1905.07628 [pdf, other]

Evolving Rewards to Automate Reinforcement Learning

Authors: Aleksandra Faust, Anthony Francis, Dar Mehta

Abstract: Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many classical control tasks guide RL training using complex rewards, which require tedious hand-tuning. We automate the reward search with AutoRL, an evolutionary layer over standard RL that treats reward tuning as hyperparame… ▽ More Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many classical control tasks guide RL training using complex rewards, which require tedious hand-tuning. We automate the reward search with AutoRL, an evolutionary layer over standard RL that treats reward tuning as hyperparameter optimization and trains a population of RL agents to find a reward that maximizes the task objective. AutoRL, evaluated on four Mujoco continuous control tasks over two RL algorithms, shows improvements over baselines, with the the biggest uplift for more complex tasks. The video can be found at: \url{https://youtu.be/svdaOFfQyC8}. △ Less

Submitted 18 May, 2019; originally announced May 2019.

Comments: Accepted to 6th AutoML@ICML

arXiv:1905.04967 [pdf, other]

Implicit Filter Sparsification In Convolutional Neural Networks

Authors: Dushyant Mehta, Kwang In Kim, Christian Theobalt

Abstract: We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. Through an extensive empirical study (Mehta et al., 2019) we hypothesize the mechanism behind the sparsification process, and find surprising links to certain f… ▽ More We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. Through an extensive empirical study (Mehta et al., 2019) we hypothesize the mechanism behind the sparsification process, and find surprising links to certain filter sparsification heuristics proposed in literature. Emergence of, and the subsequent pruning of selective features is observed to be one of the contributing mechanisms, leading to feature sparsity at par or better than certain explicit sparsification / pruning approaches. In this workshop article we summarize our findings, and point out corollaries of selective-featurepenalization which could also be employed as heuristics for filter pruning △ Less

Submitted 13 May, 2019; originally announced May 2019.

Comments: ODML-CDNNR 2019 (ICML'19 workshop) extended abstract of the CVPR 2019 paper "On Implicit Filter Level Sparsity in Convolutional Neural Networks, Mehta et al." (arXiv:1811.12495)

arXiv:1904.04245 [pdf]

doi 10.1364/JOSAA.36.000D41

Influence of laser spot size at diffuser plane on the longitudinal spatial coherence function of optical coherence microscopy system

Authors: Kashif Usmani, Azeem Ahmad, Rakesh Joshi, Vishesh Dubey, Ankit Butola, Dalip Singh Mehta

Abstract: Coherence properties and wavelength of light sources are indispensable for optical coherence microscopy/tomography as they greatly influence the signal to noise ratio, axial resolution, and penetration depth of the system. In the present letter, we investigated the longitudinal spatial coherence properties of the pseudo-thermal light source (PTS) as a function of spot size at the diffuser plane, w… ▽ More Coherence properties and wavelength of light sources are indispensable for optical coherence microscopy/tomography as they greatly influence the signal to noise ratio, axial resolution, and penetration depth of the system. In the present letter, we investigated the longitudinal spatial coherence properties of the pseudo-thermal light source (PTS) as a function of spot size at the diffuser plane, which is controlled by translating microscope objective lens towards or away from the diffuser plane. The axial resolution of PTS is found to be maximum ~ 13 microns for the beam spot size of 3.5 mm at the diffuser plane. The change in the axial resolution of the system as the spot size is increased at the diffuser plane is further confirmed by performing experiments on standard gauge blocks of height difference of 15 microns. Thus, by appropriately choosing the beam spot size at the diffuser plane, any monochromatic laser light source depending on the biological window can be utilized to obtain high axial-resolution with large penetration depth and speckle-free tomographic images of multilayered biological specimens irrespective of the source temporal coherence length. In addition, PTS could be an attractive alternative light source for achieving high axial-resolution without needing chromatic aberration corrected optics and dispersion-compensation mechanism, unlike conventional setups. △ Less

Submitted 7 April, 2019; originally announced April 2019.

Comments: 11 pages, 4 figures. arXiv admin note: text overlap with arXiv:1810.01994

arXiv:1904.03289 [pdf, other]

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Authors: Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Gerard Pons-Moll, Christian Theobalt

Abstract: Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are… ▽ More Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are either synthetically created or feature in-studio images. 3D pose estimation algorithms trained on such data often have limited ability to generalize to real world scene diversity. We therefore propose a new deep learning based method for monocular 3D human pose estimation that shows high accuracy and generalizes better to in-the-wild scenes. It has a network architecture that comprises a new disentangled hidden space encoding of explicit 2D and 3D features, and uses supervision by a new learned projection model from predicted 3D pose. Our algorithm can be jointly trained on image data with 3D labels and image data with only 2D labels. It achieves state-of-the-art accuracy on challenging in-the-wild data. △ Less

Submitted 5 April, 2019; originally announced April 2019.

Comments: Accepted to CVPR 2019

arXiv:1812.02487 [pdf]

Deep learning architecture LightOCT for diagnostic decision support using optical coherence tomography images of biological samples

Authors: Ankit Butola, Dilip K. Prasad, Azeem Ahmad, Vishesh Dubey, Darakhshan Qaiser, Anurag Srivastava, Paramsivam Senthilkumaran, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: Optical coherence tomography (OCT) is being increasingly adopted as a label-free and non-invasive technique for biomedical applications such as cancer and ocular disease diagnosis. Diagnostic information for these tissues is manifest in textural and geometric features of the OCT images, which are used by human expertise to interpret and triage. However, it suffers delays due to the long process of… ▽ More Optical coherence tomography (OCT) is being increasingly adopted as a label-free and non-invasive technique for biomedical applications such as cancer and ocular disease diagnosis. Diagnostic information for these tissues is manifest in textural and geometric features of the OCT images, which are used by human expertise to interpret and triage. However, it suffers delays due to the long process of the conventional diagnostic procedure and shortage of human expertise. Here, a custom deep learning architecture, LightOCT, is proposed for the classification of OCT images into diagnostically relevant classes. LightOCT is a convolutional neural network with only two convolutional layers and a fully connected layer, but it is shown to provide excellent training and test results for diverse OCT image datasets. We show that LightOCT provides 98.9% accuracy in classifying 44 normal and 44 malignant (invasive ductal carcinoma) breast tissue volumetric OCT images. Also, >96% accuracy in classifying public datasets of ocular OCT images as normal, age-related macular degeneration and diabetic macular edema. Additionally, we show ~96% test accuracy for classifying retinal images as belonging to choroidal neovascularization, diabetic macular edema, drusen, and normal samples on a large public dataset of more than 100,000 images. The performance of the architecture is compared with transfer learning based deep neural networks. Through this, we show that LightOCT can provide significant diagnostic support for a variety of OCT images with sufficient training and minimal hyper-parameter tuning. The trained LightOCT networks for the three-classification problem will be released online to support transfer learning on other datasets. △ Less

Submitted 6 July, 2020; v1 submitted 6 December, 2018; originally announced December 2018.

arXiv:1812.01057 [pdf]

Highly stable common-path quantitative phase microscope for biomedical imaging

Authors: Azeem Ahmad, Vishesh Dubey, Ankit Butola, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: High temporal stability is the primary requirement of any quantitative phase microscope (QPM) systems for the early stage detection of various human related diseases. The high temporal stability of the system provides accurate measurement of membrane fluctuations of the biological cells, which can be good indicator of various diseases. We developed a single element highly stable common-path QPM sy… ▽ More High temporal stability is the primary requirement of any quantitative phase microscope (QPM) systems for the early stage detection of various human related diseases. The high temporal stability of the system provides accurate measurement of membrane fluctuations of the biological cells, which can be good indicator of various diseases. We developed a single element highly stable common-path QPM system to obtain temporally stable holograms of the biological specimens. With the proposed system, the temporal stability is obtained ~ 15 mrad without using any vibration isolation table. The capability of the proposed system is demonstrated on USAF resolution chart, polystyrene spheres (dia. 4.5 micron) and human red blood cells (RBCs). The membrane fluctuation of healthy human RBCs is further successfully measured and found to be equal to 63 nm. Contrary to its counterparts, present system offers energy efficient, cost effective and simple way of generating object and reference beam for the development of common-path QPM. △ Less

Submitted 3 December, 2018; originally announced December 2018.

Comments: 9 pages, 6 figures

arXiv:1812.00378 [pdf]

Sub-nanometer height sensitivity by phase shifting interference microscopy under ambient environmental fluctuations

Authors: Azeem Ahmad, Vishesh Dubey, Ankit Butola, Jean-Claude Tinguely, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: Phase shifting interferometric (PSI) techniques are among the most sensitive phase measurement methods. Owing to its high sensitivity, any minute phase change caused due to environmental instability results into, inaccurate phase measurement. Consequently, a well calibrated piezo electric transducer (PZT) and highly-stable environment is mandatory for measuring accurate phase map using PSI impleme… ▽ More Phase shifting interferometric (PSI) techniques are among the most sensitive phase measurement methods. Owing to its high sensitivity, any minute phase change caused due to environmental instability results into, inaccurate phase measurement. Consequently, a well calibrated piezo electric transducer (PZT) and highly-stable environment is mandatory for measuring accurate phase map using PSI implementation. Here, we present a new method of recording temporal phase shifted interferograms and a numerical algorithm, which can retrieve phase maps of the samples with negligible errors under the ambient environmental fluctuations. The method is implemented by recording a video of continuous temporally phase shifted interferograms and phase shifts were calculated between all the data frames using newly developed algorithm with a high accuracy less than or equal to 5.5*10-4*pi rad. To demonstrate the robustness of the proposed method, a manual translation of the stage was employed to introduce continuous temporal phase shift between data frames. The developed algorithm is first verified by performing quantitative phase imaging of optical waveguide and red blood cells using uncalibrated PZT under the influence of vibrations/air turbulence and compared with the well calibrated PZT results. Furthermore, we demonstrated the potential of the proposed approach by acquiring the quantitative phase imaging of an optical waveguide with a rib height of only 2 nm. By using 12-bit CMOS camera the height of shallow rib waveguide is measured with a height sensitivity of 4 Angstrom without using PZT and in presence of environmental fluctuations. △ Less

Submitted 2 December, 2018; originally announced December 2018.

Comments: 26 pages, 15 figures

arXiv:1811.12495 [pdf, other]

On Implicit Filter Level Sparsity in Convolutional Neural Networks

Authors: Dushyant Mehta, Kwang In Kim, Christian Theobalt

Abstract: We investigate filter level sparsity that emerges in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. We conduct an extensive experimental study casting our initial findings into hypotheses and conclusions about the mechanisms underlying the emergent filter lev… ▽ More We investigate filter level sparsity that emerges in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. We conduct an extensive experimental study casting our initial findings into hypotheses and conclusions about the mechanisms underlying the emergent filter level sparsity. This study allows new insight into the performance gap obeserved between adapative and non-adaptive gradient descent methods in practice. Further, analysis of the effect of training strategies and hyperparameters on the sparsity leads to practical suggestions in designing CNN training strategies enabling us to explore the tradeoffs between feature selectivity, network capacity, and generalization performance. Lastly, we show that the implicit sparsity can be harnessed for neural network speedup at par or better than explicit sparsification / pruning approaches, with no modifications to the typical training pipeline required. △ Less

Submitted 5 April, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

Comments: Accepted at CVPR 2019

arXiv:1810.11726 [pdf, other]

Towards Robust Deep Neural Networks

Authors: Timothy E. Wang, Yiming Gu, Dhagash Mehta, Xiaojun Zhao, Edgar A. Bernal

Abstract: We investigate the topics of sensitivity and robustness in feedforward and convolutional neural networks. Combining energy landscape techniques developed in computational chemistry with tools drawn from formal methods, we produce empirical evidence indicating that networks corresponding to lower-lying minima in the optimization landscape of the learning objective tend to be more robust. The robust… ▽ More We investigate the topics of sensitivity and robustness in feedforward and convolutional neural networks. Combining energy landscape techniques developed in computational chemistry with tools drawn from formal methods, we produce empirical evidence indicating that networks corresponding to lower-lying minima in the optimization landscape of the learning objective tend to be more robust. The robustness estimate used is the inverse of a proposed sensitivity measure, which we define as the volume of an over-approximation of the reachable set of network outputs under all additive $l_{\infty}$-bounded perturbations on the input data. We present a novel loss function which includes a sensitivity term in addition to the traditional task-oriented and regularization terms. In our experiments on standard machine learning and computer vision datasets, we show that the proposed loss function leads to networks which reliably optimize the robustness measure as well as other related metrics of adversarial robustness without significant degradation in the classification error. Experimental results indicate that the proposed method outperforms state-of-the-art sensitivity-based learning approaches with regards to robustness to adversarial attacks. We also show that although the introduced framework does not explicitly enforce an adversarial loss, it achieves competitive overall performance relative to methods that do. △ Less

Submitted 4 December, 2018; v1 submitted 27 October, 2018; originally announced October 2018.

Comments: Added further discussions, and supplementary material

arXiv:1810.07716 [pdf, other]

The loss surface of deep linear networks viewed through the algebraic geometry lens

Authors: Dhagash Mehta, Tianran Chen, Tingting Tang, Jonathan D. Hauenstein

Abstract: By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of the deep linear neural network models. After clarifying on the various definitions of "flat" minima, we show that the geometrically flat minima, which are merely artifacts of residual continuous symmetries of the deep linear networks, can be straightforwardly removed by a gene… ▽ More By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of the deep linear neural network models. After clarifying on the various definitions of "flat" minima, we show that the geometrically flat minima, which are merely artifacts of residual continuous symmetries of the deep linear networks, can be straightforwardly removed by a generalized $L_2$ regularization. Then, we establish upper bounds on the number of isolated stationary points of these networks with the help of algebraic geometry. Using these upper bounds and utilizing a numerical algebraic geometry method, we find all stationary points of modest depth and matrix size. We show that in the presence of the non-zero regularization, deep linear networks indeed possess local minima which are not the global minima. Our computational results clarify certain aspects of the loss surfaces of deep linear networks and provide novel insights. △ Less

Submitted 17 October, 2018; originally announced October 2018.

Comments: 16 pages (2-columns), 5 figures

arXiv:1810.03697 [pdf]

doi 10.1364/OE.27.004572

Characterization of color cross-talk of CCD detectors and its influence in multispectral quantitative phase imaging

Authors: Azeem Ahmad, Anand Kumar, Vishesh Dubey, Ankit Butola, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: Multi-spectral quantitative phase imaging (QPI) is an emerging imaging modality for wavelength dependent studies of several biological and industrial specimens. Simultaneous multi-spectral QPI is generally performed with color CCD cameras. However, color CCD cameras are suffered from the color crosstalk issue, which needed to be explored. Here, we present a new approach for accurately measuring th… ▽ More Multi-spectral quantitative phase imaging (QPI) is an emerging imaging modality for wavelength dependent studies of several biological and industrial specimens. Simultaneous multi-spectral QPI is generally performed with color CCD cameras. However, color CCD cameras are suffered from the color crosstalk issue, which needed to be explored. Here, we present a new approach for accurately measuring the color crosstalk of 2D area detectors, without needing prior information about camera specifications. Color crosstalk of two different cameras commonly used in QPI, single chip CCD (1-CCD) and three chip CCD (3-CCD), is systematically studied and compared using compact interference microscopy. The influence of color crosstalk on the fringe width and the visibility of the monochromatic constituents corresponding to three color channels of white light interferogram are studied both through simulations and experiments. It is observed that presence of color crosstalk changes the fringe width and visibility over the imaging field of view. This leads to an unwanted non-uniform background error in the multi-spectral phase imaging of the specimens. It is demonstrated that the color crosstalk of the detector is the key limiting factor for phase measurement accuracy of simultaneous multi-spectral QPI systems. △ Less

Submitted 5 October, 2018; originally announced October 2018.

Comments: 16 pages, 8 figures

arXiv:1810.01994 [pdf]

doi 10.1364/OL.44.001817

Study of longitudinal coherence properties of pseudo thermal light source as a function of source size and temporal coherence

Authors: Azeem Ahmad, Tanmoy Mahanty, Vishesh Dubey, Ankit Butola, Balpreet Singh Ahluwalia, Dalip Singh Mehta

Abstract: In conventional OCT, broadband light sources are generally utilized to obtain high axial resolution due to their low temporal coherence (TC) length. Purely monochromatic (i.e., high TC length) light sources like laser cannot be implemented to acquire high resolution optically sectioned images of the specimen. Contrary to this, pseudo thermal light source having high TC and low spatial coherence (S… ▽ More In conventional OCT, broadband light sources are generally utilized to obtain high axial resolution due to their low temporal coherence (TC) length. Purely monochromatic (i.e., high TC length) light sources like laser cannot be implemented to acquire high resolution optically sectioned images of the specimen. Contrary to this, pseudo thermal light source having high TC and low spatial coherence (SC) property can be employed to achieve high axial resolution comparable to broadband light source. In the present letter, a pseudo thermal light source is synthesized by passing a purely monochromatic laser beam through a rotating diffuser. The longitudinal coherence (LC) property of the pseudo thermal light source is studied as a function of source size and TC length. The LC length of the synthesized light source decreased as the source size increased. It is found that LC length of such light source becomes independent of the parent laser TC length for source size of greater than or equal to 3.3 mm and become almost constant at around 30 micron for both the lasers. Thus any monochromatic laser light source can be utilized to obtain high axial resolution in OCT system irrespective of its TC length. The maximum achievable axial resolution is found to be equal to 650 nm corresponding to 1.2 numerical aperture (NA) objective lens at 632 nm wavelength. The findings elucidate that pseudo thermal source being monochromatic in nature can improve the performance of existing OCT systems significantly. △ Less

Submitted 3 October, 2018; originally announced October 2018.

Comments: 5 pages, 4 figures

arXiv:1808.06481 [pdf, other]

On Dividing a Rectangle

Authors: Robert Dumitru, Quinn Perian, Alexander Nealey, Mohammed Mannan, Eddie Beck, Nick Castro, David Gay, Dipen Mehta, Anish Pandya, Ejay Cho

Abstract: This paper deals with the history of the following problem: "Can an arbitrary rectangle be dissected into 3 non-rectangular congruent regions?" We present a new elementary proof that the answer is indeed no. This paper deals with the history of the following problem: "Can an arbitrary rectangle be dissected into 3 non-rectangular congruent regions?" We present a new elementary proof that the answer is indeed no. △ Less

Submitted 14 August, 2018; originally announced August 2018.

arXiv:1804.02411 [pdf, ps, other]

doi 10.1103/PhysRevE.97.052307

The Loss Surface of XOR Artificial Neural Networks

Authors: Dhagash Mehta, Xiaojun Zhao, Edgar A. Bernal, David J. Wales

Abstract: Training an artificial neural network involves an optimization process over the landscape defined by the cost (loss) as a function of the network parameters. We explore these landscapes using optimisation tools developed for potential energy landscapes in molecular science. The number of local minima and transition states (saddle points of index one), as well as the ratio of transition states to m… ▽ More Training an artificial neural network involves an optimization process over the landscape defined by the cost (loss) as a function of the network parameters. We explore these landscapes using optimisation tools developed for potential energy landscapes in molecular science. The number of local minima and transition states (saddle points of index one), as well as the ratio of transition states to minima, grow rapidly with the number of nodes in the network. There is also a strong dependence on the regularisation parameter, with the landscape becoming more convex (fewer minima) as the regularisation term increases. We demonstrate that in our formulation, stationary points for networks with $N_h$ hidden nodes, including the minimal network required to fit the XOR data, are also stationary points for networks with $N_{h} +1$ hidden nodes when all the weights involving the additional nodes are zero. Hence, smaller networks optimized to train the XOR data are embedded in the landscapes of larger networks. Our results clarify certain aspects of the classification and sensitivity (to perturbations in the input data) of minima and saddle points for this system, and may provide insight into dropout and network compression. △ Less

Submitted 6 April, 2018; originally announced April 2018.

Comments: 19 pages, 6 figures. Submitted to journal in Oct, 2017

Journal ref: Phys. Rev. E 97, 052307 (2018)

arXiv:1801.10285 [pdf, other]

Optimal Configurations in Coverage Control with Polynomial Costs

Authors: Shaunak D. Bopardikar, Dhagash Mehta, Jonathan D. Hauenstein

Abstract: We revisit the static coverage control problem for placement of vehicles with simple motion on the real line, under the assumption that the cost is a polynomial function of the locations of the vehicles. The main contribution of this paper is to demonstrate the use of tools from numerical algebraic geometry, in particular, a numerical polynomial homotopy continuation method that guarantees to find… ▽ More We revisit the static coverage control problem for placement of vehicles with simple motion on the real line, under the assumption that the cost is a polynomial function of the locations of the vehicles. The main contribution of this paper is to demonstrate the use of tools from numerical algebraic geometry, in particular, a numerical polynomial homotopy continuation method that guarantees to find all solutions of polynomial equations, in order to characterize the \emph{global minima} for the coverage control problem. The results are then compared against a classic distributed approach involving the use of Lloyd descent, which is known to converge only to a local minimum under certain technical conditions. △ Less

Submitted 30 January, 2018; originally announced January 2018.

Comments: 6 pages, 2 figures

arXiv:1712.03453 [pdf, other]

Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB

Authors: Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, Christian Theobalt

Abstract: We propose a new single-shot method for multi-person 3D pose estimation in general scenes from a monocular RGB camera. Our approach uses novel occlusion-robust pose-maps (ORPM) which enable full body pose inference even under strong partial occlusions by other people and objects in the scene. ORPM outputs a fixed number of maps which encode the 3D joint locations of all people in the scene. Body p… ▽ More We propose a new single-shot method for multi-person 3D pose estimation in general scenes from a monocular RGB camera. Our approach uses novel occlusion-robust pose-maps (ORPM) which enable full body pose inference even under strong partial occlusions by other people and objects in the scene. ORPM outputs a fixed number of maps which encode the 3D joint locations of all people in the scene. Body part associations allow us to infer 3D pose for an arbitrary number of people without explicit bounding box prediction. To train our approach we introduce MuCo-3DHP, the first large scale training data set showing real images of sophisticated multi-person interactions and occlusions. We synthesize a large corpus of multi-person images by compositing images of individual people (with ground truth from mutli-view performance capture). We evaluate our method on our new challenging 3D annotated multi-person test set MuPoTs-3D where we achieve state-of-the-art performance. To further stimulate research in multi-person 3D pose estimation, we will make our new datasets, and associated code publicly available for research purposes. △ Less

Submitted 28 August, 2018; v1 submitted 9 December, 2017; originally announced December 2017.

Comments: International Conference on 3D Vision (3DV), 2018

arXiv:1712.01057 [pdf, other]

GANerated Hands for Real-time 3D Hand Tracking from Monocular RGB

Authors: Franziska Mueller, Florian Bernard, Oleksandr Sotnychenko, Dushyant Mehta, Srinath Sridhar, Dan Casas, Christian Theobalt

Abstract: We address the highly challenging problem of real-time 3D hand tracking based on a monocular RGB-only sequence. Our tracking method combines a convolutional neural network with a kinematic 3D hand model, such that it generalizes well to unseen data, is robust to occlusions and varying camera viewpoints, and leads to anatomically plausible as well as temporally smooth hand motions. For training our… ▽ More We address the highly challenging problem of real-time 3D hand tracking based on a monocular RGB-only sequence. Our tracking method combines a convolutional neural network with a kinematic 3D hand model, such that it generalizes well to unseen data, is robust to occlusions and varying camera viewpoints, and leads to anatomically plausible as well as temporally smooth hand motions. For training our CNN we propose a novel approach for the synthetic generation of training data that is based on a geometrically consistent image-to-image translation network. To be more specific, we use a neural network that translates synthetic images to "real" images, such that the so-generated images follow the same statistical distribution as real-world hand images. For training this translation network we combine an adversarial loss and a cycle-consistency loss with a geometric consistency loss in order to preserve geometric properties (such as hand pose) during translation. We demonstrate that our hand tracking system outperforms the current state-of-the-art on challenging RGB-only footage. △ Less

Submitted 4 December, 2017; originally announced December 2017.

arXiv:1709.02046 [pdf, ps, other]

doi 10.1039/C7CP03346J

Properties of Kinetic Transition Networks for Atomic Clusters and Glassy Solids

Authors: John W R Morgan, Dhagash Mehta, David J Wales

Abstract: A database of minima and transition states corresponds to a network where the minima represent nodes and the transition states correspond to edges between the pairs of minima they connect via steepest-descent paths. Here we construct networks for small clusters bound by the Morse potential for a selection of physically relevant parameters, in two and three dimensions. The properties of these unwei… ▽ More A database of minima and transition states corresponds to a network where the minima represent nodes and the transition states correspond to edges between the pairs of minima they connect via steepest-descent paths. Here we construct networks for small clusters bound by the Morse potential for a selection of physically relevant parameters, in two and three dimensions. The properties of these unweighted and undirected networks are analysed to examine two features: whether they are small-world, where the shortest path between nodes involves only a small number or edges; and whether they are scale-free, having a degree distribution that follows a power law. Small-world character is present, but statistical tests show that a power law is not a good fit, so the networks are not scale-free. These results for clusters are compared with the corresponding properties for the molecular and atomic structural glass formers ortho-terphenyl and binary Lennard-Jones. These glassy systems do not show small-world properties, suggesting that such behaviour is linked to the structure-seeking landscapes of the Morse clusters. △ Less

Submitted 6 September, 2017; originally announced September 2017.

Comments: 23 pages, 19 figures. Accepted for publication in Physical Chemistry Chemical Physics

arXiv:1708.09246 [pdf, ps, other]

Counting equilibria of the Kuramoto model using birationally invariant intersection index

Authors: Tianran Chen, Robert Davis, Dhagash Mehta

Abstract: Synchronization in networks of interconnected oscillators is a fascinating phenomenon that appear naturally in many independent fields of science and engineering. A substantial amount of work has been devoted to understanding all possible synchronization configurations on a given network. In this setting, a key problem is to determine the total number of such configurations. Through an algebraic f… ▽ More Synchronization in networks of interconnected oscillators is a fascinating phenomenon that appear naturally in many independent fields of science and engineering. A substantial amount of work has been devoted to understanding all possible synchronization configurations on a given network. In this setting, a key problem is to determine the total number of such configurations. Through an algebraic formulation, for tree and cycle graphs, we provide an upper bound on this number using the birationally invariant intersection index of a system of rational functions on a toric variety. △ Less

Submitted 30 August, 2017; originally announced August 2017.

MSC Class: 14Q99; 65H10

arXiv:1708.02136 [pdf, other]

MonoPerfCap: Human Performance Capture from Monocular Video

Authors: Weipeng Xu, Avishek Chatterjee, Michael Zollhöfer, Helge Rhodin, Dushyant Mehta, Hans-Peter Seidel, Christian Theobalt

Abstract: We present the first marker-less approach for temporally coherent 3D performance capture of a human with general clothing from monocular video. Our approach reconstructs articulated human skeleton motion as well as medium-scale non-rigid surface deformations in general scenes. Human performance capture is a challenging problem due to the large range of articulation, potentially fast motion, and co… ▽ More We present the first marker-less approach for temporally coherent 3D performance capture of a human with general clothing from monocular video. Our approach reconstructs articulated human skeleton motion as well as medium-scale non-rigid surface deformations in general scenes. Human performance capture is a challenging problem due to the large range of articulation, potentially fast motion, and considerable non-rigid deformations, even from multi-view data. Reconstruction from monocular video alone is drastically more challenging, since strong occlusions and the inherent depth ambiguity lead to a highly ill-posed reconstruction problem. We tackle these challenges by a novel approach that employs sparse 2D and 3D human pose detections from a convolutional neural network using a batch-based pose estimation strategy. Joint recovery of per-batch motion allows to resolve the ambiguities of the monocular reconstruction problem based on a low dimensional trajectory subspace. In addition, we propose refinement of the surface geometry based on fully automatically extracted silhouettes to enable medium-scale non-rigid alignment. We demonstrate state-of-the-art performance capture results that enable exciting applications such as video editing and free viewpoint video, previously infeasible from monocular video. Our qualitative and quantitative evaluation demonstrates that our approach significantly outperforms previous monocular methods in terms of accuracy, robustness and scene complexity that can be handled. △ Less

Submitted 23 February, 2018; v1 submitted 7 August, 2017; originally announced August 2017.

Comments: Accepted to ACM TOG 2018, to be presented on SIGGRAPH 2018

arXiv:1705.01583 [pdf, other]

doi 10.1145/3072959.3073596

VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera

Authors: Dushyant Mehta, Srinath Sridhar, Oleksandr Sotnychenko, Helge Rhodin, Mohammad Shafiei, Hans-Peter Seidel, Weipeng Xu, Dan Casas, Christian Theobalt

Abstract: We present the first real-time method to capture the full global 3D skeletal pose of a human in a stable, temporally consistent manner using a single RGB camera. Our method combines a new convolutional neural network (CNN) based pose regressor with kinematic skeleton fitting. Our novel fully-convolutional pose formulation regresses 2D and 3D joint positions jointly in real time and does not requir… ▽ More We present the first real-time method to capture the full global 3D skeletal pose of a human in a stable, temporally consistent manner using a single RGB camera. Our method combines a new convolutional neural network (CNN) based pose regressor with kinematic skeleton fitting. Our novel fully-convolutional pose formulation regresses 2D and 3D joint positions jointly in real time and does not require tightly cropped input frames. A real-time kinematic skeleton fitting method uses the CNN output to yield temporally stable 3D global pose reconstructions on the basis of a coherent kinematic skeleton. This makes our approach the first monocular RGB method usable in real-time applications such as 3D character control---thus far, the only monocular methods for such applications employed specialized RGB-D cameras. Our method's accuracy is quantitatively on par with the best offline 3D monocular RGB pose estimation methods. Our results are qualitatively comparable to, and sometimes better than, results from monocular RGB-D approaches, such as the Kinect. However, we show that our approach is more broadly applicable than RGB-D solutions, i.e. it works for outdoor scenes, community videos, and low quality commodity RGB cameras. △ Less

Submitted 3 May, 2017; originally announced May 2017.

Comments: Accepted to SIGGRAPH 2017

arXiv:1704.04792 [pdf, other]

Locating Power Flow Solution Space Boundaries: A Numerical Polynomial Homotopy Approach

Authors: Souvik Chandra, Dhagash Mehta, Aranya Chakrabortty

Abstract: The solution space of any set of power flow equations may contain different number of real-valued solutions. The boundaries that separate these regions are referred to as power flow solution space boundaries. Knowledge of these boundaries is important as they provide a measure for voltage stability. Traditionally, continuation based methods have been employed to compute these boundaries on the bas… ▽ More The solution space of any set of power flow equations may contain different number of real-valued solutions. The boundaries that separate these regions are referred to as power flow solution space boundaries. Knowledge of these boundaries is important as they provide a measure for voltage stability. Traditionally, continuation based methods have been employed to compute these boundaries on the basis of initial guesses for the solution. However, with rapid growth of renewable energy sources these boundaries will be increasingly affected by variable parameters such as penetration levels, locations of the renewable sources, and voltage set-points, making it difficult to generate an initial guess that can guarantee all feasible solutions for the power flow problem. In this paper we solve this problem by applying a numerical polynomial homotopy based continuation method. The proposed method guarantees to find all solution boundaries within a given parameter space up to a chosen level of discretization, independent of any initial guess. Power system operators can use this computational tool conveniently to plan the penetration levels of renewable sources at different buses. We illustrate the proposed method through simulations on 3-bus and 10-bus power system examples with renewable generation. △ Less

Submitted 16 April, 2017; originally announced April 2017.

Comments: 9 pages, 5 figures

arXiv:1704.02201 [pdf, other]

doi 10.1109/ICCV.2017.131

Real-time Hand Tracking under Occlusion from an Egocentric RGB-D Sensor

Authors: Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt

Abstract: We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments. Existing methods typically fail for hand-object interactions in cluttered scenes imaged from egocentric viewpoints, common for virtual or augmented reality applications. Our approach uses two subsequently applied Convolutional Neural Networks (CNNs) to… ▽ More We present an approach for real-time, robust and accurate hand pose estimation from moving egocentric RGB-D cameras in cluttered real environments. Existing methods typically fail for hand-object interactions in cluttered scenes imaged from egocentric viewpoints, common for virtual or augmented reality applications. Our approach uses two subsequently applied Convolutional Neural Networks (CNNs) to localize the hand and regress 3D joint locations. Hand localization is achieved by using a CNN to estimate the 2D position of the hand center in the input, even in the presence of clutter and occlusions. The localized hand position, together with the corresponding input depth value, is used to generate a normalized cropped image that is fed into a second CNN to regress relative 3D hand joint locations in real time. For added accuracy, robustness and temporal stability, we refine the pose estimates using a kinematic pose tracking energy. To train the CNNs, we introduce a new photorealistic dataset that uses a merged reality approach to capture and synthesize large amounts of annotated data of natural hand interaction in cluttered scenes. Through quantitative and qualitative evaluation, we show that our method is robust to self-occlusion and occlusions by objects, particularly in moving egocentric perspectives. △ Less

Submitted 5 October, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

Comments: Accepted at the International Conference on Computer Vision (ICCV) 2017

arXiv:1703.07915 [pdf, other]

doi 10.1039/C7CP01108C

Perspective: Energy Landscapes for Machine Learning

Authors: Andrew J. Ballard, Ritankar Das, Stefano Martiniani, Dhagash Mehta, Levent Sagun, Jacob D. Stevenson, David J. Wales

Abstract: Machine learning techniques are being increasingly used as flexible non-linear fitting and prediction tools in the physical sciences. Fitting functions that exhibit multiple solutions as local minima can be analysed in terms of the corresponding machine learning landscape. Methods to explore and visualise molecular potential energy landscapes can be applied to these machine learning landscapes to… ▽ More Machine learning techniques are being increasingly used as flexible non-linear fitting and prediction tools in the physical sciences. Fitting functions that exhibit multiple solutions as local minima can be analysed in terms of the corresponding machine learning landscape. Methods to explore and visualise molecular potential energy landscapes can be applied to these machine learning landscapes to gain new insight into the solution space involved in training and the nature of the corresponding predictions. In particular, we can define quantities analogous to molecular structure, thermodynamics, and kinetics, and relate these emergent properties to the structure of the underlying landscape. This Perspective aims to describe these analogies with examples from recent applications, and suggest avenues for new interdisciplinary research. △ Less

Submitted 22 March, 2017; originally announced March 2017.

Comments: 41 pages, 25 figures. Accepted for publication in Physical Chemistry Chemical Physics, 2017

arXiv:1611.09813 [pdf, other]

Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision

Authors: Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, Christian Theobalt

Abstract: We propose a CNN-based approach for 3D human body pose estimation from single RGB images that addresses the issue of limited generalizability of models trained solely on the starkly limited publicly available 3D pose data. Using only the existing 3D pose data and 2D pose data, we show state-of-the-art performance on established benchmarks through transfer of learned features, while also generalizi… ▽ More We propose a CNN-based approach for 3D human body pose estimation from single RGB images that addresses the issue of limited generalizability of models trained solely on the starkly limited publicly available 3D pose data. Using only the existing 3D pose data and 2D pose data, we show state-of-the-art performance on established benchmarks through transfer of learned features, while also generalizing to in-the-wild scenes. We further introduce a new training set for human body pose estimation from monocular images of real humans that has the ground truth captured with a multi-camera marker-less motion capture system. It complements existing corpora with greater diversity in pose, human appearance, clothing, occlusion, and viewpoints, and enables an increased scope of augmentation. We also contribute a new benchmark that covers outdoor and indoor scenes, and demonstrate that our 3D pose dataset shows better in-the-wild performance than existing annotated data, which is further improved in conjunction with transfer learning from 2D pose data. All in all, we argue that the use of transfer learning of representations in tandem with algorithmic and data contributions is crucial for general 3D body pose estimation. △ Less

Submitted 4 October, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

Comments: Accepted at the International Conference on 3D Vision (3DV) 2017

arXiv:1610.00100 [pdf]

doi 10.1016/j.nimb.2017.01.044

L shell x-ray production in high-Z elements using 4-6 MeV/u fluorine ions

Authors: Sunil Kumar, Udai Singh, M. Oswal, G. Singh, N. Singh, D. Mehta, G. Lapicki, T. Nandi

Abstract: L shell line and total x-ray production cross sections in 78Pt, 79Au, 82Pb, 83Bi, 90Th, and 92U targets ionized by 4-6 MeV/u fluorine ions were measured. These cross sections are compared with available theories for L shell ionization using single- and multiple-hole fluorescence and the Coster-Kronig yields. The ECPSSR and the ECUSAR theories exhibit good agreement with the measured data, whereas,… ▽ More L shell line and total x-ray production cross sections in 78Pt, 79Au, 82Pb, 83Bi, 90Th, and 92U targets ionized by 4-6 MeV/u fluorine ions were measured. These cross sections are compared with available theories for L shell ionization using single- and multiple-hole fluorescence and the Coster-Kronig yields. The ECPSSR and the ECUSAR theories exhibit good agreement with the measured data, whereas, the FBA theory overestimates them by a factor of two. Although for the F ion charge states q = 6-8 the multiple-hole atomic parameters do not significantly differ from the single-hole values, after an account for the multiple-holes, our data are better in agreement with the ECUSAR than the ECPSSR theory. △ Less

Submitted 1 October, 2016; originally announced October 2016.

Comments: 32 pages,7 figures

arXiv:1608.02301 [pdf, other]

Uncovering Voice Misuse Using Symbolic Mismatch

Authors: Marzyeh Ghassemi, Zeeshan Syed, Daryush D. Mehta, Jarrad H. Van Stan, Robert E. Hillman, John V. Guttag

Abstract: Voice disorders affect an estimated 14 million working-aged Americans, and many more worldwide. We present the first large scale study of vocal misuse based on long-term ambulatory data collected by an accelerometer placed on the neck. We investigate an unsupervised data mining approach to uncovering latent information about voice misuse. We segment signals from over 253 days of data from 22 sub… ▽ More Voice disorders affect an estimated 14 million working-aged Americans, and many more worldwide. We present the first large scale study of vocal misuse based on long-term ambulatory data collected by an accelerometer placed on the neck. We investigate an unsupervised data mining approach to uncovering latent information about voice misuse. We segment signals from over 253 days of data from 22 subjects into over a hundred million single glottal pulses (closures of the vocal folds), cluster segments into symbols, and use symbolic mismatch to uncover differences between patients and matched controls, and between patients pre- and post-treatment. Our results show significant behavioral differences between patients and controls, as well as between some pre- and post-treatment patients. Our proposed approach provides an objective basis for hel** diagnose behavioral voice disorders, and is a first step towards a more data-driven understanding of the impact of voice therapy. △ Less

Submitted 7 August, 2016; originally announced August 2016.

Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

arXiv:1605.08459 [pdf, other]

doi 10.1103/PhysRevLett.117.028301

Kinetic Transition Networks for the Thomson Problem and Smale's 7th Problem

Authors: Dhagash Mehta, Jianxu Chen, Danny Z. Chen, Halim Kusumaatmaja, David J. Wales

Abstract: The Thomson Problem, arrangement of identical charges on the surface of a sphere, has found many applications in physics, chemistry and biology. Here we show that the energy landscape of the Thomson Problem for $N$ particles with $N=132, 135, 138, 141, 144, 147$ and $150$ is single funnelled, characteristic of a structure-seeking organisation where the global minimum is easily accessible. Algorith… ▽ More The Thomson Problem, arrangement of identical charges on the surface of a sphere, has found many applications in physics, chemistry and biology. Here we show that the energy landscape of the Thomson Problem for $N$ particles with $N=132, 135, 138, 141, 144, 147$ and $150$ is single funnelled, characteristic of a structure-seeking organisation where the global minimum is easily accessible. Algorithmically constructing starting points close to the global minimum of such a potential with spherical constraints is one of Smale's 18 unsolved problems in mathematics for the 21st century because it is important in the solution of univariate and bivariate random polynomial equations. By analysing the kinetic transition networks, we show that a randomly chosen minimum is in fact always `close' to the global minimum in terms of the number of transition states that separate them, a characteristic of small world networks. △ Less

Submitted 8 June, 2016; v1 submitted 26 May, 2016; originally announced May 2016.

Comments: 6 pages, 2-column format. Accepted for publication in Physical Review Letters

Report number: ADP-16-2/T957

Journal ref: Phys. Rev. Lett. 117, 028301 (2016)

arXiv:1605.06940 [pdf, other]

Elastic Solver: Balancing Solution Time and Energy Consumption

Authors: Barry Hurley, Deepak Mehta, Barry O'Sullivan

Abstract: Combinatorial decision problems arise in many different domains such as scheduling, routing, packing, bioinformatics, and many more. Despite recent advances in develo** scalable solvers, there are still many problems which are often very hard to solve. Typically the most advanced solvers include elements which are stochastic in nature. If a same instance is solved many times using different seed… ▽ More Combinatorial decision problems arise in many different domains such as scheduling, routing, packing, bioinformatics, and many more. Despite recent advances in develo** scalable solvers, there are still many problems which are often very hard to solve. Typically the most advanced solvers include elements which are stochastic in nature. If a same instance is solved many times using different seeds then depending on the inherent characteristics of a problem instance and the solver, one can observe a highly-variant distribution of times spanning multiple orders of magnitude. Therefore, to solve a problem instance efficiently it is often useful to solve the same instance in parallel with different seeds. With the proliferation of cloud computing, it is natural to think about an elastic solver which can scale up by launching searches in parallel on thousands of machines (or cores). However, this could result in consuming a lot of energy. Moreover, not every instance would require thousands of machines. The challenge is to resolve the tradeoff between solution time and energy consumption optimally for a given problem instance. We analyse the impact of the number of machines (or cores) on not only solution time but also on energy consumption. We highlight that although solution time always drops as the number of machines increases, the relation between the number of machines and energy consumption is more complicated. In many cases, the optimal energy consumption may be achieved by a middle ground, we analyse this relationship in detail. The tradeoff between solution time and energy consumption is studied further, showing that the energy consumption of a solver can be reduced drastically if we increase the solution time marginally. We also develop a prediction model, demonstrating that such insights can be exploited to achieve faster solutions times in a more energy efficient manor. △ Less

Submitted 23 May, 2016; originally announced May 2016.

Comments: Keywords: Combinatorial Optimisation, Energy Minimisation, Parallel Solving

arXiv:1605.06451 [pdf, other]

Fixed Points of Belief Propagation -- An Analysis via Polynomial Homotopy Continuation

Authors: Christian Knoll, Franz Pernkopf, Dhagash Mehta, Tianran Chen

Abstract: Belief propagation (BP) is an iterative method to perform approximate inference on arbitrary graphical models. Whether BP converges and if the solution is a unique fixed point depends on both the structure and the parametrization of the model. To understand this dependence it is interesting to find \emph{all} fixed points. In this work, we formulate a set of polynomial equations, the solutions of… ▽ More Belief propagation (BP) is an iterative method to perform approximate inference on arbitrary graphical models. Whether BP converges and if the solution is a unique fixed point depends on both the structure and the parametrization of the model. To understand this dependence it is interesting to find \emph{all} fixed points. In this work, we formulate a set of polynomial equations, the solutions of which correspond to BP fixed points. To solve such a nonlinear system we present the numerical polynomial-homotopy-continuation (NPHC) method. Experiments on binary Ising models and on error-correcting codes show how our method is capable of obtaining all BP fixed points. On Ising models with fixed parameters we show how the structure influences both the number of fixed points and the convergence properties. We further asses the accuracy of the marginals and weighted combinations thereof. Weighting marginals with their respective partition function increases the accuracy in all experiments. Contrary to the conjecture that uniqueness of BP fixed points implies convergence, we find graphs for which BP fails to converge, even though a unique fixed point exists. Moreover, we show that this fixed point gives a good approximation, and the NPHC method is able to obtain this fixed point. △ Less

Submitted 30 May, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

arXiv:1604.02623 [pdf, ps, other]

Decomposing the parameter space of biological networks via a numerical discriminant approach

Authors: Heather A. Harrington, Dhagash Mehta, Helen M. Byrne, Jonathan D. Hauenstein

Abstract: Many systems in biology, physics and engineering can be described by systems of ordinary differential equation containing many parameters. When studying the dynamic behavior of these large, nonlinear systems, it is useful to identify and characterize the steady-state solutions as the model parameters vary, a technically challenging problem in a high-dimensional parameter landscape. Rather than sim… ▽ More Many systems in biology, physics and engineering can be described by systems of ordinary differential equation containing many parameters. When studying the dynamic behavior of these large, nonlinear systems, it is useful to identify and characterize the steady-state solutions as the model parameters vary, a technically challenging problem in a high-dimensional parameter landscape. Rather than simply determining the number and stability of steady-states at distinct points in parameter space, we decompose the parameter space into finitely many regions, the steady-state solutions being consistent within each distinct region. From a computational algebraic viewpoint, the boundary of these regions is contained in the discriminant locus. We develop global and local numerical algorithms for constructing the discriminant locus and classifying the parameter landscape. We showcase our numerical approaches by applying them to molecular and cell-network models. △ Less

Submitted 9 April, 2016; originally announced April 2016.

Comments: 13 pages, 4 figures

arXiv:1603.06078 [pdf, other]

doi 10.1111/cgf.13225

Deep Shading: Convolutional Neural Networks for Screen-Space Shading

Authors: Oliver Nalbach, Elena Arabadzhiyska, Dushyant Mehta, Hans-Peter Seidel, Tobias Ritschel

Abstract: In computer vision, convolutional neural networks (CNNs) have recently achieved new levels of performance for several inverse problems where RGB pixel appearance is mapped to attributes such as positions, normals or reflectance. In computer graphics, screen-space shading has recently increased the visual quality in interactive image synthesis, where per-pixel attributes such as positions, normals… ▽ More In computer vision, convolutional neural networks (CNNs) have recently achieved new levels of performance for several inverse problems where RGB pixel appearance is mapped to attributes such as positions, normals or reflectance. In computer graphics, screen-space shading has recently increased the visual quality in interactive image synthesis, where per-pixel attributes such as positions, normals or reflectance of a virtual 3D scene are converted into RGB pixel appearance, enabling effects like ambient occlusion, indirect light, scattering, depth-of-field, motion blur, or anti-aliasing. In this paper we consider the diagonal problem: synthesizing appearance from given per-pixel attributes using a CNN. The resulting Deep Shading simulates various screen-space effects at competitive quality and speed while not being programmed by human experts but learned from example images. △ Less

Submitted 3 August, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

ACM Class: I.3.7; I.2.6

arXiv:1603.05908 [pdf, ps, other]

Investigating the Maximum Number of Real Solutions to the Power Flow Equations: Analysis of Lossless Four-Bus Systems

Authors: Daniel K. Molzahn, Matthew Niemerg, Dhagash Mehta, Jonathan D. Hauenstein

Abstract: The power flow equations model the steady-state relationship between the power injections and voltage phasors in an electric power system. By separating the real and imaginary components of the voltage phasors, the power flow equations can be formulated as a system of quadratic polynomials. Only the real solutions to these polynomial equations are physically meaningful. This paper focuses on the m… ▽ More The power flow equations model the steady-state relationship between the power injections and voltage phasors in an electric power system. By separating the real and imaginary components of the voltage phasors, the power flow equations can be formulated as a system of quadratic polynomials. Only the real solutions to these polynomial equations are physically meaningful. This paper focuses on the maximum number of real solutions to the power flow equations. An upper bound on the number of real power flow solutions commonly used in the literature is the maximum number of complex solutions. There exist two- and three-bus systems for which all complex solutions are real. It is an open question whether this is also the case for larger systems. This paper investigates four-bus systems using techniques from numerical algebraic geometry and conjectures a negative answer to this question. In particular, this paper studies lossless, four-bus systems composed of PV buses connected by lines with arbitrary susceptances. Computing the Galois group, which is degenerate, enables conversion of the problem of counting the number of real solutions to the power flow equations into counting the number of positive roots of a univariate sextic polynomial. From this analysis, it is conjectured that the system has at most 16 real solutions, which is strictly less than the maximum number of complex solutions, namely 20. We also provide explicit parameter values where this system has 16 real solutions so that the conjectured upper bound is achievable. △ Less

Submitted 18 March, 2016; originally announced March 2016.

Comments: 6 pages. 1 figure. IEEE style

arXiv:1603.05905 [pdf, ps, other]

Three Formulations of the Kuramoto Model as a System of Polynomial Equations

Authors: Tianran Chen, Jakub Marecek, Dhagash Mehta, Matthew Niemerg

Abstract: We compare three formulations of stationary equations of the Kuramoto model as systems of polynomial equations. In the comparison, we present bounds on the numbers of real equilibria based on the work of Bernstein, Kushnirenko, and Khovanskii, and performance of methods for the optimisation over the set of equilibria based on the work of Lasserre, both of which could be of independent interest. We compare three formulations of stationary equations of the Kuramoto model as systems of polynomial equations. In the comparison, we present bounds on the numbers of real equilibria based on the work of Bernstein, Kushnirenko, and Khovanskii, and performance of methods for the optimisation over the set of equilibria based on the work of Lasserre, both of which could be of independent interest. △ Less

Submitted 22 June, 2019; v1 submitted 18 March, 2016; originally announced March 2016.

Journal ref: 57th Annual Allerton Conference on Communication, Control, and Computing (2019)

arXiv:1512.04987 [pdf, other]

On the Network Topology Dependent Solution Count of the Algebraic Load Flow Equations

Authors: Tianran Chen, Dhagash Mehta

Abstract: A large amount of research activity in power systems areas has focused on develo** computational methods to solve load flow equations where a key question is the maximum number of isolated solutions.Though several concrete upper bounds exist, recent studies have hinted that much sharper upper bounds that depend the topology of underlying power networks may exist. This paper establishes such a to… ▽ More A large amount of research activity in power systems areas has focused on develo** computational methods to solve load flow equations where a key question is the maximum number of isolated solutions.Though several concrete upper bounds exist, recent studies have hinted that much sharper upper bounds that depend the topology of underlying power networks may exist. This paper establishes such a topology dependent solution bound which is actually the best possible bound in the sense that it is always attainable. We also develop a geometric construction called adjacency polytope which accurately captures the topology of the underlying power network and is immensely useful in the computation of the solution bound. Finally we highlight the significant implications of the development of such solution bound in solving load flow equations. △ Less

Submitted 15 December, 2015; originally announced December 2015.

Comments: 8 figures, 18 pages

Report number: ADP-15-50/T952

Showing 51–100 of 162 results for author: Mehta, D