Skip to main content

Showing 1–50 of 70 results for author: Mehta, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01585  [pdf, other

    cs.DB cs.PF

    FLEXIS: FLEXible Frequent Subgraph Mining using Maximal Independent Sets

    Authors: Akshit Sharma, Sam Reinher, Dinesh Mehta, Bo Wu

    Abstract: Frequent Subgraph Mining (FSM) is the process of identifying common subgraph patterns that surpass a predefined frequency threshold. While FSM is widely applicable in fields like bioinformatics, chemical analysis, and social network anomaly detection, its execution remains time-consuming and complex. This complexity stems from the need to recognize high-frequency subgraphs and ascertain if they ex… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2402.04297  [pdf, other

    cs.CV

    Road Surface Defect Detection -- From Image-based to Non-image-based: A Survey

    Authors: Jongmin Yu, Jiaqi Jiang, Sebastiano Fichera, Paolo Paoletti, Lisa Layzell, Devansh Mehta, Shan Luo

    Abstract: Ensuring traffic safety is crucial, which necessitates the detection and prevention of road surface defects. As a result, there has been a growing interest in the literature on the subject, leading to the development of various road surface defect detection methods. The methods for detecting road defects can be categorised in various ways depending on the input data types or training methodologies… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Survey papers

  3. arXiv:2402.04064  [pdf, other

    cs.CV cs.AI

    Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

    Authors: Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

    Abstract: Road pavement detection and segmentation are critical for develo** autonomous road repair systems. However, develo** an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a nove… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to the ICRA 2024

  4. arXiv:2401.17671  [pdf, other

    cs.CL cs.AI q-bio.NC

    Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

    Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

    Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures and 4 supplementary figures

  5. arXiv:2312.13225  [pdf, other

    cs.SE

    Automated DevOps Pipeline Generation for Code Repositories using Large Language Models

    Authors: Deep Mehta, Kartik Rawool, Subodh Gujar, Bowen Xu

    Abstract: Automating software development processes through the orchestration of GitHub Action workflows has revolutionized the efficiency and agility of software delivery pipelines. This paper presents a detailed investigation into the use of Large Language Models (LLMs) specifically, GPT 3.5 and GPT 4 to generate and evaluate GitHub Action workflows for DevOps tasks. Our methodology involves data collecti… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  6. arXiv:2311.01009  [pdf, other

    cs.CV cs.AI

    Revam** AI Models in Dermatology: Overcoming Critical Challenges for Enhanced Skin Lesion Diagnosis

    Authors: Deval Mehta, Brigid Betz-Stablein, Toan D Nguyen, Yaniv Gal, Adrian Bowling, Martin Haskett, Maithili Sashindranath, Paul Bonnington, Victoria Mar, H Peter Soyer, Zongyuan Ge

    Abstract: The surge in develo** deep learning models for diagnosing skin lesions through image analysis is notable, yet their clinical black faces challenges. Current dermatology AI models have limitations: limited number of possible diagnostic outputs, lack of real-world testing on uncommon skin lesions, inability to detect out-of-distribution images, and over-reliance on dermoscopic images. To address t… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  7. arXiv:2310.12428  [pdf, other

    stat.ML cs.AI cs.LG q-fin.ST stat.ME

    Towards Enhanced Local Explainability of Random Forests: a Proximity-Based Approach

    Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

    Abstract: We initiate a novel approach to explain the out of sample performance of random forest (RF) models by exploiting the fact that any RF can be formulated as an adaptive weighted K nearest-neighbors model. Specifically, we use the proximity between points in the feature space learned by the RF to re-write random forest predictions exactly as a weighted average of the target labels of training data po… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 5 pages, 6 figures

  8. arXiv:2310.10760  [pdf, other

    cs.CL q-fin.PM q-fin.ST stat.AP

    Towards reducing hallucination in extracting information from financial reports using Large Language Models

    Authors: Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

    Abstract: For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Op… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 4 pages + references. Accepted for publication in Workshop on Generative AI at the 3rd International Conference on AI-ML Systems 2023, Bengaluru, India

  9. arXiv:2309.08794  [pdf, other

    cs.AI cs.CV

    Privacy-preserving Early Detection of Epileptic Seizures in Videos

    Authors: Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, Zongyuan Ge

    Abstract: In this work, we contribute towards the development of video-based epileptic seizure classification by introducing a novel framework (SETR-PKD), which could achieve privacy-preserved early detection of seizures in videos. Specifically, our framework has two significant components - (1) It is built upon optical flow features extracted from the video of a seizure, which encodes the seizure motion se… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to MICCAI 2023

  10. arXiv:2309.06884  [pdf, other

    cs.CV

    Autoencoder-Based Visual Anomaly Localization for Manufacturing Quality Control

    Authors: Devang Mehta, Noah Klarmann

    Abstract: Manufacturing industries require efficient and voluminous production of high-quality finished goods. In the context of Industry 4.0, visual anomaly detection poses an optimistic solution for automatically controlled product quality with high precision. In general, automation based on computer vision is a promising solution to prevent bottlenecks at the product quality checkpoint. We considered rec… ▽ More

    Submitted 3 November, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

  11. Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting

    Authors: Hamzeh Ghasemzadeh, Robert E. Hillman, Daryush D. Mehta

    Abstract: This study's first purpose is to provide quantitative evidence that would incentivize researchers to instead use the more robust method of nested cross-validation. The second purpose is to present methods and MATLAB codes for doing power analysis for ML-based analysis during the design of a study. Monte Carlo simulations were used to quantify the interactions between the employed cross-validation… ▽ More

    Submitted 22 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted at JSLHR

    Journal ref: Journal of Speech, Language, and Hearing Research (JSLHR),Volume 67 Issue 3, March 2024, Pages 753-781

  12. arXiv:2308.06882  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Quantifying Outlierness of Funds from their Categories using Supervised Similarity

    Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

    Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 tables, 8 figures

  13. arXiv:2305.18703  [pdf, other

    cs.CL cs.AI

    Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

    Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  14. arXiv:2305.00696  [pdf, other

    cs.CV

    TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification

    Authors: Litao Yang, Deval Mehta, Sidong Liu, Dwarikanath Mahapatra, Antonio Di Ieva, Zongyuan Ge

    Abstract: Digital pathology based on whole slide images (WSIs) plays a key role in cancer diagnosis and clinical practice. Due to the high resolution of the WSI and the unavailability of patch-level annotations, WSI classification is usually formulated as a weakly supervised problem, which relies on multiple instance learning (MIL) based on patches of a WSI. In this paper, we aim to learn an optimal patch-l… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted for MIDL 2023

  15. arXiv:2304.14497  [pdf

    cs.CV cs.RO eess.IV

    Vehicle Safety Management System

    Authors: Chanthini Bhaskar, Bharath Manoj Nair, Dev Mehta

    Abstract: Overtaking is a critical maneuver in driving that requires accurate information about the location and distance of other vehicles on the road. This study suggests a real-time overtaking assistance system that uses a combination of the You Only Look Once (YOLO) object detection algorithm and stereo vision techniques to accurately identify and locate vehicles in front of the driver, and estimate the… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  16. arXiv:2211.16172  [pdf, other

    cs.CL cs.CY

    Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi

    Authors: Devansh Mehta, Harshita Diddee, Ananya Saxena, Anurag Shukla, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Vishnu Prasad, Venkanna U, Kalika Bali

    Abstract: The primary obstacle to develo** technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this pr… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: In Submission (Revised) to Language Resources and Evaluation Journal. arXiv admin note: text overlap with arXiv:2004.10270

  17. arXiv:2209.04406  [pdf, other

    q-bio.NC cs.SD eess.AS

    Longitudinal Acoustic Speech Tracking Following Pediatric Traumatic Brain Injury

    Authors: Camille Noufi, Adam C. Lammert, Daryush D. Mehta, James R. Williamson, Gregory Ciccarelli, Douglas Sturim, Jordan R. Green, Thomas F. Quatieri, Thomas F. Campbell

    Abstract: Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this ar… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  18. arXiv:2208.13318  [pdf, other

    cs.CY cs.AI cs.SI

    Multi-dimensional Racism Classification during COVID-19: Stigmatization, Offensiveness, Blame, and Exclusion

    Authors: Xin Pei, Deval Mehta

    Abstract: Transcending the binary categorization of racist texts, our study takes cues from social science theories to develop a multi-dimensional model for racism detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of BERT and topic modeling, this categorical detection enables insights into the underlying subtlety of racist discussion on digital platforms during COVID-19. Ou… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: Social Network Analysis and Mining (accepted, 2022). arXiv admin note: substantial text overlap with arXiv:2107.08347

  19. arXiv:2208.10639  [pdf, other

    cs.HC

    Evaluating Cardiovascular Surgical Planning in Mobile Augmented Reality

    Authors: Haoyang Yang, Pratham Darrpan Mehta, Jonathan Leo, Zhiyan Zhou, Megan Dass, Anish Upadhayay, Timothy C. Slesnick, Fawwaz Shaw, Amanda Randles, Duen Horng Chau

    Abstract: Advanced surgical procedures for congenital heart diseases (CHDs) require precise planning before the surgeries. The conventional approach utilizes 3D-printing and cutting physical heart models, which is a time and resource intensive process. While rapid advances in augmented reality (AR) technologies have the potential to streamline surgical planning, there is limited research that evaluates such… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: IEEE VIS 2022. 2 pages, 1 figure

  20. arXiv:2208.08331  [pdf, other

    eess.IV cs.CV cs.LG

    Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation

    Authors: Litao Yang, Deval Mehta, Dwarikanath Mahapatra, Zongyuan Ge

    Abstract: Recently, a lot of automated white blood cells (WBC) or leukocyte classification techniques have been developed. However, all of these methods only utilize a single modality microscopic image i.e. either blood smear or fluorescence based, thus missing the potential of a better learning from multimodal images. In this work, we develop an efficient multimodal architecture based on a first of its kin… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI 2022 workshop - MOVI2022

  21. arXiv:2206.15186  [pdf, other

    cs.CV cs.AI cs.LG

    Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

    Authors: Deval Mehta, Yaniv Gal, Adrian Bowling, Paul Bonnington, Zongyuan Ge

    Abstract: Recent years have witnessed a rapid development of automated methods for skin lesion diagnosis and classification. Due to an increasing deployment of such systems in clinics, it has become important to develop a more robust system towards various Out-of-Distribution(OOD) samples (unknown skin lesions and conditions). However, the current deep learning models trained for skin lesion classification… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022 (top 13% paper; early accept)

  22. arXiv:2206.08236  [pdf, other

    cs.CV cs.LG eess.IV

    Simple and Efficient Architectures for Semantic Segmentation

    Authors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

    Abstract: Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware. This paper demonstrates that a simple encoder-decoder architecture with a ResNet-like backbone and a sm… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022

  23. arXiv:2206.08009  [pdf, other

    cs.CV cs.LG

    Balancing Discriminability and Transferability for Source-Free Domain Adaptation

    Authors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh Babu

    Abstract: Conventional domain adaptation (DA) techniques aim to improve domain transferability by learning domain-invariant representations; while concurrently preserving the task-discriminability knowledge gathered from the labeled source data. However, the requirement of simultaneous access to labeled source and unlabeled target renders them unsuitable for the challenging source-free DA setting. The trivi… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022. Project page: https://sites.google.com/view/mixup-sfda

  24. arXiv:2111.15629  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    DiPD: Disruptive event Prediction Dataset from Twitter

    Authors: Sanskar Soni, Dev Mehta, Vinush Vishwanath, Aditi Seetha, Satyendra Singh Chouhan

    Abstract: Riots and protests, if gone out of control, can cause havoc in a country. We have seen examples of this, such as the BLM movement, climate strikes, CAA Movement, and many more, which caused disruption to a large extent. Our motive behind creating this dataset was to use it to develop machine learning systems that can give its users insight into the trending events going on and alert them about the… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  25. arXiv:2107.08347  [pdf, other

    cs.SI cs.CL

    Beyond a binary of (non)racist tweets: A four-dimensional categorical detection and analysis of racist and xenophobic opinions on Twitter in early Covid-19

    Authors: Xin Pei, Deval Mehta

    Abstract: Transcending the binary categorization of racist and xenophobic texts, this research takes cues from social science theories to develop a four dimensional category for racism and xenophobia detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of deep learning techniques, this categorical detection enables insights into the nuances of emergent topics reflected in raci… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

  26. arXiv:2107.07452  [pdf, other

    cs.RO cs.AI

    GI-NNet \& RGI-NNet: Development of Robotic Grasp Pose Models, Trainable with Large as well as Limited Labelled Training Datasets, under supervised and semi supervised paradigms

    Authors: Priya Shukla, Nilotpal Pramanik, Deepesh Mehta, G. C. Nandi

    Abstract: Our way of gras** objects is challenging for efficient, intelligent and optimal grasp by COBOTs. To streamline the process, here we use deep learning techniques to help robots learn to generate and execute appropriate grasps quickly. We developed a Generative Inception Neural Network (GI-NNet) model, capable of generating antipodal robotic grasps on seen as well as unseen objects. It is trained… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  27. arXiv:2106.12987  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Fund2Vec: Mutual Funds Similarity using Graph Learning

    Authors: Vipul Satone, Dhruv Desai, Dhagash Mehta

    Abstract: Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) am… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 2 column format, 8 pages, 8 figures, 5 tables

  28. An Adaptive Synaptic Array using Fowler-Nordheim Dynamic Analog Memory

    Authors: Darshit Mehta, Kenji Aono, Shantanu Chakrabartty

    Abstract: In this paper we present a synaptic array that uses dynamical states to implement an analog memory for energy-efficient training of machine learning (ML) systems. Each of the analog memory elements is a micro-dynamical system that is driven by the physics of Fowler-Nordheim (FN) quantum tunneling, whereas the system level learning modulates the state trajectory of the memory ensembles towards the… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 22 pages (incl. 7 supplementary pages), 11 figures (incl. 6 supplementary figures)

  29. arXiv:2104.04650  [pdf, other

    cs.CV cs.AI

    Towards Automated and Marker-less Parkinson Disease Assessment: Predicting UPDRS Scores using Sit-stand videos

    Authors: Deval Mehta, Umar Asif, Tian Hao, Erhan Bilal, Stefan Von Cavallar, Stefan Harrer, Jeffrey Rogers

    Abstract: This paper presents a novel deep learning enabled, video based analysis framework for assessing the Unified Parkinsons Disease Rating Scale (UPDRS) that can be used in the clinic or at home. We report results from comparing the performance of the framework to that of trained clinicians on a population of 32 Parkinsons disease (PD) patients. In-person clinical assessments by trained neurologists ar… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR Workshops 2021

  30. arXiv:2102.06837  [pdf, other

    cs.CV

    Learning Speech-driven 3D Conversational Gestures from Video

    Authors: Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Lingjie Liu, Hans-Peter Seidel, Gerard Pons-Moll, Mohamed Elgharib, Christian Theobalt

    Abstract: We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input. Our algorithm uses a CNN architecture that leverages the inherent correlation between facial expression and hand gestures. Synthesis of conversational body gestures is a multi-modal pro… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  31. arXiv:2101.04104  [pdf, other

    cs.CV

    Neural Re-Rendering of Humans from a Single Image

    Authors: Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt

    Abstract: Human re-rendering from a single image is a starkly under-constrained problem, and state-of-the-art algorithms often exhibit undesired artefacts, such as over-smoothing, unrealistic distortions of the body parts and garments, or implausible changes of the texture. To address these challenges, we propose a new method for neural re-rendering of a human under a novel user-defined pose and viewpoint,… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Published in ECCV 2020

  32. arXiv:2012.08859  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

    Authors: Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort

    Abstract: Current state-of-the-art Neural Architecture Search (NAS) methods neither efficiently scale to multiple hardware platforms, nor handle diverse architectural search-spaces. To remedy this, we present DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid, scalable and diverse NAS, that scales to many user scenarios. DONNA consists of three phases. First, an accuracy pre… ▽ More

    Submitted 27 August, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted at ICCV2021. Main text 9 pages, Full text 21 pages, 18 figures

  33. arXiv:2011.06557  [pdf, other

    stat.ML cs.LG stat.ME

    A partition-based similarity for classification distributions

    Authors: Hayden S. Helm, Ronak D. Mehta, Brandon Duderstadt, Weiwei Yang, Christoper M. White, Ali Geisa, Joshua T. Vogelstein, Carey E. Priebe

    Abstract: Herein we define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners. In particular, we propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  34. arXiv:2009.09818  [pdf, other

    cs.CV

    DeepActsNet: Spatial and Motion features from Face, Hands, and Body Combined with Convolutional and Graph Networks for Improved Action Recognition

    Authors: Umar Asif, Deval Mehta, Stefan von Cavallar, Jianbin Tang, Stefan Harrer

    Abstract: Existing action recognition methods mainly focus on joint and bone information in human body skeleton data due to its robustness to complex backgrounds and dynamic characteristics of the environments. In this paper, we combine body skeleton data with spatial and motion features from face and two hands, and present "Deep Action Stamps (DeepActs)", a novel data representation to encode actions from… ▽ More

    Submitted 4 June, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

  35. Study on State-of-the-art Cloud Services Integration Capabilities with Autonomous Ground Vehicles

    Authors: Praveen Damacharla, Dhwani Mehta, Ahmad Y Javaid, Vijay K. Devabhaktuni

    Abstract: Computing and intelligence are substantial requirements for the accurate performance of autonomous ground vehicles (AGVs). In this context, the use of cloud services in addition to onboard computers enhances computing and intelligence capabilities of AGVs. In addition, the vast amount of data processed in a cloud system contributes to overall performance and capabilities of the onboard system. Thi… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Journal ref: 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall), Chicago, IL, USA, 2018, pp. 1-5

  36. arXiv:2006.14078  [pdf, other

    stat.ML cs.LG cs.SC math.AG stat.AP

    Machine learning the real discriminant locus

    Authors: Edgar A. Bernal, Jonathan D. Hauenstein, Dhagash Mehta, Margaret H. Regan, Tingting Tang

    Abstract: Parameterized systems of polynomial equations arise in many applications in science and engineering with the real solutions describing, for example, equilibria of a dynamical system, linkages satisfying design constraints, and scene reconstruction in computer vision. Since different parameter values can have a different number of real solutions, the parameter space is decomposed into regions whose… ▽ More

    Submitted 8 August, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 22 pages, 14 figures

  37. arXiv:2006.00123  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.ML

    Machine Learning Fund Categorizations

    Authors: Dhagash Mehta, Dhruv Desai, Jithin Pradeep

    Abstract: Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many appli… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 8 pages, 2-column format, 5 figures

  38. arXiv:2005.08224  [pdf

    cs.SI cs.CL

    #Coronavirus or #Chinesevirus?!: Understanding the negative sentiment reflected in Tweets with racist hashtags across the development of COVID-19

    Authors: Xin Pei, Deval Mehta

    Abstract: Situated in the global outbreak of COVID-19, our study enriches the discussion concerning the emergent racism and xenophobia on social media. With big data extracted from Twitter, we focus on the analysis of negative sentiment reflected in tweets marked with racist hashtags, as racism and xenophobia are more likely to be delivered via the negative sentiment. Especially, we propose a stage-based ap… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

  39. arXiv:2005.00116  [pdf, other

    cs.CV cs.LG

    Sequence Information Channel Concatenation for Improving Camera Trap Image Burst Classification

    Authors: Bhuvan Malladihalli Shashidhara, Darshan Mehta, Yash Kale, Dan Morris, Megan Hazen

    Abstract: Camera Traps are extensively used to observe wildlife in their natural habitat without disturbing the ecosystem. This could help in the early detection of natural or human threats to animals, and help towards ecological conservation. Currently, a massive number of such camera traps have been deployed at various ecological conservation areas around the world, collecting data for decades, thereby re… ▽ More

    Submitted 5 June, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: 8 pages, 4 figures, 2 tables. Git repository can be found at: https://github.com/bhuvi3/camera_trap_animal_classification

    ACM Class: I.4.9; I.4.10; I.2.10

  40. arXiv:2004.12908  [pdf, other

    cs.AI cs.LG stat.ML

    A Simple Lifelong Learning Approach

    Authors: Joshua T. Vogelstein, Jayanta Dey, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Weiwei Yang, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

    Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 April, 2020; originally announced April 2020.

  41. arXiv:2004.10270  [pdf, other

    cs.CL cs.CY

    Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi

    Authors: Devansh Mehta, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Anurag Shukla, Vishnu Prasad, Venkanna U, Amit Sharma, Kalika Bali

    Abstract: The primary obstacle to develo** technologies for low-resource languages is the lack of usable data. In this paper, we report the adoption and deployment of 4 technology-driven methods of data collection for Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. In the process of data collection, we also help in its revival by expanding a… ▽ More

    Submitted 26 January, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: Accepted at LREC 2020 (7 pages). D.M. and S.S. contributed equally

  42. XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

    Authors: Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

    Abstract: We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates successfully in generic scenes which may contain occlusions by objects and by other people. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible jo… ▽ More

    Submitted 30 April, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: To appear in ACM Transactions on Graphics (SIGGRAPH) 2020

  43. arXiv:1907.00199  [pdf, other

    cs.CR

    Incidents Are Meant for Learning, Not Repeating: Sharing Knowledge About Security Incidents in Cyber-Physical Systems

    Authors: Faeq Alrimawi, Liliana Pasquale, Deepak Mehta, Nobukazu Yoshioka, Bashar Nuseibeh

    Abstract: Cyber-physical systems (CPSs) are part of most critical infrastructures such as industrial automation and transportation systems. Thus, security incidents targeting CPSs can have disruptive consequences to assets and people. As prior incidents tend to re-occur, sharing knowledge about these incidents can help organizations be more prepared to prevent, mitigate or investigate future incidents. This… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

  44. arXiv:1905.07628  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Evolving Rewards to Automate Reinforcement Learning

    Authors: Aleksandra Faust, Anthony Francis, Dar Mehta

    Abstract: Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many classical control tasks guide RL training using complex rewards, which require tedious hand-tuning. We automate the reward search with AutoRL, an evolutionary layer over standard RL that treats reward tuning as hyperparame… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Accepted to 6th AutoML@ICML

  45. arXiv:1905.04967  [pdf, other

    cs.LG cs.CV stat.ML

    Implicit Filter Sparsification In Convolutional Neural Networks

    Authors: Dushyant Mehta, Kwang In Kim, Christian Theobalt

    Abstract: We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. Through an extensive empirical study (Mehta et al., 2019) we hypothesize the mechanism behind the sparsification process, and find surprising links to certain f… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: ODML-CDNNR 2019 (ICML'19 workshop) extended abstract of the CVPR 2019 paper "On Implicit Filter Level Sparsity in Convolutional Neural Networks, Mehta et al." (arXiv:1811.12495)

  46. arXiv:1904.03289  [pdf, other

    cs.CV

    In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

    Authors: Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Gerard Pons-Moll, Christian Theobalt

    Abstract: Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted to CVPR 2019

  47. arXiv:1811.12495  [pdf, other

    cs.LG cs.CV eess.SP stat.ML

    On Implicit Filter Level Sparsity in Convolutional Neural Networks

    Authors: Dushyant Mehta, Kwang In Kim, Christian Theobalt

    Abstract: We investigate filter level sparsity that emerges in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. We conduct an extensive experimental study casting our initial findings into hypotheses and conclusions about the mechanisms underlying the emergent filter lev… ▽ More

    Submitted 5 April, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Accepted at CVPR 2019

  48. arXiv:1810.11726  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG math.OC

    Towards Robust Deep Neural Networks

    Authors: Timothy E. Wang, Yiming Gu, Dhagash Mehta, Xiaojun Zhao, Edgar A. Bernal

    Abstract: We investigate the topics of sensitivity and robustness in feedforward and convolutional neural networks. Combining energy landscape techniques developed in computational chemistry with tools drawn from formal methods, we produce empirical evidence indicating that networks corresponding to lower-lying minima in the optimization landscape of the learning objective tend to be more robust. The robust… ▽ More

    Submitted 4 December, 2018; v1 submitted 27 October, 2018; originally announced October 2018.

    Comments: Added further discussions, and supplementary material

  49. arXiv:1810.07716  [pdf, other

    stat.ML cs.LG math.AG

    The loss surface of deep linear networks viewed through the algebraic geometry lens

    Authors: Dhagash Mehta, Tianran Chen, Tingting Tang, Jonathan D. Hauenstein

    Abstract: By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of the deep linear neural network models. After clarifying on the various definitions of "flat" minima, we show that the geometrically flat minima, which are merely artifacts of residual continuous symmetries of the deep linear networks, can be straightforwardly removed by a gene… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: 16 pages (2-columns), 5 figures

  50. arXiv:1804.02411  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG

    The Loss Surface of XOR Artificial Neural Networks

    Authors: Dhagash Mehta, Xiaojun Zhao, Edgar A. Bernal, David J. Wales

    Abstract: Training an artificial neural network involves an optimization process over the landscape defined by the cost (loss) as a function of the network parameters. We explore these landscapes using optimisation tools developed for potential energy landscapes in molecular science. The number of local minima and transition states (saddle points of index one), as well as the ratio of transition states to m… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: 19 pages, 6 figures. Submitted to journal in Oct, 2017

    Journal ref: Phys. Rev. E 97, 052307 (2018)