Skip to main content

Showing 1–50 of 133 results for author: Mittal, A

.
  1. arXiv:2404.03880  [pdf, other

    cs.DB

    Semantic SQL -- Combining and optimizing semantic predicates in SQL

    Authors: Akash Mittal, Anshul Bheemreddy, Huili Tao

    Abstract: In recent years, the surge in unstructured data analysis, facilitated by advancements in Machine Learning (ML), has prompted diverse approaches for handling images, text documents, and videos. Analysts, leveraging ML models, can extract meaningful information from unstructured data and store it in relational databases, allowing the execution of SQL queries for further analysis. Simultaneously, vec… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2403.18060  [pdf, ps, other

    math.CO

    The Cordiality Game and the Game Cordiality Number

    Authors: Elliot Krop, Aryan Mittal, Michael C. Wigal

    Abstract: The cordiality game is played on a graph $G$ by two players, Admirable (A) and Impish (I), who take turns selecting \track{unlabeled} vertices of $G$. Admirable labels the selected vertices by $0$ and Impish by $1$, and the resulting label on any edge is the sum modulo $2$ of the labels of the vertices incident to that edge. The two players have opposite goals: Admirable attempts to minimize the n… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages

  3. arXiv:2402.18434  [pdf, other

    cs.LG cs.IR

    Graph Regularized Encoder Training for Extreme Classification

    Authors: Anshul Mittal, Shikhar Mohan, Deepak Saini, Suchith C. Prabhu, Jain jiao, Sumeet Agarwal, Soumen Chakrabarti, Purushottam Kar, Manik Varma

    Abstract: Deep extreme classification (XC) aims to train an encoder architecture and an accompanying classifier architecture to tag a data point with the most relevant subset of labels from a very large universe of labels. XC applications in ranking, recommendation and tagging routinely encounter tail labels for which the amount of training data is exceedingly small. Graph convolutional networks (GCN) prese… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2312.06665  [pdf

    eess.IV

    Predicting Neural Stem Cell Differentiation Using Deep Learning Models

    Authors: Chandra Suda, Nidhi Parthasarathy, Anika Mittal, Ian Young Chen, Ananya Jalihal

    Abstract: Neural stem cells have immense therapeutic potential for treating various neurological disorders. However, lengthy differentiation protocols hinder the translation of neural stem cells into clinical applications. In this study, we present a deep learning approach using convolutional neural networks (CNNs) to predict the fate of neural stem cell differentiation at an early stage. We trained a CNN m… ▽ More

    Submitted 19 November, 2023; originally announced December 2023.

  5. arXiv:2311.12727  [pdf, other

    cs.LG cs.CL

    Soft Random Sampling: A Theoretical and Empirical Analysis

    Authors: Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

    Abstract: Soft random sampling (SRS) is a simple yet effective approach for efficient training of large-scale deep neural networks when dealing with massive data. SRS selects a subset uniformly at random with replacement from the full data set in each epoch. In this paper, we conduct a theoretical and empirical analysis of SRS. First, we analyze its sampling dynamics including data coverage and occupancy. N… ▽ More

    Submitted 23 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

  6. arXiv:2310.18924  [pdf, other

    cs.LG

    Remaining useful life prediction of Lithium-ion batteries using spatio-temporal multimodal attention networks

    Authors: Sungho Suh, Dhruv Aditya Mittal, Hymalai Bello, Bo Zhou, Mayank Shekhar Jha, Paul Lukowicz

    Abstract: Lithium-ion batteries are widely used in various applications, including electric vehicles and renewable energy storage. The prediction of the remaining useful life (RUL) of batteries is crucial for ensuring reliable and efficient operation, as well as reducing maintenance costs. However, determining the life cycle of batteries in real-world scenarios is challenging, and existing methods have limi… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  7. arXiv:2310.14095  [pdf, other

    astro-ph.GA

    Neutral Hydrogen (HI) 21 cm as a probe: Investigating Spatial Variations in Interstellar Turbulent Properties

    Authors: Amit K. Mittal, Brian L Babler, Snezana Stanimirovic, Nickolas **el

    Abstract: Interstellar turbulence shapes the HI distribution in the Milky Way (MW). How this affects large-scale statistical properties of HI column density across the MW remains largely unconstrained. We use approx 13,000 square-degree GALFA-HI survey to map statistical fluctuations of HI over the 40 km s-1 velocity range. We calculate the spatial power spectrum (SPS) of HI column density image by running… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted in ApJ

  8. arXiv:2310.08891  [pdf, other

    cs.LG cs.IR

    EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval

    Authors: Ramnath Kumar, Anshul Mittal, Nilesh Gupta, Aditya Kusupati, Inderjit Dhillon, Prateek Jain

    Abstract: Dense embedding-based retrieval is now the industry standard for semantic search and ranking problems, like obtaining relevant web documents for a given query. Such techniques use a two-stage process: (a) contrastive learning to train a dual encoder to embed both the query and documents and (b) approximate nearest neighbor search (ANNS) for finding similar documents for a given query. These two st… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  9. arXiv:2310.08304  [pdf

    cs.CV cs.AI cs.LG

    CHIP: Contrastive Hierarchical Image Pretraining

    Authors: Arpit Mittal, Harshil Jhaveri, Swapnil Mallick, Abhishek Ajmera

    Abstract: Few-shot object classification is the task of classifying objects in an image with limited number of examples as supervision. We propose a one-shot/few-shot classification model that can classify an object of any unseen class into a relatively general category in an hierarchically based classification. Our model uses a three-level hierarchical contrastive loss based ResNet152 classifier for classi… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  10. arXiv:2309.14382  [pdf, other

    cs.CL cs.AI

    Agree To Disagree

    Authors: Abhinav Raghuvanshi, Siddhesh Pawar, Anirudh Mittal

    Abstract: How frequently do individuals thoroughly review terms and conditions before proceeding to register for a service, install software, or access a website? The majority of internet users do not engage in this practice. This trend is not surprising, given that terms and conditions typically consist of lengthy documents replete with intricate legal terminology and convoluted sentences. In this paper, w… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  11. arXiv:2309.05475  [pdf, other

    cs.CL

    Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model

    Authors: Neel Bhate, Ansh Mittal, Zhe He, Xiao Luo

    Abstract: Demographics, Social determinants of health, and family history documented in the unstructured text within the electronic health records are increasingly being studied to understand how this information can be utilized with the structured data to improve healthcare outcomes. After the GPT models were released, many studies have applied GPT models to extract this information from the narrative clin… ▽ More

    Submitted 13 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures

  12. Multi-modal Extreme Classification

    Authors: Anshul Mittal, Kunal Dahiya, Shreya Malani, Janani Ramaswamy, Seba Kuruvilla, Jitendra Ajmera, Keng-hao Chang, Sumeet Agarwal, Purushottam Kar, Manik Varma

    Abstract: This paper develops the MUFIN technique for extreme classification (XC) tasks with millions of labels where datapoints and labels are endowed with visual and textual descriptors. Applications of MUFIN to product-to-product recommendation and bid query prediction over several millions of products are presented. Contemporary multi-modal methods frequently rely on purely embedding-based methods. On t… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    ACM Class: H.3.3

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

  13. arXiv:2308.13969  [pdf, other

    cs.CV cs.AI cs.LG

    Fixating on Attention: Integrating Human Eye Tracking into Vision Transformers

    Authors: Sharath Koorathota, Nikolas Papadopoulos, Jia Li Ma, Shruti Kumar, Xiaoxiao Sun, Arunesh Mittal, Patrick Adelman, Paul Sajda

    Abstract: Modern transformer-based models designed for computer vision have outperformed humans across a spectrum of visual tasks. However, critical tasks, such as medical image interpretation or autonomous driving, still require reliance on human judgments. This work demonstrates how human visual input, specifically fixations collected from an eye-tracking device, can be integrated into transformer models… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 25 pages, 9 figures, 3 tables

  14. arXiv:2308.12157  [pdf, other

    cs.CL cs.AI

    Evaluation of Faithfulness Using the Longest Supported Subsequence

    Authors: Anirudh Mittal, Timo Schick, Mikel Artetxe, Jane Dwivedi-Yu

    Abstract: As increasingly sophisticated language models emerge, their trustworthiness becomes a pivotal issue, especially in tasks such as summarization and question-answering. Ensuring their responses are contextually grounded and faithful is challenging due to the linguistic diversity and the myriad of possible answers. In this paper, we introduce a novel approach to evaluate faithfulness of machine-gener… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  15. arXiv:2307.12549  [pdf, other

    cs.CY

    Estimating Time to Clear Pendency of Cases in High Courts in India using Linear Regression

    Authors: Kshitiz Verma, Anshu Musaddi, Ansh Mittal, Anshul Jain

    Abstract: Indian Judiciary is suffering from burden of millions of cases that are lying pending in its courts at all the levels. The High Court National Judicial Data Grid (HC-NJDG) indexes all the cases pending in the high courts and publishes the data publicly. In this paper, we analyze the data that we have collected from the HC-NJDG portal on 229 randomly chosen days between August 31, 2017 to March 22,… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 12 pages, 9 figures, JURISIN 2022. arXiv admin note: text overlap with arXiv:2307.10615

  16. arXiv:2307.05006  [pdf, ps, other

    cs.CL cs.LG eess.AS

    Improving RNN-Transducers with Acoustic LookAhead

    Authors: Vinit S. Unni, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi

    Abstract: RNN-Transducers (RNN-Ts) have gained widespread acceptance as an end-to-end model for speech to text conversion because of their high accuracy and streaming capabilities. A typical RNN-T independently encodes the input audio and the text context, and combines the two encodings by a thin joint network. While this architecture provides SOTA streaming accuracy, it also makes the model vulnerable to s… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 5 pages, 1 fig, 7 tables, Proceedings of Interspeech 2023

  17. arXiv:2306.16048  [pdf, other

    cs.CV cs.AI

    Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity

    Authors: Zhenlin Xu, Yi Zhu, Tiffany Deng, Abhay Mittal, Yanbei Chen, Manchen Wang, Paolo Favaro, Joseph Tighe, Davide Modolo

    Abstract: This paper presents novel benchmarks for evaluating vision-language models (VLMs) in zero-shot recognition, focusing on granularity and specificity. Although VLMs excel in tasks like image captioning, they face challenges in open-world settings. Our benchmarks test VLMs' consistency in understanding concepts across semantic granularity levels and their response to varying text specificity. Finding… ▽ More

    Submitted 18 June, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: CVPR2024 MMFM workshop

  18. arXiv:2306.14904  [pdf, other

    cs.FL

    Determining Smallest Path Size of Multiplication Transducers Without a Restricted Digit Set

    Authors: Aditya Mittal, Karthik Mittal

    Abstract: Directed multiplication transducers are a tool for performing non-decimal base multiplication without an additional conversion to base 10. This allows for faster computation and provides easier visualization depending on the problem at hand. By building these multiplication transducers computationally, new patterns can be identified as these transducers can be built with much larger bases and mult… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 15 pages, 4 figures, submitted at SoCal-Nevada MAA Session 2022 and Cal State East Bay Student Research Symposium 2022

  19. arXiv:2306.14812  [pdf, other

    cs.RO cs.CV

    MOVES: Movable and Moving LiDAR Scene Segmentation in Label-Free settings using Static Reconstruction

    Authors: Prashant Kumar, Dhruv Makwana, Onkar Susladkar, Anurag Mittal, Prem Kumar Kalra

    Abstract: Accurate static structure reconstruction and segmentation of non-stationary objects is of vital importance for autonomous navigation applications. These applications assume a LiDAR scan to consist of only static structures. In the real world however, LiDAR scans consist of non-stationary dynamic structures - moving and movable objects. Current solutions use segmentation information to isolate and… ▽ More

    Submitted 15 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 35 pages, 8 figures, 6 tables

  20. arXiv:2306.09988  [pdf, other

    math.NA

    A Numerically Robust and Stable Time-Space Pseudospectral Approach for Generalized Burgers-Fisher Equation

    Authors: Harvindra Singh, Lokendra Balyan, A. K. Mittal, Parul Saini

    Abstract: In this article, we present the time-space Chebyshev pseudospectral method (TS-CPsM) to approximate a solution to the generalised Burgers-Fisher (gBF) equation. The Chebyshev-Gauss-Lobatto (CGL) points serve as the foundation for the recommended method, which makes use of collocations in both the time and space directions. Further, using a map**, the non-homogeneous initial-boundary value proble… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  21. arXiv:2306.04849  [pdf, other

    cs.CV

    ScaleDet: A Scalable Multi-Dataset Object Detector

    Authors: Yanbei Chen, Manchen Wang, Abhay Mittal, Zhenlin Xu, Paolo Favaro, Joseph Tighe, Davide Modolo

    Abstract: Multi-dataset training provides a viable solution for exploiting heterogeneous large-scale datasets without extra annotation cost. In this work, we propose a scalable multi-dataset detector (ScaleDet) that can scale up its generalization across datasets when increasing the number of training datasets. Unlike existing multi-dataset learners that mostly rely on manual relabelling efforts or sophisti… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: CVPR 2023

  22. arXiv:2304.10050  [pdf, other

    cs.CV

    Neural Radiance Fields: Past, Present, and Future

    Authors: Ansh Mittal

    Abstract: The various aspects like modeling and interpreting 3D environments and surroundings have enticed humans to progress their research in 3D Computer Vision, Computer Graphics, and Machine Learning. An attempt made by Mildenhall et al in their paper about NeRFs (Neural Radiance Fields) led to a boom in Computer Graphics, Robotics, Computer Vision, and the possible scope of High-Resolution Low Storage… ▽ More

    Submitted 14 January, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 413 pages, 9 figures, 277 citations

  23. arXiv:2303.08230  [pdf, other

    cs.LG stat.ML

    Bayesian Beta-Bernoulli Process Sparse Coding with Deep Neural Networks

    Authors: Arunesh Mittal, Kai Yang, Paul Sajda, John Paisley

    Abstract: Several approximate inference methods have been proposed for deep discrete latent variable models. However, non-parametric methods which have previously been successfully employed for classical sparse coding models have largely been unexplored in the context of deep models. We propose a non-parametric iterative algorithm for learning discrete latent representations in such deep models. Additionall… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  24. arXiv:2301.09420  [pdf, other

    cs.LG cs.AI cs.CV

    On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS Environment

    Authors: Ansh Mittal, Aditya Malte

    Abstract: Multi-Agent RL or MARL is one of the complex problems in Autonomous Driving literature that hampers the release of fully-autonomous vehicles today. Several simulators have been in iteration after their inception to mitigate the problem of complex scenarios with multiple agents in Autonomous Driving. One such simulator--SMARTS, discusses the importance of cooperative multi-agent learning. For this… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: 6 pages, 5 figures

  25. arXiv:2212.14125  [pdf, other

    cs.HC

    MuTable (Music Table): Turn any surface into musical instrument

    Authors: Akash Mittal, Ragini Gupta

    Abstract: With the rise in pervasive computing solutions, interactive surfaces have gained a large popularity across multi-application domains including smart boards for education, touch-enabled kiosks for smart retail and smart mirrors for smart homes. Despite the increased popularity of such interactive surfaces, existing platforms are mostly limited to custom built surfaces with attached sensors and hard… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  26. arXiv:2212.05933  [pdf, other

    q-fin.ST cs.LG

    Nostradamus: Weathering Worth

    Authors: Alapan Chaudhuri, Zeeshan Ahmed, Ashwin Rao, Shivansh Subramanian, Shreyas Pradhan, Abhishek Mittal

    Abstract: Nostradamus, inspired by the French astrologer and reputed seer, is a detailed study exploring relations between environmental factors and changes in the stock market. In this paper, we analyze associative correlation and causation between environmental elements (including natural disasters, climate and weather conditions) and stock prices, using historical stock market data, historical climate da… ▽ More

    Submitted 17 January, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 13 pages, 13 figures; updated abstract; updated format to Springer LNCS

  27. arXiv:2212.04122  [pdf, other

    cs.MA cs.GT

    Reducing Collision Risk in Multi-Agent Path Planning: Application to Air traffic Management

    Authors: Sarah H. Q. Li, Avi Mittal, Pierre-Loïc Garoche, Açıkmeşe, Behçet

    Abstract: To minimize collision risks in the multi-agent path planning problem with stochastic transition dynamics, we formulate a Markov decision process congestion game with a multi-linear congestion cost. Players within the game complete individual tasks while minimizing their own collision risks. We show that the set of Nash equilibria coincides with the first-order KKT points of a non-convex optimizati… ▽ More

    Submitted 10 December, 2022; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 6 pages, 2 figures

  28. arXiv:2210.16892  [pdf, other

    cs.LG

    Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training

    Authors: Ashish Mittal, Durga Sivasubramanian, Rishabh Iyer, Preethi Jyothi, Ganesh Ramakrishnan

    Abstract: Training state-of-the-art ASR systems such as RNN-T often has a high associated financial and environmental cost. Training with a subset of training data could mitigate this problem if the subset selected could achieve on-par performance with training with the entire dataset. Although there are many data subset selection(DSS) algorithms, direct application to the RNN-T is difficult, especially the… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  29. Estimation methods for elementary chirp model parameters

    Authors: Anjali Mittal, Rhythm Grover, Debasis Kundu, Amit Mitra

    Abstract: In this paper, we propose some estimation techniques to estimate the elementary chirp model parameters, which are encountered in sonar, radar, acoustics, and other areas. We derive asymptotic theoretical properties of least squares estimators and approximate least squares estimators for the one-component elementary chirp model. It is proved that the proposed estimators are strongly consistent and… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  30. Alternate stabilization methods for CZTSSe photovoltaic devices by thermal treatment, dark electric bias and illumination

    Authors: W. Ananda, M. Rennhofer, A. Mittal, N. Zechner, W. Lang

    Abstract: Reliable measurement routines are crucial for power rating and yield prediction of photovoltaic emerging thinfilm technologies. Copper-Zinc-Tin-Sulfur-Selenium (CZTSSe) thin-film photovoltaic devices are an emerging technology made of abundant elements. Still, sufficient stabilization methods prior to electric power measurement are missing in the international standardization, while existing stand… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  31. arXiv:2207.11838  [pdf, other

    cs.CV cs.AI

    SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions

    Authors: Ansh Mittal, Shuvam Ghosal, Rishibha Bansal

    Abstract: Detecting suspicious activities in surveillance videos is a longstanding problem in real-time surveillance that leads to difficulties in detecting crimes. Hence, we propose a novel approach for detecting and summarizing suspicious activities in surveillance videos. We have also created ground truth summaries for the UCF-Crime video dataset. We modify a pre-existing approach for this task by levera… ▽ More

    Submitted 22 October, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: 14 pages, 6 figures, 6 tables

  32. arXiv:2207.04452  [pdf, other

    cs.LG cs.IR

    NGAME: Negative Mining-aware Mini-batching for Extreme Classification

    Authors: Kunal Dahiya, Nilesh Gupta, Deepak Saini, Akshay Soni, Yajun Wang, Kushal Dave, Jian Jiao, Gururaj K, Prasenjit Dey, Amit Singh, Deepesh Hada, Vidit Jain, Bhawna Paliwal, Anshul Mittal, Sonu Mehta, Ramachandran Ramjee, Sumeet Agarwal, Purushottam Kar, Manik Varma

    Abstract: Extreme Classification (XC) seeks to tag data points with the most relevant subset of labels from an extremely large label set. Performing deep XC with dense, learnt representations for data points and labels has attracted much attention due to its superiority over earlier XC methods that used sparse, hand-crafted features. Negative mining techniques have emerged as a critical component of all dee… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  33. arXiv:2205.01825  [pdf, other

    cs.CL cs.AI cs.LG

    AmbiPun: Generating Humorous Puns with Ambiguous Context

    Authors: Anirudh Mittal, Yufei Tian, Nanyun Peng

    Abstract: In this paper, we propose a simple yet effective way to generate pun sentences that does not require any training on existing puns. Our approach is inspired by humor theories that ambiguity comes from the context rather than the pun word itself. Given a pair of definitions of a pun word, our model first produces a list of related concepts through a reverse dictionary. We then utilize one-shot GPT3… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: To appear in NAACL 2022

  34. arXiv:2203.13628  [pdf, other

    cs.SD cs.CL eess.AS

    DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning

    Authors: Sreyan Ghosh, Ashish Seth, and Deepak Mittal, Maneesh Singh, S. Umesh

    Abstract: Inspired by the recent progress in self-supervised learning for computer vision, in this paper we introduce DeLoRes, a new general-purpose audio representation learning approach. Our main objective is to make our network learn representations in a resource-constrained setting (both data and compute), that can generalize well across a diverse set of downstream tasks. Inspired from the Barlow Twins… ▽ More

    Submitted 26 June, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted to AAAI 2022 workshop on Self-supervised Learning for Audio and Speech Processing

  35. arXiv:2203.02317  [pdf, other

    cs.CL cs.LG

    Adaptive Discounting of Implicit Language Models in RNN-Transducers

    Authors: Vinit Unni, Shreya Khare, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi, Samarth Bharadwaj

    Abstract: RNN-Transducer (RNN-T) models have become synonymous with streaming end-to-end ASR systems. While they perform competitively on a number of evaluation categories, rare words pose a serious challenge to RNN-T models. One main reason for the degradation in performance on rare words is that the language model (LM) internal to RNN-Ts can become overconfident and lead to hallucinated predictions that a… ▽ More

    Submitted 21 February, 2022; originally announced March 2022.

    Comments: Proceedings for ICASSP 2022

  36. arXiv:2201.11479  [pdf, other

    cs.CV cs.AI cs.MM eess.IV

    Eye-focused Detection of Bell's Palsy in Videos

    Authors: Sharik Ali Ansari, Koteswar Rao Jerripothula, Pragya Nagpal, Ankush Mittal

    Abstract: In this paper, we present how Bell's Palsy, a neurological disorder, can be detected just from a subject's eyes in a video. We notice that Bell's Palsy patients often struggle to blink their eyes on the affected side. As a result, we can observe a clear contrast between the blinking patterns of the two eyes. Although previous works did utilize images/videos to detect this disorder, none have expli… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Published in the Proceedings of the 34th Canadian Conference on Artificial Intelligence. Please cite this paper in the following manner: S. A. Ansari, K. R. Jerripothula, P. Nagpal, and A. Mittal. "Eye-focused Detection of Bell's Palsy in Videos". In: Proceedings of the 34th Canadian Conference on Artificial Intelligence (June 8, 2021). doi: 10.21428/594757db.d2f8342b

  37. arXiv:2201.11407  [pdf, other

    cs.CV cs.AI

    Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions

    Authors: Saikat Dutta, Arulkumar Subramaniam, Anurag Mittal

    Abstract: Video frame interpolation aims to synthesize one or multiple frames between two consecutive frames in a video. It has a wide range of applications including slow-motion video generation, frame-rate up-scaling and develo** video codecs. Some older works tackled this problem by assuming per-pixel linear motion between video frames. However, objects often follow a non-linear motion pattern in the r… ▽ More

    Submitted 12 April, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted at CLIC workshop, CVPR 2022. Code: https://github.com/saikatdutta/NME-VFI

  38. arXiv:2112.14314  [pdf, other

    cs.LG

    Improving Prediction of Cognitive Performance using Deep Neural Networks in Sparse Data

    Authors: Sharath Koorathota, Arunesh Mittal, Richard P. Sloan, Paul Sajda

    Abstract: Cognition in midlife is an important predictor of age-related mental decline and statistical models that predict cognitive performance can be useful for predicting decline. However, existing models struggle to capture complex relationships between physical, sociodemographic, psychological and mental health factors that effect cognition. Using data from an observational, cohort study, Midlife in th… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  39. arXiv:2112.03984  [pdf, other

    cs.CL cs.AI

    Emotion-Cause Pair Extraction in Customer Reviews

    Authors: Arpit Mittal, Jeel Tejaskumar Vaishnav, Aishwarya Kaliki, Nathan Johns, Wyatt Pease

    Abstract: Emotion-Cause Pair Extraction (ECPE) is a complex yet popular area in Natural Language Processing due to its importance and potential applications in various domains. In this report , we aim to present our work in ECPE in the domain of online reviews. With a manually annotated dataset, we explore an algorithm to extract emotion cause pairs using a neural network. In addition, we propose a model us… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 7 Pages, 8 Figures

  40. arXiv:2111.13972  [pdf, other

    cs.CL

    Tap** BERT for Preposition Sense Disambiguation

    Authors: Siddhesh Pawar, Shyam Thombre, Anirudh Mittal, Girishkumar Ponkiya, Pushpak Bhattacharyya

    Abstract: Prepositions are frequently occurring polysemous words. Disambiguation of prepositions is crucial in tasks like semantic role labelling, question answering, text entailment, and noun compound paraphrasing. In this paper, we propose a novel methodology for preposition sense disambiguation (PSD), which does not use any linguistic tools. In a supervised setting, the machine learning model is presente… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    ACM Class: I.2.7

  41. arXiv:2111.07370  [pdf, other

    cs.CV

    Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks

    Authors: Arulkumar Subramaniam, Jayesh Vaidya, Muhammed Abdul Majeed Ameen, Athira Nambiar, Anurag Mittal

    Abstract: Video-based computer vision tasks can benefit from estimation of the salient regions and interactions between those regions. Traditionally, this has been done by identifying the object regions in the images by utilizing pre-trained models to perform object detection, object segmentation and/or object pose estimation. Although using pre-trained models is a viable approach, it has several limitation… ▽ More

    Submitted 1 August, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: 26 pages, 14 figures, Preprint submitted to CVIU journal

  42. DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

    Authors: Kunal Dahiya, Deepak Saini, Anshul Mittal, Ankush Shaw, Kushal Dave, Akshay Soni, Himanshu Jain, Sumeet Agarwal, Manik Varma

    Abstract: Scalability and accuracy are well recognized challenges in deep extreme multi-label learning where the objective is to train architectures for automatically annotating a data point with the most relevant subset of labels from an extremely large label set. This paper develops the DeepXML framework that addresses these challenges by decomposing the deep extreme multi-label task into four simpler sub… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    ACM Class: F.2.2; I.2.7

    Journal ref: Web Search and Data Mining 2021

  43. arXiv:2111.00490  [pdf, other

    cs.CL cs.AI

    DSC-IITISM at FinCausal 2021: Combining POS tagging with Attention-based Contextual Representations for Identifying Causal Relationships in Financial Documents

    Authors: Gunjan Haldar, Aman Mittal, Pradyumna Gupta

    Abstract: Causality detection draws plenty of attention in the field of Natural Language Processing and linguistics research. It has essential applications in information retrieval, event prediction, question answering, financial analysis, and market research. In this study, we explore several methods to identify and extract cause-effect pairs in financial documents using transformers. For this purpose, we… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: 5 pages, 5 tables

    MSC Class: 68T50 (Primary); 91F20 (Secondary) ACM Class: I.2.7

  44. arXiv:2110.12765  [pdf, other

    cs.CL cs.AI

    "So You Think You're Funny?": Rating the Humour Quotient in Standup Comedy

    Authors: Anirudh Mittal, Pranav Jeevan, Prerak Gandhi, Diptesh Kanojia, Pushpak Bhattacharyya

    Abstract: Computational Humour (CH) has attracted the interest of Natural Language Processing and Computational Linguistics communities. Creating datasets for automatic measurement of humour quotient is difficult due to multiple possible interpretations of the content. In this work, we create a multi-modal humour-annotated dataset ($\sim$40 hours) using stand-up comedy clips. We devise a novel scoring mecha… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Accepted at EMNLP 2021 Main Conference (short papers); 4 pages, 1 figure, 3 tables

  45. arXiv:2109.06488  [pdf

    cs.CL

    Multilevel profiling of situation and dialogue-based deep networks for movie genre classification using movie trailers

    Authors: Dinesh Kumar Vishwakarma, Mayank **dal, Ayush Mittal, Aditya Sharma

    Abstract: Automated movie genre classification has emerged as an active and essential area of research and exploration. Short duration movie trailers provide useful insights about the movie as video content consists of the cognitive and the affective level features. Previous approaches were focused upon either cognitive or affective content analysis. In this paper, we propose a novel multi-modality: situati… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: 21 pages, 7 figures

  46. arXiv:2108.12585  [pdf, other

    cs.CV

    On the Significance of Question Encoder Sequence Model in the Out-of-Distribution Performance in Visual Question Answering

    Authors: Gouthaman KV, Anurag Mittal

    Abstract: Generalizing beyond the experiences has a significant role in develo** practical AI systems. It has been shown that current Visual Question Answering (VQA) models are over-dependent on the language-priors (spurious correlations between question-types and their most frequent answers) from the train set and pose poor performance on Out-of-Distribution (OOD) test sets. This conduct limits their gen… ▽ More

    Submitted 21 December, 2021; v1 submitted 28 August, 2021; originally announced August 2021.

  47. arXiv:2108.00368  [pdf, other

    cs.CL cs.IR cs.LG

    DECAF: Deep Extreme Classification with Label Features

    Authors: Anshul Mittal, Kunal Dahiya, Sheshansh Agrawal, Deepak Saini, Sumeet Agarwal, Purushottam Kar, Manik Varma

    Abstract: Extreme multi-label classification (XML) involves tagging a data point with its most relevant subset of labels from an extremely large label set, with several applications such as product-to-product recommendation with millions of products. Although leading XML algorithms scale to millions of labels, they largely ignore label meta-data such as textual descriptions of the labels. On the other hand,… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    ACM Class: F.2.2; I.2.7

    Journal ref: Web Search and Data Mining 2021

  48. arXiv:2108.00261  [pdf, other

    cs.CL cs.IR cs.LG

    ECLARE: Extreme Classification with Label Graph Correlations

    Authors: Anshul Mittal, Noveen Sachdeva, Sheshansh Agrawal, Sumeet Agarwal, Purushottam Kar, Manik Varma

    Abstract: Deep extreme classification (XC) seeks to train deep architectures that can tag a data point with its most relevant subset of labels from an extremely large label set. The core utility of XC comes from predicting labels that are rarely seen during training. Such rare labels hold the key to personalized recommendations that can delight and surprise a user. However, the large number of rare labels a… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    ACM Class: F.2.2; I.2.7

    Journal ref: The Web Conference 2021

  49. arXiv:2107.07842  [pdf, other

    cs.IR cs.AI

    A Survey of Knowledge Graph Embedding and Their Applications

    Authors: Shivani Choudhary, Tarun Luthra, Ashima Mittal, Rajat Singh

    Abstract: Knowledge Graph embedding provides a versatile technique for representing knowledge. These techniques can be used in a variety of applications such as completion of knowledge graph to predict missing information, recommender systems, question answering, query expansion, etc. The information embedded in Knowledge graph though being structured is challenging to consume in a real-world application. K… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 9 figures

  50. Representation based meta-learning for few-shot spoken intent recognition

    Authors: Ashish Mittal, Samarth Bharadwaj, Shreya Khare, Saneem Chemmengath, Karthik Sankaranarayanan, Brian Kingsbury

    Abstract: Spoken intent detection has become a popular approach to interface with various smart devices with ease. However, such systems are limited to the preset list of intents-terms or commands, which restricts the quick customization of personal devices to new intents. This paper presents a few-shot spoken intent classification approach with task-agnostic representations via meta-learning paradigm. Spec… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: Accepted paper at Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October, 2020