Skip to main content

Showing 1–49 of 49 results for author: Prabhu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17249  [pdf, other

    cs.RO

    SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation

    Authors: Xu Liu, Jiuzhou Lei, Ankit Prabhu, Yuezhan Tao, Igor Spasojevic, Pratik Chaudhari, Nikolay Atanasov, Vijay Kumar

    Abstract: This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Map** (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Preliminary release

  2. arXiv:2404.09447  [pdf, other

    cs.CV cs.LG

    kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

    Authors: Zhongrui Gui, Shuyang Sun, Runjia Li, Jianhao Yuan, Zhaochong An, Karsten Roth, Ameya Prabhu, Philip Torr

    Abstract: Rapid advancements in continual segmentation have yet to bridge the gap of scaling to large continually expanding vocabularies under compute-constrained scenarios. We discover that traditional continual training leads to catastrophic forgetting under compute constraints, unable to outperform zero-shot segmentation methods. We introduce a novel strategy for semantic and panoptic segmentation with z… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures

  3. arXiv:2404.06405  [pdf, other

    cs.AI cs.CG cs.CL cs.LG

    Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

    Authors: Shiven Sinha, Ameya Prabhu, Ponnurangam Kumaraguru, Siddharth Bhat, Matthias Bethge

    Abstract: Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 2… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Work in Progress. Released for wider feedback

  4. arXiv:2404.05764  [pdf

    eess.IV cs.CV

    Study of the effect of Sharpness on Blind Video Quality Assessment

    Authors: Anantha Prabhu, David Pratap, Narayana Darapeni, Anwesh P R

    Abstract: Introduction: Video Quality Assessment (VQA) is one of the important areas of study in this modern era, where video is a crucial component of communication with applications in every field. Rapid technology developments in mobile technology enabled anyone to create videos resulting in a varied range of video quality scenarios. Objectives: Though VQA was present for some time with the classical met… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  5. arXiv:2404.04125  [pdf, other

    cs.CV cs.CL cs.LG

    No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

    Authors: Vishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma, Philip H. S. Torr, Adel Bibi, Samuel Albanie, Matthias Bethge

    Abstract: Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation. However, it is unclear how meaningful the notion of "zero-shot" generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream conce… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Extended version of the short paper accepted at DPFM, ICLR'24

  6. arXiv:2402.19472  [pdf, other

    cs.LG cs.CV

    Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress

    Authors: Ameya Prabhu, Vishaal Udandarao, Philip Torr, Matthias Bethge, Adel Bibi, Samuel Albanie

    Abstract: Standardized benchmarks drive progress in machine learning. However, with repeated testing, the risk of overfitting grows as algorithms over-exploit benchmark idiosyncrasies. In our work, we seek to mitigate this challenge by compiling ever-expanding large-scale benchmarks called Lifelong Benchmarks. As exemplars of our approach, we create Lifelong-CIFAR10 and Lifelong-ImageNet, containing (for no… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  7. arXiv:2402.14015  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Corrective Machine Unlearning

    Authors: Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal

    Abstract: Machine Learning models increasingly face data integrity challenges due to the use of large-scale training datasets drawn from the internet. We study what model developers can do if they detect that some data was manipulated or incorrect. Such manipulated data can cause adverse effects like vulnerability to backdoored samples, systematic biases, and in general, reduced accuracy on certain input do… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 17 pages, 7 figures

  8. arXiv:2402.08823  [pdf, other

    cs.CV cs.LG

    RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

    Authors: Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

    Abstract: We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks acro… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Tech Report

  9. arXiv:2311.11293  [pdf, other

    cs.LG

    From Categories to Classifier: Name-Only Continual Learning by Exploring the Web

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Ser-Nam Lim, Bernard Ghanem, Philip H. S. Torr, Adel Bibi

    Abstract: Continual Learning (CL) often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice. We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation. In this scenario, learners adapt to new category shifts using only category names without the luxury of annot… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  10. arXiv:2310.02162  [pdf, other

    cs.RO

    TreeScope: An Agricultural Robotics Dataset for LiDAR-Based Map** of Trees in Forests and Orchards

    Authors: Derek Cheng, Fernando Cladera Ojeda, Ankit Prabhu, Xu Liu, Alan Zhu, Patrick Corey Green, Reza Ehsani, Pratik Chaudhari, Vijay Kumar

    Abstract: Data collection for forestry, timber, and agriculture currently relies on manual techniques which are labor-intensive and time-consuming. We seek to demonstrate that robotics offers improvements over these techniques and accelerate agricultural research, beginning with semantic segmentation and diameter estimation of trees in forests and orchards. We present TreeScope v1.0, the first robotics data… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA 2024) for review

  11. arXiv:2309.04502  [pdf, other

    cs.CV

    On the Efficacy of Multi-scale Data Samplers for Vision Applications

    Authors: Elvis Nunez, Thomas Merth, Anish Prabhu, Mehrdad Farajtabar, Mohammad Rastegari, Sachin Mehta, Maxwell Horton

    Abstract: Multi-scale resolution training has seen an increased adoption across multiple vision tasks, including classification and detection. Training with smaller resolutions enables faster training at the expense of a drop in accuracy. Conversely, training with larger resolutions has been shown to improve performance, but memory constraints often make this infeasible. In this paper, we empirically study… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  12. arXiv:2308.06658  [pdf, other

    cs.RO

    Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

    Authors: Igor Spasojevic, Xu Liu, Ankit Prabhu, Alejandro Ribeiro, George J. Pappas, Vijay Kumar

    Abstract: This paper addresses the problem of active collaborative localization in heterogeneous robot teams with unknown data association. It involves positioning a small number of identical unmanned ground vehicles (UGVs) at desired positions so that an unmanned aerial vehicle (UAV) can, through unlabelled measurements of UGVs, uniquely determine its global pose. We model the problem as a sequential two p… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: To appear in IROS 2023

  13. arXiv:2306.09479  [pdf, other

    cs.CL cs.AI cs.CY

    Inverse Scaling: When Bigger Isn't Better

    Authors: Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zheng** Zhou, Najoung Kim , et al. (2 additional authors not shown)

    Abstract: Work on scaling laws has found that large language models (LMs) show predictable improvements to overall loss with increased scale (model size, training data, and compute). Here, we present evidence for the claim that LMs may show inverse scaling, or worse task performance with increased scale, e.g., due to flaws in the training objective and data. We present empirical evidence of inverse scaling… ▽ More

    Submitted 12 May, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Published in TMLR (2023), 39 pages

    Journal ref: Transactions on Machine Learning Research (TMLR), 10/2023, https://openreview.net/forum?id=DwgRm72GQF

  14. arXiv:2305.09275  [pdf, other

    cs.LG cs.AI cs.CV

    Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

    Authors: Hasan Abed Al Kader Hammoud, Ameya Prabhu, Ser-Nam Lim, Philip H. S. Torr, Adel Bibi, Bernard Ghanem

    Abstract: We revisit the common practice of evaluating adaptation of Online Continual Learning (OCL) algorithms through the metric of online accuracy, which measures the accuracy of the model on the immediate next few samples. However, we show that this metric is unreliable, as even vacuous blind classifiers, which do not use input images for prediction, can achieve unrealistically high online accuracy by e… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  15. arXiv:2305.09253  [pdf, other

    cs.CV cs.LG

    Online Continual Learning Without the Storage Constraint

    Authors: Ameya Prabhu, Zhipeng Cai, Puneet Dokania, Philip Torr, Vladlen Koltun, Ozan Sener

    Abstract: Traditional online continual learning (OCL) research has primarily focused on mitigating catastrophic forgetting with fixed and limited storage allocation throughout an agent's lifetime. However, a broad range of real-world applications are primarily constrained by computational costs rather than storage limitations. In this paper, we target such applications, investigating the online continual le… ▽ More

    Submitted 2 November, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Tech Report [Additional Experiments and Improved ACM]

  16. arXiv:2303.11165  [pdf, other

    cs.LG cs.CV

    Computationally Budgeted Continual Learning: What Does Matter?

    Authors: Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi

    Abstract: Continual Learning (CL) aims to sequentially train models on streams of incoming data that vary in distribution by preserving previous knowledge while adapting to new data. Current CL literature focuses on restricted access to previously seen data, while imposing no constraints on the computational budget for training. This is unreasonable for applications in-the-wild, where systems are primarily… ▽ More

    Submitted 14 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  17. arXiv:2302.01047  [pdf, other

    cs.LG cs.AI cs.CV

    Real-Time Evaluation in Online Continual Learning: A New Hope

    Authors: Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip H. S. Torr, Bernard Ghanem

    Abstract: Current evaluations of Continual Learning (CL) methods typically assume that there is no constraint on training time and computation. This is an unrealistic assumption for any real-world setting, which motivates us to propose: a practical real-time evaluation of continual learning, in which the stream does not wait for the model to complete training before revealing the next data for predictions.… ▽ More

    Submitted 24 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted at CVPR'23 as Highlight (Top 2.5%)

  18. arXiv:2211.16882  [pdf, other

    cs.CV cs.RO

    MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

    Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

    Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

  19. arXiv:2211.02946  [pdf, other

    cs.HC cs.RO

    HREyes: Design, Development, and Evaluation of a Novel Method for AUVs to Communicate Information and Gaze Direction

    Authors: Michael Fulton, Aditya Prabhu, Junaed Sattar

    Abstract: We present the design, development, and evaluation of HREyes: biomimetic communication devices which use light to communicate information and, for the first time, gaze direction from AUVs to humans. First, we introduce two types of information displays using the HREye devices: active lucemes and ocular lucemes. Active lucemes communicate information explicitly through animations, while ocular luce… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: Under submission at ICRA23

  20. Active Metric-Semantic Map** by Multiple Aerial Robots

    Authors: Xu Liu, Ankit Prabhu, Fernando Cladera, Ian D. Miller, Lifeng Zhou, Camillo J. Taylor, Vijay Kumar

    Abstract: Traditional approaches for active map** focus on building geometric maps. For most real-world applications, however, actionable information is related to semantically meaningful objects in the environment. We propose an approach to the active metric-semantic map** problem that enables multiple heterogeneous robots to collaboratively build a map of the environment. The robots actively explore t… ▽ More

    Submitted 13 August, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: ICRA 2023 (2023 International Conference on Robotics and Automation)

    Journal ref: ICRA 2023 (2023 International Conference on Robotics and Automation)

  21. arXiv:2207.10237  [pdf, other

    cs.CV

    SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks

    Authors: Chien-Yu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari

    Abstract: Recent isotropic networks, such as ConvMixer and vision transformers, have found significant success across visual recognition tasks, matching or outperforming non-isotropic convolutional neural networks (CNNs). Isotropic architectures are particularly well-suited to cross-layer weight sharing, an effective neural network compression technique. In this paper, we perform an empirical evaluation on… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  22. arXiv:2202.04433  [pdf, other

    cs.CY

    Co-WIN: Really Winning? Analysing Inequity in India's Vaccination Response

    Authors: Tanvi Karandikar, Avinash Prabhu, Mehul Mathur, Megha Arora, Hemank Lamba, Ponnurangam Kumaraguru

    Abstract: The COVID-19 pandemic has so far accounted for reported 5.5M deaths worldwide, with 8.7% of these coming from India. The pandemic exacerbated the weakness of the Indian healthcare system. As of January 20, 2022, India is the second worst affected country with 38.2M reported cases and 487K deaths. According to epidemiologists, vaccines are an essential tool to prevent the spread of the pandemic. In… ▽ More

    Submitted 5 June, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

  23. arXiv:2201.06640  [pdf, other

    cs.LG cs.CV

    Towards Adversarial Evaluations for Inexact Machine Unlearning

    Authors: Shashwat Goel, Ameya Prabhu, Amartya Sanyal, Ser-Nam Lim, Philip Torr, Ponnurangam Kumaraguru

    Abstract: Machine Learning models face increased concerns regarding the storage of personal user data and adverse impacts of corrupted data like backdoors or systematic bias. Machine Unlearning can address these by allowing post-hoc deletion of affected training data from a learned model. Achieving this task exactly is computationally expensive; consequently, recent works have proposed inexact unlearning al… ▽ More

    Submitted 22 February, 2023; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: Tech Report

  24. arXiv:2112.00448  [pdf, other

    cs.CV

    On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification

    Authors: Rutika Moharir, Arun D Prabhu, Sukumar Moharana, Gopi Ramena, Rachit S Munjal

    Abstract: Automatic identification of script is an essential component of a multilingual OCR engine. In this paper, we present an efficient, lightweight, real-time and on-device spatial attention based CNN-LSTM network for scene text script identification, feasible for deployment on resource constrained mobile devices. Our network consists of a CNN, equipped with a spatial attention module which helps reduc… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in CVIP 2021

  25. arXiv:2111.12395  [pdf, other

    cs.SI

    I'll be back: Examining Restored Accounts On Twitter

    Authors: Arnav Kapoor, Rishi Raj Jain, Avinash Prabhu, Tanvi Karandikar, Ponnurangam Kumaraguru

    Abstract: Online social networks like Twitter actively monitor their platform to identify accounts that go against their rules. Twitter enforces account level moderation, i.e. suspension of a Twitter account in severe cases of platform abuse. A point of note is that these suspensions are sometimes temporary and even incorrect. Twitter provides a redressal mechanism to 'restore' suspended accounts. We refer… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  26. arXiv:2110.15923  [pdf, other

    cs.SI

    Efficient Representation of Interaction Patterns with Hyperbolic Hierarchical Clustering for Classification of Users on Twitter

    Authors: Tanvi Karandikar, Avinash Prabhu, Avinash Tulasi, Arun Balaji Buduru, Ponnurangam Kumaraguru

    Abstract: Social media platforms play an important role in democratic processes. During the 2019 General Elections of India, political parties and politicians widely used Twitter to share their ideals, advocate their agenda and gain popularity. Twitter served as a ground for journalists, politicians and voters to interact. The organic nature of these interactions can be upended by malicious accounts on Twit… ▽ More

    Submitted 1 November, 2021; v1 submitted 29 October, 2021; originally announced October 2021.

  27. arXiv:2110.14197  [pdf, other

    cs.AR cs.LG

    Encoder-Decoder Networks for Analyzing Thermal and Power Delivery Networks

    Authors: Vidya A. Chhabria, Vipul Ahuja, Ashwath Prabhu, Nikhil Patil, Palkesh Jain, Sachin S. Sapatnekar

    Abstract: Power delivery network (PDN) analysis and thermal analysis are computationally expensive tasks that are essential for successful IC design. Algorithmically, both these analyses have similar computational structure and complexity as they involve the solution to a partial differential equation of the same form. This paper converts these analyses into image-to-image and sequence-to-sequence translati… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 26 pages, 17 figures, Submitted to TODAES for review. arXiv admin note: text overlap with arXiv:2009.09009

  28. arXiv:2110.04252  [pdf, other

    cs.LG cs.CV

    LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time

    Authors: Elvis Nunez, Maxwell Horton, Anish Prabhu, Anurag Ranjan, Ali Farhadi, Mohammad Rastegari

    Abstract: When deploying deep learning models to a device, it is traditionally assumed that available computational resources (compute, memory, and power) remain static. However, real-world computing systems do not always provide stable resource guarantees. Computational resources need to be conserved when load from other processes is high or battery power is low. Inspired by recent works on neural network… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  29. arXiv:2110.03860  [pdf, other

    cs.CV cs.LG

    Token Pooling in Vision Transformers

    Authors: Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel

    Abstract: Despite the recent success in many applications, the high computational requirements of vision transformers limit their use in resource-constrained settings. While many existing methods improve the quadratic complexity of attention, in most vision transformers, self-attention is not the major computation bottleneck, e.g., more than 80% of the computation is spent on fully-connected layers. To impr… ▽ More

    Submitted 11 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2023

  30. STRIDE : Scene Text Recognition In-Device

    Authors: Rachit S Munjal, Arun D Prabhu, Nikhil Arora, Sukumar Moharana, Gopi Ramena

    Abstract: Optical Character Recognition (OCR) systems have been widely used in various applications for extracting semantic information from images. To give the user more control over their privacy, an on-device solution is needed. The current state-of-the-art models are too heavy and complex to be deployed on-device. We develop an efficient lightweight scene text recognition (STR) system, which has only 0.… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: accepted in IJCNN 2021

  31. arXiv:2104.01261  [pdf, other

    cs.SI

    Small World Student Network at the University of Texas at Dallas in Times of Social Distancing

    Authors: Kailash Subramanian, Joshua M. Williams, Daniel C. DeAnda, Aditya A. Agrawal, Andrei Racila, Aditi R. Prabhu, Lawrence Redlinger, Christopher Wendt, Ravi Prakash

    Abstract: To limit the spread of the novel coronavirus on college campuses, a common strategy for the Fall 2020 and Spring 2021 terms has been to offer instruction weighted toward hybrid or fully online modalities. Colleges are now considering whether and how to expand hybrid or fully in-person instruction for future terms, and learn lessons from this experience for future use. Our paper uses Fall 2019 enro… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  32. arXiv:2104.00795  [pdf, other

    cs.LG cs.CV

    No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks

    Authors: Shyamgopal Karthik, Ameya Prabhu, Puneet K. Dokania, Vineet Gandhi

    Abstract: There has been increasing interest in building deep hierarchy-aware classifiers that aim to quantify and reduce the severity of mistakes, and not just reduce the number of errors. The idea is to exploit the label hierarchy (e.g., the WordNet ontology) and consider graph distances as a proxy for mistake severity. Surprisingly, on examining mistake-severity distributions of the top-1 prediction, we… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  33. Monocular Multi-Layer Layout Estimation for Warehouse Racks

    Authors: Meher Shashwat Nigam, Avinash Prabhu, Anurag Sahu, Puru Gupta, Tanvi Karandikar, N. Sai Shankar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

    Abstract: Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, Rac… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Visit our project repository at https://github.com/Avinash2468/RackLay

  34. arXiv:2101.06914  [pdf, other

    cs.CY cs.SI

    Capitol (Pat)riots: A comparative study of Twitter and Parler

    Authors: Hitkul, Avinash Prabhu, Dipanwita Guhathakurta, Jivitesh jain, Mallika Subramanian, Manvith Reddy, Shradha Sehgal, Tanvi Karandikar, Amogh Gulati, Udit Arora, Rajiv Ratn Shah, Ponnurangam Kumaraguru

    Abstract: On 6 January 2021, a mob of right-wing conservatives stormed the USA Capitol Hill interrupting the session of congress certifying 2020 Presidential election results. Immediately after the start of the event, posts related to the riots started to trend on social media. A social media platform which stood out was a free speech endorsing social media platform Parler; it is being claimed as the platfo… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  35. arXiv:2012.13427  [pdf

    cs.IT cs.DL stat.OT

    Reproducible Workflow

    Authors: Anirudh Prabhu, Peter Fox

    Abstract: Reproducibility has been consistently identified as an important component of scientific research. Although there is a general consensus on the importance of reproducibility along with the other commonly used 'R' terminology (i.e., Replicability, Repeatability etc.), there is some disagreement on the usage of these terms, including conflicting definitions used by different parts of the research co… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 7 pages, no figures. Submitted as an entry to the "Encyclopedia of Mathematical Geosciences."

  36. Codeswitched Sentence Creation using Dependency Parsing

    Authors: Dhruval Jain, Arun D Prabhu, Shubham Vatsal, Gopi Ramena, Naresh Purre

    Abstract: Codeswitching has become one of the most common occurrences across multilingual speakers of the world, especially in countries like India which encompasses around 23 official languages with the number of bilingual speakers being around 300 million. The scarcity of Codeswitched data becomes a bottleneck in the exploration of this domain with respect to various Natural Language Processing (NLP) task… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

  37. On-Device Sentence Similarity for SMS Dataset

    Authors: Arun D Prabhu, Nikhil Arora, Shubham Vatsal, Gopi Ramena, Sukumar Moharana, Naresh Purre

    Abstract: Determining the sentence similarity between Short Message Service (SMS) texts/sentences plays a significant role in mobile device industry. Gauging the similarity between SMS data is thus necessary for various applications like enhanced searching and navigation, clubbing together SMS of similar type when given a custom label or tag is provided by user irrespective of their sender etc. The problem… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  38. On-Device Text Image Super Resolution

    Authors: Dhruval Jain, Arun D Prabhu, Gopi Ramena, Manoj Goyal, Debi Prasanna Mohanty, Sukumar Moharana, Naresh Purre

    Abstract: Recent research on super-resolution (SR) has witnessed major developments with the advancements of deep convolutional neural networks. There is a need for information extraction from scenic text images or even document images on device, most of which are low-resolution (LR) images. Therefore, SR becomes an essential pre-processing step as Bicubic Upsampling, which is conventionally present in smar… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: Accepted to the International Conference on Pattern Recognition(ICPR), 2020

  39. arXiv:2009.09009  [pdf, other

    cs.AR cs.AI cs.LG

    Thermal and IR Drop Analysis Using Convolutional Encoder-Decoder Networks

    Authors: Vidya A. Chhabria, Vipul Ahuja, Ashwath Prabhu, Nikhil Patil, Palkesh Jain, Sachin S. Sapatnekar

    Abstract: Computationally expensive temperature and power grid analyses are required during the design cycle to guide IC design. This paper employs encoder-decoder based generative (EDGe) networks to map these analyses to fast and accurate image-to-image and sequence-to-sequence translation tasks. The network takes a power map as input and outputs the corresponding temperature or IR drop map. We propose two… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted in ASP-DAC 2021 conference

  40. arXiv:2006.02609  [pdf, other

    cs.CV

    Simple Unsupervised Multi-Object Tracking

    Authors: Shyamgopal Karthik, Ameya Prabhu, Vineet Gandhi

    Abstract: Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for develo** better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised re-identification network, thus sidestep** the labeling costs entirely, required for training. Given unlabeled videos, our proposed method (SimpleReID) first gene… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

  41. arXiv:1911.11433  [pdf, other

    cs.LG cs.CV cs.IR eess.IV stat.ML

    "You might also like this model": Data Driven Approach for Recommending Deep Learning Models for Unknown Image Datasets

    Authors: Ameya Prabhu, Riddhiman Dasgupta, Anush Sankaran, Srikanth Tamilselvam, Senthil Mani

    Abstract: For an unknown (new) classification dataset, choosing an appropriate deep learning architecture is often a recursive, time-taking, and laborious process. In this research, we propose a novel technique to recommend a suitable architecture from a repository of known models. Further, we predict the performance accuracy of the recommended architecture on the given unknown dataset, without the need for… ▽ More

    Submitted 20 May, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019, New in ML Group

  42. arXiv:1909.09389  [pdf, other

    cs.CL cs.LG

    Sampling Bias in Deep Active Classification: An Empirical Study

    Authors: Ameya Prabhu, Charles Dognin, Maneesh Singh

    Abstract: The exploding cost and time needed for data labeling and model training are bottlenecks for training DNN models on large datasets. Identifying smaller representative data samples with strategies like active learning can help mitigate such bottlenecks. Previous works on active learning in NLP identify the problem of sampling bias in the samples acquired by uncertainty-based querying and develop cos… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: Accepted at EMNLP 2019

  43. arXiv:1906.02256  [pdf, other

    cs.CV cs.LG

    Butterfly Transform: An Efficient FFT Based Neural Architecture Design

    Authors: Keivan Alizadeh Vahid, Anish Prabhu, Ali Farhadi, Mohammad Rastegari

    Abstract: In this paper, we show that extending the butterfly operations from the FFT algorithm to a general Butterfly Transform (BFT) can be beneficial in building an efficient block structure for CNN designs. Pointwise convolutions, which we refer to as channel fusions, are the main computational bottleneck in the state-of-the-art efficient CNNs (e.g. MobileNets ). We introduce a set of criteria for chann… ▽ More

    Submitted 16 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  44. arXiv:1809.05839  [pdf, other

    cs.HC cs.LG stat.ML

    A Generic Multi-modal Dynamic Gesture Recognition System using Machine Learning

    Authors: Gautham Krishna G, Karthik Subramanian Nathan, Yogesh Kumar B, Ankith A Prabhu, Ajay Kannan, Vineeth Vijayaraghavan

    Abstract: Human computer interaction facilitates intelligent communication between humans and computers, in which gesture recognition plays a prominent role. This paper proposes a machine learning system to identify dynamic gestures using tri-axial acceleration data acquired from two public datasets. These datasets, uWave and Sony, were acquired using accelerometers embedded in Wii remotes and smartwatches,… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

    Comments: Accepted at IEEE Future of Information and Communications Conference (FICC 2018)

  45. arXiv:1804.03867  [pdf, other

    cs.CV

    Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory

    Authors: Ameya Prabhu, Vishal Batchu, Rohit Gajawada, Sri Aurobindo Munagala, Anoop Namboodiri

    Abstract: Binarization is an extreme network compression approach that provides large computational speedups along with energy and memory savings, albeit at significant accuracy costs. We investigate the question of where to binarize inputs at layer-level granularity and show that selectively binarizing the inputs to specific layers in the network could lead to significant improvements in accuracy while pre… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Accepted in WACV'18 (Oral)

  46. arXiv:1804.02941  [pdf, other

    cs.CV

    Distribution-Aware Binarization of Neural Networks for Sketch Recognition

    Authors: Ameya Prabhu, Vishal Batchu, Sri Aurobindo Munagala, Rohit Gajawada, Anoop Namboodiri

    Abstract: Deep neural networks are highly effective at a range of computational tasks. However, they tend to be computationally expensive, especially in vision-related problems, and also have large memory requirements. One of the most effective methods to achieve significant improvements in computational/spatial efficiency is to binarize the weights and activations in a network. However, naive binarization… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: Accepted at WACV '18 (Oral)

  47. arXiv:1711.08757  [pdf, other

    cs.CV

    Deep Expander Networks: Efficient Deep Networks from Graph Theory

    Authors: Ameya Prabhu, Girish Varma, Anoop Namboodiri

    Abstract: Efficient CNN designs like ResNets and DenseNet were proposed to improve accuracy vs efficiency trade-offs. They essentially increased the connectivity, allowing efficient information flow across layers. Inspired by these techniques, we propose to model connections between filters of a CNN using graphs which are simultaneously sparse and well connected. Sparsity results in efficiency while well co… ▽ More

    Submitted 26 July, 2018; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: ECCV'18

  48. arXiv:1611.00472  [pdf, other

    cs.CL

    Towards Sub-Word Level Compositions for Sentiment Analysis of Hindi-English Code Mixed Text

    Authors: Ameya Prabhu, Aditya Joshi, Manish Shrivastava, Vasudeva Varma

    Abstract: Sentiment analysis (SA) using code-mixed data from social media has several applications in opinion mining ranging from customer satisfaction to social campaign analysis in multilingual societies. Advances in this area are impeded by the lack of a suitable annotated dataset. We introduce a Hindi-English (Hi-En) code-mixed dataset for sentiment analysis and perform empirical analysis comparing the… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

    Comments: Accepted paper at COLING 2016

  49. arXiv:1610.09756  [pdf, other

    cs.CL cs.LG

    Towards Deep Learning in Hindi NER: An approach to tackle the Labelled Data Scarcity

    Authors: Vinayak Athavale, Shreenivas Bharadwaj, Monik Pamecha, Ameya Prabhu, Manish Shrivastava

    Abstract: In this paper we describe an end to end Neural Model for Named Entity Recognition NER) which is based on Bi-Directional RNN-LSTM. Almost all NER systems for Hindi use Language Specific features and handcrafted rules with gazetteers. Our model is language independent and uses no domain specific features or any handcrafted rules. Our models rely on semantic information in the form of word vectors wh… ▽ More

    Submitted 16 November, 2016; v1 submitted 30 October, 2016; originally announced October 2016.

    Comments: 7 pages

    Report number: https://aclweb.org/anthology/W/W16/W16-6320.pdf