Skip to main content

Showing 1–6 of 6 results for author: Mitheran, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.11250  [pdf, other

    cs.CV cs.LG eess.IV

    Rich Feature Distillation with Feature Affinity Module for Efficient Image Dehazing

    Authors: Sai Mitheran, Anushri Suresh, Nisha J. S., Varun P. Gopi

    Abstract: Single-image haze removal is a long-standing hurdle for computer vision applications. Several works have been focused on transferring advances from image classification, detection, and segmentation to the niche of image dehazing, primarily focusing on contrastive learning and knowledge distillation. However, these approaches prove computationally expensive, raising concern regarding their applicab… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Preprint version. Accepted at Optik

  2. arXiv:2206.08175  [pdf, other

    cs.LG

    Not All Lotteries Are Made Equal

    Authors: Surya Kant Sahu, Sai Mitheran, Somya Suhans Mahapatra

    Abstract: The Lottery Ticket Hypothesis (LTH) states that for a reasonably sized neural network, a sub-network within the same network yields no less performance than the dense counterpart when trained from the same initialization. This work investigates the relation between model size and the ease of finding these sparse sub-networks. We show through experiments that, surprisingly, under a finite budget, s… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML 2022 HAET Workshop

  3. Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding

    Authors: Lalithkumar Seenivasan, Sai Mitheran, Mobarakol Islam, Hongliang Ren

    Abstract: Global and local relational reasoning enable scene understanding models to perform human-like scene analysis and understanding. Scene understanding enables better semantic segmentation and object-to-object interaction detection. In the medical domain, a robust surgical scene understanding model allows the automation of surgical skill evaluation, real-time monitoring of surgeon's performance and po… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: Code available at: https://github.com/lalithjets/Global-reasoned-multi-task-model

  4. arXiv:2109.10252   

    cs.LG cs.CL cs.SD eess.AS

    Audiomer: A Convolutional Transformer For Keyword Spotting

    Authors: Surya Kant Sahu, Sai Mitheran, Juhi Kamdar, Meet Gandhi

    Abstract: Transformers have seen an unprecedented rise in Natural Language Processing and Computer Vision tasks. However, in audio tasks, they are either infeasible to train due to extremely large sequence length of audio waveforms or incur a performance penalty when trained on Fourier-based features. In this work, we introduce an architecture, Audiomer, where we combine 1D Residual Networks with Performer… ▽ More

    Submitted 1 February, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: The results and claims made are incorrect due to data leakage and an erroneous split of datasets

  5. arXiv:2107.06212  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    'CADSketchNet' -- An Annotated Sketch dataset for 3D CAD Model Retrieval with Deep Neural Networks

    Authors: Bharadwaj Manda, Shubham Dhayarkar, Sai Mitheran, V. K. Viekash, Ramanathan Muthuganapathy

    Abstract: Ongoing advancements in the fields of 3D modelling and digital archiving have led to an outburst in the amount of data stored digitally. Consequently, several retrieval systems have been developed depending on the type of data stored in these databases. However, unlike text data or images, performing a search for 3D models is non-trivial. Among 3D models, retrieving 3D Engineering/CAD models or me… ▽ More

    Submitted 20 July, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: Computers & Graphics Journal, Special Section on 3DOR 2021

    Journal ref: Computers & Graphics, Volume 99, 2021, Pages 100-113, ISSN 0097-8493

  6. arXiv:2107.01516  [pdf, other

    cs.IR cs.LG

    Introducing Self-Attention to Target Attentive Graph Neural Networks

    Authors: Sai Mitheran, Abhinav Java, Surya Kant Sahu, Arshad Shaikh

    Abstract: Session-based recommendation systems suggest relevant items to users by modeling user behavior and preferences using short-term anonymous sessions. Existing methods leverage Graph Neural Networks (GNNs) that propagate and aggregate information from neighboring nodes i.e., local message passing. Such graph-based architectures have representational limits, as a single sub-graph is susceptible to ove… ▽ More

    Submitted 7 January, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: Accepted at AISP 2022

    ACM Class: H.3.3; I.2.1