Skip to main content

Showing 1–9 of 9 results for author: Agneeswaran, V

.
  1. arXiv:2404.16112  [pdf, other

    cs.LG cs.AI cs.CV cs.MM eess.IV

    Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges

    Authors: Badri Narayana Patro, Vijay Srinivas Agneeswaran

    Abstract: Sequence modeling is a crucial area across various domains, including Natural Language Processing (NLP), speech recognition, time series forecasting, music generation, and bioinformatics. Recurrent Neural Networks (RNNs) and Long Short Term Memory Networks (LSTMs) have historically dominated sequence modeling tasks like Machine Translation, Named Entity Recognition (NER), etc. However, the advance… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  2. arXiv:2404.09302  [pdf, other

    cs.LG cs.AI cs.DC

    High Significant Fault Detection in Azure Core Workload Insights

    Authors: Pranay Lohia, Laurent Boue, Sharath Rangappa, Vijay Agneeswaran

    Abstract: Azure Core workload insights have time-series data with different metric units. Faults or Anomalies are observed in these time-series data owing to faults observed with respect to metric name, resources region, dimensions, and its dimension value associated with the data. For Azure Core, an important task is to highlight faults or anomalies to the user on a dashboard that they can perceive easily.… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  3. arXiv:2403.18063  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis

    Authors: Badri N. Patro, Suhas Ranganath, Vinay P. Namboodiri, Vijay S. Agneeswaran

    Abstract: Transformers have revolutionized image modeling tasks with adaptations like DeIT, Swin, SVT, Biformer, STVit, and FDVIT. However, these models often face challenges with inductive bias and high quadratic complexity, making them less efficient for high-resolution images. State space models (SSMs) such as Mamba, V-Mamba, ViM, and SiMBA offer an alternative to handle high resolution images in compute… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  4. arXiv:2403.15360  [pdf, other

    cs.CV cs.LG eess.IV eess.SY

    SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series

    Authors: Badri N. Patro, Vijay S. Agneeswaran

    Abstract: Transformers have widely adopted attention networks for sequence mixing and MLPs for channel mixing, playing a pivotal role in achieving breakthroughs across domains. However, recent literature highlights issues with attention networks, including low inductive bias and quadratic complexity concerning input sequence length. State Space Models (SSMs) like S4 and others (Hippo, Global Convolutions, l… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2311.01310  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Scattering Vision Transformer: Spectral Mixing Matters

    Authors: Badri N. Patro, Vijay Srinivas Agneeswaran

    Abstract: Vision transformers have gained significant attention and achieved state-of-the-art performance in various computer vision tasks, including image classification, instance segmentation, and object detection. However, challenges remain in addressing attention complexity and effectively capturing fine-grained information within images. Existing solutions often resort to down-sampling operations, such… ▽ More

    Submitted 20 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted @NeurIPS 2023

  6. arXiv:2304.06446  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SpectFormer: Frequency and Attention is what you need in a Vision Transformer

    Authors: Badri N. Patro, Vinay P. Namboodiri, Vijay Srinivas Agneeswaran

    Abstract: Vision transformers have been applied successfully for image recognition tasks. There have been either multi-headed self-attention based (ViT \cite{dosovitskiy2020image}, DeIT, \cite{touvron2021training}) similar to the original work in textual models or more recently based on spectral layers (Fnet\cite{lee2021fnet}, GFNet\cite{rao2021global}, AFNO\cite{guibas2021efficient}). We hypothesize that b… ▽ More

    Submitted 14 April, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: The project page is available at this webpage \url{https://badripatro.github.io/SpectFormers/}

  7. arXiv:2302.08374  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Efficiency 360: Efficient Vision Transformers

    Authors: Badri N. Patro, Vijay Srinivas Agneeswaran

    Abstract: Transformers are widely used for solving tasks in natural language processing, computer vision, speech, and music domains. In this paper, we talk about the efficiency of transformers in terms of memory (the number of parameters), computation cost (number of floating points operations), and performance of models, including accuracy, the robustness of the model, and fair \& bias-free features. We ma… ▽ More

    Submitted 23 February, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  8. arXiv:2207.13287  [pdf, other

    cs.LG

    Detecting Concept Drift in the Presence of Sparsity -- A Case Study of Automated Change Risk Assessment System

    Authors: Vishwas Choudhary, Binay Gupta, Anirban Chatterjee, Subhadip Paul, Kunal Banerjee, Vijay Agneeswaran

    Abstract: Missing values, widely called as \textit{sparsity} in literature, is a common characteristic of many real-world datasets. Many imputation methods have been proposed to address this problem of data incompleteness or sparsity. However, the accuracy of a data imputation method for a given feature or a set of features in a dataset is highly dependent on the distribution of the feature values and its c… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  9. arXiv:2108.07951  [pdf, other

    cs.LG cs.AI

    Look Before You Leap! Designing a Human-Centered AI System for Change Risk Assessment

    Authors: Binay Gupta, Anirban Chatterjee, Harika Matha, Kunal Banerjee, Lalitdutt Parsai, Vijay Agneeswaran

    Abstract: Reducing the number of failures in a production system is one of the most challenging problems in technology driven industries, such as, the online retail industry. To address this challenge, change management has emerged as a promising sub-field in operations that manages and reviews the changes to be deployed in production in a systematic manner. However, it is practically impossible to manually… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.