Skip to main content

Showing 1–29 of 29 results for author: Gowda, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05066  [pdf, other

    cs.DS

    Efficient Centroid-Linkage Clustering

    Authors: MohammadHossein Bateni, Laxman Dhulipala, Willem Fletcher, Kishen N Gowda, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki

    Abstract: We give an efficient algorithm for Centroid-Linkage Hierarchical Agglomerative Clustering (HAC), which computes a $c$-approximate clustering in roughly $n^{1+O(1/c^2)}$ time. We obtain our result by combining a new Centroid-Linkage HAC algorithm with a novel fully dynamic data structure for nearest neighbor search which works under adaptive updates. We also evaluate our algorithm empirically. By… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.02929  [pdf, other

    cs.CV cs.LG

    Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models

    Authors: Zihan Ye, Shreyank N. Gowda, Xiaobo **, Xiaowei Huang, Haotian Xu, Yaochu **, Kaizhu Huang

    Abstract: Zero-Shot Learning (ZSL) aims to enable classifiers to identify unseen classes by enhancing data efficiency at the class level. This is achieved by generating image features from pre-defined semantics of unseen classes. However, most current approaches heavily depend on the number of samples from seen classes, i.e. they do not consider instance-level effectiveness. In this paper, we demonstrate th… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2404.19019  [pdf, other

    cs.DS cs.DC

    Optimal Parallel Algorithms for Dendrogram Computation and Single-Linkage Clustering

    Authors: Laxman Dhulipala, Xiaojun Dong, Kishen N Gowda, Yan Gu

    Abstract: Computing a Single-Linkage Dendrogram (SLD) is a key step in the classic single-linkage hierarchical clustering algorithm. Given an input edge-weighted tree $T$, the SLD of $T$ is a binary dendrogram that summarizes the $n-1$ clusterings obtained by contracting the edges of $T$ in order of weight. Existing algorithms for computing the SLD all require $Ω(n\log n)$ work where $n = |T|$. Furthermore,… ▽ More

    Submitted 12 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: To appear at SPAA 2024

  4. arXiv:2404.14730  [pdf, other

    cs.DS cs.CC cs.DC

    It's Hard to HAC with Average Linkage!

    Authors: MohammadHossein Bateni, Laxman Dhulipala, Kishen N Gowda, D Ellis Hershkowitz, Rajesh Jayaram, Jakub Łącki

    Abstract: Average linkage Hierarchical Agglomerative Clustering (HAC) is an extensively studied and applied method for hierarchical clustering. Recent applications to massive datasets have driven significant interest in near-linear-time and efficient parallel algorithms for average linkage HAC. We provide hardness results that rule out such algorithms. On the sequential side, we establish a runtime lower… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: To appear at ICALP 2024

  5. arXiv:2401.17883  [pdf, other

    cs.CV

    Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques

    Authors: Shreyank N Gowda, Yash Thakre, Shashank Narayana Gowda, Xiaobo **

    Abstract: This paper offers a comprehensive analysis of recent advancements in video inpainting techniques, a critical subset of computer vision and artificial intelligence. As a process that restores or fills in missing or corrupted portions of video sequences with plausible content, video inpainting has evolved significantly with the advent of deep learning methodologies. Despite the plethora of existing… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  6. arXiv:2401.11406  [pdf, other

    cs.CV

    Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts

    Authors: Kiyoon Kim, Shreyank N Gowda, Panagiotis Eustratiadis, Antreas Antoniou, Robert B Fisher

    Abstract: Despite recent advances in video action recognition achieving strong performance on existing benchmarks, these models often lack robustness when faced with natural distribution shifts between training and test data. We propose two novel evaluation methods to assess model resilience to such distribution disparity. One method uses two different datasets collected from different sources and uses one… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  7. arXiv:2310.06522  [pdf, other

    cs.LG cs.CV

    Watt For What: Rethinking Deep Learning's Energy-Performance Relationship

    Authors: Shreyank N Gowda, Xinyue Hao, Gen Li, Laura Sevilla-Lara, Shashank Narayana Gowda

    Abstract: Deep learning models have revolutionized various fields, from image recognition to natural language processing, by achieving unprecedented levels of accuracy. However, their increasing energy consumption has raised concerns about their environmental impact, disadvantaging smaller entities in research and exacerbating global energy consumption. In this paper, we explore the trade-off between model… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  8. arXiv:2309.17327  [pdf, other

    cs.CV

    Telling Stories for Common Sense Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara

    Abstract: Video understanding has long suffered from reliance on large labeled datasets, motivating research into zero-shot learning. Recent progress in language modeling presents opportunities to advance zero-shot video analysis, but constructing an effective semantic space relating action classes remains challenging. We address this by introducing a novel dataset, Stories, which contains rich textual desc… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  9. arXiv:2309.01390  [pdf, other

    cs.CV cs.AI

    Bridging the Projection Gap: Overcoming Projection Bias Through Parameterized Distance Learning

    Authors: Chong Zhang, Mingyu **, Qinkai Yu, Haochen Xue, Shreyank N Gowda, Xiaobo **

    Abstract: Generalized zero-shot learning (GZSL) aims to recognize samples from both seen and unseen classes using only seen class samples for training. However, GZSL methods are prone to bias towards seen classes during inference due to the projection function being learned from seen classes. Most methods focus on learning an accurate projection, but bias in the projection is inevitable. We address this pro… ▽ More

    Submitted 2 April, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 18 pages, 9 figures

  10. arXiv:2308.16041  [pdf, other

    cs.CV

    From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

    Authors: Shreyank N Gowda, Dheeraj Pandey, Shashank Narayana Gowda

    Abstract: Recent advancements in deep learning and computer vision have led to a surge of interest in generating realistic talking heads. This paper presents a comprehensive survey of state-of-the-art methods for talking head generation. We systematically categorises them into four main approaches: image-driven, audio-driven, video-driven and others (including neural radiance fields (NeRF), and 3D-based met… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  11. arXiv:2306.04822  [pdf, other

    cs.CV

    Optimizing ViViT Training: Time and Memory Reduction for Action Recognition

    Authors: Shreyank N Gowda, Anurag Arnab, Jonathan Huang

    Abstract: In this paper, we address the challenges posed by the substantial training time and memory consumption associated with video transformers, focusing on the ViViT (Video Vision Transformer) model, in particular the Factorised Encoder version, as our baseline for action recognition tasks. The factorised encoder variant follows the late-fusion approach that is adopted by many state of the art approach… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  12. arXiv:2304.02846  [pdf, other

    cs.CV

    Synthetic Sample Selection for Generalized Zero-Shot Learning

    Authors: Shreyank N Gowda

    Abstract: Generalized Zero-Shot Learning (GZSL) has emerged as a pivotal research domain in computer vision, owing to its capability to recognize objects that have not been seen during training. Despite the significant progress achieved by generative techniques in converting traditional GZSL to fully supervised learning, they tend to generate a large number of synthetic features that are often redundant, th… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Paper accepted in CVPRW 2023

  13. Impact, Attention, Influence: Early Assessment of Autonomous Driving Datasets

    Authors: Daniel Bogdoll, Jonas Hendl, Felix Schreyer, Nishanth Gowda, Michael Färber, J. Marius Zöllner

    Abstract: Autonomous Driving (AD), the area of robotics with the greatest potential impact on society, has gained a lot of momentum in the last decade. As a result of this, the number of datasets in AD has increased rapidly. Creators and users of datasets can benefit from a better understanding of developments in the field. While scientometric analysis has been conducted in other fields, it rarely revolves… ▽ More

    Submitted 31 March, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: Daniel Bogdoll and Jonas Hendl contributed equally. Accepted for publication at ICCRE 2023

  14. arXiv:2210.13395  [pdf, other

    cs.DS

    Improved Bi-point Rounding Algorithms and a Golden Barrier for $k$-Median

    Authors: Kishen N. Gowda, Thomas Pensyl, Aravind Srinivasan, Khoa Trinh

    Abstract: The current best approximation algorithms for $k$-median rely on first obtaining a structured fractional solution known as a bi-point solution, and then rounding it to an integer solution. We improve this second step by unifying and refining previous approaches. We describe a hierarchy of increasingly-complex partitioning schemes for the facilities, along with corresponding sets of algorithms and… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  15. arXiv:2209.15501  [pdf, other

    cs.CV

    A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

    Authors: Anil Batra, Shreyank N Gowda, Frank Keller, Laura Sevilla-Lara

    Abstract: Understanding the steps required to perform a task is an important skill for AI systems. Learning these steps from instructional videos involves two subproblems: (i) identifying the temporal boundary of sequentially occurring segments and (ii) summarizing these steps in natural language. We refer to this task as Procedure Segmentation and Summarization (PSS). In this paper, we take a closer look a… ▽ More

    Submitted 7 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted at BMVC 2022

  16. arXiv:2208.10095  [pdf, other

    cs.LG cs.CY cs.DS

    Socially Fair Center-based and Linear Subspace Clustering

    Authors: Sruthi Gorantla, Kishen N. Gowda, Amit Deshpande, Anand Louis

    Abstract: Center-based clustering (e.g., $k$-means, $k$-medians) and clustering using linear subspaces are two most popular techniques to partition real-world data into smaller clusters. However, when the data consists of sensitive demographic groups, significantly different clustering cost per point for different sensitive groups can lead to fairness-related harms (e.g., different quality-of-service). The… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages

  17. arXiv:2206.04790  [pdf, other

    cs.CV

    Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

    Authors: Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara

    Abstract: We address the problem of data augmentation for video action recognition. Standard augmentation strategies in video are hand-designed and sample the space of possible augmented data points either at random, without knowing which augmented points will be better, or through heuristics. We propose to learn what makes a good video for action recognition and select only high-quality samples for augment… ▽ More

    Submitted 23 July, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to ECCV-2022

  18. arXiv:2201.10394  [pdf, other

    cs.CV

    Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

    Authors: Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara

    Abstract: We address the problem of capturing temporal information for video classification in 2D networks, without increasing their computational cost. Existing approaches focus on modifying the architecture of 2D networks (e.g. by including filters in the temporal dimension to turn them into 3D networks, or using optical flow, etc.), which increases computation cost. Instead, we propose a novel sampling s… ▽ More

    Submitted 10 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: BMVC 2022

  19. arXiv:2107.13029  [pdf, other

    cs.CV

    A New Split for Evaluating True Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach

    Abstract: Zero-shot action recognition is the task of classifying action categories that are not available in the training set. In this setting, the standard evaluation protocol is to use existing action recognition datasets(e.g. UCF101) and randomly split the classes into seen and unseen. However, most recent work builds on representations pre-trained on the Kinetics dataset, where classes largely overlap… ▽ More

    Submitted 13 September, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted to GCPR 2021

  20. arXiv:2107.00443  [pdf, other

    cs.RO

    Test Framework for a Virtual Competition Testbed

    Authors: Liam Wellacott, Emilyann Nault, Ioannis Skottis, Alexandre Colle, Shreyank N Gowda, Pierre Nicolay, Emily Rolley-Parnell

    Abstract: Virtual environments have been utilised in robotics research as a tool to assess systems before deploying them in the field. The COVID-19 pandemic has brought about additional motivation for the development of virtual benchmarks in order to aid in safe and productive development. In-person robotics competitions have also halted, thus limiting the scope of opportunities for students and researchers… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  21. arXiv:2101.07042  [pdf, other

    cs.CV

    CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach

    Abstract: Zero-shot action recognition is the task of recognizingaction classes without visual examples, only with a seman-tic embedding which relates unseen to seen classes. Theproblem can be seen as learning a function which general-izes well to instances of unseen classes without losing dis-crimination between classes. Neural networks can modelthe complex boundaries between visual classes, which ex-plain… ▽ More

    Submitted 23 July, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted to ECCV-22

  22. arXiv:2012.10671  [pdf, other

    cs.CV

    SMART Frame Selection for Action Recognition

    Authors: Shreyank N Gowda, Marcus Rohrbach, Laura Sevilla-Lara

    Abstract: Action recognition is computationally expensive. In this paper, we address the problem of frame selection to improve the accuracy of action recognition. In particular, we show that selecting good frames helps in action recognition performance even in the trimmed videos domain. Recent work has successfully leveraged frame selection for long, untrimmed videos, where much of the content is not releva… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: To be published in AAAI-21

  23. arXiv:2009.13949  [pdf, other

    cs.DS cs.CC

    Improved FPT Algorithms for Deletion to Forest-like Structures

    Authors: Kishen N. Gowda, Aditya Lonkar, Fahad Panolan, Vraj Patel, Saket Saurabh

    Abstract: The Feedback Vertex Set problem is undoubtedly one of the most well-studied problems in Parameterized Complexity. In this problem, given an undirected graph $G$ and a non-negative integer $k$, the objective is to test whether there exists a subset $S\subseteq V(G)$ of size at most $k$ such that $G-S$ is a forest. After a long line of improvement, recently, Li and Nederlof [SODA, 2020] designed a r… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: ISAAC 2020, 36 pages. arXiv admin note: text overlap with arXiv:1906.12298, arXiv:1103.0534 by other authors

  24. arXiv:2005.13039  [pdf, other

    cs.CV

    ALBA : Reinforcement Learning for Video Object Segmentation

    Authors: Shreyank N Gowda, Panagiotis Eustratiadis, Timothy Hospedales, Laura Sevilla-Lara

    Abstract: We consider the challenging problem of zero-shot video object segmentation (VOS). That is, segmenting and tracking multiple moving objects within a video fully automatically, without any manual initialization. We treat this as a grou** problem by exploiting object proposals and making a joint inference about grou** over both space and time. We propose a network architecture for tractably perfo… ▽ More

    Submitted 14 August, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  25. A Parameterized Perspective on Attacking and Defending Elections

    Authors: Kishen N. Gowda, Neeldhara Misra, Vraj Patel

    Abstract: We consider the problem of protecting and manipulating elections by recounting and changing ballots, respectively. Our setting involves a plurality-based election held across multiple districts, and the problem formulations are based on the model proposed recently by~[Elkind et al, IJCAI 2019]. It turns out that both of the manipulation and protection problems are NP-complete even in fairly simple… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  26. arXiv:2003.05005  [pdf, other

    cs.CV cs.CR

    Using an ensemble color space model to tackle adversarial examples

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Minute pixel changes in an image drastically change the prediction that the deep learning model makes. One of the most significant problems that could arise due to this, for instance, is autonomous driving. Many methods have been proposed to combat this with varying amounts of success. We propose a 3 step method for defending such attacks. First, we denoise the image using statistical methods. Sec… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  27. StegColNet: Steganalysis based on an ensemble colorspace approach

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Image steganography refers to the process of hiding information inside images. Steganalysis is the process of detecting a steganographic image. We introduce a steganalysis approach that uses an ensemble color space model to obtain a weighted concatenated feature activation map. The concatenated map helps to obtain certain features explicit to each color space. We use a levy-flight grey wolf optimi… ▽ More

    Submitted 16 October, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

  28. arXiv:1906.07421  [pdf

    cs.CV

    Using colorization as a tool for automatic makeup suggestion

    Authors: Shreyank Narayana Gowda

    Abstract: Colorization is the method of converting an image in grayscale to a fully color image. There are multiple methods to do the same. Old school methods used machine learning algorithms and optimization techniques to suggest possible colors to use. With advances in the field of deep learning, colorization results have improved consistently with improvements in deep learning architectures. The latest d… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  29. arXiv:1902.00267  [pdf, ps, other

    cs.CV

    ColorNet: Investigating the importance of color spaces for image classification

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Image classification is a fundamental application in computer vision. Recently, deeper networks and highly connected networks have shown state of the art performance for image classification tasks. Most datasets these days consist of a finite number of color images. These color images are taken as input in the form of RGB images and classification is done without modifying them. We explore the imp… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Journal ref: Asian Conference on Computer Vision 2018