Skip to main content

Showing 1–7 of 7 results for author: Subramanyam, R

.
  1. arXiv:2307.04838  [pdf, other

    cs.CV cs.LG

    CREPE: Learnable Prompting With CLIP Improves Visual Relationship Prediction

    Authors: Rakshith Subramanyam, T. S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: In this paper, we explore the potential of Vision-Language Models (VLMs), specifically CLIP, in predicting visual object relationships, which involves interpreting visual features from images into language-based relations. Current state-of-the-art methods use complex graphical models that utilize language cues and visual features to address this challenge. We hypothesize that the strong language p… ▽ More

    Submitted 19 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  2. arXiv:2305.13284  [pdf, other

    cs.CV cs.AI

    Target-Aware Generative Augmentations for Single-Shot Adaptation

    Authors: Kowshik Thopalli, Rakshith Subramanyam, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: In this paper, we address the problem of adapting models from a source domain to a target domain, a task that has become increasingly important due to the brittle generalization of deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic toolbox data augmentations in cases of limited target data availability. We consider the challenging set… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at International Conference Machine Learning (ICML) 2023

  3. arXiv:2210.16692  [pdf, other

    cs.CV cs.LG stat.ML

    Single-Shot Domain Adaptation via Target-Aware Generative Augmentation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Spring Berman, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  4. arXiv:2207.12346  [pdf, other

    cs.LG

    Contrastive Knowledge-Augmented Meta-Learning for Few-Shot Classification

    Authors: Rakshith Subramanyam, Mark Heimann, Jayram Thathachar, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: Model agnostic meta-learning algorithms aim to infer priors from several observed tasks that can then be used to adapt to a new task with few examples. Given the inherent diversity of tasks arising in existing benchmarks, recent methods use separate, learnable structure, such as hierarchies or graphs, for enabling task-specific adaptation of the prior. While these approaches have produced signific… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  5. arXiv:1802.09850  [pdf, other

    cs.CV cs.LG

    Solving Inverse Computational Imaging Problems using Deep Pixel-level Prior

    Authors: Akshat Dave, Anil Kumar Vadathya, Ramana Subramanyam, Rahul Baburajan, Kaushik Mitra

    Abstract: Signal reconstruction is a challenging aspect of computational imaging as it often involves solving ill-posed inverse problems. Recently, deep feed-forward neural networks have led to state-of-the-art results in solving various inverse imaging problems. However, being task specific, these networks have to be learned for each inverse problem. On the other hand, a more flexible approach would be to… ▽ More

    Submitted 23 April, 2018; v1 submitted 27 February, 2018; originally announced February 2018.

  6. arXiv:1707.09538  [pdf, other

    cs.MM cs.CL

    Benchmarking Multimodal Sentiment Analysis

    Authors: Erik Cambria, Devamanyu Hazarika, Soujanya Poria, Amir Hussain, R. B. V. Subramaanyam

    Abstract: We propose a framework for multimodal sentiment analysis and emotion recognition using convolutional neural network-based feature extraction from text and visual modalities. We obtain a performance improvement of 10% over the state of the art by combining visual, text and audio features. We also discuss some major issues frequently ignored in multimodal sentiment analysis research: the role of spe… ▽ More

    Submitted 29 July, 2017; originally announced July 2017.

    Comments: Accepted in CICLing 2017

  7. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures