Skip to main content

Showing 1–6 of 6 results for author: Ramaiah, C

.
  1. arXiv:2208.01813  [pdf, other

    cs.CV

    TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation

    Authors: Jun Wang, Mingfei Gao, Yuqian Hu, Ramprasaath R. Selvaraju, Chetan Ramaiah, Ran Xu, Joseph F. JaJa, Larry S. Davis

    Abstract: Text-VQA aims at answering questions that require understanding the textual cues in an image. Despite the great progress of existing Text-VQA methods, their performance suffers from insufficient human-labeled question-answer (QA) pairs. However, we observe that, in general, the scene text is not fully exploited in the existing datasets -- only a small portion of the text in each image participates… ▽ More

    Submitted 7 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: BMVC 2022

  2. arXiv:2204.13207  [pdf, other

    cs.CV cs.AI cs.LG

    Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework

    Authors: Shu Zhang, Ran Xu, Caiming Xiong, Chetan Ramaiah

    Abstract: Current contrastive learning frameworks focus on leveraging a single supervisory signal to learn representations, which limits the efficacy on unseen data and downstream tasks. In this paper, we present a hierarchical multi-label representation learning framework that can leverage all available labels and preserve the hierarchical relationship between classes. We introduce novel hierarchy preservi… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR, 2022

  3. arXiv:2112.07820  [pdf, other

    cs.CV cs.AI

    Value Retrieval with Arbitrary Queries for Form-like Documents

    Authors: Mingfei Gao, Le Xue, Chetan Ramaiah, Chen Xing, Ran Xu, Caiming Xiong

    Abstract: We propose value retrieval with arbitrary queries for form-like documents to reduce human effort of processing forms. Unlike previous methods that only address a fixed set of field items, our method predicts target value for an arbitrary query based on the understanding of the layout and semantics of a form. To further boost model performance, we propose a simple document language modeling (Simple… ▽ More

    Submitted 15 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  4. arXiv:2112.04345  [pdf, other

    cs.CV cs.LG

    Burn After Reading: Online Adaptation for Cross-domain Streaming Data

    Authors: Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah

    Abstract: In the context of online privacy, many methods propose complex privacy and security preserving measures to protect sensitive data. In this paper, we argue that: not storing any sensitive data is the best form of security. Thus we propose an online framework that "burns after reading", i.e. each online sample is immediately deleted after it is processed. Meanwhile, we tackle the inevitable distribu… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  5. arXiv:2001.05086  [pdf, other

    cs.CV

    Proposal Learning for Semi-Supervised Object Detection

    Authors: Peng Tang, Chetan Ramaiah, Yan Wang, Ran Xu, Caiming Xiong

    Abstract: In this paper, we focus on semi-supervised object detection to boost performance of proposal-based object detectors (a.k.a. two-stage object detectors) by training on both labeled and unlabeled data. However, it is non-trivial to train object detectors on unlabeled data due to the unavailability of ground truth labels. To address this problem, we present a proposal learning approach to learn propo… ▽ More

    Submitted 23 April, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

  6. arXiv:1307.0414  [pdf, other

    stat.ML cs.LG

    Challenges in Representation Learning: A report on three machine learning contests

    Authors: Ian J. Goodfellow, Dumitru Erhan, Pierre Luc Carrier, Aaron Courville, Mehdi Mirza, Ben Hamner, Will Cukierski, Yichuan Tang, David Thaler, Dong-Hyun Lee, Yingbo Zhou, Chetan Ramaiah, Fangxiang Feng, Ruifan Li, Xiaojie Wang, Dimitris Athanasakis, John Shawe-Taylor, Maxim Milakov, John Park, Radu Ionescu, Marius Popescu, Cristian Grozea, James Bergstra, **g**g Xie, Lukasz Romaszko , et al. (3 additional authors not shown)

    Abstract: The ICML 2013 Workshop on Challenges in Representation Learning focused on three challenges: the black box learning challenge, the facial expression recognition challenge, and the multimodal learning challenge. We describe the datasets created for these challenges and summarize the results of the competitions. We provide suggestions for organizers of future challenges and some comments on what kin… ▽ More

    Submitted 1 July, 2013; originally announced July 2013.

    Comments: 8 pages, 2 figures