Skip to main content

Showing 1–7 of 7 results for author: Rangrej, S B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.08312  [pdf, other

    cs.CV cs.LG

    GePSAn: Generative Procedure Step Anticipation in Cooking Videos

    Authors: Mohamed Ashraf Abdelsalam, Samrudhdhi B. Rangrej, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Afsaneh Fazly

    Abstract: We study the problem of future step anticipation in procedural videos. Given a video of an ongoing procedural activity, we predict a plausible next procedure step described in rich natural language. While most previous work focus on the problem of data scarcity in procedural video datasets, another core challenge of future anticipation is how to account for multiple plausible future realizations i… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: published at ICCV 2023

  2. arXiv:2210.13605  [pdf, other

    cs.CV

    GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction

    Authors: Samrudhdhi B Rangrej, Kevin J Liang, Tal Hassner, James J Clark

    Abstract: Many online action prediction models observe complete frames to locate and attend to informative subregions in the frames called glimpses and recognize an ongoing action based on global and local information. However, in applications with constrained resources, an agent may not be able to observe the complete frame, yet must still locate useful glimpses to predict an incomplete action based on loc… ▽ More

    Submitted 18 April, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  3. arXiv:2204.05494  [pdf, other

    cs.CV stat.ML

    Few-shot Learning with Noisy Labels

    Authors: Kevin J Liang, Samrudhdhi B. Rangrej, Vladan Petrovic, Tal Hassner

    Abstract: Few-shot learning (FSL) methods typically assume clean support sets with accurately labeled samples when training on novel classes. This assumption can often be unrealistic: support sets, no matter how small, can still include mislabeled samples. Robustness to label noise is therefore essential for FSL methods to be practical, but this problem surprisingly remains largely unexplored. To address mi… ▽ More

    Submitted 31 July, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  4. arXiv:2204.00656  [pdf, other

    cs.CV

    Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes

    Authors: Samrudhdhi B. Rangrej, Chetan L. Srinidhi, James J. Clark

    Abstract: Most hard attention models initially observe a complete scene to locate and sense informative glimpses, and predict class-label of a scene based on glimpses. However, in many applications (e.g., aerial imaging), observing an entire scene is not always feasible due to the limited time and resources available for acquisition. In this paper, we develop a Sequential Transformers Attention Model (STAM)… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022

  5. arXiv:2111.07534  [pdf, other

    cs.CV

    A Probabilistic Hard Attention Model For Sequentially Observed Scenes

    Authors: Samrudhdhi B. Rangrej, James J. Clark

    Abstract: A visual hard attention model actively selects and observes a sequence of subregions in an image to make a prediction. The majority of hard attention models determine the attention-worthy regions by first analyzing a complete image. However, it may be the case that the entire image is not available initially but instead sensed gradually through a series of partial observations. In this paper, we d… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: Accepted to BMVC 2021

  6. arXiv:2104.00177  [pdf, other

    cs.CV

    Visual Attention in Imaginative Agents

    Authors: Samrudhdhi B. Rangrej, James J. Clark

    Abstract: We present a recurrent agent who perceives surroundings through a series of discrete fixations. At each timestep, the agent imagines a variety of plausible scenes consistent with the fixation history. The next fixation is planned using uncertainty in the content of the imagined scenes. As time progresses, the agent becomes more certain about the content of the surrounding, and the variety in the i… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

  7. arXiv:1806.08859  [pdf, other

    cs.CV

    A deep learning framework for segmentation of retinal layers from OCT images

    Authors: Karthik Gopinath, Samrudhdhi B Rangrej, Jayanthi Sivaswamy

    Abstract: Segmentation of retinal layers from Optical Coherence Tomography (OCT) volumes is a fundamental problem for any computer aided diagnostic algorithm development. This requires preprocessing steps such as denoising, region of interest extraction, flattening and edge detection all of which involve separate parameter tuning. In this paper, we explore deep learning techniques to automate all these step… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted in The 4th Asian Conference on Pattern Recognition (ACPR 2017)