Skip to main content

Showing 1–7 of 7 results for author: Sawhney, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.00794  [pdf, other

    cs.CV

    STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos

    Authors: Anshul Shah, Benjamin Lundell, Harpreet Sawhney, Rama Chellappa

    Abstract: We address the problem of extracting key steps from unlabeled procedural videos, motivated by the potential of Augmented Reality (AR) headsets to revolutionize job training and performance. We decompose the problem into two steps: representation learning and key steps extraction. We propose a training objective, Bootstrapped Multi-Cue Contrastive (BMC2) loss to learn discriminative representations… ▽ More

    Submitted 9 September, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: Accepted at ICCV 2023

  2. arXiv:2207.04398  [pdf, other

    cs.CV cs.AI

    Self-supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation

    Authors: Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke

    Abstract: We present a self-supervised learning (SSL) method suitable for semi-global tasks such as object detection and semantic segmentation. We enforce local consistency between self-learned features, representing corresponding image locations of transformed versions of the same image, by minimizing a pixel-level local contrastive (LC) loss during training. LC-loss can be added to existing self-supervise… ▽ More

    Submitted 7 December, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: accepted to WACV 2023

  3. Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition

    Authors: Huseyin Coskun, Zeeshan Zia, Bugra Tekin, Federica Bogo, Nassir Navab, Federico Tombari, Harpreet Sawhney

    Abstract: The lack of large-scale real datasets with annotations makes transfer learning a necessity for video activity understanding. We aim to develop an effective method for few-shot transfer learning for first-person action classification. We leverage independently trained local visual cues to learn representations that can be transferred from a source domain, which provides primitive action labels, to… ▽ More

    Submitted 7 December, 2021; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: Paper has been accepted in Transactions on Pattern Analysis and Machine Intelligence

    Journal ref: year = {5555}, volume = {}, number = {01}, issn = {1939-3539}, pages = {1-1},

  4. arXiv:1604.03130  [pdf

    cs.CY

    Video Analysis for Body-worn Cameras in Law Enforcement

    Authors: Jason J. Corso, Alexandre Alahi, Kristen Grauman, Gregory D. Hager, Louis-Philippe Morency, Harpreet Sawhney, Yaser Sheikh

    Abstract: The social conventions and expectations around the appropriate use of imaging and video has been transformed by the availability of video cameras in our pockets. The impact on law enforcement can easily be seen by watching the nightly news; more and more arrests, interventions, or even routine stops are being caught on cell phones or surveillance video, with both positive and negative consequences… ▽ More

    Submitted 7 May, 2018; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: A Computing Community Consortium (CCC) white paper, 9 pages

  5. arXiv:1512.00818  [pdf, other

    cs.CV cs.CL cs.LG

    Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos

    Authors: Mohamed Elhoseiny, **gen Liu, Hui Cheng, Harpreet Sawhney, Ahmed Elgammal

    Abstract: We propose a new zero-shot Event Detection method by Multi-modal Distributional Semantic embedding of videos. Our model embeds object and action concepts as well as other available modalities from videos into a distributional semantic space. To our knowledge, this is the first Zero-Shot event detection model that is built on top of distributional semantics and extends it in the following direction… ▽ More

    Submitted 15 December, 2015; v1 submitted 2 December, 2015; originally announced December 2015.

    Comments: To appear in AAAI 2016

  6. arXiv:1510.07317  [pdf, other

    cs.CV

    Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries

    Authors: S. Hussain Raza, Omar Javed, Aveek Das, Harpreet Sawhney, Hui Cheng, Irfan Essa

    Abstract: We present an algorithm to estimate depth in dynamic video scenes. We propose to learn and infer depth in videos from appearance, motion, occlusion boundaries, and geometric context of the scene. Using our method, depth can be estimated from unconstrained videos with no requirement of camera pose estimation, and with significant background/foreground motions. We start by decomposing a video into s… ▽ More

    Submitted 25 October, 2015; originally announced October 2015.

    Comments: British Machine Vision Conference (BMVC) 2014

  7. arXiv:cs/0109043   

    cs.CY

    PUC Autonomy and Policy Innovation: Local Telephone Competition in Arkansas and New York

    Authors: Hokyu Lee, Harmeet Sawhney

    Abstract: In the pre-divestiture era, the regulatory environment in the U.S. was fairly uniform and harmonious with the FCC setting the course and the accommodative state PUCs making corresponding changes in their own policies. The divestiture fractured this monolithic system as it forced the PUCs to respond to new forces unleashed in their own backyards. Soon there was great diversity in the overall regu… ▽ More

    Submitted 21 September, 2001; originally announced September 2001.

    Comments: 29th TPRC Conference, 2001

    Report number: TPRC-2001-026 ACM Class: K.4.m