Skip to main content

Showing 1–11 of 11 results for author: Elhamifar, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.14900  [pdf, other

    cs.CV

    BIT: Bi-Level Temporal Modeling for Efficient Supervised Action Segmentation

    Authors: Zijia Lu, Ehsan Elhamifar

    Abstract: We address the task of supervised action segmentation which aims to partition a video into non-overlap** segments, each representing a different action. Recent works apply transformers to perform temporal modeling at the frame-level, which suffer from high computational cost and cannot well capture action dependencies over long temporal horizons. To address these issues, we propose an efficient… ▽ More

    Submitted 7 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 9 pages, 6 figures

  2. arXiv:2306.12559  [pdf, other

    cs.CV cs.SD eess.AS

    Exploring the Role of Audio in Video Captioning

    Authors: Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang

    Abstract: Recent focus in video captioning has been on designing architectures that can consume both video and text modalities, and using large-scale video datasets with text transcripts for pre-training, such as HowTo100M. Though these approaches have achieved significant improvement, the audio modality is often ignored in video captioning. In this work, we present an audio-visual framework, which aims to… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  3. arXiv:2207.05137  [pdf, other

    cs.CV

    Towards Effective Multi-Label Recognition Attacks via Knowledge Graph Consistency

    Authors: Hassan Mahmood, Ehsan Elhamifar

    Abstract: Many real-world applications of image recognition require multi-label learning, whose goal is to find all labels in an image. Thus, robustness of such systems to adversarial image perturbations is extremely important. However, despite a large body of recent research on adversarial attacks, the scope of the existing works is mainly limited to the multi-class setting, where each image contains a sin… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  4. arXiv:2111.12698  [pdf, other

    cs.CV

    Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling

    Authors: Dat Huynh, Jason Kuen, Zhe Lin, Jiuxiang Gu, Ehsan Elhamifar

    Abstract: Open-vocabulary instance segmentation aims at segmenting novel classes without mask annotations. It is an important step toward reducing laborious human supervision. Most existing works first pretrain a model on captioned images covering many novel classes and then finetune it on limited base classes with mask annotations. However, the high-level textual information learned from caption pretrainin… ▽ More

    Submitted 19 April, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  5. arXiv:2105.10438  [pdf, other

    cs.CV

    Compositional Fine-Grained Low-Shot Learning

    Authors: Dat Huynh, Ehsan Elhamifar

    Abstract: We develop a novel compositional generative model for zero- and few-shot learning to recognize fine-grained classes with a few or no training samples. Our key observation is that generating holistic features for fine-grained classes fails to capture small attribute differences between classes. Therefore, we propose a feature composition framework that learns to extract attribute features from trai… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

  6. arXiv:2002.09927  [pdf, other

    cs.LG stat.ML

    Weighting Is Worth the Wait: Bayesian Optimization with Importance Sampling

    Authors: Setareh Ariafar, Zelda Mariet, Ehsan Elhamifar, Dana Brooks, Jennifer Dy, Jasper Snoek

    Abstract: Many contemporary machine learning models require extensive tuning of hyperparameters to perform well. A variety of methods, such as Bayesian optimization, have been developed to automate and expedite this process. However, tuning remains extremely costly as it typically requires repeatedly fully training models. We propose to accelerate the Bayesian optimization approach to hyperparameter tuning… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  7. arXiv:1407.6810  [pdf, other

    cs.LG stat.ML

    Dissimilarity-based Sparse Subset Selection

    Authors: Ehsan Elhamifar, Guillermo Sapiro, S. Shankar Sastry

    Abstract: Finding an informative subset of a large collection of data points or models is at the center of many problems in computer vision, recommender systems, bio/health informatics as well as image and natural language processing. Given pairwise dissimilarities between the elements of a `source set' and a `target set,' we consider the problem of finding a subset of the source set, called representatives… ▽ More

    Submitted 8 April, 2016; v1 submitted 25 July, 2014; originally announced July 2014.

  8. arXiv:1301.2603  [pdf, ps, other

    cs.LG cs.IT math.OC math.ST stat.ML

    Robust subspace clustering

    Authors: Mahdi Soltanolkotabi, Ehsan Elhamifar, Emmanuel J. Candès

    Abstract: Subspace clustering refers to the task of finding a multi-subspace representation that best fits a collection of points taken from a high-dimensional space. This paper introduces an algorithm inspired by sparse subspace clustering (SSC) [In IEEE Conference on Computer Vision and Pattern Recognition, CVPR (2009) 2790-2797] to cluster noisy data, and develops some novel theory demonstrating its corr… ▽ More

    Submitted 23 May, 2014; v1 submitted 11 January, 2013; originally announced January 2013.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1199 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1199

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 2, 669-699

  9. arXiv:1203.1005  [pdf, other

    cs.CV cs.IR cs.IT cs.LG math.OC stat.ML

    Sparse Subspace Clustering: Algorithm, Theory, and Applications

    Authors: Ehsan Elhamifar, Rene Vidal

    Abstract: In many real-world problems, we are dealing with collections of high-dimensional data, such as images, videos, text and web documents, DNA microarray data, and more. Often, high-dimensional data lie close to low-dimensional structures corresponding to several classes or categories the data belongs to. In this paper, we propose and study an algorithm, called Sparse Subspace Clustering (SSC), to clu… ▽ More

    Submitted 4 February, 2013; v1 submitted 5 March, 2012; originally announced March 2012.

  10. arXiv:1201.3674  [pdf, other

    cs.CV cs.LG stat.ML

    On the Lagrangian Biduality of Sparsity Minimization Problems

    Authors: Dheeraj Singaraju, Ehsan Elhamifar, Roberto Tron, Allen Y. Yang, S. Shankar Sastry

    Abstract: Recent results in Compressive Sensing have shown that, under certain conditions, the solution to an underdetermined system of linear equations with sparsity-based regularization can be accurately recovered by solving convex relaxations of the original problem. In this work, we present a novel primal-dual analysis on a class of sparsity minimization problems. We show that the Lagrangian bidual (i.e… ▽ More

    Submitted 17 January, 2012; originally announced January 2012.

  11. arXiv:1104.0654  [pdf, other

    math.OC cs.CV cs.IT

    Block-Sparse Recovery via Convex Optimization

    Authors: Ehsan Elhamifar, Rene Vidal

    Abstract: Given a dictionary that consists of multiple blocks and a signal that lives in the range space of only a few blocks, we study the problem of finding a block-sparse representation of the signal, i.e., a representation that uses the minimum number of blocks. Motivated by signal/image processing and computer vision applications, such as face recognition, we consider the block-sparse recovery problem… ▽ More

    Submitted 13 April, 2012; v1 submitted 4 April, 2011; originally announced April 2011.

    Comments: IEEE Transactions on Signal Processing