Skip to main content

Showing 1–8 of 8 results for author: Golestaneh, S A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.00638  [pdf, other

    cs.CV

    Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

    Authors: Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Juergen Gall, Mehdi Noroozi

    Abstract: This paper introduces a unified framework for video action segmentation via sequence to sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to current state-of-the-art frame-level prediction methods, we view action segmentation as a seq2seq translation task, i.e., map** a sequence of video frames to a sequence of action segments. Our proposed method involves a s… ▽ More

    Submitted 11 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 (Main Conference)

  2. arXiv:2112.09260  [pdf, other

    cs.CV

    How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation

    Authors: Akash Umakantha, Joao D. Semedo, S. Alireza Golestaneh, Wan-Yi S. Lin

    Abstract: The Vision Transformer (ViT) architecture has recently achieved competitive performance across a variety of computer vision tasks. One of the motivations behind ViTs is weaker inductive biases, when compared to convolutional neural networks (CNNs). However this also makes ViTs more difficult to train. They require very large training datasets, heavy regularization, and strong data augmentations. T… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  3. arXiv:2108.06858  [pdf, other

    eess.IV cs.CV

    No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency

    Authors: S. Alireza Golestaneh, Saba Dadsetan, Kris M. Kitani

    Abstract: The goal of No-Reference Image Quality Assessment (NR-IQA) is to estimate the perceptual image quality in accordance with subjective evaluations, it is a complex and unsolved problem due to the absence of the pristine reference image. In this paper, we propose a novel model to address the NR-IQA task by leveraging a hybrid approach that benefits from Convolutional Neural Networks (CNNs) and self-a… ▽ More

    Submitted 5 January, 2022; v1 submitted 15 August, 2021; originally announced August 2021.

  4. arXiv:2008.03789  [pdf, other

    cs.CV

    3D Human Motion Estimation via Motion Compression and Refinement

    Authors: Zhengyi Luo, S. Alireza Golestaneh, Kris M. Kitani

    Abstract: We develop a technique for generating smooth and accurate 3D human pose and motion estimates from RGB video sequences. Our method, which we call Motion Estimation via Variational Autoencoder (MEVA), decomposes a temporal sequence of human motion into a smooth motion representation using auto-encoder-based motion compression and a residual representation learned through motion refinement. This two-… ▽ More

    Submitted 5 October, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

    Comments: Accepted by ACCV 2020 (Oral). Project page: https://zhengyiluo.github.io/projects/meva/

  5. arXiv:2008.01860  [pdf, other

    cs.CV

    Importance of Self-Consistency in Active Learning for Semantic Segmentation

    Authors: S. Alireza Golestaneh, Kris M. Kitani

    Abstract: We address the task of active learning in the context of semantic segmentation and show that self-consistency can be a powerful source of self-supervision to greatly improve the performance of a data-driven model with access to only a small amount of labeled data. Self-consistency uses the simple observation that the results of semantic segmentation for a specific image should not change under tra… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: Accepted in The British Machine Vision Conference (BMVC) 2020

  6. arXiv:2006.03783  [pdf, other

    cs.CV eess.IV

    No-Reference Image Quality Assessment via Feature Fusion and Multi-Task Learning

    Authors: S. Alireza Golestaneh, Kris Kitani

    Abstract: Blind or no-reference image quality assessment (NR-IQA) is a fundamental, unsolved, and yet challenging problem due to the unavailability of a reference image. It is vital to the streaming and social media industries that impact billions of viewers daily. Although previous NR-IQA methods leveraged different feature extraction approaches, the performance bottleneck still exists. In this paper, we p… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

  7. arXiv:1804.08020  [pdf, other

    cs.CV

    Synthesized Texture Quality Assessment via Multi-scale Spatial and Statistical Texture Attributes of Image and Gradient Magnitude Coefficients

    Authors: S. Alireza Golestaneh, Lina Karam

    Abstract: Perceptual quality assessment for synthesized textures is a challenging task. In this paper, we propose a training-free reduced-reference (RR) objective quality assessment method that quantifies the perceived quality of synthesized textures. The proposed reduced-reference synthesized texture quality assessment metric is based on measuring the spatial and statistical attributes of the texture image… ▽ More

    Submitted 26 April, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

  8. arXiv:1703.07478  [pdf, other

    cs.CV

    Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes

    Authors: S. Alireza Golestaneh, Lina J. Karam

    Abstract: The detection of spatially-varying blur without having any information about the blur type is a challenging task. In this paper, we propose a novel effective approach to address the blur detection problem from a single image without requiring any knowledge about the blur type, level, or camera settings. Our approach computes blur detection maps based on a novel High-frequency multiscale Fusion and… ▽ More

    Submitted 11 April, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: Accepted to CVPR 2017