Skip to main content

Showing 1–6 of 6 results for author: Foroutan, Y

.
  1. arXiv:2404.12547  [pdf, other

    cs.CV

    Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting

    Authors: Yalda Foroutan, Daniel Rebain, Kwang Moo Yi, Andrea Tagliasacchi

    Abstract: 3D Gaussian Splatting has recently been embraced as a versatile and effective method for scene reconstruction and novel view synthesis, owing to its high-quality results and compatibility with hardware rasterization. Despite its advantages, Gaussian Splatting's reliance on high-quality point cloud initialization by Structure-from-Motion (SFM) algorithms is a significant limitation to be overcome.… ▽ More

    Submitted 23 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2307.02430  [pdf, other

    eess.IV cs.CV

    Base Layer Efficiency in Scalable Human-Machine Coding

    Authors: Yalda Foroutan, Alon Harell, Anderson de Andrade, Ivan V. Bajić

    Abstract: A basic premise in scalable human-machine coding is that the base layer is intended for automated machine analysis and is therefore more compressible than the same content would be for human viewing. Use cases for such coding include video surveillance and traffic monitoring, where the majority of the content will never be seen by humans. Therefore, base layer efficiency is of paramount importance… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 5 pages, 6 figures, IEEE ICIP 2023

  3. arXiv:2305.17295  [pdf, other

    eess.IV cs.IT

    Rate-Distortion Theory in Coding for Machines and its Application

    Authors: Alon Harell, Yalda Foroutan, Nilesh Ahuja, Parual Datta, Bhavya Kanzariya, V. Srinivasa Somayaulu, Omesh Tickoo, Anderson de Andrade, Ivan V. Bajic

    Abstract: Recent years have seen a tremendous growth in both the capability and popularity of automatic machine analysis of images and video. As a result, a growing need for efficient compression methods optimized for machine vision, rather than human vision, has emerged. To meet this growing demand, several methods have been developed for image and video coding for machines. Unfortunately, while there is a… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  4. arXiv:2305.10453  [pdf, other

    eess.IV cs.CV

    VVC+M: Plug and Play Scalable Image Coding for Humans and Machines

    Authors: Alon Harell, Yalda Foroutan, Ivan V. Bajic

    Abstract: Compression for machines is an emerging field, where inputs are encoded while optimizing the performance of downstream automated analysis. In scalable coding for humans and machines, the compressed representation used for machines is further utilized to enable input reconstruction. Often performed by jointly optimizing the compression scheme for both machine task and human perception, this results… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  5. arXiv:2305.02562  [pdf, ps, other

    eess.IV cs.IT cs.LG

    Conditional and Residual Methods in Scalable Coding for Humans and Machines

    Authors: Anderson de Andrade, Alon Harell, Yalda Foroutan, Ivan V. Bajić

    Abstract: We present methods for conditional and residual coding in the context of scalable coding for humans and machines. Our focus is on optimizing the rate-distortion performance of the reconstruction task using the information available in the computer vision task. We include an information analysis of both approaches to provide baselines and also propose an entropy model suitable for conditional codin… ▽ More

    Submitted 4 July, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: IEEE ICME Workshop on Coding for Machines, Brisbane, Australia, 2023

  6. arXiv:2012.13188  [pdf, other

    cs.CV cs.HC

    Control of Computer Pointer Using Hand Gesture Recognition in Motion Pictures

    Authors: Yalda Foroutan, Ahmad Kalhor, Saeid Mohammadi Nejati, Samad Sheikhaei

    Abstract: This paper presents a user interface designed to enable computer cursor control through hand detection and gesture classification. A comprehensive hand dataset comprising 6720 image samples was collected, encompassing four distinct classes: fist, palm, pointing to the left, and pointing to the right. The images were captured from 15 individuals in various settings, including simple backgrounds wit… ▽ More

    Submitted 9 June, 2023; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: 9 pages, 6 figures, 2 tables