Skip to main content

Showing 1–10 of 10 results for author: Zakhor, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.12903  [pdf, other

    cs.CV

    Gallery Filter Network for Person Search

    Authors: Lucas Jaffe, Avideh Zakhor

    Abstract: In person search, we aim to localize a query person from one scene in other gallery scenes. The cost of this search operation is dependent on the number of gallery scenes, making it beneficial to reduce the pool of likely scenes. We describe and demonstrate the Gallery Filter Network (GFN), a novel module which can efficiently discard gallery scenes from the search process, and benefit scoring for… ▽ More

    Submitted 25 October, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: WACV 2023; Code: https://github.com/LukeJaffe/GFN

  2. arXiv:2202.07706   

    cs.CV

    Misinformation Detection in Social Media Video Posts

    Authors: Kehan Wang, David Chan, Seth Z. Zhao, John Canny, Avideh Zakhor

    Abstract: With the growing adoption of short-form video by social media platforms, reducing the spread of misinformation through video posts has become a critical challenge for social media providers. In this paper, we develop methods to detect misinformation in social media posts, exploiting modalities such as video and text. Due to the lack of large-scale public data for misinformation detection in multi-… ▽ More

    Submitted 30 July, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: We discovered an error in our dataset construction where retweets were not properly filtered. This resulted in test data leakage in training data, and the results reported are affected

  3. Recognition-Aware Learned Image Compression

    Authors: Maxime Kawawa-Beaudan, Ryan Roggenkemper, Avideh Zakhor

    Abstract: Learned image compression methods generally optimize a rate-distortion loss, trading off improvements in visual distortion for added bitrate. Increasingly, however, compressed imagery is used as an input to deep learning networks for various tasks such as classification, object detection, and superresolution. We propose a recognition-aware learned compression method, which optimizes a rate-distort… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: Electronic Imaging Symposium, Computational Imaging XX Conference, January 2022

  4. arXiv:2201.03786  [pdf, other

    cs.CV

    Drone Object Detection Using RGB/IR Fusion

    Authors: Lizhi Yang, Ruhang Ma, Avideh Zakhor

    Abstract: Object detection using aerial drone imagery has received a great deal of attention in recent years. While visible light images are adequate for detecting objects in most scenarios, thermal cameras can extend the capabilities of object detection to night-time or occluded objects. As such, RGB and Infrared (IR) fusion methods for object detection are useful and important. One of the biggest challeng… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted to Electronic Imaging Symposium, Computational Imaging XX Conference, 2022

  5. arXiv:2106.02777  [pdf, other

    cs.NI cs.HC cs.LG

    Immediate Proximity Detection Using Wi-Fi-Enabled Smartphones

    Authors: Zach Van Hyfte, Avideh Zakhor

    Abstract: Smartphone apps for exposure notification and contact tracing have been shown to be effective in controlling the COVID-19 pandemic. However, Bluetooth Low Energy tokens similar to those broadcast by existing apps can still be picked up far away from the transmitting device. In this paper, we present a new class of methods for detecting whether or not two Wi-Fi-enabled devices are in immediate phys… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 12 pages, 1 figure

  6. arXiv:2105.12855  [pdf, other

    cs.CV

    Multi-Modal Semantic Inconsistency Detection in Social Media News Posts

    Authors: Scott McCrae, Kehan Wang, Avideh Zakhor

    Abstract: As computer-generated content and deepfakes make steady improvements, semantic approaches to multimedia forensics will become more important. In this paper, we introduce a novel classification architecture for identifying semantic inconsistencies between video appearance and text caption in social media news posts. We develop a multi-modal fusion framework to identify mismatches between videos and… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  7. Fast, Accurate Barcode Detection in Ultra High-Resolution Images

    Authors: Jerome Quenum, Kehan Wang, Avideh Zakhor

    Abstract: Object detection in Ultra High-Resolution (UHR) images has long been a challenging problem in computer vision due to the varying scales of the targeted objects. When it comes to barcode detection, resizing UHR input images to smaller sizes often leads to the loss of pertinent information, while processing them directly is highly inefficient and computationally expensive. In this paper, we propose… ▽ More

    Submitted 10 June, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: 5 pages, 4 figures, 3 tables, GitHub Link added, Initial ArXiv Submission is 13 Feb 2021, Accepted at IEEE International Conference on Image Processing, September 2021, USA

    Report number: pp. 1019-1023

    Journal ref: 2021 IEEE International Conference on Image Processing (ICIP)

  8. arXiv:2012.09409  [pdf, other

    cs.CV cs.LG

    Temporal LiDAR Frame Prediction for Autonomous Driving

    Authors: David Deng, Avideh Zakhor

    Abstract: Anticipating the future in a dynamic scene is critical for many fields such as autonomous driving and robotics. In this paper we propose a class of novel neural network architectures to predict future LiDAR frames given previous ones. Since the ground truth in this application is simply the next frame in the sequence, we can train our models in a self-supervised fashion. Our proposed architectures… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: In 3DV 2020

  9. arXiv:2012.03998  [pdf, other

    cs.CV

    GenScan: A Generative Method for Populating Parametric 3D Scan Datasets

    Authors: Mohammad Keshavarzi, Oladapo Afolabi, Luisa Caldas, Allen Y. Yang, Avideh Zakhor

    Abstract: The availability of rich 3D datasets corresponding to the geometrical complexity of the built environments is considered an ongoing challenge for 3D deep learning methodologies. To address this challenge, we introduce GenScan, a generative system that populates synthetic 3D scan datasets in a parametric fashion. The system takes an existing captured 3D scan as an input and outputs alternative vari… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  10. Duodepth: Static Gesture Recognition Via Dual Depth Sensors

    Authors: Ilya Chugunov, Avideh Zakhor

    Abstract: Static gesture recognition is an effective non-verbal communication channel between a user and their devices; however many modern methods are sensitive to the relative pose of the user's hands with respect to the capture device, as parts of the gesture can become occluded. We present two methodologies for gesture recognition via synchronized recording from two depth cameras to alleviate this occlu… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 26th International Conference on Image Processing

    Journal ref: 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, 2019, pp. 3467-3471