Skip to main content

Showing 1–12 of 12 results for author: Yucel, M K

.
  1. arXiv:2403.04508  [pdf, other

    cs.CV cs.GR

    Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces

    Authors: Evangelos Skartados, Mehmet Kerim Yucel, Bruno Manganelli, Anastasios Drosou, Albert Saà-Garriga

    Abstract: Neural Radiance Fields (NeRF) have quickly become the primary approach for 3D reconstruction and novel view synthesis in recent years due to their remarkable performance. Despite the huge interest in NeRF methods, a practical use case of NeRFs has largely been ignored; the exploration of the scene space modelled by a NeRF. In this paper, for the first time in the literature, we propose and formall… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted at ACM MMSys'24

  2. A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation

    Authors: Francesco Barbato, Umberto Michieli, Mehmet Kerim Yucel, Pietro Zanuttigh, Mete Ozay

    Abstract: In multimedia understanding tasks, corrupted samples pose a critical challenge, because when fed to machine learning models they lead to performance degradation. In the past, three groups of approaches have been proposed to handle noisy data: i) enhancer and denoiser modules to improve the quality of the noisy data, ii) data augmentation approaches, and iii) domain adaptation strategies. All the a… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted at ACM MMSys'24. 10 pages, 7 figures, 8 tables

  3. arXiv:2307.11823  [pdf, other

    cs.CV cs.AI

    HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness

    Authors: Mehmet Kerim Yucel, Ramazan Gokberk Cinbis, Pinar Duygulu

    Abstract: Convolutional Neural Networks (CNN) are known to exhibit poor generalization performance under distribution shifts. Their generalization have been studied extensively, and one line of work approaches the problem from a frequency-centric perspective. These studies highlight the fact that humans and CNNs might focus on different frequency components of an image. First, inspired by these observations… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  4. arXiv:2306.15377  [pdf, other

    cs.CV

    TrickVOS: A Bag of Tricks for Video Object Segmentation

    Authors: Evangelos Skartados, Konstantinos Georgiadis, Mehmet Kerim Yucel, Koskinas Ioannis, Armando Domi, Anastasios Drosou, Bruno Manganelli, Albert Saa-Garriga

    Abstract: Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a struct… ▽ More

    Submitted 28 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted to ICIP 2023

  5. arXiv:2303.12862  [pdf, other

    cs.CV

    LP-IOANet: Efficient High Resolution Document Shadow Removal

    Authors: Konstantinos Georgiadis, M. Kerim Yucel, Evangelos Skartados, Valia Dimaridou, Anastasios Drosou, Albert Saa-Garriga, Bruno Manganelli

    Abstract: Document shadow removal is an integral task in document enhancement pipelines, as it improves visibility, readability and thus the overall quality. Assuming that the majority of practical document shadow removal scenarios require real-time, accurate models that can produce high-resolution outputs in-the-wild, we propose Laplacian Pyramid with Input/Output Attention Network (LP-IOANet), a novel pip… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  6. arXiv:2303.07815  [pdf, other

    cs.CV

    MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation

    Authors: Roy Miles, Mehmet Kerim Yucel, Bruno Manganelli, Albert Saa-Garriga

    Abstract: This paper tackles the problem of semi-supervised video object segmentation on resource-constrained devices, such as mobile phones. We formulate this problem as a distillation task, whereby we demonstrate that small space-time-memory networks with finite memory can achieve competitive results with state of the art, but at a fraction of the computational cost (32 milliseconds per frame on a Samsung… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  7. arXiv:2210.16078  [pdf, other

    cs.CV

    Adaptive Mask-based Pyramid Network for Realistic Bokeh Rendering

    Authors: Konstantinos Georgiadis, Albert Saà-Garriga, Mehmet Kerim Yucel, Anastasios Drosou, Bruno Manganelli

    Abstract: Bokeh effect highlights an object (or any part of the image) while blurring the rest of the image, and creates a visually pleasant artistic effect. Due to the sensor-based limitations on mobile devices, machine learning (ML) based bokeh rendering has gained attention as a reliable alternative. In this paper, we focus on several improvements in ML-based bokeh rendering; i) on-device performance wit… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Advances in Image Manipulation Workshop. See the workshop website for posters and recordings

  8. How Robust are Discriminatively Trained Zero-Shot Learning Models?

    Authors: Mehmet Kerim Yucel, Ramazan Gokberk Cinbis, Pinar Duygulu

    Abstract: Data shift robustness has been primarily investigated from a fully supervised perspective, and robustness of zero-shot learning (ZSL) models have been largely neglected. In this paper, we present novel analyses on the robustness of discriminative ZSL to image corruptions. We subject several ZSL models to a large set of common corruptions and defenses. In order to realize the corruption analysis, w… ▽ More

    Submitted 27 January, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  9. arXiv:2105.12053  [pdf, other

    cs.CV

    Real-time Monocular Depth Estimation with Sparse Supervision on Mobile

    Authors: Mehmet Kerim Yucel, Valia Dimaridou, Anastasios Drosou, Albert Saà-Garriga

    Abstract: Monocular (relative or metric) depth estimation is a critical task for various applications, such as autonomous vehicles, augmented reality and image editing. In recent years, with the increasing availability of mobile devices, accurate and mobile-friendly depth models have gained importance. Increasingly accurate models typically require more computational resources, which inhibits the use of suc… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: To appear at CVPR 2021 Mobile AI (MAI) Workshop

  10. arXiv:2009.07576  [pdf, other

    cs.CV

    Red Carpet to Fight Club: Partially-supervised Domain Transfer for Face Recognition in Violent Videos

    Authors: Yunus Can Bilge, Mehmet Kerim Yucel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis, Pinar Duygulu

    Abstract: In many real-world problems, there is typically a large discrepancy between the characteristics of data used in training versus deployment. A prime example is the analysis of aggression videos: in a criminal incidence, typically suspects need to be identified based on their clean portrait-like photos, instead of their prior video recordings. This results in three major challenges; large domain dis… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: To appear in WACV 2021

  11. arXiv:2008.07651  [pdf, other

    cs.CV

    A Deep Dive into Adversarial Robustness in Zero-Shot Learning

    Authors: Mehmet Kerim Yucel, Ramazan Gokberk Cinbis, Pinar Duygulu

    Abstract: Machine learning (ML) systems have introduced significant advances in various fields, due to the introduction of highly complex models. Despite their success, it has been shown multiple times that machine learning models are prone to imperceptible perturbations that can severely degrade their accuracy. So far, existing studies have primarily focused on models where supervision across all classes w… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: To appear in ECCV 2020, Workshop on Adversarial Robustness in the Real World

  12. arXiv:1805.07566  [pdf, other

    cs.CV

    Wildest Faces: Face Detection and Recognition in Violent Settings

    Authors: Mehmet Kerim Yucel, Yunus Can Bilge, Oguzhan Oguz, Nazli Ikizler-Cinbis, Pinar Duygulu, Ramazan Gokberk Cinbis

    Abstract: With the introduction of large-scale datasets and deep learning models capable of learning complex representations, impressive advances have emerged in face detection and recognition tasks. Despite such advances, existing datasets do not capture the difficulty of face recognition in the wildest scenarios, such as hostile disputes or fights. Furthermore, existing datasets do not represent completel… ▽ More

    Submitted 19 May, 2018; originally announced May 2018.

    Comments: Submitted to BMVC 2018