Skip to main content

Showing 1–16 of 16 results for author: Güney, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.06907  [pdf, other

    cs.CV

    Self-supervised Object-Centric Learning for Videos

    Authors: Görkay Aydemir, Weidi Xie, Fatma Güney

    Abstract: Unsupervised multi-object segmentation has shown impressive results on images by utilizing powerful semantics learned from self-supervised pretraining. An additional modality such as depth or motion is often used to facilitate the segmentation in video sequences. However, the performance improvements observed in synthetic sequences, which rely on the robustness of an additional cue, do not transla… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  2. arXiv:2309.09756  [pdf, other

    cs.RO cs.CV

    Privileged to Predicted: Towards Sensorimotor Reinforcement Learning for Urban Driving

    Authors: Ege Onat Özsüer, Barış Akgün, Fatma Güney

    Abstract: Reinforcement Learning (RL) has the potential to surpass human performance in driving without needing any expert supervision. Despite its promise, the state-of-the-art in sensorimotor self-driving is dominated by imitation learning methods due to the inherent shortcomings of RL algorithms. Nonetheless, RL agents are able to discover highly successful policies when provided with privileged ground t… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 7 pages

  3. arXiv:2309.04302  [pdf, other

    cs.CV

    Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes

    Authors: Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzard, Fatma Güney, Hanno Gottschalk

    Abstract: In the life cycle of highly automated systems operating in an open and dynamic environment, the ability to adjust to emerging challenges is crucial. For systems integrating data-driven AI-based components, rapid responses to deployment issues require fast access to related data for testing and reconfiguration. In the context of automated driving, this especially applies to road obstacles that were… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 11 pages, 7 figures, and 3 tables

  4. arXiv:2307.14187  [pdf, other

    cs.CV cs.RO

    ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation

    Authors: Görkay Aydemir, Adil Kaan Akan, Fatma Güney

    Abstract: Forecasting future trajectories of agents in complex traffic scenes requires reliable and efficient predictions for all agents in the scene. However, existing methods for trajectory prediction are either inefficient or sacrifice accuracy. To address this challenge, we propose ADAPT, a novel approach for jointly predicting the trajectories of all agents in the scene with dynamic weight learning. Ou… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  5. arXiv:2307.08027  [pdf, other

    cs.CV

    Multi-Object Discovery by Low-Dimensional Object Motion

    Authors: Sadra Safadoust, Fatma Güney

    Abstract: Recent work in unsupervised multi-object segmentation shows impressive results by predicting motion from a single image despite the inherent ambiguity in predicting motion without the next image. On the other hand, the set of possible motions for an image can be constrained to a low-dimensional space by considering the scene structure and moving objects in it. We propose to model pixel-wise geomet… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  6. arXiv:2301.02092  [pdf, other

    cs.CV

    DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax

    Authors: Sadra Safadoust, Fatma Güney

    Abstract: Current self-supervised monocular depth estimation methods are mostly based on estimating a rigid-body motion representing camera motion. These methods suffer from the well-known scale ambiguity problem in their predictions. We propose DepthP+P, a method that learns to estimate outputs in metric scale by following the traditional planar parallax paradigm. We first align the two frames using a comm… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  7. arXiv:2211.14293  [pdf, other

    cs.CV

    RbA: Segmenting Unknown Regions Rejected by All

    Authors: Nazir Nayal, Mısra Yavuz, João F. Henriques, Fatma Güney

    Abstract: Standard semantic segmentation models owe their success to curated datasets with a fixed set of semantic categories, without contemplating the possibility of identifying unknown objects from novel categories. Existing methods in outlier detection suffer from a lack of smoothness and objectness in their predictions, due to limitations of the per-pixel classification paradigm. Furthermore, additiona… ▽ More

    Submitted 29 March, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  8. arXiv:2210.16795  [pdf, other

    cs.CV

    Two-Level Temporal Relation Model for Online Video Instance Segmentation

    Authors: Çağan Selim Çoban, Oğuzhan Keskin, Jordi Pont-Tuset, Fatma Güney

    Abstract: In Video Instance Segmentation (VIS), current approaches either focus on the quality of the results, by taking the whole video as input and processing it offline; or on speed, by handling it frame by frame at the cost of competitive performance. In this work, we propose an online method that is on par with the performance of the offline counterparts. We introduce a message-passing graph neural net… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  9. arXiv:2207.00255  [pdf, other

    cs.CV cs.RO

    Trajectory Forecasting on Temporal Graphs

    Authors: Görkay Aydemir, Adil Kaan Akan, Fatma Güney

    Abstract: Predicting future locations of agents in the scene is an important problem in self-driving. In recent years, there has been a significant progress in representing the scene and the agents in it. The interactions of agents with the scene and with each other are typically modeled with a Graph Neural Network. However, the graph structure is mostly static and fails to represent the temporal changes in… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  10. arXiv:2203.13641  [pdf, other

    cs.CV cs.LG

    StretchBEV: Stretching Future Instance Prediction Spatially and Temporally

    Authors: Adil Kaan Akan, Fatma Güney

    Abstract: In self-driving, predicting future in terms of location and motion of all the agents around the vehicle is a crucial requirement for planning. Recently, a new joint formulation of perception and prediction has emerged by fusing rich sensory information perceived from multiple cameras into a compact bird's-eye view representation to perform prediction. However, the quality of future predictions deg… ▽ More

    Submitted 10 August, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: ECCV 2022

  11. arXiv:2203.10528  [pdf, other

    cs.CV cs.LG

    Stochastic Video Prediction with Structure and Motion

    Authors: Adil Kaan Akan, Sadra Safadoust, Fatma Güney

    Abstract: While stochastic video prediction models enable future prediction under uncertainty, they mostly fail to model the complex dynamics of real-world scenes. For example, they cannot provide reliable predictions for scenes with a moving camera and independently moving foreground objects in driving scenarios. The existing methods fail to fully capture the dynamics of the structured world by only focusi… ▽ More

    Submitted 29 April, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Under review at TPAMI

  12. arXiv:2111.04780  [pdf, other

    cs.CV

    Frustum Fusion: Pseudo-LiDAR and LiDAR Fusion for 3D Detection

    Authors: Farzin Negahbani, Onur Berk Töre, Fatma Güney, Baris Akgun

    Abstract: Most autonomous vehicles are equipped with LiDAR sensors and stereo cameras. The former is very accurate but generates sparse data, whereas the latter is dense, has rich texture and color information but difficult to extract robust 3D representations from. In this paper, we propose a novel data fusion algorithm to combine accurate point clouds with dense but less accurate point clouds obtained fro… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    ACM Class: I.4.8

  13. arXiv:2110.11275  [pdf, other

    cs.CV

    Self-Supervised Monocular Scene Decomposition and Depth Estimation

    Authors: Sadra Safadoust, Fatma Güney

    Abstract: Self-supervised monocular depth estimation approaches either ignore independently moving objects in the scene or need a separate segmentation step to identify them. We propose MonoDepthSeg to jointly estimate depth and segment moving objects from monocular video without using any ground-truth labels. We decompose the scene into a fixed number of components where each component corresponds to a reg… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 3DV 2021

  14. arXiv:2108.02760  [pdf, other

    cs.CV

    SLAMP: Stochastic Latent Appearance and Motion Prediction

    Authors: Adil Kaan Akan, Erkut Erdem, Aykut Erdem, Fatma Güney

    Abstract: Motion is an important cue for video prediction and often utilized by separating video content into static and dynamic components. Most of the previous work utilizing motion is deterministic but there are stochastic methods that can model the inherent uncertainty of the future. Existing stochastic models either do not reason about motion explicitly or make limiting assumptions about the static par… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  15. arXiv:1712.08416  [pdf, other

    cs.CV

    On the Integration of Optical Flow and Action Recognition

    Authors: Laura Sevilla-Lara, Yiyi Liao, Fatma Guney, Varun Jampani, Andreas Geiger, Michael J. Black

    Abstract: Most of the top performing action recognition methods use optical flow as a "black box" input. Here we take a deeper look at the combination of flow and action recognition, and investigate why optical flow is helpful, what makes a flow method good for action recognition, and how we can make it better. In particular, we investigate the impact of different flow algorithms and input transformations t… ▽ More

    Submitted 22 December, 2017; originally announced December 2017.

  16. arXiv:1704.05519  [pdf, other

    cs.CV cs.RO

    Computer Vision for Autonomous Vehicles: Problems, Datasets and State of the Art

    Authors: Joel Janai, Fatma Güney, Aseem Behl, Andreas Geiger

    Abstract: Recent years have witnessed enormous progress in AI-related fields such as computer vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it becomes increasingly difficult to stay up-to-date or enter the field as a beginner. While several survey papers on particular sub-problems have appeared, no comprehensive survey on problems, datasets, and methods in computer vi… ▽ More

    Submitted 17 March, 2021; v1 submitted 18 April, 2017; originally announced April 2017.