Skip to main content

Showing 1–7 of 7 results for author: Grujicic, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.15223  [pdf, other

    cs.CV cs.LG

    Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis

    Authors: Mingshi Li, Dusan Grujicic, Steven De Saeger, Stien Heremans, Ben Somers, Matthew B. Blaschko

    Abstract: In recent years, machine learning has become crucial in remote sensing analysis, particularly in the domain of Land-use/Land-cover (LULC). The synergy of machine learning and satellite imagery analysis has demonstrated significant productivity in this field, as evidenced by several studies. A notable challenge within this area is the semantic segmentation map** of land usage over extensive terri… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  2. arXiv:2307.07483  [pdf, other

    cs.CV

    Multimodal Distillation for Egocentric Action Recognition

    Authors: Gorjan Radevski, Dusan Grujicic, Marie-Francine Moens, Matthew Blaschko, Tinne Tuytelaars

    Abstract: The focal point of egocentric video understanding is modelling hand-object interactions. Standard models, e.g. CNNs or Vision Transformers, which receive RGB frames as input perform well. However, their performance improves further by employing additional input modalities that provide complementary cues, such as object detections, optical flow, audio, etc. The added complexity of the modality-spec… ▽ More

    Submitted 18 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV 2023; Codebase released at https://github.com/gorjanradevski/multimodal-distillation

  3. arXiv:2210.04331  [pdf, other

    cs.CV

    Students taught by multimodal teachers are superior action recognizers

    Authors: Gorjan Radevski, Dusan Grujicic, Matthew Blaschko, Marie-Francine Moens, Tinne Tuytelaars

    Abstract: The focal point of egocentric video understanding is modelling hand-object interactions. Standard models -- CNNs, Vision Transformers, etc. -- which receive RGB frames as input perform well, however, their performance improves further by employing additional modalities such as object detections, optical flow, audio, etc. as input. The added complexity of the required modality-specific modules, on… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: Extended abstract accepted at the 2nd Ego4D Workshop @ ECCV 2022

  4. arXiv:2112.05419  [pdf, other

    cs.AI cs.CL cs.CV cs.HC cs.LG

    Predicting Physical World Destinations for Commands Given to Self-Driving Cars

    Authors: Dusan Grujicic, Thierry Deruyttere, Marie-Francine Moens, Matthew Blaschko

    Abstract: In recent years, we have seen significant steps taken in the development of self-driving cars. Multiple companies are starting to roll out impressive systems that work in a variety of settings. These systems can sometimes give the impression that full self-driving is just around the corner and that we would soon build cars without even a steering wheel. The increase in the level of autonomy and co… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022. First two authors have contributed equally. Extended camera-ready version including the appendix and references to it in the main text

  5. arXiv:2009.08792  [pdf, other

    cs.CV cs.AI

    Commands 4 Autonomous Vehicles (C4AV) Workshop Summary

    Authors: Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Yu Liu, Luc Van Gool, Matthew Blaschko, Tinne Tuytelaars, Marie-Francine Moens

    Abstract: The task of visual grounding requires locating the most relevant region or object in an image, given a natural language query. So far, progress on this task was mostly measured on curated datasets, which are not always representative of human spoken language. In this work, we deviate from recent, popular task settings and consider the problem under an autonomous vehicle scenario. In particular, we… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  6. arXiv:2004.13822  [pdf, other

    cs.CL cs.LG

    A Baseline for the Commands For Autonomous Vehicles Challenge

    Authors: Simon Vandenhende, Thierry Deruyttere, Dusan Grujicic

    Abstract: The Commands For Autonomous Vehicles (C4AV) challenge requires participants to solve an object referral task in a real-world setting. More specifically, we consider a scenario where a passenger can pass free-form natural language commands to a self-driving car. This problem is particularly challenging, as the language is much less constrained compared to existing benchmarks, and object references… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: Technical Report

  7. arXiv:1909.10838  [pdf, other

    cs.AI cs.CL cs.RO

    Talk2Car: Taking Control of Your Self-Driving Car

    Authors: Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Luc Van Gool, Marie-Francine Moens

    Abstract: A long-term goal of artificial intelligence is to have an agent execute commands communicated through natural language. In many cases the commands are grounded in a visual environment shared by the human who gives the command and the agent. Execution of the command then requires map** the command into the physical visual space, after which the appropriate action can be taken. In this paper we co… ▽ More

    Submitted 26 August, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: 14 pages, accepted at emnlp-ijcnlp 2019 - Added Talk2Nav Reference