Skip to main content

Showing 1–33 of 33 results for author: Šegvić, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15374  [pdf, other

    cs.CV cs.LG

    Outlier detection by ensembling uncertainty with negative objectness

    Authors: Anja Delić, Matej Grcić, Siniša Šegvić

    Abstract: Outlier detection is an essential capability in safety-critical applications of supervised visual recognition. Most of the existing methods deliver best results by encouraging standard closed-set models to produce low-confidence predictions in negative training data. However, that approach conflates prediction uncertainty with recognition of the negative class. We therefore reconsider direct predi… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2310.06085  [pdf, other

    cs.CV cs.LG

    Quantile-based Maximum Likelihood Training for Outlier Detection

    Authors: Masoud Taghikhah, Nishant Kumar, Siniša Šegvić, Abouzar Eslami, Stefan Gumhold

    Abstract: Discriminative learning effectively predicts true object class for image classification. However, it often results in false positives for outliers, posing critical concerns in applications like autonomous driving and video surveillance systems. Previous attempts to address this challenge involved training image classifiers through contrastive learning using actual outlier data or synthesizing outl… ▽ More

    Submitted 2 June, 2024; v1 submitted 20 August, 2023; originally announced October 2023.

    Comments: Camera Ready Version. Accepted at AAAI 2024. Code available at https://github.com/taghikhah/QuantOD

  3. arXiv:2305.15227  [pdf, ps, other

    cs.CV cs.LG

    Real time dense anomaly detection by learning on synthetic negative data

    Authors: Anja Delić, Matej Grcić, Siniša Šegvić

    Abstract: Most approaches to dense anomaly detection rely on generative modeling or on discriminative methods that train with negative data. We consider a recent hybrid method that optimizes the same shared representation according to cross-entropy of the discriminative predictions, and negative log likelihood of the predicted energy-based density. We extend that work with a jointly trained generative flow… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 3 pages

  4. arXiv:2303.06999  [pdf, other

    cs.CV cs.LG

    Identifying Label Errors in Object Detection Datasets by Loss Inspection

    Authors: Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Siniša Šegvić, Matthias Rottmann

    Abstract: Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as wel… ▽ More

    Submitted 19 December, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  5. arXiv:2302.07106  [pdf, other

    cs.CV

    Normalizing Flow based Feature Synthesis for Outlier-Aware Object Detection

    Authors: Nishant Kumar, Siniša Šegvić, Abouzar Eslami, Stefan Gumhold

    Abstract: Real-world deployment of reliable object detectors is crucial for applications such as autonomous driving. However, general-purpose object detectors like Faster R-CNN are prone to providing overconfident predictions for outlier objects. Recent outlier-aware object detection approaches estimate the density of instance-wide features with class-conditional Gaussians and train on synthesized outlier f… ▽ More

    Submitted 28 May, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted as CVPR 2023 Highlight (Top 10% of all acceptance)

  6. Hybrid Open-set Segmentation with Synthetic Negative Data

    Authors: Matej Grcić, Siniša Šegvić

    Abstract: Open-set segmentation can be conceived by complementing closed-set classification with anomaly detection. Many of the existing dense anomaly detectors operate through generative modelling of regular data or by discriminating with respect to negative data. These two approaches optimize different objectives and therefore exhibit different failure modes. Consequently, we propose a novel anomaly score… ▽ More

    Submitted 24 April, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Published in IEEE TPAMI

  7. arXiv:2301.03407  [pdf, other

    cs.CV

    On Advantages of Mask-level Recognition for Outlier-aware Segmentation

    Authors: Matej Grcić, Josip Šarić, Siniša Šegvić

    Abstract: Most dense recognition approaches bring a separate decision in each particular pixel. These approaches deliver competitive performance in usual closed-set setups. However, important applications in the wild typically require strong performance in presence of outliers. We show that this demanding setup greatly benefit from mask-level predictions, even in the case of non-finetuned baseline models. M… ▽ More

    Submitted 5 April, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted to CVPR 2023 workshop on Visual Anomaly and Novelty Detection (VAND)

  8. Weakly supervised training of universal visual concepts for multi-domain semantic segmentation

    Authors: Petra Bevandić, Marin Oršić, Ivan Grubišić, Josip Šarić, Siniša Šegvić

    Abstract: Deep supervised models have an unprecedented capacity to absorb large quantities of training data. Hence, training on multiple datasets becomes a method of choice towards strong generalization in usual scenes and graceful performance degradation in edge cases. Unfortunately, different datasets often have incompatible labels. For instance, the Cityscapes road class subsumes all driving surfaces, wh… ▽ More

    Submitted 12 March, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 27 pages, 16 figures, 10 tables, accepted to International Journal of Computer Vision

    Journal ref: International Journal of Computer Vision, 2024, 1-23

  9. arXiv:2211.04165  [pdf, other

    cs.CV

    Dynamic loss balancing and sequential enhancement for road-safety assessment and traffic scene classification

    Authors: Marin Kačan, Marko Ševrović, Siniša Šegvić

    Abstract: Road-safety inspection is an indispensable instrument for reducing road-accident fatalities contributed to road infrastructure. Recent work formalizes road-safety assessment in terms of carefully selected risk factors that are also known as road-safety attributes. In current practice, these attributes are manually annotated in geo-referenced monocular video for each road segment. We propose to red… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2207.08445  [pdf, other

    cs.CV

    Automatic universal taxonomies for multi-domain semantic segmentation

    Authors: Petra Bevandić, Siniša Šegvić

    Abstract: Training semantic segmentation models on multiple datasets has sparked a lot of recent interest in the computer vision community. This interest has been motivated by expensive annotations and a desire to achieve proficiency across multiple visual domains. However, established datasets have mutually incompatible labels which disrupt principled inference in the wild. We address this issue by automat… ▽ More

    Submitted 26 October, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: BMVC 2022, 8 pages, 5 figures, 3 tables

  11. arXiv:2207.02606  [pdf, other

    cs.CV

    DenseHybrid: Hybrid Anomaly Detection for Dense Open-set Recognition

    Authors: Matej Grcić, Petra Bevandić, Siniša Šegvić

    Abstract: Anomaly detection can be conceived either through generative modelling of regular training data or by discriminating with respect to negative training data. These two approaches exhibit different failure modes. Consequently, hybrid algorithms present an attractive research goal. Unfortunately, dense anomaly detection requires translational equivariance and very large input resolutions. These requi… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted on ECCV 2022

  12. Panoptic SwiftNet: Pyramidal Fusion for Real-time Panoptic Segmentation

    Authors: Josip Šarić, Marin Oršić, Siniša Šegvić

    Abstract: Dense panoptic prediction is a key ingredient in many existing applications such as autonomous driving, automated warehouses or remote sensing. Many of these applications require fast inference over large input resolutions on affordable or even embedded hardware. We propose to achieve this goal by trading off backbone capacity for multi-scale feature extraction. In comparison with contemporaneous… ▽ More

    Submitted 18 April, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Code available at: https://github.com/jsaric/panoptic-swiftnet

    Journal ref: Remote Sensing. 2023, 15(8), 1968;

  13. arXiv:2112.12833  [pdf, other

    cs.CV

    Dense Out-of-Distribution Detection by Robust Learning on Synthetic Negative Data

    Authors: Matej Grcić, Petra Bevandić, Zoran Kalafatić, Siniša Šegvić

    Abstract: Standard machine learning is unable to accommodate inputs which do not belong to the training distribution. The resulting models often give rise to confident incorrect predictions which may lead to devastating consequences. This problem is especially demanding in the context of dense prediction since input images may be only partially anomalous. Previous work has addressed dense out-of-distributio… ▽ More

    Submitted 31 July, 2023; v1 submitted 23 December, 2021; originally announced December 2021.

  14. arXiv:2108.11224  [pdf, other

    cs.CV

    Multi-domain semantic segmentation with overlap** labels

    Authors: Petra Bevandić, Marin Oršić, Ivan Grubišić, Josip Šarić, Siniša Šegvić

    Abstract: Deep supervised models have an unprecedented capacity to absorb large quantities of training data. Hence, training on many datasets becomes a method of choice towards graceful degradation in unusual scenes. Unfortunately, different datasets often use incompatible labels. For instance, the Cityscapes road class subsumes all driving surfaces, while Vistas defines separate classes for road markings,… ▽ More

    Submitted 2 November, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: 18 pages, 8 figures, 11 tables

  15. Revisiting consistency for semi-supervised semantic segmentation

    Authors: Ivan Grubišić, Marin Oršić, Siniša Šegvić

    Abstract: Semi-supervised learning an attractive technique in practical deployments of deep models since it relaxes the dependence on labeled data. It is especially important in the scope of dense prediction because pixel-level annotation requires significant effort. This paper considers semi-supervised algorithms that enforce consistent predictions over perturbed unlabeled inputs. We study the advantages o… ▽ More

    Submitted 20 January, 2023; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: The source code is available at https://github.com/Ivan1248/semisup-seg-efficient

    Journal ref: Sensors. 2023; 23(2):940

  16. arXiv:2106.04627  [pdf, other

    cs.LG cs.AI cs.CV

    Densely connected normalizing flows

    Authors: Matej Grcić, Ivan Grubišić, Siniša Šegvić

    Abstract: Normalizing flows are bijective map**s between inputs and latent representations with a fully factorized distribution. They are very attractive due to exact likelihood valuation and efficient sampling. However, their effective capacity is often insufficient since the bijectivity constraint limits the model width. We address this issue by incrementally padding intermediate representations with no… ▽ More

    Submitted 2 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS2021

  17. Dense Semantic Forecasting in Video by Joint Regression of Features and Feature Motion

    Authors: Josip Šarić, Sacha Vražić, Siniša Šegvić

    Abstract: Dense semantic forecasting anticipates future events in video by inferring pixel-level semantics of an unobserved future image. We present a novel approach that is applicable to various single-frame architectures and tasks. Our approach consists of two modules. Feature-to-motion (F2M) module forecasts a dense deformation field that warps past features into their future positions. Feature-to-featur… ▽ More

    Submitted 16 December, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: 13 pages, 10 figures

  18. Dense outlier detection and open-set recognition based on training with noisy negative images

    Authors: Petra Bevandić, Ivan Krešo, Marin Oršić, Siniša Šegvić

    Abstract: Deep convolutional models often produce inadequate predictions for inputs foreign to the training distribution. Consequently, the problem of detecting outlier images has recently been receiving a lot of attention. Unlike most previous work, we address this problem in the dense prediction context in order to be able to locate outlier objects in front of in-distribution background. Our approach is b… ▽ More

    Submitted 12 March, 2024; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: Published in Image and Vision Computing

    Journal ref: Image and Vision Computing, Vol. 124, 2022, 104490

  19. arXiv:2011.11094  [pdf, other

    cs.CV

    Dense open-set recognition with synthetic outliers generated by Real NVP

    Authors: Matej Grcić, Petra Bevandić, Siniša Šegvić

    Abstract: Today's deep models are often unable to detect inputs which do not belong to the training distribution. This gives rise to confident incorrect predictions which could lead to devastating consequences in many important application fields such as healthcare and autonomous driving. Interestingly, both discriminative and generative models appear to be equally affected. Consequently, this vulnerability… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: Accepted to VISAPP 2021 conference

  20. arXiv:2010.09067  [pdf, other

    cs.CV

    Multimodal semantic forecasting based on conditional generation of future features

    Authors: Kristijan Fugošić, Josip Šarić, Siniša Šegvić

    Abstract: This paper considers semantic forecasting in road-driving scenes. Most existing approaches address this problem as deterministic regression of future features or future predictions given observed frames. However, such approaches ignore the fact that future can not always be guessed with certainty. For example, when a car is about to turn around a corner, the road which is currently occluded by bui… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted to German Conference on Pattern Recognition 2020. 24 pages, 11 figures, 5 tables

  21. arXiv:2009.01636  [pdf, ps, other

    cs.CV

    Multi-domain semantic segmentation with pyramidal fusion

    Authors: Petra Bevandić, Marin Oršić, Ivan Grubišić, Josip Šarić, Siniša Šegvić

    Abstract: We present our submission to the semantic segmentation contest of the Robust Vision Challenge held at ECCV 2020. The contest requires submitting the same model to seven benchmarks from three different domains. Our approach is based on the SwiftNet architecture with pyramidal fusion. We address inconsistent taxonomies with a single-level 193-dimensional softmax output. We strive to train with large… ▽ More

    Submitted 7 October, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: 2 pages, 2 tables, no figures

  22. arXiv:1908.01098  [pdf, other

    cs.CV

    Simultaneous Semantic Segmentation and Outlier Detection in Presence of Domain Shift

    Authors: Petra Bevandić, Ivan Krešo, Marin Oršić, Siniša Šegvić

    Abstract: Recent success on realistic road driving datasets has increased interest in exploring robust performance in real-world applications. One of the major unsolved problems is to identify image content which can not be reliably recognized with a given inference engine. We therefore study approaches to recover a dense outlier map alongside the primary task with a single forward pass, by relying on share… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: Accepted to German Conference on Pattern Recognition 2019. 25 pages, 10 figures, 9 tables

  23. arXiv:1907.11475  [pdf, other

    cs.CV

    Single Level Feature-to-Feature Forecasting with Deformable Convolutions

    Authors: Josip Šarić, Marin Oršić, Tonći Antunović, Sacha Vražić, Siniša Šegvić

    Abstract: Future anticipation is of vital importance in autonomous driving and other decision-making systems. We present a method to anticipate semantic segmentation of future frames in driving scenarios based on feature-to-feature forecasting. Our method is based on a semantic segmentation model without lateral connections within the upsampling path. Such design ensures that the forecasting addresses only… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: Accepted to German Conference on Pattern Recognition 2019. 19 pages, 8 figures, 7 tables

  24. arXiv:1907.07045  [pdf, other

    cs.CV cs.LG cs.RO

    Pedestrian Tracking by Probabilistic Data Association and Correspondence Embeddings

    Authors: Borna Bićanić, Marin Oršić, Ivan Marković, Siniša Šegvić, Ivan Petrović

    Abstract: This paper studies the interplay between kinematics (position and velocity) and appearance cues for establishing correspondences in multi-target pedestrian tracking. We investigate tracking-by-detection approaches based on a deep learning detector, joint integrated probabilistic data association (JIPDA), and appearance-based tracking of deep correspondence embeddings. We first addressed the fixed-… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Journal ref: 22nd International Conference on Information Fusion (FUSION) (2019)

  25. arXiv:1905.05661  [pdf, other

    cs.CV

    Efficient Ladder-style DenseNets for Semantic Segmentation of Large Images

    Authors: Ivan Krešo, Josip Krapac, Siniša Šegvić

    Abstract: Recent progress of deep image classification models has provided great potential to improve state-of-the-art performance in related computer vision tasks. However, the transition to semantic segmentation is hampered by strict memory limitations of contemporary GPUs. The extent of feature map caching required by convolutional backprop poses significant challenges even for moderately sized Pascal im… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 12 pages, 6 figures, under review

  26. arXiv:1903.08469  [pdf, other

    cs.CV

    In Defense of Pre-trained ImageNet Architectures for Real-time Semantic Segmentation of Road-driving Images

    Authors: Marin Oršić, Ivan Krešo, Petra Bevandić, Siniša Šegvić

    Abstract: Recent success of semantic segmentation approaches on demanding road driving datasets has spurred interest in many related application fields. Many of these applications involve real-time prediction on mobile platforms such as cars, drones and various kinds of robots. Real-time setup is challenging due to extraordinary computational complexity involved. Many previous works address the challenge wi… ▽ More

    Submitted 12 April, 2019; v1 submitted 20 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019. 8 pages, 8 figures, 5 tables. PyTorch source is available at https://github.com/orsic/swiftnet

  27. arXiv:1808.07703  [pdf, other

    cs.CV

    Discriminative out-of-distribution detection for semantic segmentation

    Authors: Petra Bevandić, Ivan Krešo, Marin Oršić, Siniša Šegvić

    Abstract: Most classification and segmentation datasets assume a closed-world scenario in which predictions are expressed as distribution over a predetermined set of visual classes. However, such assumption implies unavoidable and often unnoticeable failures in presence of out-of-distribution (OOD) input. These failures are bound to happen in most real-life applications since current visual ontologies are f… ▽ More

    Submitted 1 October, 2018; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: This paper has been withdrawn from AutoNUE workshop at ECCV 2018 due to ECCV registration being closed

  28. arXiv:1806.03465  [pdf, other

    cs.CV cs.LG

    Robust Semantic Segmentation with Ladder-DenseNet Models

    Authors: Ivan Krešo, Marin Oršić, Petra Bevandić, Siniša Šegvić

    Abstract: We present semantic segmentation experiments with a model capable to perform predictions on four benchmark datasets: Cityscapes, ScanNet, WildDash and KITTI. We employ a ladder-style convolutional architecture featuring a modified DenseNet-169 model in the downsampling datapath, and only one convolution in each stage of the upsampling datapath. Due to limited computing resources, we perform the tr… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

    Comments: 4 pages, 4 figures, CVPR 2018 Robust Vision Challenge Workshop

  29. arXiv:1310.0319   

    cs.CV

    Second Croatian Computer Vision Workshop (CCVW 2013)

    Authors: Sven Lončarić, Siniša Šegvić

    Abstract: Proceedings of the Second Croatian Computer Vision Workshop (CCVW 2013, http://www.fer.unizg.hr/crv/ccvw2013) held September 19, 2013, in Zagreb, Croatia. Workshop was organized by the Center of Excellence for Computer Vision of the University of Zagreb.

    Submitted 3 November, 2013; v1 submitted 1 October, 2013; originally announced October 2013.

    Comments: Papers presented at the Second Croatian Computer Vision Workshop CCVW 2013

  30. arXiv:1310.0316  [pdf

    cs.CV

    Classifying Traffic Scenes Using The GIST Image Descriptor

    Authors: Ivan Sikirić, Karla Brkić, Siniša Šegvić

    Abstract: This paper investigates classification of traffic scenes in a very low bandwidth scenario, where an image should be coded by a small number of features. We introduce a novel dataset, called the FM1 dataset, consisting of 5615 images of eight different traffic scenes: open highway, open road, settlement, tunnel, tunnel exit, toll booth, heavy traffic and the overpass. We evaluate the suitability of… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: Part of the Proceedings of the Croatian Computer Vision Workshop, CCVW 2013, Year 1

    Report number: UniZg-CRV-CCVW/2013/0013

  31. arXiv:1310.0311  [pdf

    cs.CV

    Multiclass Road Sign Detection using Multiplicative Kernel

    Authors: Valentina Zadrija, Siniša Šegvić

    Abstract: We consider the problem of multiclass road sign detection using a classification function with multiplicative kernel comprised from two kernels. We show that problems of detection and within-foreground classification can be jointly solved by using one kernel to measure object-background differences and another one to account for within-class variations. The main idea behind this approach is that r… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: Part of the Proceedings of the Croatian Computer Vision Workshop, CCVW 2013, Year 1

    Report number: UniZg-CRV-CCVW/2013/0016

  32. arXiv:1310.0310  [pdf

    cs.CV

    A Novel Georeferenced Dataset for Stereo Visual Odometry

    Authors: Ivan Krešo, Marko Ševrović, Siniša Šegvić

    Abstract: In this work, we present a novel dataset for assessing the accuracy of stereo visual odometry. The dataset has been acquired by a small-baseline stereo rig mounted on the top of a moving car. The groundtruth is supplied by a consumer grade GPS device without IMU. Synchronization and alignment between GPS readings and stereo frames are recovered after the acquisition. We show that the attained grou… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: Part of the Proceedings of the Croatian Computer Vision Workshop, CCVW 2013, Year 1

    Report number: UniZg-CRV-CCVW/2013/0017

  33. arXiv:1310.0308  [pdf

    cs.CV

    Combining Spatio-Temporal Appearance Descriptors and Optical Flow for Human Action Recognition in Video Data

    Authors: Karla Brkić, Srđan Rašić, Axel Pinz, Siniša Šegvić, Zoran Kalafatić

    Abstract: This paper proposes combining spatio-temporal appearance (STA) descriptors with optical flow for human action recognition. The STA descriptors are local histogram-based descriptors of space-time, suitable for building a partial representation of arbitrary spatio-temporal phenomena. Because of the possibility of iterative refinement, they are interesting in the context of online human action recogn… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: Part of the Proceedings of the Croatian Computer Vision Workshop, CCVW 2013, Year 1

    Report number: UniZg-CRV-CCVW/2013/0011