Skip to main content

Showing 1–16 of 16 results for author: Zagoruyko, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02162  [pdf, other

    cs.CV cs.AI cs.RO

    Map** the Unseen: Unified Promptable Panoptic Map** with Dynamic Labeling using Foundation Models

    Authors: Mohamad Al Mdfaa, Raghad Salameh, Sergey Zagoruyko, Gonzalo Ferrer

    Abstract: In the field of robotics and computer vision, efficient and accurate semantic map** remains a significant challenge due to the growing demand for intelligent machines that can comprehend and interact with complex environments. Conventional panoptic map** methods, however, are limited by predefined semantic classes, thus making them ineffective for handling novel or unforeseen objects. In respo… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  2. arXiv:2302.03802  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking

    Authors: Ziqi Pang, Jie Li, Pavel Tokmakov, Dian Chen, Sergey Zagoruyko, Yu-Xiong Wang

    Abstract: This work proposes an end-to-end multi-camera 3D multi-object tracking (MOT) framework. It emphasizes spatio-temporal continuity and integrates both past and future reasoning for tracked objects. Thus, we name it "Past-and-Future reasoning for Tracking" (PF-Track). Specifically, our method adapts the "tracking by attention" framework and represents tracked instances coherently over time with objec… ▽ More

    Submitted 3 April, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: CVPR 2023 Camera Ready, 15 pages, 8 figures

  3. arXiv:2211.02131  [pdf, other

    cs.RO cs.LG

    Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts

    Authors: Stefano Pini, Christian S. Perone, Aayush Ahuja, Ana Sofia Rufino Ferreira, Moritz Niendorf, Sergey Zagoruyko

    Abstract: The goal of autonomous vehicles is to navigate public roads safely and comfortably. To enforce safety, traditional planning approaches rely on handcrafted rules to generate trajectories. Machine learning-based systems, on the other hand, scale with data and are able to learn more complex behaviors. However, they often ignore that agents and self-driving vehicle trajectory distributions can be leve… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  4. arXiv:2210.02174  [pdf, other

    cs.LG cs.RO

    CW-ERM: Improving Autonomous Driving Planning with Closed-loop Weighted Empirical Risk Minimization

    Authors: Eesha Kumar, Yiming Zhang, Stefano Pini, Simon Stent, Ana Ferreira, Sergey Zagoruyko, Christian S. Perone

    Abstract: The imitation learning of self-driving vehicle policies through behavioral cloning is often carried out in an open-loop fashion, ignoring the effect of actions to future states. Training such policies purely with Empirical Risk Minimization (ERM) can be detrimental to real-world performance, as it biases policy networks towards matching only open-loop behavior, showing poor results when evaluated… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: v2: minor update in dataset and results (no changes in improvements or conclusions)

  5. arXiv:2005.12872  [pdf, other

    cs.CV

    End-to-End Object Detection with Transformers

    Authors: Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko

    Abstract: We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression procedure or anchor generation that explicitly encode our prior knowledge about the task. The main ingredients of the new framework, called DEtection TRansformer or DET… ▽ More

    Submitted 28 May, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  6. arXiv:2001.09832  [pdf, other

    cs.LG stat.ML

    Polygames: Improved Zero Learning

    Authors: Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-** Lin, Xavier Martinet, Vegard Mella, Jeremy Rapin, Baptiste Roziere, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko

    Abstract: Since DeepMind's AlphaZero, Zero learning quickly became the state-of-the-art method for many board games. It can be improved using a fully convolutional structure (no fully connected layer). Using such an architecture plus global pooling, we can create bots independent of the board size. The training can be made more robust by kee** track of the best checkpoints during the training and by train… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  7. arXiv:1904.10348  [pdf, other

    cs.RO cs.CV

    Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning

    Authors: Yann Labbé, Sergey Zagoruyko, Igor Kalevatykh, Ivan Laptev, Justin Carpentier, Mathieu Aubry, Josef Sivic

    Abstract: We address the problem of visually guided rearrangement planning with many movable objects, i.e., finding a sequence of actions to move a set of objects from an initial arrangement to a desired one, while relying on visual inputs coming from an RGB camera. To do so, we introduce a complete pipeline relying on two key contributions. First, we introduce an efficient and scalable rearrangement planni… ▽ More

    Submitted 1 April, 2020; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters (RA-L)

  8. arXiv:1812.11027  [pdf, other

    cs.LG stat.ML

    Exploring Weight Symmetry in Deep Neural Networks

    Authors: Xu Shell Hu, Sergey Zagoruyko, Nikos Komodakis

    Abstract: We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the symmetry constraints, one would expect a dramatic drop in accuracy. Surprisingly, we show that this is not the case, and, depending on network size, symmetry can ha… ▽ More

    Submitted 10 January, 2019; v1 submitted 28 December, 2018; originally announced December 2018.

  9. Compressing the Input for CNNs with the First-Order Scattering Transform

    Authors: Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko, Michal Valko

    Abstract: We study the first-order scattering transform as a candidate for reducing the signal processed by a convolutional neural network (CNN). We show theoretical and empirical evidence that in the case of natural images and sufficiently small translation invariance, this transform preserves most of the signal information needed for classification while substantially reducing the spatial resolution and t… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Journal ref: ECCV 2018

  10. arXiv:1809.06367  [pdf, other

    cs.LG cs.CV stat.ML

    Scattering Networks for Hybrid Representation Learning

    Authors: Edouard Oyallon, Sergey Zagoruyko, Gabriel Huang, Nikos Komodakis, Simon Lacoste-Julien, Matthew Blaschko, Eugene Belilovsky

    Abstract: Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modelling images. In particular, by working in scattering space, we achieve competitive results both for supervised and unsupervised learning tasks, while making progress towards constructing more interpretable CNNs. For supervised learning, we… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1703.08961

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2018, pp.11

  11. arXiv:1706.00388  [pdf, other

    cs.CV

    DiracNets: Training Very Deep Neural Networks Without Skip-Connections

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Deep neural networks with skip-connections, such as ResNet, show excellent performance in various image classification benchmarks. It is though observed that the initial motivation behind them - training deeper networks - does not actually hold true, and the benefits come from increased capacity, rather than from depth. Motivated by this, and inspired from ResNet, we propose a simple Dirac weight… ▽ More

    Submitted 26 January, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

  12. arXiv:1703.08961  [pdf, ps, other

    cs.CV cs.LG

    Scaling the Scattering Transform: Deep Hybrid Networks

    Authors: Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko

    Abstract: We use the scattering network as a generic and fixed ini-tialization of the first layers of a supervised hybrid deep network. We show that early layers do not necessarily need to be learned, providing the best results to-date with pre-defined representations while being competitive with Deep CNNs. Using a shallow cascade of 1 x 1 convolutions, which encodes scattering coefficients that correspond… ▽ More

    Submitted 4 April, 2017; v1 submitted 27 March, 2017; originally announced March 2017.

  13. arXiv:1612.03928  [pdf, other

    cs.CV

    Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Attention plays a critical role in human visual experience. Furthermore, it has recently been demonstrated that attention can also play an important role in the context of applying artificial neural networks to a variety of tasks from fields such as computer vision and NLP. In this work we show that, by properly defining attention for convolutional neural networks, we can actually use this type of… ▽ More

    Submitted 12 February, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  14. arXiv:1605.07146  [pdf, other

    cs.CV cs.LG cs.NE

    Wide Residual Networks

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Deep residual networks were shown to be able to scale up to thousands of layers and still have improving performance. However, each fraction of a percent of improved accuracy costs nearly doubling the number of layers, and so training very deep residual networks has a problem of diminishing feature reuse, which makes these networks very slow to train. To tackle these problems, in this paper we con… ▽ More

    Submitted 14 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  15. arXiv:1604.02135  [pdf, other

    cs.CV

    A MultiPath Network for Object Detection

    Authors: Sergey Zagoruyko, Adam Lerer, Tsung-Yi Lin, Pedro O. Pinheiro, Sam Gross, Soumith Chintala, Piotr Dollár

    Abstract: The recent COCO object detection dataset presents several new challenges for object detection. In particular, it contains objects at a broad range of scales, less prototypical images, and requires more precise localization. To address these challenges, we test three modifications to the standard Fast R-CNN object detector: (1) skip connections that give the detector access to features at multiple… ▽ More

    Submitted 8 August, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

  16. arXiv:1504.03641  [pdf, other

    cs.CV cs.LG cs.NE

    Learning to Compare Image Patches via Convolutional Neural Networks

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: In this paper we show how to learn directly from image data (i.e., without resorting to manually-designed features) a general similarity function for comparing image patches, which is a task of fundamental importance for many computer vision problems. To encode such a function, we opt for a CNN-based model that is trained to account for a wide variety of changes in image appearance. To that end, w… ▽ More

    Submitted 14 April, 2015; originally announced April 2015.

    Comments: CVPR 2015