Skip to main content

Showing 1–31 of 31 results for author: Kämäräinen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16707  [pdf, other

    cs.LG cs.AI

    Probabilistic Subgoal Representations for Hierarchical Reinforcement learning

    Authors: Vivienne Huiling Wang, Tinghuai Wang, Wenyan Yang, Joni-Kristian Kämäräinen, Joni Pajarinen

    Abstract: In goal-conditioned hierarchical reinforcement learning (HRL), a high-level policy specifies a subgoal for the low-level policy to reach. Effective HRL hinges on a suitable subgoal represen tation function, abstracting state space into latent subgoal space and inducing varied low-level behaviors. Existing methods adopt a subgoal representation that provides a deterministic map** from state space… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2309.17260  [pdf, other

    cs.RO cs.AI cs.LG

    PlaceNav: Topological Navigation through Place Recognition

    Authors: Lauri Suomela, Jussi Kalliola, Harry Edelman, Joni-Kristian Kämäräinen

    Abstract: Recent results suggest that splitting topological navigation into robot-independent and robot-specific components improves navigation performance by enabling the robot-independent part to be trained with data collected by robots of different types. However, the navigation methods' performance is still limited by the scarcity of suitable training data and they suffer from poor computational scaling… ▽ More

    Submitted 29 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICRA2024 camera ready

  3. arXiv:2303.09334  [pdf, other

    cs.CV

    Depth-Aware Image Compositing Model for Parallax Camera Motion Blur

    Authors: German F. Torres, Joni-Kristian Kämäräinen

    Abstract: Camera motion introduces spatially varying blur due to the depth changes in the 3D world. This work investigates scene configurations where such blur is produced under parallax camera motion. We present a simple, yet accurate, Image Compositing Blur (ICB) model for depth-dependent spatially varying blur. The (forward) model produces realistic motion blur from a single image, depth map, and camera… ▽ More

    Submitted 30 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  4. arXiv:2303.02646  [pdf, other

    cs.RO

    Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

    Authors: Wenyan Yang, Alexandre Angleraud, Roel S. Pieters, Joni Pajarinen, Joni-Kristian Kämäräinen

    Abstract: Robot control for tactile feedback-based manipulation can be difficult due to the modeling of physical contacts, partial observability of the environment, and noise in perception and control. This work focuses on solving partial observability of contact-rich manipulation tasks as a Sequence-to-Sequence (Seq2Seq)} Imitation Learning (IL) problem. The proposed Seq2Seq model produces a robot-environm… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  5. arXiv:2302.08865  [pdf, other

    cs.LG cs.AI

    Swapped goal-conditioned offline reinforcement learning

    Authors: Wenyan Yang, Huiling Wang, Dingding Cai, Joni Pajarinen, Joni-Kristen Kämäräinen

    Abstract: Offline goal-conditioned reinforcement learning (GCRL) can be challenging due to overfitting to the given dataset. To generalize agents' skills outside the given dataset, we propose a goal-swap** procedure that generates additional trajectories. To alleviate the problem of noise and extrapolation errors, we present a general offline reinforcement learning method called deterministic Q-advantage… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.07741

  6. arXiv:2302.07741  [pdf, other

    cs.LG cs.AI

    Prioritized offline Goal-swap** Experience Replay

    Authors: Wenyan Yang, Joni Pajarinen, Dinging Cai, Joni Kämäräinen

    Abstract: In goal-conditioned offline reinforcement learning, an agent learns from previously collected data to go to an arbitrary goal. Since the offline data only contains a finite number of trajectories, a main challenge is how to generate more data. Goal-swap** generates additional data by switching trajectory goals but while doing so produces a large number of invalid trajectories. To address this is… ▽ More

    Submitted 5 March, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  7. arXiv:2203.14134  [pdf, other

    cs.CV

    RGBD Object Tracking: An In-depth Review

    Authors: **yu Yang, Zhe Li, Song Yan, Feng Zheng, Aleš Leonardis, Joni-Kristian Kämäräinen, Ling Shao

    Abstract: RGBD object tracking is gaining momentum in computer vision research thanks to the development of depth sensors. Although numerous RGBD trackers have been proposed with promising performance, an in-depth review for comprehensive understanding of this area is lacking. In this paper, we firstly review RGBD object trackers from different perspectives, including RGBD fusion, depth usage, and tracking… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

    Comments: 13 pages

  8. arXiv:2203.13048  [pdf, other

    cs.CV

    Benchmarking Visual Localization for Autonomous Navigation

    Authors: Lauri Suomela, Jussi Kalliola, Atakan Dag, Harry Edelman, Joni-Kristian Kämäräinen

    Abstract: This work introduces a simulator-based benchmark for visual localization in the autonomous navigation context. The dynamic benchmark enables investigation of how variables such as the time of day, weather, and camera perspective affect the navigation performance of autonomous agents that utilize visual localization for closed-loop control. The experimental part of the paper studies the effects of… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: WACV2023 camera ready

  9. arXiv:2201.09635  [pdf, other

    cs.LG

    State-Conditioned Adversarial Subgoal Generation

    Authors: Vivienne Huiling Wang, Joni Pajarinen, Tinghuai Wang, Joni-Kristian Kämäräinen

    Abstract: Hierarchical reinforcement learning (HRL) proposes to solve difficult tasks by performing decision-making and control at successively higher levels of temporal abstraction. However, off-policy HRL often suffers from the problem of a non-stationary high-level policy since the low-level policy is constantly changing. In this paper, we propose a novel HRL approach for mitigating the non-stationarity… ▽ More

    Submitted 13 March, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

  10. arXiv:2110.11679  [pdf, other

    cs.CV

    Depth-only Object Tracking

    Authors: Song Yan, **yu Yang, Ales Leonardis, Joni-Kristian Kamarainen

    Abstract: Depth (D) indicates occlusion and is less sensitive to illumination changes, which make depth attractive modality for Visual Object Tracking (VOT). Depth is used in RGBD object tracking where the best trackers are deep RGB trackers with additional heuristic using depth maps. There are two potential reasons for the heuristics: 1) the lack of large RGBD tracking datasets to train deep RGBD trackers… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Accepted to BMVC2021

  11. arXiv:2108.13962  [pdf, other

    cs.CV

    DepthTrack : Unveiling the Power of RGBD Tracking

    Authors: Song Yan, **yu Yang, Jani Käpylä, Feng Zheng, Aleš Leonardis, Joni-Kristian Kämäräinen

    Abstract: RGBD (RGB plus depth) object tracking is gaining momentum as RGBD sensors have become popular in many application fields such as robotics.However, the best RGBD trackers are extensions of the state-of-the-art deep RGB trackers. They are trained with RGB data and the depth channel is used as a sidekick for subtleties such as occlusion detection. This can be explained by the fact that there are no s… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV2021

  12. arXiv:2108.07514  [pdf, other

    cs.RO cs.AI

    Monolithic vs. hybrid controller for multi-objective Sim-to-Real learning

    Authors: Atakan Dag, Alexandre Angleraud, Wenyan Yang, Nataliya Strokina, Roel S. Pieters, Minna Lanz, Joni-Kristian Kamarainen

    Abstract: Simulation to real (Sim-to-Real) is an attractive approach to construct controllers for robotic tasks that are easier to simulate than to analytically solve. Working Sim-to-Real solutions have been demonstrated for tasks with a clear single objective such as "reach the target". Real world applications, however, often consist of multiple simultaneous objectives such as "reach the target" but "avoid… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

  13. arXiv:2103.12379  [pdf, other

    cs.RO cs.LG eess.SY

    Neural Network Controller for Autonomous Pile Loading Revised

    Authors: Wenyan Yang, Nataliya Strokina, Nikolay Serbenyuk, Joni Pajarinen, Reza Ghabcheloo, Juho Vihonen, Mohammad M. Aref, Joni-Kristian Kämäräinen

    Abstract: We have recently proposed two pile loading controllers that learn from human demonstrations: a neural network (NNet) [1] and a random forest (RF) controller [2]. In the field experiments the RF controller obtained clearly better success rates. In this work, the previous findings are drastically revised by experimenting summer time trained controllers in winter conditions. The winter experiments re… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 7 pages

  14. arXiv:2101.02515  [pdf, other

    cs.CV

    Learning Anthropometry from Rendered Humans

    Authors: Song Yan, Joni-Kristian Kämäräinen

    Abstract: Accurate estimation of anthropometric body measurements from RGB images has many potential applications in industrial design, online clothing, medical diagnosis and ergonomics. Research on this topic is limited by the fact that there exist only generated datasets which are based on fitting a 3D body mesh to 3D body scans in the commercial CAESAR dataset. For 2D only silhouettes are generated. To c… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  15. arXiv:2011.04612  [pdf, other

    cs.CV cs.GR

    Fast Fourier Intrinsic Network

    Authors: Yanlin Qian, Miao**g Shi, Joni-Kristian Kämäräinen, Jiri Matas

    Abstract: We address the problem of decomposing an image into albedo and shading. We propose the Fast Fourier Intrinsic Network, FFI-Net in short, that operates in the spectral domain, splitting the input into several spectral bands. Weights in FFI-Net are optimized in the spectral domain, allowing faster convergence to a lower error. FFI-Net is lightweight and does not need auxiliary networks for training.… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: WACV 2021 - camera ready

  16. arXiv:2003.03763  [pdf, other

    cs.CV

    A Benchmark for Temporal Color Constancy

    Authors: Yanlin Qian, Jani Käpylä, Joni-Kristian Kämäräinen, Samu Koskinen, Jiri Matas

    Abstract: Temporal Color Constancy (CC) is a recently proposed approach that challenges the conventional single-frame color constancy. The conventional approach is to use a single frame - shot frame - to estimate the scene illumination color. In temporal CC, multiple frames from the view finder sequence are used to estimate the color. However, there are no realistic large scale temporal color constancy data… ▽ More

    Submitted 8 March, 2020; originally announced March 2020.

    Comments: 16 pages, 6 figures

  17. arXiv:1912.00660  [pdf, other

    cs.CV

    DAL -- A Deep Depth-aware Long-term Tracker

    Authors: Yanlin Qian, Alan Lukežič, Matej Kristan, Joni-Kristian Kämäräinen, Jiri Matas

    Abstract: The best RGBD trackers provide high accuracy but are slow to run. On the other hand, the best RGB trackers are fast but clearly inferior on the RGBD datasets. In this work, we propose a deep depth-aware long-term tracker that achieves state-of-the-art RGBD tracking performance and is fast to run. We reformulate deep discriminative correlation filter (DCF) to embed the depth information into deep f… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 10 pages

  18. arXiv:1911.00694  [pdf, other

    cs.CV cs.LG

    Anthropometric clothing measurements from 3D body scans

    Authors: Song Yan, Johan Wirta, Joni-Kristian Kämäräinen

    Abstract: We propose a full processing pipeline to acquire anthropometric measurements from 3D measurements. The first stage of our pipeline is a commercial point cloud scanner. In the second stage, a pre-defined body model is fitted to the captured point cloud. We have generated one male and one female model from the SMPL library. The fitting process is based on non-rigid Iterative Closest Point (ICP) algo… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

  19. arXiv:1909.02933  [pdf, other

    cs.RO

    AR-based interaction for safe human-robot collaborative manufacturing

    Authors: Antti Hietanen, Jyrki Latokartano, Roel Pieters, Minna Lanz, Joni-Kristian Kämäräinen

    Abstract: Industrial standards define safety requirements for Human-Robot Collaboration (HRC) in industrial manufacturing. The standards particularly require real-time monitoring and securing of the minimum protective distance between a robot and an operator. In this work, we propose a depth-sensor based model for workspace monitoring and an interactive Augmented Reality (AR) User Interface (UI) for safe HR… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

    Comments: 7 pages, 7 figures

  20. arXiv:1907.00618  [pdf, other

    cs.CV

    CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark

    Authors: Alan Lukežič, Ugur Kart, Jani Käpylä, Ahmed Durmush, Joni-Kristian Kämäräinen, Jiří Matas, Matej Kristan

    Abstract: A long-term visual object tracking performance evaluation methodology and a benchmark are proposed. Performance measures are designed by following a long-term tracking definition to maximize the analysis probing strength. The new measures outperform existing ones in interpretation potential and in better distinguishing between different tracking behaviors. We show that these measures generalize th… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  21. arXiv:1906.02783  [pdf, other

    cs.CV

    Object Pose Estimation in Robotics Revisited

    Authors: Antti Hietanen, Jyrki Latokartano, Alessandro Foi, Roel Pieters, Ville Kyrki, Minna Lanz, Joni-Kristian Kämäräinen

    Abstract: Vision based object gras** and manipulation in robotics require accurate estimation of object's 6D pose. The 6D pose estimation has received significant attention in computer vision community and multiple datasets and evaluation metrics have been proposed. However, the existing metrics measure how well two geometrical surfaces are aligned - ground truth vs. estimated pose - which does not direct… ▽ More

    Submitted 21 May, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: 29 pages, 8 figures

  22. arXiv:1902.10466  [pdf, other

    cs.CV

    Flash Lightens Gray Pixels

    Authors: Yanlin Qian, Song Yan, Joni-Kristian Kämäräinen, Jiri Matas

    Abstract: In the real world, a scene is usually cast by multiple illuminants and herein we address the problem of spatial illumination estimation. Our solution is based on detecting gray pixels with the help of flash photography. We show that flash photography significantly improves the performance of gray pixel detection without illuminant prior, training data or calibration of the flash. We also introduce… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 5 pages including refs, 4 figures, submitted to International Conference on Image Processing

  23. arXiv:1901.03198  [pdf, other

    cs.CV

    On Finding Gray Pixels

    Authors: Yanlin Qian, Joni-Kristian Kämäräinen, Jarno Nikkanen, Jiri Matas

    Abstract: We propose a novel grayness index for finding gray pixels and demonstrate its effectiveness and efficiency in illumination estimation. The grayness index, GI in short, is derived using the Dichromatic Reflection Model and is learning-free. GI allows to estimate one or multiple illumination sources in color-biased images. On standard single-illumination and multiple-illumination estimation benchmar… ▽ More

    Submitted 2 May, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Comments: appear in IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) 2019. 9 pages, 7 figures. this article is an extension of arXiv:1803.08326

  24. arXiv:1811.10863  [pdf, other

    cs.CV

    Object Tracking by Reconstruction with View-Specific Discriminative Correlation Filters

    Authors: Ugur Kart, Alan Lukezic, Matej Kristan, Joni-Kristian Kamarainen, Jiri Matas

    Abstract: Standard RGB-D trackers treat the target as an inherently 2D structure, which makes modelling appearance changes related even to simple out-of-plane rotation highly challenging. We address this limitation by proposing a novel long-term RGB-D tracker - Object Tracking by Reconstruction (OTR). The tracker performs online 3D target reconstruction to facilitate robust learning of a set of view-specifi… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  25. arXiv:1808.05848  [pdf, other

    cs.CV

    Performance Analysis and Robustification of Single-query 6-DoF Camera Pose Estimation

    Authors: Junsheng Fu, Said Pertuz, Jiri Matas, Joni-Kristian Kämäräinen

    Abstract: We consider a single-query 6-DoF camera pose estimation with reference images and a point cloud, i.e. the problem of estimating the position and orientation of a camera by using reference images and a point cloud. In this work, we perform a systematic comparison of three state-of-the-art strategies for 6-DoF camera pose estimation, i.e. feature-based, photometric-based and mutual-information-based… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

  26. arXiv:1805.08009  [pdf, other

    cs.CV

    Object Detection in Equirectangular Panorama

    Authors: Wenyan Yang, Yanlin Qian, Francesco Cricri, Lixin Fan, Joni-Kristian Kamarainen

    Abstract: We introduced a high-resolution equirectangular panorama (360-degree, virtual reality) dataset for object detection and propose a multi-projection variant of YOLO detector. The main challenge with equirectangular panorama image are i) the lack of annotated training data, ii) high-resolution imagery and iii) severe geometric distortions of objects near the panorama projection poles. In this work, w… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: 6 pages

  27. arXiv:1803.08326  [pdf, other

    cs.CV

    Revisiting Gray Pixel for Statistical Illumination Estimation

    Authors: Yanlin Qian, Said Pertuz, Jarno Nikkanen, Joni-Kristian Kämäräinen, Jiri Matas

    Abstract: We present a statistical color constancy method that relies on novel gray pixel detection and mean shift clustering. The method, called Mean Shifted Grey Pixel -- MSGP, is based on the observation: true-gray pixels are aligned towards one single direction. Our solution is compact, easy to compute and requires no training. Experiments on two real-world benchmarks show that the proposed approach out… ▽ More

    Submitted 9 January, 2019; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: updated and will appear in VISSAP 2019 (long paper)

  28. arXiv:1802.09227  [pdf, other

    cs.CV

    Depth Masked Discriminative Correlation Filter

    Authors: Uğur Kart, Joni-Kristian Kämäräinen, Jiří Matas, Lixin Fan, Francesco Cricri

    Abstract: Depth information provides a strong cue for occlusion detection and handling, but has been largely omitted in generic object tracking until recently due to lack of suitable benchmark datasets and applications. In this work, we propose a Depth Masked Discriminative Correlation Filter (DM-DCF) which adopts novel depth segmentation based occlusion detection that stops correlation filter updating and… ▽ More

    Submitted 10 October, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: 6 pages, accepted to ICPR 2018. ©2018 IEEE

  29. Pose Estimation using Local Structure-Specific Shape and Appearance Context

    Authors: Anders Glent Buch, Dirk Kraft, Joni-Kristian Kamarainen, Henrik Gordon Petersen, Norbert Krüger

    Abstract: We address the problem of estimating the alignment pose between two models using structure-specific local descriptors. Our descriptors are generated using a combination of 2D image data and 3D contextual shape data, resulting in a set of semi-local descriptors containing rich appearance and shape information for both edge and texture structures. This is achieved by defining feature space relations… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Journal ref: 2013 IEEE International Conference on Robotics and Automation (ICRA)

  30. arXiv:1703.05393  [pdf, other

    cs.CV

    Convolutional Low-Resolution Fine-Grained Classification

    Authors: Dingding Cai, Ke Chen, Yanlin Qian, Joni-Kristian Kämäräinen

    Abstract: Successful fine-grained image classification methods learn subtle details between visually similar (sub-)classes, but the problem becomes significantly more challenging if the details are missing due to low resolution. Encouraged by the recent success of Convolutional Neural Network (CNN) architectures in image classification, we propose a novel resolution-aware deep model which combines convoluti… ▽ More

    Submitted 16 October, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

  31. arXiv:1607.03856  [pdf, other

    cs.CV

    Deep Structured-Output Regression Learning for Computational Color Constancy

    Authors: Yanlin Qian, Ke Chen, Joni-Kristian Kamarainen, Jarno Nikkanen, Jiri Matas

    Abstract: Computational color constancy that requires esti- mation of illuminant colors of images is a fundamental yet active problem in computer vision, which can be formulated into a regression problem. To learn a robust regressor for color constancy, obtaining meaningful imagery features and capturing latent correlations across output variables play a vital role. In this work, we introduce a novel deep s… ▽ More

    Submitted 11 August, 2016; v1 submitted 13 July, 2016; originally announced July 2016.