Skip to main content

Showing 1–7 of 7 results for author: Sejnova, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01932  [pdf, other

    cs.RO cs.LG

    Bridging Language, Vision and Action: Multimodal VAEs in Robotic Manipulation Tasks

    Authors: Gabriela Sejnova, Michal Vavrecka, Karla Stepanova

    Abstract: In this work, we focus on unsupervised vision-language-action map** in the area of robotic manipulation. Recently, multiple approaches employing pre-trained large language and vision models have been proposed for this task. However, they are computationally demanding and require careful fine-tuning of the produced outputs. A more lightweight alternative would be the implementation of multimodal… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 5 figures, 2 tables, conference

  2. arXiv:2312.06280  [pdf, other

    cs.LG cs.AI

    Adaptive Compression of the Latent Space in Variational Autoencoders

    Authors: Gabriela Sejnova, Michal Vavrecka, Karla Stepanova

    Abstract: Variational Autoencoders (VAEs) are powerful generative models that have been widely used in various fields, including image and text generation. However, one of the known challenges in using VAEs is the model's sensitivity to its hyperparameters, such as the latent space size. This paper presents a simple extension of VAEs for automatically determining the optimal latent space size during the tra… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures

  3. arXiv:2310.15321  [pdf, other

    cs.RO

    How language of interaction affects the user perception of a robot

    Authors: Barbara Sienkiewicz, Gabriela Sejnova, Paul Gajewski, Michal Vavrecka, Bipin Indurkhya

    Abstract: Spoken language is the most natural way for a human to communicate with a robot. It may seem intuitive that a robot should communicate with users in their native language. However, it is not clear if a user's perception of a robot is affected by the language of interaction. We investigated this question by conducting a study with twenty-three native Czech participants who were also fluent in Eng… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: ICSR 2023

  4. Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators

    Authors: Jiri Sedlar, Karla Stepanova, Radoslav Skoviera, Jan K. Behrens, Matus Tuna, Gabriela Sejnova, Josef Sivic, Robert Babuska

    Abstract: This paper introduces a dataset for training and evaluating methods for 6D pose estimation of hand-held tools in task demonstrations captured by a standard RGB camera. Despite the significant progress of 6D pose estimation methods, their performance is usually limited for heavily occluded objects, which is a common case in imitation learning, where the object is typically partially occluded by the… ▽ More

    Submitted 5 April, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: The dataset and code are publicly available at http://imitrob.ciirc.cvut.cz/imitrobdataset.php

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 5, pp. 2788-2795, 2023

  5. arXiv:2209.03048  [pdf, other

    cs.LG

    Benchmarking Multimodal Variational Autoencoders: CdSprites+ Dataset and Toolkit

    Authors: Gabriela Sejnova, Michal Vavrecka, Karla Stepanova

    Abstract: Multimodal Variational Autoencoders (VAEs) have been the subject of intense research in the past years as they can integrate multiple modalities into a joint representation and can thus serve as a promising tool for both data classification and generation. Several approaches toward multimodal VAE learning have been proposed so far, their comparison and evaluation have however been rather inconsist… ▽ More

    Submitted 24 November, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

  6. arXiv:2012.11643  [pdf, other

    cs.RO cs.AI

    myGym: Modular Toolkit for Visuomotor Robotic Tasks

    Authors: Michal Vavrecka, Nikita Sokovnin, Megi Mejdrechova, Gabriela Sejnova, Marek Otahal

    Abstract: We introduce a novel virtual robotic toolkit myGym, developed for reinforcement learning (RL), intrinsic motivation and imitation learning tasks trained in a 3D simulator. The trained tasks can then be easily transferred to real-world robotic scenarios. The modular structure of the simulator enables users to train and validate their algorithms on a large number of scenarios with various robots, en… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: 6 pages, 5 figures

  7. arXiv:1901.08335  [pdf, other

    cs.HC cs.RO

    Teaching robots to imitate a human with no on-teacher sensors. What are the key challenges?

    Authors: Radoslav Skoviera, Karla Stepanova, Michael Tesar, Gabriela Sejnova, Jiri Sedlar, Michal Vavrecka, Robert Babuska, Josef Sivic

    Abstract: In this paper, we consider the problem of learning object manipulation tasks from human demonstration using RGB or RGB-D cameras. We highlight the key challenges in capturing sufficiently good data with no tracking devices - starting from sensor selection and accurate 6DoF pose estimation to natural language processing. In particular, we focus on two showcases: gluing task with a glue gun and simp… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Journal ref: The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018, Workshop on: Towards Intelligent Social Robots: From Naive Robots to Robot Sapiens http://intelligent-social-robots-ws.com/materials/