Skip to main content

Showing 1–8 of 8 results for author: Kujanpää, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02696  [pdf, other

    cs.LG

    iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

    Authors: Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

    Abstract: Learning representations for reinforcement learning (RL) has shown much promise for continuous control. We propose an efficient representation learning method using only a self-supervised latent-state consistency loss. Our approach employs an encoder and a dynamics model to map observations to latent states and predict future latent states, respectively. We achieve high performance and prevent rep… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 9 pages, 11 figures

  2. arXiv:2401.03236  [pdf, other

    cs.RO

    Challenges of Data-Driven Simulation of Diverse and Consistent Human Driving Behaviors

    Authors: Kalle Kujanpää, Daulet Baimukashev, Shibei Zhu, Shoaib Azam, Farzeen Munir, Gokhan Alcan, Ville Kyrki

    Abstract: Building simulation environments for develo** and testing autonomous vehicles necessitates that the simulators accurately model the statistical realism of the real-world environment, including the interaction with other vehicles driven by human drivers. To address this requirement, an accurate human behavior model is essential to incorporate the diversity and consistency of human driving behavio… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  3. arXiv:2310.12819  [pdf, other

    cs.AI cs.LG

    Hybrid Search for Efficient Planning with Completeness Guarantees

    Authors: Kalle Kujanpää, Joni Pajarinen, Alexander Ilin

    Abstract: Solving complex planning problems has been a long-standing challenge in computer science. Learning-based subgoal search methods have shown promise in tackling these problems, but they often suffer from a lack of completeness guarantees, meaning that they may fail to find a solution even if one exists. In this paper, we propose an efficient approach to augment a subgoal search method to achieve com… ▽ More

    Submitted 28 November, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Poster

  4. arXiv:2309.00249  [pdf, other

    cs.RO

    Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles

    Authors: Yuhang Yang, Kalle Kujanpaa, Amin Babadi, Joni Pajarinen, Alexander Ilin

    Abstract: Develo** reliable autonomous driving algorithms poses challenges in testing, particularly when it comes to safety-critical traffic scenarios involving pedestrians. An open question is how to simulate rare events, not necessarily found in autonomous driving datasets or scripted simulations, but which can occur in testing, and, in the end may lead to severe pedestrian related accidents. This paper… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 6 pages; 5 figures; 2 tables

  5. arXiv:2301.12962  [pdf, other

    cs.AI cs.LG

    Hierarchical Imitation Learning with Vector Quantized Models

    Authors: Kalle Kujanpää, Joni Pajarinen, Alexander Ilin

    Abstract: The ability to plan actions on multiple levels of abstraction enables intelligent agents to solve complex tasks effectively. However, learning the models for both low and high-level planning from demonstrations has proven challenging, especially with higher-dimensional inputs. To address this issue, we propose to use reinforcement learning to identify subgoals in expert trajectories by associating… ▽ More

    Submitted 29 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To appear at ICML 2023

  6. arXiv:2210.01426  [pdf, other

    cs.AI cs.LG cs.RO

    Continuous Monte Carlo Graph Search

    Authors: Kalle Kujanpää, Amin Babadi, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen

    Abstract: Online planning is crucial for high performance in many complex sequential decision-making tasks. Monte Carlo Tree Search (MCTS) employs a principled mechanism for trading off exploration for exploitation for efficient online planning, and it outperforms comparison methods in many discrete decision-making domains such as Go, Chess, and Shogi. Subsequently, extensions of MCTS to continuous domains… ▽ More

    Submitted 7 February, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted at AAMAS 2024 (full paper & oral)

  7. Automating Privilege Escalation with Deep Reinforcement Learning

    Authors: Kalle Kujanpää, Willie Victor, Alexander Ilin

    Abstract: AI-based defensive solutions are necessary to defend networks and information assets against intelligent automated attacks. Gathering enough realistic data for training machine learning-based defenses is a significant practical challenge. An intelligent red teaming agent capable of performing realistic attacks can alleviate this problem. However, there is little scientific evidence demonstrating t… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: To appear at AISec'21 (aisec.cc)

  8. arXiv:2006.09763  [pdf, other

    stat.ML cs.LG

    Longitudinal Variational Autoencoder

    Authors: Siddharth Ramchandran, Gleb Tikhonov, Kalle Kujanpää, Miika Koskinen, Harri Lähdesmäki

    Abstract: Longitudinal datasets measured repeatedly over time from individual subjects, arise in many biomedical, psychological, social, and other studies. A common approach to analyse high-dimensional data that contains missing values is to learn a low-dimensional representation using variational autoencoders (VAEs). However, standard VAEs assume that the learnt representations are i.i.d., and fail to capt… ▽ More

    Submitted 20 April, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS-2021), pp. 3898-3906. PMLR, 2021