Skip to main content

Showing 1–27 of 27 results for author: Ilin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07097  [pdf, other

    cs.LG cs.AI

    Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems

    Authors: Katsiaryna Haitsiukevich, Onur Poyraz, Pekka Marttinen, Alexander Ilin

    Abstract: This paper explores the efficacy of diffusion-based generative models as neural operators for partial differential equations (PDEs). Neural operators are neural networks that learn a map** from the parameter space to the solution space of PDEs from data, and they can also solve the inverse problem of estimating the parameter from the solution. Diffusion models excel in many domains, but their po… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Preprint submitted to IEEE MLSP 2024

  2. arXiv:2402.02906  [pdf, other

    cs.CV cs.LG

    ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis

    Authors: Bernard Spiegl, Andrea Perin, Stéphane Deny, Alexander Ilin

    Abstract: Deep learning is providing a wealth of new approaches to the old problem of novel view synthesis, from Neural Radiance Field (NeRF) based approaches to end-to-end style architectures. Each approach offers specific strengths but also comes with specific limitations in their applicability. This work introduces ViewFusion, a state-of-the-art end-to-end generative approach to novel view synthesis with… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2310.12819  [pdf, other

    cs.AI cs.LG

    Hybrid Search for Efficient Planning with Completeness Guarantees

    Authors: Kalle Kujanpää, Joni Pajarinen, Alexander Ilin

    Abstract: Solving complex planning problems has been a long-standing challenge in computer science. Learning-based subgoal search methods have shown promise in tackling these problems, but they often suffer from a lack of completeness guarantees, meaning that they may fail to find a solution even if one exists. In this paper, we propose an efficient approach to augment a subgoal search method to achieve com… ▽ More

    Submitted 28 November, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Poster

  4. arXiv:2309.00249  [pdf, other

    cs.RO

    Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles

    Authors: Yuhang Yang, Kalle Kujanpaa, Amin Babadi, Joni Pajarinen, Alexander Ilin

    Abstract: Develo** reliable autonomous driving algorithms poses challenges in testing, particularly when it comes to safety-critical traffic scenarios involving pedestrians. An open question is how to simulate rare events, not necessarily found in autonomous driving datasets or scripted simulations, but which can occur in testing, and, in the end may lead to severe pedestrian related accidents. This paper… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 6 pages; 5 figures; 2 tables

  5. arXiv:2305.13092  [pdf, other

    cs.CL

    Improved Compositional Generalization by Generating Demonstrations for Meta-Learning

    Authors: Sam Spilsbury, Alexander Ilin

    Abstract: Meta-learning and few-shot prompting are viable methods to induce certain types of compositional behaviour. However, these methods can be very sensitive to the choice of support examples used. Choosing good supports from the training data for a given test query is already a difficult problem, but in some cases solving this may not even be enough. We consider a grounded language learning problem (g… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  6. arXiv:2301.12962  [pdf, other

    cs.AI cs.LG

    Hierarchical Imitation Learning with Vector Quantized Models

    Authors: Kalle Kujanpää, Joni Pajarinen, Alexander Ilin

    Abstract: The ability to plan actions on multiple levels of abstraction enables intelligent agents to solve complex tasks effectively. However, learning the models for both low and high-level planning from demonstrations has proven challenging, especially with higher-dimensional inputs. To address this issue, we propose to use reinforcement learning to identify subgoals in expert trajectories by associating… ▽ More

    Submitted 29 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To appear at ICML 2023

  7. arXiv:2210.14139  [pdf, other

    cs.CV cs.LG

    Learning Explicit Object-Centric Representations with Vision Transformers

    Authors: Oscar Vikström, Alexander Ilin

    Abstract: With the recent successful adaptation of transformers to the vision domain, particularly when trained in a self-supervised fashion, it has been shown that vision transformers can learn impressive object-reasoning-like behaviour and features expressive for the task of object segmentation in images. In this paper, we build on the self-supervision task of masked autoencoding and explore its effective… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  8. arXiv:2210.13846  [pdf, other

    cs.LG cs.RO

    Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

    Authors: Yi Zhao, Rinu Boney, Alexander Ilin, Juho Kannala, Joni Pajarinen

    Abstract: Offline reinforcement learning, by learning from a fixed dataset, makes it possible to learn agent behaviors without interacting with the environment. However, depending on the quality of the offline dataset, such pre-trained agents may have limited performance and would further need to be fine-tuned online by interacting with the environment. During online fine-tuning, the performance of the pre-… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  9. arXiv:2210.01426  [pdf, other

    cs.AI cs.LG cs.RO

    Continuous Monte Carlo Graph Search

    Authors: Kalle Kujanpää, Amin Babadi, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen

    Abstract: Online planning is crucial for high performance in many complex sequential decision-making tasks. Monte Carlo Tree Search (MCTS) employs a principled mechanism for trading off exploration for exploitation for efficient online planning, and it outperforms comparison methods in many discrete decision-making domains such as Go, Chess, and Shogi. Subsequently, extensions of MCTS to continuous domains… ▽ More

    Submitted 7 February, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted at AAMAS 2024 (full paper & oral)

  10. arXiv:2207.02518  [pdf, other

    cs.CL cs.LG

    Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

    Authors: Sam Spilsbury, Alexander Ilin

    Abstract: We provide a study of how induced model sparsity can help achieve compositional generalization and better sample efficiency in grounded language learning problems. We consider simple language-conditioned navigation problems in a grid world environment with disentangled observations. We show that standard neural architectures do not always yield compositional generalization. To address this, we des… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 6 pages, 7 figures. Appears in NAACL-2022 SRW. Acknowledgements: Yonatan Bisk. Code: github.com/aalto-ai/sparse-compgen

  11. arXiv:2204.05108  [pdf, other

    cs.LG cs.AI stat.ML

    Improved Training of Physics-Informed Neural Networks with Model Ensembles

    Authors: Katsiaryna Haitsiukevich, Alexander Ilin

    Abstract: Learning the solution of partial differential equations (PDEs) with a neural network is an attractive alternative to traditional solvers due to its elegance, greater flexibility and the ease of incorporating observed data. However, training such physics-informed neural networks (PINNs) is notoriously difficult in practice since PINNs often converge to wrong solutions. In this paper, we address thi… ▽ More

    Submitted 11 January, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

  12. Learning Trajectories of Hamiltonian Systems with Neural Networks

    Authors: Katsiaryna Haitsiukevich, Alexander Ilin

    Abstract: Modeling of conservative systems with neural networks is an area of active research. A popular approach is to use Hamiltonian neural networks (HNNs) which rely on the assumptions that a conservative system is described with Hamilton's equations of motion. Many recent works focus on improving the integration schemes used when training HNNs. In this work, we propose to enhance HNNs with an estimatio… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Journal ref: ICANN 2022

  13. A Grid-Structured Model of Tubular Reactors

    Authors: Katsiaryna Haitsiukevich, Samuli Bergman, Cesar de Araujo Filho, Francesco Corona, Alexander Ilin

    Abstract: We propose a grid-like computational model of tubular reactors. The architecture is inspired by the computations performed by solvers of partial differential equations which describe the dynamics of the chemical process inside a tubular reactor. The proposed model may be entirely based on the known form of the partial differential equations or it may contain generic machine learning components suc… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 2021 IEEE 19th International Conference on Industrial Informatics (INDIN)

  14. A Relational Model for One-Shot Classification

    Authors: Arturs Polis, Alexander Ilin

    Abstract: We show that a deep learning model with built-in relational inductive bias can bring benefits to sample-efficient learning, without relying on extensive data augmentation. The proposed one-shot classification model performs relational matching of a pair of inputs in the form of local and pairwise attention. Our approach solves perfectly the one-shot image classification Omniglot challenge. Our mod… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: Published at ESANN 2021

  15. Automating Privilege Escalation with Deep Reinforcement Learning

    Authors: Kalle Kujanpää, Willie Victor, Alexander Ilin

    Abstract: AI-based defensive solutions are necessary to defend networks and information assets against intelligent automated attacks. Gathering enough realistic data for training machine learning-based defenses is a significant practical challenge. An intelligent red teaming agent capable of performing realistic attacks can alleviate this problem. However, there is little scientific evidence demonstrating t… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: To appear at AISec'21 (aisec.cc)

  16. arXiv:2110.01311  [pdf, ps, other

    cs.AI cs.LG

    Learning to Assist Agents by Observing Them

    Authors: Antti Keurulainen, Isak Westerlund, Samuel Kaski, Alexander Ilin

    Abstract: The ability of an AI agent to assist other agents, such as humans, is an important and challenging goal, which requires the assisting agent to reason about the behavior and infer the goals of the assisted agent. Training such an ability by using reinforcement learning usually requires large amounts of online training, which is difficult and costly. On the other hand, offline data about the behavio… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  17. arXiv:2110.01266  [pdf, ps, other

    cs.LG cs.MA

    Behaviour-conditioned policies for cooperative reinforcement learning tasks

    Authors: Antti Keurulainen, Isak Westerlund, Ariel Kwiatkowski, Samuel Kaski, Alexander Ilin

    Abstract: The cooperation among AI systems, and between AI systems and humans is becoming increasingly important. In various real-world tasks, an agent needs to cooperate with unknown partner agent types. This requires the agent to assess the behaviour of the partner agent during a cooperative task and to adjust its own policy to support the cooperation. Deep reinforcement learning models can be trained to… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  18. arXiv:2108.13672  [pdf, other

    cs.LG

    SANSformers: Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models

    Authors: Yogesh Kumar, Alexander Ilin, Henri Salo, Sangita Kulathinal, Maarit K. Leinonen, Pekka Marttinen

    Abstract: Despite the proven effectiveness of Transformer neural networks across multiple domains, their performance with Electronic Health Records (EHR) can be nuanced. The unique, multidimensional sequential nature of EHR data can sometimes make even simple linear models with carefully engineered features more competitive. Thus, the advantages of Transformers, such as efficient transfer learning and impro… ▽ More

    Submitted 10 November, 2023; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: 17 pages, 11 figures, 11 tables, Submitted to an IEEE journal

  19. arXiv:2106.07995  [pdf, other

    cs.LG cs.CV cs.RO

    Learning of feature points without additional supervision improves reinforcement learning from images

    Authors: Rinu Boney, Alexander Ilin, Juho Kannala

    Abstract: In many control problems that include vision, optimal controls can be inferred from the location of the objects in the scene. This information can be represented using feature points, which is a list of spatial locations in learned feature maps of an input image. Previous works show that feature points learned using unsupervised pre-training or human supervision can provide good features for contr… ▽ More

    Submitted 4 June, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

  20. arXiv:2104.02526  [pdf, ps, other

    eess.AS cs.CL cs.LG

    LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring

    Authors: Anton Mitrofanov, Mariya Korenevskaya, Ivan Podluzhny, Yuri Khokhlov, Aleksandr Laptev, Andrei Andrusenko, Aleksei Ilin, Maxim Korenevsky, Ivan Medennikov, Aleksei Romanenko

    Abstract: Neural network-based language models are commonly used in rescoring approaches to improve the quality of modern automatic speech recognition (ASR) systems. Most of the existing methods are computationally expensive since they use autoregressive language models. We propose a novel rescoring approach, which processes the entire lattice in a single call to the model. The key feature of our rescoring… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: Submitted to InterSpeech 2021

  21. arXiv:2012.12186  [pdf, other

    cs.AI

    Learning to Play Imperfect-Information Games by Imitating an Oracle Planner

    Authors: Rinu Boney, Alexander Ilin, Juho Kannala, Jarno Seppänen

    Abstract: We consider learning to play multiplayer imperfect-information games with simultaneous moves and large state-action spaces. Previous attempts to tackle such challenging games have largely focused on model-free learning methods, often requiring hundreds of years of experience to produce competitive agents. Our approach is based on model-based planning. We tackle the problem of partial observability… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  22. arXiv:2008.00715  [pdf, other

    cs.RO

    Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning

    Authors: Ari Viitala, Rinu Boney, Yi Zhao, Alexander Ilin, Juho Kannala

    Abstract: We present Learning to Drive (L2D), a low-cost benchmark for real-world reinforcement learning (RL). L2D involves a simple and reproducible experimental setup where an RL agent has to learn to drive a Donkey car around three miniature tracks, given only monocular image observations and speed of the car. The agent has to learn to drive from disengagements, which occurs when it drives off the track.… ▽ More

    Submitted 6 November, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  23. arXiv:2004.13764  [pdf, other

    eess.AS cs.SD

    Conditional Spoken Digit Generation with StyleGAN

    Authors: Kasperi Palkama, Lauri Juvela, Alexander Ilin

    Abstract: This paper adapts a StyleGAN model for speech generation with minimal or no conditioning on text. StyleGAN is a multi-scale convolutional GAN capable of hierarchically capturing data structure and latent variation on multiple spatial (or temporal) levels. The model has previously achieved impressive results on facial image generation, and it is appealing to audio applications due to similar multi-… ▽ More

    Submitted 15 September, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Interspeech2020 accepted version

  24. arXiv:1910.05527  [pdf, other

    cs.LG cs.RO stat.ML

    Regularizing Model-Based Planning with Energy-Based Models

    Authors: Rinu Boney, Juho Kannala, Alexander Ilin

    Abstract: Model-based reinforcement learning could enable sample-efficient learning by quickly acquiring rich knowledge about the world and using it to improve behaviour without additional data. Learned dynamics models can be directly used for planning actions but this has been challenging because of inaccuracies in the learned models. In this paper, we focus on planning with learned dynamics models and pro… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning 2019

  25. arXiv:1903.11981  [pdf, other

    cs.LG cs.RO stat.ML

    Regularizing Trajectory Optimization with Denoising Autoencoders

    Authors: Rinu Boney, Norman Di Palo, Mathias Berglund, Alexander Ilin, Juho Kannala, Antti Rasmus, Harri Valpola

    Abstract: Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize trajectory optimization by means of a denoising autoencoder that is trained on the same trajectories as the model of the environment. We show that the proposed reg… ▽ More

    Submitted 25 December, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: NeurIPS 2019

  26. arXiv:1711.10856  [pdf, other

    cs.LG stat.ML

    Semi-Supervised and Active Few-Shot Learning with Prototypical Networks

    Authors: Rinu Boney, Alexander Ilin

    Abstract: We consider the problem of semi-supervised few-shot classification where a classifier needs to adapt to new tasks using a few labeled examples and (potentially many) unlabeled examples. We propose a clustering approach to the problem. The features extracted with Prototypical Networks are clustered using $K$-means with the few labeled examples guiding the clustering process. We note that in many re… ▽ More

    Submitted 25 April, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

  27. arXiv:1707.09219  [pdf, other

    cs.NE cs.AI cs.LG stat.ML

    Recurrent Ladder Networks

    Authors: Isabeau Prémont-Schwarz, Alexander Ilin, Tele Hotloo Hao, Antti Rasmus, Rinu Boney, Harri Valpola

    Abstract: We propose a recurrent extension of the Ladder networks whose structure is motivated by the inference required in hierarchical latent variable models. We demonstrate that the recurrent Ladder is able to handle a wide variety of complex learning tasks that benefit from iterative inference and temporal modeling. The architecture shows close-to-optimal results on temporal modeling of video data, comp… ▽ More

    Submitted 18 December, 2017; v1 submitted 28 July, 2017; originally announced July 2017.

    Comments: 9 pages, 9 figures, 7-page appendix, fixed fig 9 (c)