-
Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications
Authors:
Yao Du,
Carlos M. Mateo,
Mirjana Maras,
Tsun-Hsuan Wang,
Marc Blanchon,
Alexander Amini,
Daniela Rus,
Omar Tahri
Abstract:
Unlike a traditional gyroscope, a visual gyroscope estimates camera rotation through images. The integration of omnidirectional cameras, offering a larger field of view compared to traditional RGB cameras, has proven to yield more accurate and robust results. However, challenges arise in situations that lack features, have substantial noise causing significant errors, and where certain features in…
▽ More
Unlike a traditional gyroscope, a visual gyroscope estimates camera rotation through images. The integration of omnidirectional cameras, offering a larger field of view compared to traditional RGB cameras, has proven to yield more accurate and robust results. However, challenges arise in situations that lack features, have substantial noise causing significant errors, and where certain features in the images lack sufficient strength, leading to less precise prediction results.
Here, we address these challenges by introducing a novel visual gyroscope, which combines an analytical method with a neural network approach to provide a more efficient and accurate rotation estimation from spherical images. The presented method relies on three key contributions: an adapted analytical approach to compute the spherical moments coefficients, introduction of masks for better global feature representation, and the use of a multilayer perceptron to adaptively choose the best combination of masks and filters. Experimental results demonstrate superior performance of the proposed approach in terms of accuracy. The paper emphasizes the advantages of integrating machine learning to optimize analytical solutions, discusses limitations, and suggests directions for future research.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Exploring Latent Pathways: Enhancing the Interpretability of Autonomous Driving with a Variational Autoencoder
Authors:
Anass Bairouk,
Mirjana Maras,
Simon Herlin,
Alexander Amini,
Marc Blanchon,
Ramin Hasani,
Patrick Chareyre,
Daniela Rus
Abstract:
Autonomous driving presents a complex challenge, which is usually addressed with artificial intelligence models that are end-to-end or modular in nature. Within the landscape of modular approaches, a bio-inspired neural circuit policy model has emerged as an innovative control module, offering a compact and inherently interpretable system to infer a steering wheel command from abstract visual feat…
▽ More
Autonomous driving presents a complex challenge, which is usually addressed with artificial intelligence models that are end-to-end or modular in nature. Within the landscape of modular approaches, a bio-inspired neural circuit policy model has emerged as an innovative control module, offering a compact and inherently interpretable system to infer a steering wheel command from abstract visual features. Here, we take a leap forward by integrating a variational autoencoder with the neural circuit policy controller, forming a solution that directly generates steering commands from input camera images. By substituting the traditional convolutional neural network approach to feature extraction with a variational autoencoder, we enhance the system's interpretability, enabling a more transparent and understandable decision-making process.
In addition to the architectural shift toward a variational autoencoder, this study introduces the automatic latent perturbation tool, a novel contribution designed to probe and elucidate the latent features within the variational autoencoder. The automatic latent perturbation tool automates the interpretability process, offering granular insights into how specific latent variables influence the overall model's behavior. Through a series of numerical experiments, we demonstrate the interpretative power of the variational autoencoder-neural circuit policy model and the utility of the automatic latent perturbation tool in making the inner workings of autonomous driving systems more transparent.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Fast and Knowledge-Free Deep Learning for General Game Playing (Student Abstract)
Authors:
Michał Maras,
Michał Kępa,
Jakub Kowalski,
Marek Szykuła
Abstract:
We develop a method of adapting the AlphaZero model to General Game Playing (GGP) that focuses on faster model generation and requires less knowledge to be extracted from the game rules. The dataset generation uses MCTS playing instead of self-play; only the value network is used, and attention layers replace the convolutional ones. This allows us to abandon any assumptions about the action space…
▽ More
We develop a method of adapting the AlphaZero model to General Game Playing (GGP) that focuses on faster model generation and requires less knowledge to be extracted from the game rules. The dataset generation uses MCTS playing instead of self-play; only the value network is used, and attention layers replace the convolutional ones. This allows us to abandon any assumptions about the action space and board topology. We implement the method within the Regular Boardgames GGP system and show that we can build models outperforming the UCT baseline for most games efficiently.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Develo** a Successful Bomberman Agent
Authors:
Dominik Kowalczyk,
Jakub Kowalski,
Hubert Obrzut,
Michał Maras,
Szymon Kosakowski,
Radosław Miernik
Abstract:
In this paper, we study AI approaches to successfully play a 2-4 players, full information, Bomberman variant published on the CodinGame platform. We compare the behavior of three search algorithms: Monte Carlo Tree Search, Rolling Horizon Evolution, and Beam Search. We present various enhancements leading to improve the agents' strength that concern search, opponent prediction, game state evaluat…
▽ More
In this paper, we study AI approaches to successfully play a 2-4 players, full information, Bomberman variant published on the CodinGame platform. We compare the behavior of three search algorithms: Monte Carlo Tree Search, Rolling Horizon Evolution, and Beam Search. We present various enhancements leading to improve the agents' strength that concern search, opponent prediction, game state evaluation, and game engine encoding. Our top agent variant is based on a Beam Search with low-level bit-based state representation and evaluation function heavy relying on pruning unpromising states based on simulation-based estimation of survival. It reached the top one position among the 2,300 AI agents submitted on the CodinGame arena.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
2D Phase Diagram for Minimizers of a Cahn-Hilliard Functional with Long-Range Interactions
Authors:
Rustum Choksi,
Mirjana Maras,
J. F. Williams
Abstract:
This paper presents a detailed asymptotic and numerical investigation of the phase diagram for global minimizers to a Cahn-Hilliard functional with long-range interactions in two space dimensions. We introduce a small parameter measuring perturbation from the minimal orderdisorder transition, and derive asymptotic estimates for stability regions as the parameter tends to zero. Based upon the H^-1…
▽ More
This paper presents a detailed asymptotic and numerical investigation of the phase diagram for global minimizers to a Cahn-Hilliard functional with long-range interactions in two space dimensions. We introduce a small parameter measuring perturbation from the minimal orderdisorder transition, and derive asymptotic estimates for stability regions as the parameter tends to zero. Based upon the H^-1 gradient ow, we introduce a hybrid numerical method to navigate through the complex energy landscape and access the ground state of the functional. We use this method to numerically compute the phase diagram. Our asymptotic predictions show surprisingly good agreement with our numerical results.
△ Less
Submitted 15 March, 2011;
originally announced March 2011.