-
CLIP-Loc: Multi-modal Landmark Association for Global Localization in Object-based Maps
Authors:
Shigemichi Matsuzaki,
Takuma Sugino,
Kazuhito Tanaka,
Zijun Sha,
Shintaro Nakaoka,
Shintaro Yoshizawa,
Kazuhiro Shintani
Abstract:
This paper describes a multi-modal data association method for global localization using object-based maps and camera images. In global localization, or relocalization, using object-based maps, existing methods typically resort to matching all possible combinations of detected objects and landmarks with the same object category, followed by inlier extraction using RANSAC or brute-force search. Thi…
▽ More
This paper describes a multi-modal data association method for global localization using object-based maps and camera images. In global localization, or relocalization, using object-based maps, existing methods typically resort to matching all possible combinations of detected objects and landmarks with the same object category, followed by inlier extraction using RANSAC or brute-force search. This approach becomes infeasible as the number of landmarks increases due to the exponential growth of correspondence candidates. In this paper, we propose labeling landmarks with natural language descriptions and extracting correspondences based on conceptual similarity with image observations using a Vision Language Model (VLM). By leveraging detailed text information, our approach efficiently extracts correspondences compared to methods using only object categories. Through experiments, we demonstrate that the proposed method enables more accurate global localization with fewer iterations compared to baseline methods, exhibiting its efficiency.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
The Pieri formulas and the Littlewood-Richardson rule for Schur multiple zeta functions
Authors:
Shutaro Nakaoka
Abstract:
We prove the Pieri formulas for Schur multiple zeta functions, which are generalizations of the Pieri formulas proved by Nakasuji and Takeda for hook type Schur multiple zeta functions. Moreover, we also prove the Littlewood-Richardson rule for Schur multiple zeta functions. In the course of their proofs, we regard the `truncated' version of Schur multiple zeta functions as series over…
▽ More
We prove the Pieri formulas for Schur multiple zeta functions, which are generalizations of the Pieri formulas proved by Nakasuji and Takeda for hook type Schur multiple zeta functions. Moreover, we also prove the Littlewood-Richardson rule for Schur multiple zeta functions. In the course of their proofs, we regard the `truncated' version of Schur multiple zeta functions as series over $\mathrm{GL}(N)$ crystals to arrive at the Littlewood-Richardson rule for the Schur multiple zeta functions.
△ Less
Submitted 1 June, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Computation harvesting in road traffic dynamics
Authors:
Hiroyasu Ando,
T. Okamoto,
H. Chang,
T. Noguchi,
Shinji Nakaoka
Abstract:
Owing to recent advances in artificial intelligence and internet of things (IoT) technologies, collected big data facilitates high computational performance, while its computational resources and energy cost are large. Moreover, data are often collected but not used. To solve these problems, we propose a framework for a computational model that follows a natural computational system, such as the h…
▽ More
Owing to recent advances in artificial intelligence and internet of things (IoT) technologies, collected big data facilitates high computational performance, while its computational resources and energy cost are large. Moreover, data are often collected but not used. To solve these problems, we propose a framework for a computational model that follows a natural computational system, such as the human brain, and does not rely heavily on electronic computers. In particular, we propose a methodology based on the concept of `computation harvesting', which uses IoT data collected from rich sensors and leaves most of the computational processes to real-world phenomena as collected data. This aspect assumes that large-scale computations can be fast and resilient. Herein, we perform prediction tasks using real-world road traffic data to show the feasibility of computation harvesting. First, we show that the substantial computation in traffic flow is resilient against sensor failure and real-time traffic changes due to several combinations of harvesting from spatiotemporal dynamics to synthesize specific patterns. Next, we show the practicality of this method as a real-time prediction because of its low computational cost. Finally, we show that, compared to conventional methods, our method requires lower resources while providing a comparable performance.
△ Less
Submitted 21 November, 2020;
originally announced November 2020.
-
Effect of shapes of activation functions on predictability in the echo state network
Authors:
Hanten Chang,
Shinji Nakaoka,
Hiroyasu Ando
Abstract:
We investigate prediction accuracy for time series of Echo state networks with respect to several kinds of activation functions. As a result, we found that some kinds of activation functions with an appropriate nonlinearity show high performance compared to the conventional sigmoid function.
We investigate prediction accuracy for time series of Echo state networks with respect to several kinds of activation functions. As a result, we found that some kinds of activation functions with an appropriate nonlinearity show high performance compared to the conventional sigmoid function.
△ Less
Submitted 22 May, 2019;
originally announced May 2019.