Skip to main content

Showing 1–50 of 66 results for author: Taniguchi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09647  [pdf, other

    cs.RO

    Object Instance Retrieval in Assistive Robotics: Leveraging Fine-Tuned SimSiam with Multi-View Images Based on 3D Semantic Map

    Authors: Taichi Sakaguchi, Akira Taniguchi, Yoshinobu Hagiwara, Lotfi El Hafi, Shoichi Hasegawa, Tadahiro Taniguchi

    Abstract: Robots that assist in daily life are required to locate specific instances of objects that match the user's desired object in the environment. This task is known as Instance-Specific Image Goal Navigation (InstanceImageNav), which requires a model capable of distinguishing between different instances within the same class. One significant challenge in robotics is that when a robot observes the sam… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: See website at https://emergentsystemlabstudent.github.io/MultiViewRetrieve/. Submitted to IROS2024

  2. arXiv:2404.09645  [pdf, other

    cs.RO cs.CL cs.CV

    Real-world Instance-specific Image Goal Navigation for Service Robots: Bridging the Domain Gap with Contrastive Learning

    Authors: Taichi Sakaguchi, Akira Taniguchi, Yoshinobu Hagiwara, Lotfi El Hafi, Shoichi Hasegawa, Tadahiro Taniguchi

    Abstract: Improving instance-specific image goal navigation (InstanceImageNav), which locates the identical object in a real-world environment from a query image, is essential for robotic systems to assist users in finding desired objects. The challenge lies in the domain gap between low-quality images observed by the moving robot, characterized by motion blur and low-resolution, and high-quality query imag… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: See website at https://emergentsystemlabstudent.github.io/DomainBridgingNav/. Submitted to IROS2024

  3. arXiv:2404.07717  [pdf, other

    cs.RO

    Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics

    Authors: Masashi Osada, Gustavo A. Garcia Ricardez, Yosuke Suzuki, Tadahiro Taniguchi

    Abstract: Large language models (LLMs) and vision-language models (VLMs) have been increasingly used in robotics for high-level cognition, but their use for low-level cognition, such as interpreting sensor information, remains underexplored. In robotic gras**, estimating the reflectance of objects is crucial for successful gras**, as it significantly impacts the distance measured by proximity sensors. W… ▽ More

    Submitted 12 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 16 pages, 10 figures, submitted to Advanced Robotics Special Issue on Real-World Robot Applications of the Foundation Models

  4. arXiv:2403.19129  [pdf, other

    cs.RO

    Stable Object Placing using Curl and Diff Features of Vision-based Tactile Sensors

    Authors: Kuniyuki Takahashi, Shimpei Masuda, Tadahiro Taniguchi

    Abstract: Ensuring stable object placement is crucial to prevent objects from toppling over, breaking, or causing spills. When an object makes initial contact to a surface, and some force is exerted, the moment of rotation caused by the instability of the object's placing can cause the object to rotate in a certain direction (henceforth referred to as direction of corrective rotation). Existing methods ofte… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 9 pages, 7 figures

  5. arXiv:2403.13221  [pdf, other

    cs.RO

    A Contact Model based on Denoising Diffusion to Learn Variable Impedance Control for Contact-rich Manipulation

    Authors: Masashi Okada, Mayumi Komatsu, Tadahiro Taniguchi

    Abstract: In this paper, a novel approach is proposed for learning robot control in contact-rich tasks such as wi**, by develo** Diffusion Contact Model (DCM). Previous methods of learning such tasks relied on impedance control with time-varying stiffness tuning by performing Bayesian optimization by trial-and-error with robots. The proposed approach aims to reduce the cost of robot operation by predict… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  6. arXiv:2312.01121  [pdf, other

    cs.DC cs.LG

    Virtual reservoir acceleration for CPU and GPU: Case study for coupled spin-torque oscillator reservoir

    Authors: Thomas Geert de Jong, Nozomi Akashi, Tomohiro Taniguchi, Hirofumi Notsu, Kohei Nakajima

    Abstract: We provide high-speed implementations for simulating reservoirs described by $N$-coupled spin-torque oscillators. Here $N$ also corresponds to the number of reservoir nodes. We benchmark a variety of implementations based on CPU and GPU. Our new methods are at least 2.6 times quicker than the baseline for $N$ in range $1$ to $10^4$. More specifically, over all implementations the best factor is 78… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  7. arXiv:2311.04453  [pdf, other

    cs.CL

    Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments

    Authors: Ryo Ueda, Tadahiro Taniguchi

    Abstract: As a sub-discipline of evolutionary and computational linguistics, emergent communication (EC) studies communication protocols, called emergent languages, arising in simulations where agents communicate. A key goal of EC is to give rise to languages that share statistical properties with natural languages. In this paper, we reinterpret Lewis's signaling game, a frequently used setting in EC, as be… ▽ More

    Submitted 2 April, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: ICLR2024 camera-ready

  8. arXiv:2309.04148  [pdf, other

    cs.CV

    Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning

    Authors: Hiroki Nakamura, Masashi Okada, Tadahiro Taniguchi

    Abstract: In this paper, we propose a new self-supervised learning (SSL) method for representations that enable logic operations. Representation learning has been applied to various tasks, such as image generation and retrieval. The logical controllability of representations is important for these tasks. Although some methods have been shown to enable the intuitive control of representations using natural l… ▽ More

    Submitted 5 February, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2308.02136  [pdf, other

    cs.RO

    World-Model-Based Control for Industrial box-packing of Multiple Objects using NewtonianVAE

    Authors: Yusuke Kato, Ryo Okumura, Tadahiro Taniguchi

    Abstract: The process of industrial box-packing, which involves the accurate placement of multiple objects, requires high-accuracy positioning and sequential actions. When a robot is tasked with placing an object at a specific location with high accuracy, it is important not only to have information about the location of the object to be placed, but also the posture of the object grasped by the robotic hand… ▽ More

    Submitted 3 April, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 7 pages, 8 figures

  10. arXiv:2307.15345  [pdf, other

    cs.RO eess.SY

    Learning Compliant Stiffness by Impedance Control-Aware Task Segmentation and Multi-objective Bayesian Optimization with Priors

    Authors: Masashi Okada, Mayumi Komatsu, Ryo Okumura, Tadahiro Taniguchi

    Abstract: Rather than traditional position control, impedance control is preferred to ensure the safe operation of industrial robots programmed from demonstrations. However, variable stiffness learning studies have focused on task performance rather than safety (or compliance). Thus, this paper proposes a novel stiffness learning method to satisfy both task performance and compliance requirements. The propo… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted to IROS2023

  11. arXiv:2307.05004  [pdf, other

    cs.AI cs.LG cs.MA

    Control as Probabilistic Inference as an Emergent Communication Mechanism in Multi-Agent Reinforcement Learning

    Authors: Tomoaki Nakamura, Akira Taniguchi, Tadahiro Taniguchi

    Abstract: This paper proposes a generative probabilistic model integrating emergent communication and multi-agent reinforcement learning. The agents plan their actions by probabilistic inference, called control as inference, and communicate using messages that are latent variables and estimated based on the planned actions. Through these messages, each agent can send information about its actions and know i… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  12. arXiv:2306.15837  [pdf, other

    cs.CL cs.AI

    Symbol emergence as interpersonal cross-situational learning: the emergence of lexical knowledge with combinatoriality

    Authors: Yoshinobu Hagiwara, Kazuma Furukawa, Takafumi Horie, Akira Taniguchi, Tadahiro Taniguchi

    Abstract: We present a computational model for a symbol emergence system that enables the emergence of lexical knowledge with combinatoriality among agents through a Metropolis-Hastings naming game and cross-situational learning. Many computational models have been proposed to investigate combinatoriality in emergent communication and symbol emergence in cognitive and developmental robotics. However, existi… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  13. arXiv:2305.19936  [pdf, ps, other

    cs.CL cs.HC

    Metropolis-Hastings algorithm in joint-attention naming game: Experimental semiotics study

    Authors: Ryota Okumura, Tadahiro Taniguchi, Yosinobu Hagiwara, Akira Taniguchi

    Abstract: In this study, we explore the emergence of symbols during interactions between individuals through an experimental semiotic study. Previous studies investigate how humans organize symbol systems through communication using artificially designed subjective experiments. In this study, we have focused on a joint attention-naming game (JA-NG) in which participants independently categorize objects and… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  14. arXiv:2305.19761  [pdf, other

    cs.CL cs.LG cs.MA

    Recursive Metropolis-Hastings Naming Game: Symbol Emergence in a Multi-agent System based on Probabilistic Generative Models

    Authors: Jun Inukai, Tadahiro Taniguchi, Akira Taniguchi, Yoshinobu Hagiwara

    Abstract: In the studies on symbol emergence and emergent communication in a population of agents, a computational model was employed in which agents participate in various language games. Among these, the Metropolis-Hastings naming game (MHNG) possesses a notable mathematical property: symbol emergence through MHNG is proven to be a decentralized Bayesian inference of representations shared by the agents.… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  15. arXiv:2305.01790  [pdf

    cond-mat.mtrl-sci cs.ET eess.SP

    Cascaded Logic Gates Based on High-Performance Ambipolar Dual-Gate WSe2 Thin Film Transistors

    Authors: Xintong Li, Peng Zhou, Xuan Hu, Ethan Rivers, Kenji Watanabe, Takashi Taniguchi, Deji Akinwande, Joseph S. Friedman, Jean Anne C. Incorvia

    Abstract: Ambipolar dual-gate transistors based on two-dimensional (2D) materials, such as graphene, carbon nanotubes, black phosphorus, and certain transition metal dichalcogenides (TMDs), enable reconfigurable logic circuits with suppressed off-state current. These circuits achieve the same logical output as CMOS with fewer transistors and offer greater flexibility in design. The primary challenge lies in… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  16. arXiv:2302.01489  [pdf, other

    cs.RO cs.MA

    Online Re-Planning and Adaptive Parameter Update for Multi-Agent Path Finding with Stochastic Travel Times

    Authors: Atsuyoshi Kita, Nobuhiro Suenari, Masashi Okada, Tadahiro Taniguchi

    Abstract: This study explores the problem of Multi-Agent Path Finding with continuous and stochastic travel times whose probability distribution is unknown. Our purpose is to manage a group of automated robots that provide package delivery services in a building where pedestrians and a wide variety of robots coexist, such as delivery services in office buildings, hospitals, and apartments. It is often the c… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures

  17. arXiv:2301.11538  [pdf, other

    cs.RO

    Goal-Image Conditioned Dynamic Cable Manipulation through Bayesian Inference and Multi-Objective Black-Box Optimization

    Authors: Kuniyuki Takahashi, Tadahiro Taniguchi

    Abstract: To perform dynamic cable manipulation to realize the configuration specified by a target image, we formulate dynamic cable manipulation as a stochastic forward model. Then, we propose a method to handle uncertainty by maximizing the expectation, which also considers estimation errors of the trained model. To avoid issues like multiple local minima and requirement of differentiability by gradient-b… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 7 pages. Accepted at ICRA2023. An accompanying video is available at the following link: https://youtu.be/AMDTJRNEbek

  18. arXiv:2301.05832  [pdf, other

    cs.RO cs.AI cs.LG

    World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges

    Authors: Tadahiro Taniguchi, Shingo Murata, Masahiro Suzuki, Dimitri Ognibene, Pablo Lanillos, Emre Ugur, Lorenzo Jamone, Tomoaki Nakamura, Alejandra Ciria, Bruno Lara, Giovanni Pezzulo

    Abstract: Creating autonomous robots that can actively explore the environment, acquire knowledge and learn skills continuously is the ultimate achievement envisioned in cognitive and developmental robotics. Their learning processes should be based on interactions with their physical and social world in the manner of human learning and cognitive development. Based on this context, in this paper, we focus on… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 28 pages, 3 figures

  19. Active Exploration based on Information Gain by Particle Filter for Efficient Spatial Concept Formation

    Authors: Akira Taniguchi, Yoshiki Tabuchi, Tomochika Ishikawa, Lotfi El Hafi, Yoshinobu Hagiwara, Tadahiro Taniguchi

    Abstract: Autonomous robots need to learn the categories of various places by exploring their environments and interacting with users. However, preparing training datasets with linguistic instructions from users is time-consuming and labor-intensive. Moreover, effective exploration is essential for appropriate concept formation and rapid environmental coverage. To address this issue, we propose an active in… ▽ More

    Submitted 12 June, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted to Advanced Robotics

  20. arXiv:2207.02457  [pdf, other

    q-bio.NC cs.AI cs.CL

    Brain-inspired probabilistic generative model for double articulation analysis of spoken language

    Authors: Akira Taniguchi, Maoko Muro, Hiroshi Yamakawa, Tadahiro Taniguchi

    Abstract: The human brain, among its several functions, analyzes the double articulation structure in spoken language, i.e., double articulation analysis (DAA). A hierarchical structure in which words are connected to form a sentence and words are composed of phonemes or syllables is called a double articulation structure. Where and how DAA is performed in the human brain has not been established, although… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted to the 2022 IEEE International Conference on Development and Learning (ICDL 2022)

  21. arXiv:2206.04780  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    Speak Like a Dog: Human to Non-human creature Voice Conversion

    Authors: Kohei Suzuki, Shoki Sakamoto, Tadahiro Taniguchi, Hirokazu Kameoka

    Abstract: This paper proposes a new voice conversion (VC) task from human speech to dog-like speech while preserving linguistic information as an example of human to non-human creature voice conversion (H2NH-VC) tasks. Although most VC studies deal with human to human VC, H2NH-VC aims to convert human speech into non-human creature-like speech. Non-parallel VC allows us to develop H2NH-VC, because we cannot… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 5 pages, 4 figures

    Journal ref: 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (pp. 1388-1393)

  22. arXiv:2205.15027  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    Symbol Emergence as Inter-personal Categorization with Head-to-head Latent Word

    Authors: Kazuma Furukawa, Akira Taniguchi, Yoshinobu Hagiwara, Tadahiro Taniguchi

    Abstract: In this study, we propose a head-to-head type (H2H-type) inter-personal multimodal Dirichlet mixture (Inter-MDM) by modifying the original Inter-MDM, which is a probabilistic generative model that represents the symbol emergence between two agents as multiagent multimodal categorization. A Metropolis--Hastings method-based naming game based on the Inter-MDM enables two agents to collaboratively pe… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 7 pages, 4 figures, 5 tables

    Journal ref: IEEE International Conference on Development and Learning (ICDL 2022), 2022, 60-67

  23. arXiv:2205.12392  [pdf, other

    cs.AI cs.CL

    Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models

    Authors: Tadahiro Taniguchi, Yuto Yoshida, Akira Taniguchi, Yoshinobu Hagiwara

    Abstract: Constructive studies on symbol emergence systems seek to investigate computational models that can better explain human language evolution, the creation of symbol systems, and the construction of internal representations. This study provides a new model for emergent communication, which is based on a probabilistic generative model (PGM) instead of a discriminative model based on deep reinforcement… ▽ More

    Submitted 14 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 23 pages, 12 figures

  24. arXiv:2203.11437  [pdf, other

    cs.CV

    Representation Uncertainty in Self-Supervised Learning as Variational Inference

    Authors: Hiroki Nakamura, Masashi Okada, Tadahiro Taniguchi

    Abstract: In this study, a novel self-supervised learning (SSL) method is proposed, which considers SSL in terms of variational inference to learn not only representation but also representation uncertainties. SSL is a method of learning representations without labels by maximizing the similarity between image representations of different augmented views of an image. Meanwhile, variational autoencoder (VAE)… ▽ More

    Submitted 8 September, 2023; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted to ICCV 2023

  25. arXiv:2203.11024  [pdf, other

    cs.AI cs.RO eess.SY

    Multi-View Dreaming: Multi-View World Model with Contrastive Learning

    Authors: Akira Kinose, Masashi Okada, Ryo Okumura, Tadahiro Taniguchi

    Abstract: In this paper, we propose Multi-View Dreaming, a novel reinforcement learning agent for integrated recognition and control from multi-view observations by extending Dreaming. Most current reinforcement learning method assumes a single-view observation space, and this imposes limitations on the observed data, such as lack of spatial information and occlusions. This makes obtaining ideal observation… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 7 pages, 8 figures

  26. Hierarchical Path-planning from Speech Instructions with Spatial Concept-based Topometric Semantic Map**

    Authors: Akira Taniguchi, Shuya Ito, Tadahiro Taniguchi

    Abstract: Assisting individuals in their daily activities through autonomous mobile robots, especially for users without specialized knowledge, is crucial. Specifically, the capability of robots to navigate to destinations based on human speech instructions is essential. While robots can take different paths to the same goal, the shortest path is not always the best. A preferred approach is to accommodate w… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted to Frontiers in Robotics and AI

  27. arXiv:2203.05955  [pdf, other

    cs.RO cs.AI

    Tactile-Sensitive NewtonianVAE for High-Accuracy Industrial Connector Insertion

    Authors: Ryo Okumura, Nobuki Nishio, Tadahiro Taniguchi

    Abstract: An industrial connector insertion task requires submillimeter positioning and grasp pose compensation for a plug. Thus, highly accurate estimation of the relative pose between a plug and socket is fundamental for achieving the task. World models are promising technologies for visuomotor control because they obtain appropriate state representation to jointly optimize feature extraction and latent d… ▽ More

    Submitted 2 August, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 7 pages, 4 figures

  28. arXiv:2203.00494  [pdf, other

    cs.LG cs.AI eess.SY

    DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction

    Authors: Masashi Okada, Tadahiro Taniguchi

    Abstract: The present paper proposes a novel reinforcement learning method with world models, DreamingV2, a collaborative extension of DreamerV2 and Dreaming. DreamerV2 is a cutting-edge model-based reinforcement learning from pixels that uses discrete world models to represent latent states with categorical variables. Dreaming is also a form of reinforcement learning from pixels that attempts to avoid the… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: The code will be available soon

  29. Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues

    Authors: Akira Taniguchi, Hiroaki Murakami, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: Human infants acquire their verbal lexicon with minimal prior knowledge of language based on the statistical properties of phonological distributions and the co-occurrence of other sensory stimuli. This study proposes a novel fully unsupervised learning method for discovering speech units using phonological information as a distributional cue and object information as a co-occurrence cue. The prop… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE TRANSACTIONS ON COGNITIVE DEVELOPMENTAL SYSTEMS

  30. arXiv:2109.07194  [pdf, other

    cs.AI cs.CL

    Multiagent Multimodal Categorization for Symbol Emergence: Emergent Communication via Interpersonal Cross-modal Inference

    Authors: Yoshinobu Hagiwara, Kazuma Furukawa, Akira Taniguchi, Tadahiro Taniguchi

    Abstract: This paper describes a computational model of multiagent multimodal categorization that realizes emergent communication. We clarify whether the computational model can reproduce the following functions in a symbol emergence system, comprising two agents with different sensory modalities playing a naming game. (1) Function for forming a shared lexical system that comprises perceptual categories and… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 27 pages, 5 figures, 12 tables

  31. StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition

    Authors: Shoki Sakamoto, Akira Taniguchi, Tadahiro Taniguchi, Hirokazu Kameoka

    Abstract: Preserving the linguistic content of input speech is essential during voice conversion (VC). The star generative adversarial network-based VC method (StarGAN-VC) is a recently developed method that allows non-parallel many-to-many VC. Although this method is powerful, it can fail to preserve the linguistic content of input speech when the number of available training samples is extremely small. To… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 5 pages, 6 figures, Accepted to INTERSPEECH 2021

    Journal ref: INTERSPEECH 2021, 1359--1363

  32. arXiv:2106.08574  [pdf

    cs.AI cs.RO

    Unsupervised Lexical Acquisition of Relative Spatial Concepts Using Spoken User Utterances

    Authors: Rikunari Sagara, Ryo Taguchi, Akira Taniguchi, Tadahiro Taniguchi, Koosuke Hattori, Masahiro Hoguro, Taizo Umezaki

    Abstract: This paper proposes methods for unsupervised lexical acquisition for relative spatial concepts using spoken user utterances. A robot with a flexible spoken dialog system must be able to acquire linguistic representation and its meaning specific to an environment through interactions with humans as children do. Specifically, relative spatial concepts (e.g., front and right) are widely used in our d… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 27 pages, 12 figures, submitted to Advanced Robotics

    ACM Class: I.2.9

  33. arXiv:2104.01807  [pdf, other

    cs.SD cs.CL eess.AS

    StarGAN-based Emotional Voice Conversion for Japanese Phrases

    Authors: Asuka Moritani, Ryo Ozaki, Shoki Sakamoto, Hirokazu Kameoka, Tadahiro Taniguchi

    Abstract: This paper shows that StarGAN-VC, a spectral envelope transformation method for non-parallel many-to-many voice conversion (VC), is capable of emotional VC (EVC). Although StarGAN-VC has been shown to enable speaker identity conversion, its capability for EVC for Japanese phrases has not been clarified. In this paper, we describe the direct application of StarGAN-VC to an EVC task with minimal fun… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Submitted to Interspeech 2021

  34. Map completion from partial observation using the global structure of multiple environmental maps

    Authors: Yuki Katsumata, Akinori Kanechika, Akira Taniguchi, Lotfi El Hafi, Yoshinobu Hagiwara, Tadahiro Taniguchi

    Abstract: Using the spatial structure of various indoor environments as prior knowledge, the robot would construct the map more efficiently. Autonomous mobile robots generally apply simultaneous localization and map** (SLAM) methods to understand the reachable area in newly visited environments. However, conventional map** approaches are limited by only considering sensor observation and control signals… ▽ More

    Submitted 17 January, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted to Advanced Robotics

  35. Double Articulation Analyzer with Prosody for Unsupervised Word and Phoneme Discovery

    Authors: Yasuaki Okuda, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: Infants acquire words and phonemes from unsegmented speech signals using segmentation cues, such as distributional, prosodic, and co-occurrence cues. Many pre-existing computational models that represent the process tend to focus on distributional or prosodic cues. This paper proposes a nonparametric Bayesian probabilistic generative model called the prosodic hierarchical Dirichlet process-hidden… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 11 pages, Submitted to IEEE Transactions on Cognitive and Developmental Systems

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, 2022

  36. A Whole Brain Probabilistic Generative Model: Toward Realizing Cognitive Architectures for Developmental Robots

    Authors: Tadahiro Taniguchi, Hiroshi Yamakawa, Takayuki Nagai, Kenji Doya, Masamichi Sakagami, Masahiro Suzuki, Tomoaki Nakamura, Akira Taniguchi

    Abstract: Building a humanlike integrative artificial cognitive system, that is, an artificial general intelligence (AGI), is the holy grail of the artificial intelligence (AI) field. Furthermore, a computational model that enables an artificial system to achieve cognitive development will be an excellent reference for brain and cognitive science. This paper describes an approach to develop a cognitive arch… ▽ More

    Submitted 9 January, 2022; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: 62 pages, 9 figures, submitted to Neural Networks

    Journal ref: Neural Networks, 2022, Volume 150, 293-312

  37. arXiv:2103.06442  [pdf, other

    cs.RO cs.AI

    Hierarchical Bayesian Model for the Transfer of Knowledge on Spatial Concepts based on Multimodal Information

    Authors: Yoshinobu Hagiwara, Keishiro Taguchi, Satoshi Ishibushi, Akira Taniguchi, Tadahiro Taniguchi

    Abstract: This paper proposes a hierarchical Bayesian model based on spatial concepts that enables a robot to transfer the knowledge of places from experienced environments to a new environment. The transfer of knowledge based on spatial concepts is modeled as the calculation process of the posterior distribution based on the observations obtained in each environment with the parameters of spatial concepts… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: 17 pages, 12 figures, 6 tables

  38. Visual Exploration System for Analyzing Trends in Annual Recruitment Using Time-varying Graphs

    Authors: Toshiyuki T. Yokoyama, Masashi Okada, Tadahiro Taniguchi

    Abstract: Annual recruitment data of new graduates are manually analyzed by human resources specialists (HR) in industries, which signifies the need to evaluate the recruitment strategy of HR specialists. Every year, different applicants send in job applications to companies. The relationships between applicants' attributes (e.g., English skill or academic credential) can be used to analyze the changes in r… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  39. arXiv:2007.14535  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction

    Authors: Masashi Okada, Tadahiro Taniguchi

    Abstract: In the present paper, we propose a decoder-free extension of Dreamer, a leading model-based reinforcement learning (MBRL) method from pixels. Dreamer is a sample- and cost-efficient solution to robot learning, as it is used to train latent state-space models based on a variational autoencoder and to conduct policy optimization by latent trajectory imagination. However, this autoencoding based appr… ▽ More

    Submitted 11 March, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted to ICRA2021. Camera ready version

  40. arXiv:2007.10204  [pdf, other

    cs.CR

    Graph Convolutional Network-based Suspicious Communication Pair Estimation for Industrial Control Systems

    Authors: Tatsumi Oba, Tadahiro Taniguchi

    Abstract: Whitelisting is considered an effective security monitoring method for networks used in industrial control systems, where the whitelists consist of observed tuples of the IP address of the server, the TCP/UDP port number, and IP address of the client (communication triplets). However, this method causes frequent false detections. To reduce false positives due to a simple whitelist-based judgment,… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 9 pages, 3 figures

    Journal ref: Proc. of the BlackHat Europe 2020 Conference, BlackHat EU 2020, Dec. 7-10, 2020

  41. arXiv:2003.00370  [pdf, other

    cs.LG cs.NE cs.RO stat.ML

    PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference

    Authors: Masashi Okada, Norio Kosaka, Tadahiro Taniguchi

    Abstract: In the present paper, we propose an extension of the Deep Planning Network (PlaNet), also referred to as PlaNet of the Bayesians (PlaNet-Bayes). There has been a growing demand in model predictive control (MPC) in partially observable environments in which complete information is unavailable because of, for example, lack of expensive sensors. PlaNet is a promising solution to realize such latent M… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

  42. Spatial Concept-Based Navigation with Human Speech Instructions via Probabilistic Inference on Bayesian Generative Model

    Authors: Akira Taniguchi, Yoshinobu Hagiwara, Tadahiro Taniguchi, Tetsunari Inamura

    Abstract: Robots are required to not only learn spatial concepts autonomously but also utilize such knowledge for various tasks in a domestic environment. Spatial concept represents a multimodal place category acquired from the robot's spatial experience including vision, speech-language, and self-position. The aim of this study is to enable a mobile robot to perform navigational tasks with human speech ins… ▽ More

    Submitted 26 August, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: Accepted to Advanced Robotics

  43. Autonomous Planning Based on Spatial Concepts to Tidy Up Home Environments with Service Robots

    Authors: Akira Taniguchi, Shota Isobe, Lotfi El Hafi, Yoshinobu Hagiwara, Tadahiro Taniguchi

    Abstract: Tidy-up tasks by service robots in home environments are challenging in robotics applications because they involve various interactions with the environment. In particular, robots are required not only to grasp, move, and release various home objects but also to plan the order and positions for placing the objects. In this paper, we propose a novel planning method that can efficiently estimate the… ▽ More

    Submitted 10 February, 2021; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: This paper has been accepted to Advanced Robotics

  44. arXiv:2001.11628  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Domain-Adversarial and Conditional State Space Model for Imitation Learning

    Authors: Ryo Okumura, Masashi Okada, Tadahiro Taniguchi

    Abstract: State representation learning (SRL) in partially observable Markov decision processes has been studied to learn abstract features of data useful for robot control tasks. For SRL, acquiring domain-agnostic states is essential for achieving efficient imitation learning. Without these states, imitation learning is hampered by domain-dependent information useless for control. However, existing methods… ▽ More

    Submitted 4 June, 2021; v1 submitted 30 January, 2020; originally announced January 2020.

    Comments: Published at IROS 2020

  45. Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models

    Authors: Tadahiro Taniguchi, Tomoaki Nakamura, Masahiro Suzuki, Ryo Kuniyasu, Kaede Hayashi, Akira Taniguchi, Takato Horii, Takayuki Nagai

    Abstract: This paper describes a framework for the development of an integrative cognitive system based on probabilistic generative models (PGMs) called Neuro-SERKET. Neuro-SERKET is an extension of SERKET, which can compose elemental PGMs developed in a distributed manner and provide a scheme that allows the composed PGMs to learn throughout the system in an unsupervised way. In addition to the head-to-tai… ▽ More

    Submitted 29 January, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: New Gener. Comput. (2020)

    Journal ref: New Generation Computing, 2020, volume 38, 23--48

  46. arXiv:1909.07031  [pdf, other

    cs.CV cs.LG

    Multi-person Pose Tracking using Sequential Monte Carlo with Probabilistic Neural Pose Predictor

    Authors: Masashi Okada, Shinji Takenaka, Tadahiro Taniguchi

    Abstract: It is an effective strategy for the multi-person pose tracking task in videos to employ prediction and pose matching in a frame-by-frame manner. For this type of approach, uncertainty-aware modeling is essential because precise prediction is impossible. However, previous studies have relied on only a single prediction without incorporating uncertainty, which can cause critical tracking errors if t… ▽ More

    Submitted 27 February, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: Accepted to ICRA2020; Camera-ready ver

  47. arXiv:1907.04202  [pdf, other

    cs.LG eess.SY stat.ML

    Variational Inference MPC for Bayesian Model-based Reinforcement Learning

    Authors: Masashi Okada, Tadahiro Taniguchi

    Abstract: In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge model free methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynami… ▽ More

    Submitted 6 October, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: Accepted to CoRL2019. Camera-ready ver

  48. Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model

    Authors: Akira Kinose, Tadahiro Taniguchi

    Abstract: Integration of reinforcement learning and imitation learning is an important problem that has been studied for a long time in the field of intelligent robotics. Reinforcement learning optimizes policies to maximize the cumulative reward, whereas imitation learning attempts to extract general knowledge about the trajectories demonstrated by experts, i.e., demonstrators. Because each of them has the… ▽ More

    Submitted 16 October, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Submitted to Advanced Robotics

    Journal ref: Advanced Robotics, 2020, 34:16, 1055-1067

  49. Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias

    Authors: Ryo Nakashima, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: This paper describes a new unsupervised machine learning method for simultaneous phoneme and word discovery from multiple speakers. Human infants can acquire knowledge of phonemes and words from interactions with his/her mother as well as with others surrounding him/her. From a computational perspective, phoneme and word discovery from multiple speakers is a more challenging problem than that from… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 21 pages. Submitted

    Journal ref: Front. Robot. AI, 2019, 6:92

  50. arXiv:1905.13443  [pdf, other

    cs.CL

    Symbol Emergence as an Interpersonal Multimodal Categorization

    Authors: Yoshinobu Hagiwara, Hiroyoshi Kobayashi, Akira Taniguchi, Tadahiro Taniguchi

    Abstract: This study focuses on category formation for individual agents and the dynamics of symbol emergence in a multi-agent system through semiotic communication. Semiotic communication is defined, in this study, as the generation and interpretation of signs associated with the categories formed through the agent's own sensory experience or by exchange of signs with other agents. From the viewpoint of la… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: 21 pages, 12 figures