-
Explainable AI for Ship Collision Avoidance: Decoding Decision-Making Processes and Behavioral Intentions
Authors:
Hitoshi Yoshioka,
Hirotada Hashimoto
Abstract:
This study developed an explainable AI for ship collision avoidance. Initially, a critic network composed of sub-task critic networks was proposed to individually evaluate each sub-task in collision avoidance to clarify the AI decision-making processes involved. Additionally, an attempt was made to discern behavioral intentions through a Q-value analysis and an Attention mechanism. The former focu…
▽ More
This study developed an explainable AI for ship collision avoidance. Initially, a critic network composed of sub-task critic networks was proposed to individually evaluate each sub-task in collision avoidance to clarify the AI decision-making processes involved. Additionally, an attempt was made to discern behavioral intentions through a Q-value analysis and an Attention mechanism. The former focused on interpreting intentions by examining the increment of the Q-value resulting from AI actions, while the latter incorporated the significance of other ships in the decision-making process for collision avoidance into the learning objective. AI's behavioral intentions in collision avoidance were visualized by combining the perceived collision danger with the degree of attention to other ships. The proposed method was evaluated through a numerical experiment. The developed AI was confirmed to be able to safely avoid collisions under various congestion levels, and AI's decision-making process was rendered comprehensible to humans. The proposed method not only facilitates the understanding of DRL-based controllers/systems in the ship collision avoidance task but also extends to any task comprising sub-tasks.
△ Less
Submitted 19 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Reliability Quantification of Deep Reinforcement Learning-based Control
Authors:
Hitoshi Yoshioka,
Hirotada Hashimoto
Abstract:
Reliability quantification of deep reinforcement learning (DRL)-based control is a significant challenge for the practical application of artificial intelligence (AI) in safety-critical systems. This study proposes a method for quantifying the reliability of DRL-based control. First, an existing method, random noise distillation, was applied to the reliability evaluation to clarify the issues to b…
▽ More
Reliability quantification of deep reinforcement learning (DRL)-based control is a significant challenge for the practical application of artificial intelligence (AI) in safety-critical systems. This study proposes a method for quantifying the reliability of DRL-based control. First, an existing method, random noise distillation, was applied to the reliability evaluation to clarify the issues to be solved. Second, a novel method for reliability quantification was proposed to solve these issues. The reliability is quantified using two neural networks: reference and evaluator. They have the same structure with the same initial parameters. The outputs of the two networks were the same before training. During training, the evaluator network parameters were updated to maximize the difference between the reference and evaluator networks for trained data. Thus, the reliability of the DRL-based control for a state can be evaluated based on the difference in output between the two networks. The proposed method was applied to DQN-based control as an example of a simple task, and its effectiveness was demonstrated. Finally, the proposed method was applied to the problem of switching trained models depending on the state. Con-sequently, the performance of the DRL-based control was improved by switching the trained models according to their reliability.
△ Less
Submitted 13 October, 2023; v1 submitted 29 September, 2023;
originally announced September 2023.
-
Optimal harvesting policy for biological resources with uncertain heterogeneity for application in fisheries management
Authors:
Hidekazu Yoshioka
Abstract:
Conventional harvesting problems for natural resources often assume physiological homogeneity of the body length/weight among individuals. However, such assumptions generally are not valid in real-world problems, where heterogeneity plays an essential role in the planning of biological resource harvesting. Furthermore, it is difficult to observe heterogeneity directly from the available data. This…
▽ More
Conventional harvesting problems for natural resources often assume physiological homogeneity of the body length/weight among individuals. However, such assumptions generally are not valid in real-world problems, where heterogeneity plays an essential role in the planning of biological resource harvesting. Furthermore, it is difficult to observe heterogeneity directly from the available data. This paper presents a novel optimal control framework for the cost-efficient harvesting of biological resources for application in fisheries management. The heterogeneity is incorporated into the resource dynamics, which is the population dynamics in this case, through a probability density that can be distorted from the reality. Subsequently, the distortion, which is the model uncertainty, is penalized through a divergence, leading to a non-standard dynamic differential game wherein the Hamilton-Jacobi-Bellman-Isaacs (HJBI) equation has a unique nonlinear partial differential term. Here, the existence and uniqueness results of the HJBI equation are presented along with an explicit monotone finite difference method. Finally, the proposed optimal control is applied to a harvesting problem with recreationally, economically, and ecologically important fish species using collected field data.
△ Less
Submitted 9 December, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Determination of the Interface between Amorphous Insulator and Crystalline 4H-SiC in Transmission Electron Microscope Image by using Convolutional Neural Network
Authors:
Hironori Yoshioka,
Tomonori Honda
Abstract:
A rough interface seems to be one of the possible reasons for low channel mobility (conductivity) in SiC MOSFETs. To evaluate the mobility by interface roughness, we drew a boundary line between amorphous insulator and crystalline 4H-SiC in a cross-sectional image obtained by a transmission electron microscope (TEM), by using the deep learning approach of convolutional neural network (CNN). We sho…
▽ More
A rough interface seems to be one of the possible reasons for low channel mobility (conductivity) in SiC MOSFETs. To evaluate the mobility by interface roughness, we drew a boundary line between amorphous insulator and crystalline 4H-SiC in a cross-sectional image obtained by a transmission electron microscope (TEM), by using the deep learning approach of convolutional neural network (CNN). We show that the CNN model recognizes the interface very well, even when the interface is too rough to draw the boundary line manually. Power spectral density of interface roughness was calculated.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
A random observation-based management model of population dynamics and its ecological application
Authors:
Hidekazu Yoshioka,
Yuta Yaegashi,
Motoh Tsujimura
Abstract:
A new stochastic control problem of population dynamics under partial observation is formulated and analyzed both mathematically and numerically, with an emphasis on environmental and ecological problems. The decision-maker can only randomly and time-discretely observe and impulsively intervene the population dynamics governed by a regime-switching stochastic differential equation. The hybrid natu…
▽ More
A new stochastic control problem of population dynamics under partial observation is formulated and analyzed both mathematically and numerically, with an emphasis on environmental and ecological problems. The decision-maker can only randomly and time-discretely observe and impulsively intervene the population dynamics governed by a regime-switching stochastic differential equation. The hybrid nature of the problem leads to an optimality equation containing an integro-differential equation and a static optimization problem. It is therefore different from the conventional Hamilton-Jacobi-Bellman equations. Existence and solvability issues of this optimality equation are analyzed in a viscosity sense. Its exact solution to a reduced but still nontrivial model is derived as well. The model is finally applied to a realistic environmental management problem in a river using a finite difference scheme.
△ Less
Submitted 9 April, 2020;
originally announced April 2020.