-
Deep Multi-stream Network for Video-based Calving Sign Detection
Authors:
Ryosuke Hyodo,
Teppei Nakano,
Tetsuji Ogawa
Abstract:
We have designed a deep multi-stream network for automatically detecting calving signs from video. Calving sign detection from a camera, which is a non-contact sensor, is expected to enable more efficient livestock management. As large-scale, well-developed data cannot generally be assumed when establishing calving detection systems, the basis for making the prediction needs to be presented to far…
▽ More
We have designed a deep multi-stream network for automatically detecting calving signs from video. Calving sign detection from a camera, which is a non-contact sensor, is expected to enable more efficient livestock management. As large-scale, well-developed data cannot generally be assumed when establishing calving detection systems, the basis for making the prediction needs to be presented to farmers during operation, so black-box modeling (also known as end-to-end modeling) is not appropriate. For practical operation of calving detection systems, the present study aims to incorporate expert knowledge into a deep neural network. To this end, we propose a multi-stream calving sign detection network in which multiple calving-related features are extracted from the corresponding feature extraction networks designed for each attribute with different characteristics, such as a cow's posture, rotation, and movement, known as calving signs, and are then integrated appropriately depending on the cow's situation. Experimental comparisons conducted using videos of 15 cows demonstrated that our multi-stream system yielded a significant improvement over the end-to-end system, and the multi-stream architecture significantly contributed to a reduction in detection errors. In addition, the distinctive mixture weights we observed helped provide interpretability of the system's behavior.
△ Less
Submitted 10 January, 2023;
originally announced February 2023.
-
Video Surveillance System Incorporating Expert Decision-making Process: A Case Study on Detecting Calving Signs in Cattle
Authors:
Ryosuke Hyodo,
Susumu Saito,
Teppei Nakano,
Makoto Akabane,
Ryoichi Kasuga,
Tetsuji Ogawa
Abstract:
Through a user study in the field of livestock farming, we verify the effectiveness of an XAI framework for video surveillance systems. The systems can be made interpretable by incorporating experts' decision-making processes. AI systems are becoming increasingly common in real-world applications, especially in fields related to human decision-making, and its interpretability is necessary. However…
▽ More
Through a user study in the field of livestock farming, we verify the effectiveness of an XAI framework for video surveillance systems. The systems can be made interpretable by incorporating experts' decision-making processes. AI systems are becoming increasingly common in real-world applications, especially in fields related to human decision-making, and its interpretability is necessary. However, there are still relatively few standard methods for assessing and addressing the interpretability of machine learning-based systems in real-world applications. In this study, we examine the framework of a video surveillance AI system that presents the reasoning behind predictions by incorporating experts' decision-making processes with rich domain knowledge of the notification target. While general black-box AI systems can only present final probability values, the proposed framework can present information relevant to experts' decisions, which is expected to be more helpful for their decision-making. In our case study, we designed a system for detecting signs of calving in cattle based on the proposed framework and evaluated the system through a user study (N=6) with people involved in livestock farming. A comparison with the black-box AI system revealed that many participants referred to the presented reasons for the prediction results, and five out of six participants selected the proposed system as the system they would like to use in the future. It became clear that we need to design a user interface that considers the reasons for the prediction results.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Machine Learning model for gas-liquid interface reconstruction in CFD numerical simulations
Authors:
Tamon Nakano,
Alessandro Michele Bucci,
Jean-Marc Gratien,
Thibault Faney,
Guillaume Charpiat
Abstract:
The volume of fluid (VoF) method is widely used in multi-phase flow simulations to track and locate the interface between two immiscible fluids. A major bottleneck of the VoF method is the interface reconstruction step due to its high computational cost and low accuracy on unstructured grids. We propose a machine learning enhanced VoF method based on Graph Neural Networks (GNN) to accelerate the i…
▽ More
The volume of fluid (VoF) method is widely used in multi-phase flow simulations to track and locate the interface between two immiscible fluids. A major bottleneck of the VoF method is the interface reconstruction step due to its high computational cost and low accuracy on unstructured grids. We propose a machine learning enhanced VoF method based on Graph Neural Networks (GNN) to accelerate the interface reconstruction on general unstructured meshes. We first develop a methodology to generate a synthetic dataset based on paraboloid surfaces discretized on unstructured meshes. We then train a GNN based model and perform generalization tests. Our results demonstrate the efficiency of a GNN based approach for interface reconstruction in multi-phase flow simulations in the industrial context.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
Bayesian Optimization for Cascade-type Multi-stage Processes
Authors:
Shunya Kusakawa,
Shion Takeno,
Yu Inatsu,
Kentaro Kutsukake,
Shogo Iwazaki,
Takashi Nakano,
Toru Ujihara,
Masayuki Karasuyama,
Ichiro Takeuchi
Abstract:
Complex processes in science and engineering are often formulated as multistage decision-making problems. In this paper, we consider a type of multistage decision-making process called a cascade process. A cascade process is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the…
▽ More
Complex processes in science and engineering are often formulated as multistage decision-making problems. In this paper, we consider a type of multistage decision-making process called a cascade process. A cascade process is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the optimal controllable parameters for each stage exhaustively. To address this problem, we formulate the optimization of the cascade process as an extension of the Bayesian optimization framework and propose two types of acquisition functions based on credible intervals and expected improvement. We investigate the theoretical properties of the proposed acquisition functions and demonstrate their effectiveness through numerical experiments. In addition, we consider an extension called suspension setting in which we are allowed to suspend the cascade process at the middle of the multistage decision-making process that often arises in practical problems. We apply the proposed method in a test problem involving a solar cell simulator, which was the motivation for this study.
△ Less
Submitted 7 March, 2023; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Reduced-Lead ECG Classifier Model Trained with DivideMix and Model Ensemble
Authors:
Hiroshi Seki,
Takashi Nakano,
Koshiro Ikeda,
Shinji Hirooka,
Takaaki Kawasaki,
Mitsutomo Yamada,
Shumpei Saito,
Toshitaka Yamakawa,
Shimpei Ogawa
Abstract:
Automatic diagnosis of multiple cardiac abnormalities from reduced-lead electrocardiogram (ECG) data is challenging. One of the reasons for this is the difficulty of defining labels from standard 12-lead data. Reduced-lead ECG data usually do not have identical characteristics of cardiac abnormalities because of the noisy label problem. Thus, there is an inconsistency in the annotated labels betwe…
▽ More
Automatic diagnosis of multiple cardiac abnormalities from reduced-lead electrocardiogram (ECG) data is challenging. One of the reasons for this is the difficulty of defining labels from standard 12-lead data. Reduced-lead ECG data usually do not have identical characteristics of cardiac abnormalities because of the noisy label problem. Thus, there is an inconsistency in the annotated labels between the reduced-lead and 12-lead ECG data. To solve this, we propose deep neural network (DNN)-based ECG classifier models that incorporate DivideMix and stochastic weight averaging (SWA). DivideMix was used to refine the noisy label by using two separate models. Besides DivideMix, we used a model ensemble technique, SWA, which also focuses on the noisy label problem, to enhance the effect of the models generated by DivideMix. Our classifiers (ami_kagoshima) received scores of 0.49, 0.47, 0.48, 0.47, and 0.47 (ranked 9th, 10th, 10th, 11th, and 10th, respectively, out of 39 teams) for the 12-lead, 6-lead, 4-lead, 3-lead, and 2-lead versions, respectively, of the hidden test set with the challenge evaluation metric. We obtained the scores of 0.701, 0.686, 0.693, 0.693, and 0.685 on the 10-fold cross validation, and 0.623, 0.593, 0.606, 0.612, and 0.601 on the hidden validation set for each lead combination.
△ Less
Submitted 24 September, 2021;
originally announced September 2021.
-
Exploring Effectiveness of Inter-Microtask Qualification Tests in Crowdsourcing
Authors:
Masaya Morinaga,
Susumu Saito,
Teppei Nakano,
Tetsunori Kobayashi,
Tetsuji Ogawa
Abstract:
Qualification tests in crowdsourcing are often used to pre-filter workers by measuring their ability in executing microtasks.While creating qualification tests for each task type is considered as a common and reasonable way, this study investigates into its worker-filtering performance when the same qualification test is used across multiple types of tasks.On Amazon Mechanical Turk, we tested the…
▽ More
Qualification tests in crowdsourcing are often used to pre-filter workers by measuring their ability in executing microtasks.While creating qualification tests for each task type is considered as a common and reasonable way, this study investigates into its worker-filtering performance when the same qualification test is used across multiple types of tasks.On Amazon Mechanical Turk, we tested the annotation accuracy in six different cases where tasks consisted of two different difficulty levels, arising from the identical real-world domain: four combinatory cases in which the qualification test and the actual task were the same or different from each other, as well as two other cases where workers with Masters Qualification were asked to perform the actual task only.The experimental results demonstrated the two following findings: i) Workers that were assigned to a difficult qualification test scored better annotation accuracy regardless of the difficulty of the actual task; ii) Workers with Masters Qualification scored better annotation accuracy on the low-difficulty task, but were not as accurate as those who passed a qualification test on the high-difficulty task.
△ Less
Submitted 20 December, 2020;
originally announced December 2020.
-
Estimating Blink Probability for Highlight Detection in Figure Skating Videos
Authors:
Tamami Nakano,
Atsuya Sakata,
Akihiro Kishimoto
Abstract:
Highlight detection in sports videos has a broad viewership and huge commercial potential. It is thus imperative to detect highlight scenes more suitably for human interest with high temporal accuracy. Since people instinctively suppress blinks during attention-grabbing events and synchronously generate blinks at attention break points in videos, the instantaneous blink rate can be utilized as a h…
▽ More
Highlight detection in sports videos has a broad viewership and huge commercial potential. It is thus imperative to detect highlight scenes more suitably for human interest with high temporal accuracy. Since people instinctively suppress blinks during attention-grabbing events and synchronously generate blinks at attention break points in videos, the instantaneous blink rate can be utilized as a highly accurate temporal indicator of human interest. Therefore, in this study, we propose a novel, automatic highlight detection method based on the blink rate. The method trains a one-dimensional convolution network (1D-CNN) to assess blink rates at each video frame from the spatio-temporal pose features of figure skating videos. Experiments show that the method successfully estimates the blink rate in 94% of the video clips and predicts the temporal change in the blink rate around a jump event with high accuracy. Moreover, the method detects not only the representative athletic action, but also the distinctive artistic expression of figure skating performance as key frames. This suggests that the blink-rate-based supervised learning approach enables high-accuracy highlight detection that more closely matches human sensibility.
△ Less
Submitted 2 July, 2020;
originally announced July 2020.
-
TurkScanner: Predicting the Hourly Wage of Microtasks
Authors:
Susumu Saito,
Chun-Wei Chiang,
Saiph Savage,
Teppei Nakano,
Tetsunori Kobayashi,
Jeffrey Bigham
Abstract:
Workers in crowd markets struggle to earn a living. One reason for this is that it is difficult for workers to accurately gauge the hourly wages of microtasks, and they consequently end up performing labor with little pay. In general, workers are provided with little information about tasks, and are left to rely on noisy signals, such as textual description of the task or rating of the requester.…
▽ More
Workers in crowd markets struggle to earn a living. One reason for this is that it is difficult for workers to accurately gauge the hourly wages of microtasks, and they consequently end up performing labor with little pay. In general, workers are provided with little information about tasks, and are left to rely on noisy signals, such as textual description of the task or rating of the requester. This study explores various computational methods for predicting the working times (and thus hourly wages) required for tasks based on data collected from other workers completing crowd work. We provide the following contributions. (i) A data collection method for gathering real-world training data on crowd-work tasks and the times required for workers to complete them; (ii) TurkScanner: a machine learning approach that predicts the necessary working time to complete a task (and can thus implicitly provide the expected hourly wage). We collected 9,155 data records using a web browser extension installed by 84 Amazon Mechanical Turk workers, and explored the challenge of accurately recording working times both automatically and by asking workers. TurkScanner was created using ~150 derived features, and was able to predict the hourly wages of 69.6% of all the tested microtasks within a 75% error. Directions for future research include observing the effects of tools on people's working practices, adapting this approach to a requester tool for better price setting, and predicting other elements of work (e.g., the acceptance likelihood and worker task preferences.)
△ Less
Submitted 17 March, 2019;
originally announced March 2019.
-
Fault Tolerance of Random Graphs with respect to Connectivity: Mean-field Approximation for Semi-dense Random Graphs
Authors:
Satoshi Takabe,
Takafumi Nakano,
Tadashi Wadayama
Abstract:
The fault tolerance of random graphs with unbounded degrees with respect to connectivity is investigated, which relates to the reliability of wireless sensor networks with unreliable relay nodes. The model evaluates the network breakdown probability that a graph is disconnected after stochastic node removal. To establish a mean-field approximation for the model, we propose the cavity method for fi…
▽ More
The fault tolerance of random graphs with unbounded degrees with respect to connectivity is investigated, which relates to the reliability of wireless sensor networks with unreliable relay nodes. The model evaluates the network breakdown probability that a graph is disconnected after stochastic node removal. To establish a mean-field approximation for the model, we propose the cavity method for finite systems. The analysis enables us to obtain an approximation formula for random graphs with any number of nodes and an arbitrary degree distribution. In addition, its asymptotic analysis reveals that the phase transition occurs in semi-dense random graphs whose average degree grows logarithmically. These results, which are supported by numerical simulations, coincide with the mathematical results, indicating successful predictions by mean-field approximation for unbounded but not dense random graphs.
△ Less
Submitted 9 May, 2019; v1 submitted 21 December, 2017;
originally announced December 2017.
-
On Zero Error Capacity of Nearest Neighbor Error Channels with Multilevel Alphabet
Authors:
Takafumi Nakano,
Tadashi Wadayama
Abstract:
This paper studies the zero error capacity of the Nearest Neighbor Error (NNE) channels with a multilevel alphabet. In the NNE channels, a transmitted symbol is a $d$-tuple of elements in $\{0,1,2,\dots, n-1 \}$. It is assumed that only one element error to a nearest neighbor element in a transmitted symbol can occur. The NNE channels can be considered as a special type of limited magnitude error…
▽ More
This paper studies the zero error capacity of the Nearest Neighbor Error (NNE) channels with a multilevel alphabet. In the NNE channels, a transmitted symbol is a $d$-tuple of elements in $\{0,1,2,\dots, n-1 \}$. It is assumed that only one element error to a nearest neighbor element in a transmitted symbol can occur. The NNE channels can be considered as a special type of limited magnitude error channels, and it is closely related to error models for flash memories. In this paper, we derive a lower bound of the zero error capacity of the NNE channels based on a result of the perfect Lee codes. An upper bound of the zero error capacity of the NNE channels is also derived from a feasible solution of a linear programming problem defined based on the confusion graphs of the NNE channels. As a result, a concise formula of the zero error capacity is obtained using the lower and upper bounds.
△ Less
Submitted 30 January, 2017;
originally announced January 2017.
-
Analysis of Breakdown Probability of Wireless Sensor Networks with Unreliable Relay Nodes
Authors:
Takayuki Nozaki,
Takafumi Nakano,
Tadashi Wadayama
Abstract:
In the present paper, we derive an upper bound of the average network breakdown probability of packet networks with unreliable relay nodes. We here assume that relay nodes get independently broken with a given node breakdown probability. A survivor graph is the induced subgraph obtained by removing the broken relay nodes and their connecting edges from the original graph. If the survivor network i…
▽ More
In the present paper, we derive an upper bound of the average network breakdown probability of packet networks with unreliable relay nodes. We here assume that relay nodes get independently broken with a given node breakdown probability. A survivor graph is the induced subgraph obtained by removing the broken relay nodes and their connecting edges from the original graph. If the survivor network is disconnected, we consider a network breakdown happens. The primal contribution of the paper is to derive an upper bound of the average network breakdown probability, where the expectation is taken over a regular graph ensemble. The proof of the bound is based on a natural one-to-one correspondence between a regular graph and a regular bipartite graph, and also on enumeration of bipartite graphs satisfying certain conditions. This proof argument is inspired by the analysis of weight distribution for low-density parity-check codes. Compared with estimates of the average network breakdown probability obtained by computer experiments, it is observed that the upper bound provides the values which are not only upper bounds but also precise estimates of the network breakdown probability when the node breakdown probability is small.
△ Less
Submitted 24 January, 2017;
originally announced January 2017.
-
RNA as a Nanoscale Data Transmission Medium: Error Analysis
Authors:
Andrew W. Eckford,
Taro Furbayashi,
Tadashi Nakano
Abstract:
RNA can be used as a high-density medium for data storage and transmission; however, an important RNA process -- replication -- is noisy. This paper presents an error analysis for RNA as a data transmission medium, analyzing how deletion errors increase in a collection of replicated DNA strands over time.
RNA can be used as a high-density medium for data storage and transmission; however, an important RNA process -- replication -- is noisy. This paper presents an error analysis for RNA as a data transmission medium, analyzing how deletion errors increase in a collection of replicated DNA strands over time.
△ Less
Submitted 23 September, 2016;
originally announced September 2016.
-
TCP-like molecular communications
Authors:
Luca Felicetti,
Mauro Femminella,
Gianluca Reali,
Tadashi Nakano,
Athanasios V. Vasilakos
Abstract:
In this paper, we present a communication protocol between a pair of biological nanomachines, transmitter and receiver, built upon molecular communications in an aqueous environment. In our proposal, the receiver, acting as a control node, sends a connection setup signal to the transmitter, which stokes molecules, to start molecule transmission. The molecules transmitted by the transmitter propaga…
▽ More
In this paper, we present a communication protocol between a pair of biological nanomachines, transmitter and receiver, built upon molecular communications in an aqueous environment. In our proposal, the receiver, acting as a control node, sends a connection setup signal to the transmitter, which stokes molecules, to start molecule transmission. The molecules transmitted by the transmitter propagate in the environment and are absorbed by the receiver through its receptors. When the receiver absorbs the desired quantity of molecules, it releases a tear-down signal to notify the transmitter to stop the transmission. The proposed protocol implements a bidirectional communication by using a number of techniques originally designed for the TCP. In fact, the proposed protocol is connection-oriented, and uses the TCP-like probing to find a suitable transmission rate between transmitter and receiver so as to avoid receiver congestion. Unlike the TCP, however, explicit acknowledgments are not used, since they would degrade the communication throughput due to the large delay, a characteristic feature of molecular communications. Thus, the proposed protocol uses implicit acknowledgments, and feedback signals are sent by the receiver to throttle the transmission rate at the transmitter, i.e., explicit negative feedbacks. We also present the results of an extensive simulation campaign, used to validate the proposed protocol and to properly dimension the main protocol parameters.
△ Less
Submitted 17 June, 2014;
originally announced June 2014.