Search | arXiv e-print repository

Deep Multi-stream Network for Video-based Calving Sign Detection

Authors: Ryosuke Hyodo, Teppei Nakano, Tetsuji Ogawa

Abstract: We have designed a deep multi-stream network for automatically detecting calving signs from video. Calving sign detection from a camera, which is a non-contact sensor, is expected to enable more efficient livestock management. As large-scale, well-developed data cannot generally be assumed when establishing calving detection systems, the basis for making the prediction needs to be presented to far… ▽ More We have designed a deep multi-stream network for automatically detecting calving signs from video. Calving sign detection from a camera, which is a non-contact sensor, is expected to enable more efficient livestock management. As large-scale, well-developed data cannot generally be assumed when establishing calving detection systems, the basis for making the prediction needs to be presented to farmers during operation, so black-box modeling (also known as end-to-end modeling) is not appropriate. For practical operation of calving detection systems, the present study aims to incorporate expert knowledge into a deep neural network. To this end, we propose a multi-stream calving sign detection network in which multiple calving-related features are extracted from the corresponding feature extraction networks designed for each attribute with different characteristics, such as a cow's posture, rotation, and movement, known as calving signs, and are then integrated appropriately depending on the cow's situation. Experimental comparisons conducted using videos of 15 cows demonstrated that our multi-stream system yielded a significant improvement over the end-to-end system, and the multi-stream architecture significantly contributed to a reduction in detection errors. In addition, the distinctive mixture weights we observed helped provide interpretability of the system's behavior. △ Less

Submitted 10 January, 2023; originally announced February 2023.

arXiv:2301.03926 [pdf, other]

Video Surveillance System Incorporating Expert Decision-making Process: A Case Study on Detecting Calving Signs in Cattle

Authors: Ryosuke Hyodo, Susumu Saito, Teppei Nakano, Makoto Akabane, Ryoichi Kasuga, Tetsuji Ogawa

Abstract: Through a user study in the field of livestock farming, we verify the effectiveness of an XAI framework for video surveillance systems. The systems can be made interpretable by incorporating experts' decision-making processes. AI systems are becoming increasingly common in real-world applications, especially in fields related to human decision-making, and its interpretability is necessary. However… ▽ More Through a user study in the field of livestock farming, we verify the effectiveness of an XAI framework for video surveillance systems. The systems can be made interpretable by incorporating experts' decision-making processes. AI systems are becoming increasingly common in real-world applications, especially in fields related to human decision-making, and its interpretability is necessary. However, there are still relatively few standard methods for assessing and addressing the interpretability of machine learning-based systems in real-world applications. In this study, we examine the framework of a video surveillance AI system that presents the reasoning behind predictions by incorporating experts' decision-making processes with rich domain knowledge of the notification target. While general black-box AI systems can only present final probability values, the proposed framework can present information relevant to experts' decisions, which is expected to be more helpful for their decision-making. In our case study, we designed a system for detecting signs of calving in cattle based on the proposed framework and evaluated the system through a user study (N=6) with people involved in livestock farming. A comparison with the black-box AI system revealed that many participants referred to the presented reasons for the prediction results, and five out of six participants selected the proposed system as the system they would like to use in the future. It became clear that we need to design a user interface that considers the reasons for the prediction results. △ Less

Submitted 10 January, 2023; originally announced January 2023.

arXiv:2207.05684 [pdf, other]

Machine Learning model for gas-liquid interface reconstruction in CFD numerical simulations

Authors: Tamon Nakano, Alessandro Michele Bucci, Jean-Marc Gratien, Thibault Faney, Guillaume Charpiat

Abstract: The volume of fluid (VoF) method is widely used in multi-phase flow simulations to track and locate the interface between two immiscible fluids. A major bottleneck of the VoF method is the interface reconstruction step due to its high computational cost and low accuracy on unstructured grids. We propose a machine learning enhanced VoF method based on Graph Neural Networks (GNN) to accelerate the i… ▽ More The volume of fluid (VoF) method is widely used in multi-phase flow simulations to track and locate the interface between two immiscible fluids. A major bottleneck of the VoF method is the interface reconstruction step due to its high computational cost and low accuracy on unstructured grids. We propose a machine learning enhanced VoF method based on Graph Neural Networks (GNN) to accelerate the interface reconstruction on general unstructured meshes. We first develop a methodology to generate a synthetic dataset based on paraboloid surfaces discretized on unstructured meshes. We then train a GNN based model and perform generalization tests. Our results demonstrate the efficiency of a GNN based approach for interface reconstruction in multi-phase flow simulations in the industrial context. △ Less

Submitted 12 July, 2022; originally announced July 2022.

Comments: 12 pages, fullpaper of ECCOMAS2022

arXiv:2111.08330 [pdf, ps, other]

doi 10.1162/neco_a_01550

Bayesian Optimization for Cascade-type Multi-stage Processes

Authors: Shunya Kusakawa, Shion Takeno, Yu Inatsu, Kentaro Kutsukake, Shogo Iwazaki, Takashi Nakano, Toru Ujihara, Masayuki Karasuyama, Ichiro Takeuchi

Abstract: Complex processes in science and engineering are often formulated as multistage decision-making problems. In this paper, we consider a type of multistage decision-making process called a cascade process. A cascade process is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the… ▽ More Complex processes in science and engineering are often formulated as multistage decision-making problems. In this paper, we consider a type of multistage decision-making process called a cascade process. A cascade process is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the optimal controllable parameters for each stage exhaustively. To address this problem, we formulate the optimization of the cascade process as an extension of the Bayesian optimization framework and propose two types of acquisition functions based on credible intervals and expected improvement. We investigate the theoretical properties of the proposed acquisition functions and demonstrate their effectiveness through numerical experiments. In addition, we consider an extension called suspension setting in which we are allowed to suspend the cascade process at the middle of the multistage decision-making process that often arises in practical problems. We apply the proposed method in a test problem involving a solar cell simulator, which was the motivation for this study. △ Less

Submitted 7 March, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

Comments: 70pages, 7 figures

Journal ref: Neural Computation (2022) 34 (12): 2408-2431

arXiv:2109.12063 [pdf, ps, other]

Reduced-Lead ECG Classifier Model Trained with DivideMix and Model Ensemble

Authors: Hiroshi Seki, Takashi Nakano, Koshiro Ikeda, Shinji Hirooka, Takaaki Kawasaki, Mitsutomo Yamada, Shumpei Saito, Toshitaka Yamakawa, Shimpei Ogawa

Abstract: Automatic diagnosis of multiple cardiac abnormalities from reduced-lead electrocardiogram (ECG) data is challenging. One of the reasons for this is the difficulty of defining labels from standard 12-lead data. Reduced-lead ECG data usually do not have identical characteristics of cardiac abnormalities because of the noisy label problem. Thus, there is an inconsistency in the annotated labels betwe… ▽ More Automatic diagnosis of multiple cardiac abnormalities from reduced-lead electrocardiogram (ECG) data is challenging. One of the reasons for this is the difficulty of defining labels from standard 12-lead data. Reduced-lead ECG data usually do not have identical characteristics of cardiac abnormalities because of the noisy label problem. Thus, there is an inconsistency in the annotated labels between the reduced-lead and 12-lead ECG data. To solve this, we propose deep neural network (DNN)-based ECG classifier models that incorporate DivideMix and stochastic weight averaging (SWA). DivideMix was used to refine the noisy label by using two separate models. Besides DivideMix, we used a model ensemble technique, SWA, which also focuses on the noisy label problem, to enhance the effect of the models generated by DivideMix. Our classifiers (ami_kagoshima) received scores of 0.49, 0.47, 0.48, 0.47, and 0.47 (ranked 9th, 10th, 10th, 11th, and 10th, respectively, out of 39 teams) for the 12-lead, 6-lead, 4-lead, 3-lead, and 2-lead versions, respectively, of the hidden test set with the challenge evaluation metric. We obtained the scores of 0.701, 0.686, 0.693, 0.693, and 0.685 on the 10-fold cross validation, and 0.623, 0.593, 0.606, 0.612, and 0.601 on the hidden validation set for each lead combination. △ Less

Submitted 24 September, 2021; originally announced September 2021.

Comments: 4 pages, in Computing in Cardiology (CinC) 2021, PhysioNet/CinC Challenge 2021 submission

arXiv:2012.10999 [pdf, other]

Exploring Effectiveness of Inter-Microtask Qualification Tests in Crowdsourcing

Authors: Masaya Morinaga, Susumu Saito, Teppei Nakano, Tetsunori Kobayashi, Tetsuji Ogawa

Abstract: Qualification tests in crowdsourcing are often used to pre-filter workers by measuring their ability in executing microtasks.While creating qualification tests for each task type is considered as a common and reasonable way, this study investigates into its worker-filtering performance when the same qualification test is used across multiple types of tasks.On Amazon Mechanical Turk, we tested the… ▽ More Qualification tests in crowdsourcing are often used to pre-filter workers by measuring their ability in executing microtasks.While creating qualification tests for each task type is considered as a common and reasonable way, this study investigates into its worker-filtering performance when the same qualification test is used across multiple types of tasks.On Amazon Mechanical Turk, we tested the annotation accuracy in six different cases where tasks consisted of two different difficulty levels, arising from the identical real-world domain: four combinatory cases in which the qualification test and the actual task were the same or different from each other, as well as two other cases where workers with Masters Qualification were asked to perform the actual task only.The experimental results demonstrated the two following findings: i) Workers that were assigned to a difficult qualification test scored better annotation accuracy regardless of the difficulty of the actual task; ii) Workers with Masters Qualification scored better annotation accuracy on the low-difficulty task, but were not as accurate as those who passed a qualification test on the high-difficulty task. △ Less

Submitted 20 December, 2020; originally announced December 2020.

arXiv:2007.01089 [pdf]

Estimating Blink Probability for Highlight Detection in Figure Skating Videos

Authors: Tamami Nakano, Atsuya Sakata, Akihiro Kishimoto

Abstract: Highlight detection in sports videos has a broad viewership and huge commercial potential. It is thus imperative to detect highlight scenes more suitably for human interest with high temporal accuracy. Since people instinctively suppress blinks during attention-grabbing events and synchronously generate blinks at attention break points in videos, the instantaneous blink rate can be utilized as a h… ▽ More Highlight detection in sports videos has a broad viewership and huge commercial potential. It is thus imperative to detect highlight scenes more suitably for human interest with high temporal accuracy. Since people instinctively suppress blinks during attention-grabbing events and synchronously generate blinks at attention break points in videos, the instantaneous blink rate can be utilized as a highly accurate temporal indicator of human interest. Therefore, in this study, we propose a novel, automatic highlight detection method based on the blink rate. The method trains a one-dimensional convolution network (1D-CNN) to assess blink rates at each video frame from the spatio-temporal pose features of figure skating videos. Experiments show that the method successfully estimates the blink rate in 94% of the video clips and predicts the temporal change in the blink rate around a jump event with high accuracy. Moreover, the method detects not only the representative athletic action, but also the distinctive artistic expression of figure skating performance as key frames. This suggests that the blink-rate-based supervised learning approach enables high-accuracy highlight detection that more closely matches human sensibility. △ Less

Submitted 2 July, 2020; originally announced July 2020.

arXiv:1903.07032 [pdf, other]

doi 10.1145/3308558.3313716

TurkScanner: Predicting the Hourly Wage of Microtasks

Authors: Susumu Saito, Chun-Wei Chiang, Saiph Savage, Teppei Nakano, Tetsunori Kobayashi, Jeffrey Bigham

Abstract: Workers in crowd markets struggle to earn a living. One reason for this is that it is difficult for workers to accurately gauge the hourly wages of microtasks, and they consequently end up performing labor with little pay. In general, workers are provided with little information about tasks, and are left to rely on noisy signals, such as textual description of the task or rating of the requester.… ▽ More Workers in crowd markets struggle to earn a living. One reason for this is that it is difficult for workers to accurately gauge the hourly wages of microtasks, and they consequently end up performing labor with little pay. In general, workers are provided with little information about tasks, and are left to rely on noisy signals, such as textual description of the task or rating of the requester. This study explores various computational methods for predicting the working times (and thus hourly wages) required for tasks based on data collected from other workers completing crowd work. We provide the following contributions. (i) A data collection method for gathering real-world training data on crowd-work tasks and the times required for workers to complete them; (ii) TurkScanner: a machine learning approach that predicts the necessary working time to complete a task (and can thus implicitly provide the expected hourly wage). We collected 9,155 data records using a web browser extension installed by 84 Amazon Mechanical Turk workers, and explored the challenge of accurately recording working times both automatically and by asking workers. TurkScanner was created using ~150 derived features, and was able to predict the hourly wages of 69.6% of all the tested microtasks within a 75% error. Directions for future research include observing the effects of tools on people's working practices, adapting this approach to a requester tool for better price setting, and predicting other elements of work (e.g., the acceptance likelihood and worker task preferences.) △ Less

Submitted 17 March, 2019; originally announced March 2019.

Comments: Proceedings of the 28th International Conference on World Wide Web (WWW '19), San Francisco, CA, USA, May 13-17, 2019

arXiv:1712.07807 [pdf, other]

doi 10.1103/PhysRevE.99.050304

Fault Tolerance of Random Graphs with respect to Connectivity: Mean-field Approximation for Semi-dense Random Graphs

Authors: Satoshi Takabe, Takafumi Nakano, Tadashi Wadayama

Abstract: The fault tolerance of random graphs with unbounded degrees with respect to connectivity is investigated, which relates to the reliability of wireless sensor networks with unreliable relay nodes. The model evaluates the network breakdown probability that a graph is disconnected after stochastic node removal. To establish a mean-field approximation for the model, we propose the cavity method for fi… ▽ More The fault tolerance of random graphs with unbounded degrees with respect to connectivity is investigated, which relates to the reliability of wireless sensor networks with unreliable relay nodes. The model evaluates the network breakdown probability that a graph is disconnected after stochastic node removal. To establish a mean-field approximation for the model, we propose the cavity method for finite systems. The analysis enables us to obtain an approximation formula for random graphs with any number of nodes and an arbitrary degree distribution. In addition, its asymptotic analysis reveals that the phase transition occurs in semi-dense random graphs whose average degree grows logarithmically. These results, which are supported by numerical simulations, coincide with the mathematical results, indicating successful predictions by mean-field approximation for unbounded but not dense random graphs. △ Less

Submitted 9 May, 2019; v1 submitted 21 December, 2017; originally announced December 2017.

Comments: 5 pages, 3 figures

Journal ref: Phys. Rev. E 99, 050304 (2019)

arXiv:1701.08492 [pdf, ps, other]

doi 10.1587/transfun.E100.A.2647

On Zero Error Capacity of Nearest Neighbor Error Channels with Multilevel Alphabet

Authors: Takafumi Nakano, Tadashi Wadayama

Abstract: This paper studies the zero error capacity of the Nearest Neighbor Error (NNE) channels with a multilevel alphabet. In the NNE channels, a transmitted symbol is a $d$-tuple of elements in $\{0,1,2,\dots, n-1 \}$. It is assumed that only one element error to a nearest neighbor element in a transmitted symbol can occur. The NNE channels can be considered as a special type of limited magnitude error… ▽ More This paper studies the zero error capacity of the Nearest Neighbor Error (NNE) channels with a multilevel alphabet. In the NNE channels, a transmitted symbol is a $d$-tuple of elements in $\{0,1,2,\dots, n-1 \}$. It is assumed that only one element error to a nearest neighbor element in a transmitted symbol can occur. The NNE channels can be considered as a special type of limited magnitude error channels, and it is closely related to error models for flash memories. In this paper, we derive a lower bound of the zero error capacity of the NNE channels based on a result of the perfect Lee codes. An upper bound of the zero error capacity of the NNE channels is also derived from a feasible solution of a linear programming problem defined based on the confusion graphs of the NNE channels. As a result, a concise formula of the zero error capacity is obtained using the lower and upper bounds. △ Less

Submitted 30 January, 2017; originally announced January 2017.

arXiv:1701.06770 [pdf, ps, other]

Analysis of Breakdown Probability of Wireless Sensor Networks with Unreliable Relay Nodes

Authors: Takayuki Nozaki, Takafumi Nakano, Tadashi Wadayama

Abstract: In the present paper, we derive an upper bound of the average network breakdown probability of packet networks with unreliable relay nodes. We here assume that relay nodes get independently broken with a given node breakdown probability. A survivor graph is the induced subgraph obtained by removing the broken relay nodes and their connecting edges from the original graph. If the survivor network i… ▽ More In the present paper, we derive an upper bound of the average network breakdown probability of packet networks with unreliable relay nodes. We here assume that relay nodes get independently broken with a given node breakdown probability. A survivor graph is the induced subgraph obtained by removing the broken relay nodes and their connecting edges from the original graph. If the survivor network is disconnected, we consider a network breakdown happens. The primal contribution of the paper is to derive an upper bound of the average network breakdown probability, where the expectation is taken over a regular graph ensemble. The proof of the bound is based on a natural one-to-one correspondence between a regular graph and a regular bipartite graph, and also on enumeration of bipartite graphs satisfying certain conditions. This proof argument is inspired by the analysis of weight distribution for low-density parity-check codes. Compared with estimates of the average network breakdown probability obtained by computer experiments, it is observed that the upper bound provides the values which are not only upper bounds but also precise estimates of the network breakdown probability when the node breakdown probability is small. △ Less

Submitted 24 January, 2017; originally announced January 2017.

Comments: 5 pages, 4 figures, submitted to ISIT2017

arXiv:1609.07329 [pdf, other]

RNA as a Nanoscale Data Transmission Medium: Error Analysis

Authors: Andrew W. Eckford, Taro Furbayashi, Tadashi Nakano

Abstract: RNA can be used as a high-density medium for data storage and transmission; however, an important RNA process -- replication -- is noisy. This paper presents an error analysis for RNA as a data transmission medium, analyzing how deletion errors increase in a collection of replicated DNA strands over time. RNA can be used as a high-density medium for data storage and transmission; however, an important RNA process -- replication -- is noisy. This paper presents an error analysis for RNA as a data transmission medium, analyzing how deletion errors increase in a collection of replicated DNA strands over time. △ Less

Submitted 23 September, 2016; originally announced September 2016.

Comments: Accepted for publication in the 2016 IEEE International Conference on Nanotechnology (IEEE NANO), Sendai, Japan

arXiv:1406.4259 [pdf, ps, other]

doi 10.1109/JSAC.2014.2367653

TCP-like molecular communications

Authors: Luca Felicetti, Mauro Femminella, Gianluca Reali, Tadashi Nakano, Athanasios V. Vasilakos

Abstract: In this paper, we present a communication protocol between a pair of biological nanomachines, transmitter and receiver, built upon molecular communications in an aqueous environment. In our proposal, the receiver, acting as a control node, sends a connection setup signal to the transmitter, which stokes molecules, to start molecule transmission. The molecules transmitted by the transmitter propaga… ▽ More In this paper, we present a communication protocol between a pair of biological nanomachines, transmitter and receiver, built upon molecular communications in an aqueous environment. In our proposal, the receiver, acting as a control node, sends a connection setup signal to the transmitter, which stokes molecules, to start molecule transmission. The molecules transmitted by the transmitter propagate in the environment and are absorbed by the receiver through its receptors. When the receiver absorbs the desired quantity of molecules, it releases a tear-down signal to notify the transmitter to stop the transmission. The proposed protocol implements a bidirectional communication by using a number of techniques originally designed for the TCP. In fact, the proposed protocol is connection-oriented, and uses the TCP-like probing to find a suitable transmission rate between transmitter and receiver so as to avoid receiver congestion. Unlike the TCP, however, explicit acknowledgments are not used, since they would degrade the communication throughput due to the large delay, a characteristic feature of molecular communications. Thus, the proposed protocol uses implicit acknowledgments, and feedback signals are sent by the receiver to throttle the transmission rate at the transmitter, i.e., explicit negative feedbacks. We also present the results of an extensive simulation campaign, used to validate the proposed protocol and to properly dimension the main protocol parameters. △ Less

Submitted 17 June, 2014; originally announced June 2014.

Comments: Accepted for publication, IEEE Journal on Selected Areas in Communications, 2014

Showing 1–13 of 13 results for author: Nakano, T