Search | arXiv e-print repository

Instrument-tissue Interaction Detection Framework for Surgical Video Understanding

Authors: Wenjun Lin, Yan Hu, Huazhu Fu, Mingming Yang, Chin-Boon Chng, Ryo Kawasaki, Cheekong Chui, Jiang Liu

Abstract: Instrument-tissue interaction detection task, which helps understand surgical activities, is vital for constructing computer-assisted surgery systems but with many challenges. Firstly, most models represent instrument-tissue interaction in a coarse-grained way which only focuses on classification and lacks the ability to automatically detect instruments and tissues. Secondly, existing works do not… ▽ More Instrument-tissue interaction detection task, which helps understand surgical activities, is vital for constructing computer-assisted surgery systems but with many challenges. Firstly, most models represent instrument-tissue interaction in a coarse-grained way which only focuses on classification and lacks the ability to automatically detect instruments and tissues. Secondly, existing works do not fully consider relations between intra- and inter-frame of instruments and tissues. In the paper, we propose to represent instrument-tissue interaction as <instrument class, instrument bounding box, tissue class, tissue bounding box, action class> quintuple and present an Instrument-Tissue Interaction Detection Network (ITIDNet) to detect the quintuple for surgery videos understanding. Specifically, we propose a Snippet Consecutive Feature (SCF) Layer to enhance features by modeling relationships of proposals in the current frame using global context information in the video snippet. We also propose a Spatial Corresponding Attention (SCA) Layer to incorporate features of proposals between adjacent frames through spatial encoding. To reason relationships between instruments and tissues, a Temporal Graph (TG) Layer is proposed with intra-frame connections to exploit relationships between instruments and tissues in the same frame and inter-frame connections to model the temporal information for the same instance. For evaluation, we build a cataract surgery video (PhacoQ) dataset and a cholecystectomy surgery video (CholecQ) dataset. Experimental results demonstrate the promising performance of our model, which outperforms other state-of-the-art models on both datasets. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2011.12527 [pdf, other]

Match Them Up: Visually Explainable Few-shot Image Classification

Authors: Bowen Wang, Liangzhi Li, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara

Abstract: Few-shot learning (FSL) approaches are usually based on an assumption that the pre-trained knowledge can be obtained from base (seen) categories and can be well transferred to novel (unseen) categories. However, there is no guarantee, especially for the latter part. This issue leads to the unknown nature of the inference process in most FSL methods, which hampers its application in some risk-sensi… ▽ More Few-shot learning (FSL) approaches are usually based on an assumption that the pre-trained knowledge can be obtained from base (seen) categories and can be well transferred to novel (unseen) categories. However, there is no guarantee, especially for the latter part. This issue leads to the unknown nature of the inference process in most FSL methods, which hampers its application in some risk-sensitive areas. In this paper, we reveal a new way to perform FSL for image classification, using visual representations from the backbone model and weights generated by a newly-emerged explainable classifier. The weighted representations only include a minimum number of distinguishable features and the visualized weights can serve as an informative hint for the FSL process. Finally, a discriminator will compare the representations of each pair of the images in the support set and the query set. Pairs with the highest scores will decide the classification results. Experimental results prove that the proposed method can achieve both good accuracy and satisfactory explainability on three mainstream datasets. △ Less

Submitted 25 November, 2020; originally announced November 2020.

arXiv:2011.03772 [pdf, other]

Automated Grading System of Retinal Arterio-venous Crossing Patterns: A Deep Learning Approach Replicating Ophthalmologist's Diagnostic Process of Arteriolosclerosis

Authors: Liangzhi Li, Manisha Verma, Bowen Wang, Yuta Nakashima, Hajime Nagahara, Ryo Kawasaki

Abstract: The status of retinal arteriovenous crossing is of great significance for clinical evaluation of arteriolosclerosis and systemic hypertension. As an ophthalmology diagnostic criteria, Scheie's classification has been used to grade the severity of arteriolosclerosis. In this paper, we propose a deep learning approach to support the diagnosis process, which, to the best of our knowledge, is one of t… ▽ More The status of retinal arteriovenous crossing is of great significance for clinical evaluation of arteriolosclerosis and systemic hypertension. As an ophthalmology diagnostic criteria, Scheie's classification has been used to grade the severity of arteriolosclerosis. In this paper, we propose a deep learning approach to support the diagnosis process, which, to the best of our knowledge, is one of the earliest attempts in medical imaging. The proposed pipeline is three-fold. First, we adopt segmentation and classification models to automatically obtain vessels in a retinal image with the corresponding artery/vein labels and find candidate arteriovenous crossing points. Second, we use a classification model to validate the true crossing point. At last, the grade of severity for the vessel crossings is classified. To better address the problem of label ambiguity and imbalanced label distribution, we propose a new model, named multi-diagnosis team network (MDTNet), in which the sub-models with different structures or different loss functions provide different decisions. MDTNet unifies these diverse theories to give the final decision with high accuracy. Our severity grading method was able to validate crossing points with precision and recall of 96.3% and 96.3%, respectively. Among correctly detected crossing points, the kappa value for the agreement between the grading by a retina specialist and the estimated score was 0.85, with an accuracy of 0.92. The numerical results demonstrate that our method can achieve a good performance in both arteriovenous crossing validation and severity grading tasks. By the proposed models, we could build a pipeline reproducing retina specialist's subjective grading without feature extractions. The code is available for reproducibility. △ Less

Submitted 1 December, 2022; v1 submitted 7 November, 2020; originally announced November 2020.

Comments: Accepted in PLOS Digital Health

arXiv:2010.09466 [pdf, other]

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Authors: Bowen Wang, Liangzhi Li, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara, Yasushi Yagi

Abstract: Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherency in video frames. We also present a simple yet effective training strategy, which replaces a frame in video sequence with noises. This strategy spoils the tempora… ▽ More Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherency in video frames. We also present a simple yet effective training strategy, which replaces a frame in video sequence with noises. This strategy spoils the temporal coherency in video frames during training and thus makes the temporal links in ConvLSTMs unreliable, which may consequently improve feature extraction from video frames, as well as serve as a regularizer to avoid overfitting, without requiring extra data annotation or computational costs. Experimental results demonstrate that the proposed model can achieve state-of-the-art performances in both the CityScapes and EndoVis2018 datasets. △ Less

Submitted 19 October, 2020; originally announced October 2020.

arXiv:2009.06138 [pdf, other]

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

Authors: Liangzhi Li, Bowen Wang, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara

Abstract: Explainable artificial intelligence has been gaining attention in the past few years. However, most existing methods are based on gradients or intermediate features, which are not directly involved in the decision-making process of the classifier. In this paper, we propose a slot attention-based classifier called SCOUTER for transparent yet accurate classification. Two major differences from other… ▽ More Explainable artificial intelligence has been gaining attention in the past few years. However, most existing methods are based on gradients or intermediate features, which are not directly involved in the decision-making process of the classifier. In this paper, we propose a slot attention-based classifier called SCOUTER for transparent yet accurate classification. Two major differences from other attention-based methods include: (a) SCOUTER's explanation is involved in the final confidence for each category, offering more intuitive interpretation, and (b) all the categories have their corresponding positive or negative explanation, which tells "why the image is of a certain category" or "why the image is not of a certain category." We design a new loss tailored for SCOUTER that controls the model's behavior to switch between positive and negative explanations, as well as the size of explanatory regions. Experimental results show that SCOUTER can give better visual explanations in terms of various metrics while kee** good accuracy on small and medium-sized datasets. △ Less

Submitted 20 August, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

arXiv:2005.13337 [pdf, other]

Joint Learning of Vessel Segmentation and Artery/Vein Classification with Post-processing

Authors: Liangzhi Li, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara

Abstract: Retinal imaging serves as a valuable tool for diagnosis of various diseases. However, reading retinal images is a difficult and time-consuming task even for experienced specialists. The fundamental step towards automated retinal image analysis is vessel segmentation and artery/vein classification, which provide various information on potential disorders. To improve the performance of the existing… ▽ More Retinal imaging serves as a valuable tool for diagnosis of various diseases. However, reading retinal images is a difficult and time-consuming task even for experienced specialists. The fundamental step towards automated retinal image analysis is vessel segmentation and artery/vein classification, which provide various information on potential disorders. To improve the performance of the existing automated methods for retinal image analysis, we propose a two-step vessel classification. We adopt a UNet-based model, SeqNet, to accurately segment vessels from the background and make prediction on the vessel type. Our model does segmentation and classification sequentially, which alleviates the problem of label distribution bias and facilitates training. To further refine classification results, we post-process them considering the structural information among vessels to propagate highly confident prediction to surrounding vessels. Our experiments show that our method improves AUC to 0.98 for segmentation and the accuracy to 0.92 in classification over DRIVE dataset. △ Less

Submitted 27 May, 2020; originally announced May 2020.

Comments: Accepted in Medical Imaging with Deep Learning (MIDL) 2020

arXiv:2004.04367 [pdf]

Network Analysis of Attitudes towards Immigrants in Asia

Authors: Rachael Kei Kawasaki, Yuichi Ikeda

Abstract: This study models cross-national attitudes towards immigrants in East and Southeast Asia as a signed and weighted bipartite network of countries and evaluative reactions to a variety of political issues, or determinants. This network is then projected into two one-mode networks, one of countries and one of determinants, and community detection methods are applied. The paper aims to fill two defici… ▽ More This study models cross-national attitudes towards immigrants in East and Southeast Asia as a signed and weighted bipartite network of countries and evaluative reactions to a variety of political issues, or determinants. This network is then projected into two one-mode networks, one of countries and one of determinants, and community detection methods are applied. The paper aims to fill two deficiencies in the current research on attitudes towards immigrants: 1) the lack of cross-national studies in Asia, a region where migration is growing, and 2) the tendency of researchers to treat determinants as uncorrelated, despite the interdependent nature of evaluative reactions. The results show that the nine countries in the sample are a cohesive clique, showing greater similarities than differences in the determinants of their attitudes. A blockmodeling approach was employed to identify eight determinants in attitudes towards immigrants, namely views on independence and social dependencies, group identities, absolute or relative moral orientation, attitudes towards democracy, science and technology, prejudice and stigma, and two determinants related to religion. However, the findings of this survey yielded some surprising results when compared with the literature review. First, education was not found to be a significant determinants of attitudes towards immigrants, despite its strong and consistent predictive power in European models. Second, prejudice appears to be mediated in part by religion, especially in religious identification and belief in God. Group identity and prejudice also appear to be related, though only weakly. Finally, anxiety appears in clusters related to social norms, suggesting that fears regarding immigrants relates closely to expectations of others' behavior. △ Less

Submitted 9 April, 2020; originally announced April 2020.

Comments: 36 pages, 14 figures, submitted to Applied Network Science

arXiv:1912.05763 [pdf, other]

IterNet: Retinal Image Segmentation Utilizing Structural Redundancy in Vessel Networks

Authors: Liangzhi Li, Manisha Verma, Yuta Nakashima, Hajime Nagahara, Ryo Kawasaki

Abstract: Retinal vessel segmentation is of great interest for diagnosis of retinal vascular diseases. To further improve the performance of vessel segmentation, we propose IterNet, a new model based on UNet, with the ability to find obscured details of the vessel from the segmented vessel image itself, rather than the raw input image. IterNet consists of multiple iterations of a mini-UNet, which can be 4… ▽ More Retinal vessel segmentation is of great interest for diagnosis of retinal vascular diseases. To further improve the performance of vessel segmentation, we propose IterNet, a new model based on UNet, with the ability to find obscured details of the vessel from the segmented vessel image itself, rather than the raw input image. IterNet consists of multiple iterations of a mini-UNet, which can be 4$\times$ deeper than the common UNet. IterNet also adopts the weight-sharing and skip-connection features to facilitate training; therefore, even with such a large architecture, IterNet can still learn from merely 10$\sim$20 labeled images, without pre-training or any prior knowledge. IterNet achieves AUCs of 0.9816, 0.9851, and 0.9881 on three mainstream datasets, namely DRIVE, CHASE-DB1, and STARE, respectively, which currently are the best scores in the literature. The source code is available. △ Less

Submitted 11 December, 2019; originally announced December 2019.

Comments: Accepted in 2020 Winter Conference on Applications of Computer Vision (WACV 20)

arXiv:1306.5080 [pdf, ps, other]

doi 10.1103/PhysRevD.88.033019

Quark sector CP violation of the universal seesaw model

Authors: Ryomu Kawasaki, Takuya Morozumi, Hiroyuki Umeeda

Abstract: We study the CP violation of universal seesaw model, especially its quark sector. The model is based on SU(2)_L \times SU(2)_R \times U(1)_{Y^\prime}. In order to count the number of parameters in quark sector, we use the degree of freedom of weak basis transformation. For N(3)-generation model, the number of CP violating phase in quark sector is identified as 3N^2-3N+1 (19). We also construct nin… ▽ More We study the CP violation of universal seesaw model, especially its quark sector. The model is based on SU(2)_L \times SU(2)_R \times U(1)_{Y^\prime}. In order to count the number of parameters in quark sector, we use the degree of freedom of weak basis transformation. For N(3)-generation model, the number of CP violating phase in quark sector is identified as 3N^2-3N+1 (19). We also construct nineteen CP violating weak basis invariants of Yukawa coupling matrices and SU(2) singlet quark mass matrices in the three-generation universal seesaw model. The quark interaction terms induced by neutral currents are given as an exact formula. Both of the charged current and the neutral current are expressed in terms of the mass basis by finding the transformations from weak basis to mass basis. Finally, we calculate the mixing matrix element approximately assuming that the SU(2)_R breaking scale v_R is much larger than the electro-weak breaking scale v_L. △ Less

Submitted 6 September, 2013; v1 submitted 21 June, 2013; originally announced June 2013.

Comments: 32pages

Report number: HUPD1305

arXiv:0711.1903 [pdf, ps, other]

doi 10.1016/j.jcrysgro.2007.11.033

Shape of Heteroepitaxial Island Determined by Asymmetric Detachment

Authors: Yukio Saito, Ryo Kawasaki

Abstract: Square lattice gas models for heteroepitaxial growth are studied by means of kinetic Monte Carlo simulations, in order to find a possible origin of anisotropic island shape observed in growth experiments of long organic molecules. When deposited molecules form clusters irreversibly at their encounter during surface diffusion, islands grow in a ramified dendritic shape, similar to DLA. Introducti… ▽ More Square lattice gas models for heteroepitaxial growth are studied by means of kinetic Monte Carlo simulations, in order to find a possible origin of anisotropic island shape observed in growth experiments of long organic molecules. When deposited molecules form clusters irreversibly at their encounter during surface diffusion, islands grow in a ramified dendritic shape, similar to DLA. Introduction of molecular detachment from edges makes islands compact with smooth edges. Tilting of adsorbed long molecules or steps in a vicinal substrate may induce orientation-dependence in the detachment rate of edge molecules from an island. In simulations with orientation-dependent detachment rates, a clear anisotropy in an island shape is observed. Shape anisotropy on a vicinal substrate is enhanced as steps get dense, in agreement to the experimental observation. △ Less

Submitted 12 November, 2007; originally announced November 2007.

Comments: 5 pages, 6 figures

arXiv:0705.3730 [pdf, ps, other]

doi 10.1143/JPSJ.76.074604

Two-Dimensional Island Shape Determined by Detachment

Authors: Yukio Saito, Ryo Kawasaki

Abstract: Effect of an anisotropic detachment on a heteroepitaxial island shape is studied by means of a kinetic Monte Carlo simulation of a square lattice gas model. Only with molecular deposition followed by surface diffusion, islands grow in a ramified dendritic shape, similar to DLA. Introduction of molecular detachment from edges makes islands compact. To understand an anisotropic island shape observ… ▽ More Effect of an anisotropic detachment on a heteroepitaxial island shape is studied by means of a kinetic Monte Carlo simulation of a square lattice gas model. Only with molecular deposition followed by surface diffusion, islands grow in a ramified dendritic shape, similar to DLA. Introduction of molecular detachment from edges makes islands compact. To understand an anisotropic island shape observed in the experiment of pentacene growth on a hydrogen-terminated Si(111) vicinal surface, asymmetry in detachment around the substrate step is assumed. Edge molecules detach more to the higher terrace than to the lower terrace. The island edge from which molecules are easy to detach is smooth and the one hard to detach is dendritic. If islands are close to each other, islands tend to align in a line, since detached molecules from the smooth edge of the right island are fed to the dendritic and fast growing edge of the left island. △ Less

Submitted 25 May, 2007; originally announced May 2007.

Comments: 13 pages, 5 figures

Journal ref: J. Phys. Soc. Jpn. 76 (2007) 074604/1-6

arXiv:cond-mat/0511252 [pdf, ps, other]

doi 10.1016/j.ssc.2005.11.016

Enhancement of the anomalous Hall effect and spin glass behavior in the bilayered manganite La(2-2x)Sr(1+2x)Mn2O7

Authors: Y. Hirobe, Y. Ashikawa, R. Kawasaki, K. Noda, D. Akahoshi, H. Kuwahara

Abstract: The Hall resistivity and magnetization have been investigated in the ferromagnetic state of the bilayered manganite La(2-2x)Sr(1+2x)Mn2O7 (x=0.36). The Hall resistivity shows an increase in both the ordinary and anomalous Hall coefficients at low temperatures below 50K, a region in which experimental evidence for the spin glass state has been found in a low magnetic field of 1mT. The origin of t… ▽ More The Hall resistivity and magnetization have been investigated in the ferromagnetic state of the bilayered manganite La(2-2x)Sr(1+2x)Mn2O7 (x=0.36). The Hall resistivity shows an increase in both the ordinary and anomalous Hall coefficients at low temperatures below 50K, a region in which experimental evidence for the spin glass state has been found in a low magnetic field of 1mT. The origin of the anomalous behavior of the Hall resistivity relevant to magnetic states may lie in the intrinsic microscopic inhomogeneity in a quasi-two-dimensional electron system. △ Less

Submitted 10 November, 2005; originally announced November 2005.

Comments: 7 pages, 4 figures, Solid State Communications (in press)

Journal ref: Solid state communications Volume 137, pages 191-195, 2006

Showing 1–12 of 12 results for author: Kawasaki, R