Skip to main content

Showing 1–30 of 30 results for author: Iyatomi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.11431  [pdf, other

    cs.CL

    Majority or Minority: Data Imbalance Learning Method for Named Entity Recognition

    Authors: Sota Nemoto, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Data imbalance presents a significant challenge in various machine learning (ML) tasks, particularly named entity recognition (NER) within natural language processing (NLP). NER exhibits a data imbalance with a long-tail distribution, featuring numerous minority classes (i.e., entity classes) and a single majority class (i.e., O-class). This imbalance leads to misclassifications of the entity clas… ▽ More

    Submitted 16 March, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 5 pages, 1 figures, 3 tables. Accepted at Practical ML for Low Resource Settings (PML4LRS) Workshop @ ICLR 2024

  2. arXiv:2309.01903  [pdf, other

    cs.CV

    Towards Robust Plant Disease Diagnosis with Hard-sample Re-mining Strategy

    Authors: Quan Huu Cap, Atsushi Fukuda, Satoshi Kagiwada, Hiroyuki Uga, Nobusuke Iwasaki, Hitoshi Iyatomi

    Abstract: With rich annotation information, object detection-based automated plant disease diagnosis systems (e.g., YOLO-based systems) often provide advantages over classification-based systems (e.g., EfficientNet-based), such as the ability to detect disease locations and superior classification performance. One drawback of these detection systems is dealing with unannotated healthy data with no real symp… ▽ More

    Submitted 30 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  3. arXiv:2304.01864  [pdf, other

    eess.IV cs.CV

    A Practical Framework for Unsupervised Structure Preservation Medical Image Enhancement

    Authors: Quan Huu Cap, Atsushi Fukuda, Hitoshi Iyatomi

    Abstract: Medical images are extremely valuable for supporting medical diagnoses. However, in practice, low-quality (LQ) medical images, such as images that are hazy/blurry, have uneven illumination, or are out of focus, among others, are often obtained during data acquisition. This leads to difficulties in the screening and diagnosis of medical diseases. Several generative adversarial networks (GAN)-based… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 11 pages, 7 figures

  4. arXiv:2211.09427  [pdf, other

    cs.CV cs.AI cs.CL cs.HC cs.LG

    Feedback is Needed for Retakes: An Explainable Poor Image Notification Framework for the Visually Impaired

    Authors: Kazuya Ohata, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: We propose a simple yet effective image captioning framework that can determine the quality of an image and notify the user of the reasons for any flaws in the image. Our framework first determines the quality of images and then generates captions using only those images that are determined to be of high quality. The user is notified by the flaws feature to retake if image quality is low, and this… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures. Accepted at 2022 IEEE 19th International Conference on Smart Communities: Improving Quality of Life Using ICT, IoT and AI (HONET) as a full paper

  5. arXiv:2210.00506  [pdf, other

    eess.IV cs.CV cs.IR cs.LG q-bio.NC

    Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

    Authors: Kei Nishimaki, Kumpei Ikuta, Yuto Onga, Hitoshi Iyatomi, Kenichi Oishi

    Abstract: Content-based image retrieval (CBIR) systems are an emerging technology that supports reading and interpreting medical images. Since 3D brain MR images are high dimensional, dimensionality reduction is necessary for CBIR using machine learning techniques. In addition, for a reliable CBIR system, each dimension in the resulting low-dimensional representation must be associated with a neurologically… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: 6 pages, 6 figures. Accepted at the International Conference on Systems, Man, and Cybernetics (IEEE SMC '22)

  6. arXiv:2209.03126  [pdf, other

    cs.MM cs.AI cs.CL cs.CV cs.LG

    DM$^2$S$^2$: Deep Multi-Modal Sequence Sets with Hierarchical Modality Attention

    Authors: Shunsuke Kitada, Yuki Iwazaki, Riku Togashi, Hitoshi Iyatomi

    Abstract: There is increasing interest in the use of multimodal data in various web applications, such as digital advertising and e-commerce. Typical methods for extracting important information from multimodal data rely on a mid-fusion architecture that combines the feature representations from multiple encoders. However, as the number of modalities increases, several potential problems with the mid-fusion… ▽ More

    Submitted 22 November, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 12 pages, 3 figures. Accepted by IEEE Access on Nov. 3, 2022

    Journal ref: in IEEE Access, vol. 10, pp. 120023-120034, 2022

  7. arXiv:2208.14244  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Expressions Causing Differences in Emotion Recognition in Social Networking Service Documents

    Authors: Tsubasa Nakagawa, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: It is often difficult to correctly infer a writer's emotion from text exchanged online, and differences in recognition between writers and readers can be problematic. In this paper, we propose a new framework for detecting sentences that create differences in emotion recognition between the writer and the reader and for detecting the kinds of expressions that cause such differences. The proposed f… ▽ More

    Submitted 3 September, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: 5 pages, 3 figures. Accepted at the 31st ACM International Conference on Information and Knowledge Management (CIKM '22) as a short paper

    Journal ref: Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM'22), October 17--21, 2022, Atlanta, GA, USA

  8. arXiv:2204.11588  [pdf, other

    cs.IR cs.AI cs.CL cs.CV cs.LG

    Ad Creative Discontinuation Prediction with Multi-Modal Multi-Task Neural Survival Networks

    Authors: Shunsuke Kitada, Hitoshi Iyatomi, Yoshifumi Seki

    Abstract: Discontinuing ad creatives at an appropriate time is one of the most important ad operations that can have a significant impact on sales. Such operational support for ineffective ads has been less explored than that for effective ads. After pre-analyzing 1,000,000 real-world ad creatives, we found that there are two types of discontinuation: short-term (i.e., cut-out) and long-term (i.e., wear-out… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: 23 pages, 5 figures. Accepted by Appl. Sci. on March 29th, 2022

    Journal ref: Appl. Sci. 2022, 12(7), 3594

  9. arXiv:2108.08158  [pdf, other

    eess.IV cs.CV

    Practical X-ray Gastric Cancer Screening Using Refined Stochastic Data Augmentation and Hard Boundary Box Training

    Authors: Hideaki Okamoto, Takakiyo Nomura, Kazuhito Nabeshima, Jun Hashimoto, Hitoshi Iyatomi

    Abstract: In gastric cancer screening, X-rays can be performed by radiographers, allowing them to see far more patients than endoscopy, which can only be performed by physicians. However, due to subsequent diagnostic difficulties, the sensitivity of gastric X-ray is only 85.5%, and little research has been done on automated diagnostic aids that directly target gastric cancer. This paper proposes a practical… ▽ More

    Submitted 22 March, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 10 pages, 6 figures

  10. Disease-oriented image embedding with pseudo-scanner standardization for content-based image retrieval on 3D brain MRI

    Authors: Hayato Arai, Yuto Onga, Kumpei Ikuta, Yusuke Chayama, Hitoshi Iyatomi, Kenichi Oishi

    Abstract: To build a robust and practical content-based image retrieval (CBIR) system that is applicable to a clinical brain MRI database, we propose a new framework -- Disease-oriented image embedding with pseudo-scanner standardization (DI-PSS) -- that consists of two core techniques, data harmonization and a dimension reduction algorithm. Our DI-PSS uses skull strip** and CycleGAN-based image transform… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: 13 pages, 7 figures

  11. Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Although attention mechanisms have become fundamental components of deep learning models, they are vulnerable to perturbations, which may degrade the prediction performance and model interpretability. Adversarial training (AT) for attention mechanisms has successfully reduced such drawbacks by considering adversarial perturbations. However, this technique requires label information, and thus, its… ▽ More

    Submitted 25 December, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: 18 pages, 3 figures. Accepted for publication in Springer Applied Intelligence (APIN)

    Journal ref: Applied Intelligence, Springer, 2022

  12. arXiv:2103.02198  [pdf, other

    cs.CV cs.AI cs.LG

    Bulk Production Augmentation Towards Explainable Melanoma Diagnosis

    Authors: Kasumi Obi, Quan Huu Cap, Noriko Umegaki-Arao, Masaru Tanaka, Hitoshi Iyatomi

    Abstract: Although highly accurate automated diagnostic techniques for melanoma have been reported, the realization of a system capable of providing diagnostic evidence based on medical indices remains an open issue because of difficulties in obtaining reliable training data. In this paper, we propose bulk production augmentation (BPA) to generate high-quality, diverse pseudo-skin tumor images with the desi… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES2020), Best Paper Award Student Category in Biomedical Imaging and Image Processing

  13. arXiv:2011.14132  [pdf, other

    eess.IV cs.CV

    MIINet: An Image Quality Improvement Framework for Supporting Medical Diagnosis

    Authors: Quan Huu Cap, Hitoshi Iyatomi, Atsushi Fukuda

    Abstract: Medical images have been indispensable and useful tools for supporting medical experts in making diagnostic decisions. However, taken medical images especially throat and endoscopy images are normally hazy, lack of focus, or uneven illumination. Thus, these could difficult the diagnosis process for doctors. In this paper, we propose MIINet, a novel image-to-image translation network for improving… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted at the ICPR2020 Workshops

  14. arXiv:2011.04184  [pdf, other

    cs.CL cs.AI cs.LG

    Text Classification through Glyph-aware Disentangled Character Embedding and Semantic Sub-character Augmentation

    Authors: Takumi Aoki, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: We propose a new character-based text classification framework for non-alphabetic languages, such as Chinese and Japanese. Our framework consists of a variational character encoder (VCE) and character-level text classifier. The VCE is composed of a $β$-variational auto-encoder ($β$-VAE) that learns the proposed glyph-aware disentangled character embedding (GDCE). Since our GDCE provides zero-mean… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: 6 pages, 3 figures, Accepted at AACL-IJCNLP 2020: Student Research Workshop

  15. arXiv:2010.06499  [pdf, other

    cs.CV eess.IV

    LASSR: Effective Super-Resolution Method for Plant Disease Diagnosis

    Authors: Quan Huu Cap, Hiroki Tani, Hiroyuki Uga, Satoshi Kagiwada, Hitoshi Iyatomi

    Abstract: The collection of high-resolution training data is crucial in building robust plant disease diagnosis systems, since such data have a significant impact on diagnostic performance. However, they are very difficult to obtain and are not always available in practice. Deep learning-based techniques, and particularly generative adversarial networks (GANs), can be applied to generate high-quality super-… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  16. Attention Meets Perturbations: Robust and Interpretable Attention with Adversarial Training

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Although attention mechanisms have been applied to a variety of deep learning models and have been shown to improve the prediction performance, it has been reported to be vulnerable to perturbations to the mechanism. To overcome the vulnerability to perturbations in the mechanism, we are inspired by adversarial training (AT), which is a powerful regularization technique for enhancing the robustnes… ▽ More

    Submitted 30 June, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 12 pages, 4 figures. Accepted by IEEE Access on Jun. 21, 2021

    Journal ref: in IEEE Access, vol. 9, pp. 92974-92985, 2021

  17. arXiv:2006.11586  [pdf, other

    cs.CL

    AraDIC: Arabic Document Classification using Image-Based Character Embeddings and Class-Balanced Loss

    Authors: Mahmoud Daif, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Classical and some deep learning techniques for Arabic text classification often depend on complex morphological analysis, word segmentation, and hand-crafted feature engineering. These could be eliminated by using character-level features. We propose a novel end-to-end Arabic document classification framework, Arabic document image-based classifier (AraDIC), inspired by the work on image-based ch… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  18. arXiv:2002.10100  [pdf, other

    cs.CV

    LeafGAN: An Effective Data Augmentation Method for Practical Plant Disease Diagnosis

    Authors: Quan Huu Cap, Hiroyuki Uga, Satoshi Kagiwada, Hitoshi Iyatomi

    Abstract: Many applications for the automated diagnosis of plant disease have been developed based on the success of deep learning techniques. However, these applications often suffer from overfitting, and the diagnostic performance is drastically decreased when used on test datasets from new environments. In this paper, we propose LeafGAN, a novel image-to-image translation system with own attention mechan… ▽ More

    Submitted 27 November, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted as a regular paper in the IEEE Transactions on Automation Science and Engineering (T-ASE)

  19. arXiv:1912.01824  [pdf, ps, other

    eess.IV cs.CV

    Efficient feature embedding of 3D brain MRI images for content-based image retrieval with deep metric learning

    Authors: Yuto Onga, Shingo Fujiyama, Hayato Arai, Yusuke Chayama, Hitoshi Iyatomi, Kenichi Oishi

    Abstract: Increasing numbers of MRI brain scans, improvements in image resolution, and advancements in MRI acquisition technology are causing significant increases in the demand for and burden on radiologists' efforts in terms of reading and interpreting brain MRIs. Content-based image retrieval (CBIR) is an emerging technology for reducing this burden by supporting the reading of medical images. High dimen… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: To appear in the IEEE BigData 2019 Workshop on Advances in High Dimensional (AdHD) Big Data

  20. arXiv:1911.11341  [pdf, other

    eess.IV cs.CV

    Super-Resolution for Practical Automated Plant Disease Diagnosis System

    Authors: Quan Huu Cap, Hiroki Tani, Hiroyuki Uga, Satoshi Kagiwada, Hitoshi Iyatomi

    Abstract: Automated plant diagnosis using images taken from a distance is often insufficient in resolution and degrades diagnostic accuracy since the important external characteristics of symptoms are lost. In this paper, we first propose an effective pre-processing method for improving the performance of automated plant disease diagnosis systems using super-resolution techniques. We investigate the efficie… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: Published as a conference paper at CISS 2019, Baltimore, MD, USA

  21. arXiv:1911.10727  [pdf, ps, other

    cs.CV

    AOP: An Anti-overfitting Pretreatment for Practical Image-based Plant Diagnosis

    Authors: Takumi Saikawa, Quan Huu Cap, Satoshi Kagiwada, Hiroyuki Uga, Hitoshi Iyatomi

    Abstract: In image-based plant diagnosis, clues related to diagnosis are often unclear, and the other factors such as image backgrounds often have a significant impact on the final decision. As a result, overfitting due to latent similarities in the dataset often occurs, and the diagnostic performance on real unseen data (e,g. images from other farms) is usually dropped significantly. However, this problem… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: To appear in the IEEE BigData 2019 Workshop on Big Food and Nutrition Data Management and Analysis

  22. arXiv:1910.11506  [pdf, other

    cs.CV

    A comparable study: Intrinsic difficulties of practical plant diagnosis from wide-angle images

    Authors: Katsumasa Suwa, Quan Huu Cap, Ryunosuke Kotani, Hiroyuki Uga, Satoshi Kagiwada, Hitoshi Iyatomi

    Abstract: Practical automated detection and diagnosis of plant disease from wide-angle images (i.e. in-field images containing multiple leaves using a fixed-position camera) is a very important application for large-scale farm management, in view of the need to ensure global food security. However, develo** automated systems for disease diagnosis is often difficult, because labeling a reliable wide-angle… ▽ More

    Submitted 22 November, 2019; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: 7 pages, 3 figures

  23. Conversion Prediction Using Multi-task Conditional Attention Networks to Support the Creation of Effective Ad Creative

    Authors: Shunsuke Kitada, Hitoshi Iyatomi, Yoshifumi Seki

    Abstract: Accurately predicting conversions in advertisements is generally a challenging task, because such conversions do not occur frequently. In this paper, we propose a new framework to support creating high-performing ad creatives, including the accurate prediction of ad creative text conversions before delivering to the consumer. The proposed framework includes three key ideas: multi-task learning, co… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 9 pages, 6 figures. Accepted at The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2019) as an applied data science paper

    Journal ref: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '19), August 4--8, 2019, Anchorage, AK, USA

  24. End-to-End Text Classification via Image-based Embedding using Character-level Networks

    Authors: Shunsuke Kitada, Ryunosuke Kotani, Hitoshi Iyatomi

    Abstract: For analysing and/or understanding languages having no word boundaries based on morphological analysis such as Japanese, Chinese, and Thai, it is desirable to perform appropriate word segmentation before word embeddings. But it is inherently difficult in these languages. In recent years, various language models based on deep learning have made remarkable progress, and some of these methodologies u… ▽ More

    Submitted 10 October, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: To appear in IEEE Applied Imagery Pattern Recognition (AIPR) 2018 workshop

  25. arXiv:1809.02568  [pdf, ps, other

    cs.CV

    Skin lesion classification with ensemble of squeeze-and-excitation networks and semi-supervised learning

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: In this report, we introduce the outline of our system in Task 3: Disease Classification of ISIC 2018: Skin Lesion Analysis Towards Melanoma Detection. We fine-tuned multiple pre-trained neural network models based on Squeeze-and-Excitation Networks (SENet) which achieved state-of-the-art results in the field of image recognition. In addition, we used the mean teachers as a semi-supervised learnin… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 6 pages, 4 figures, ISIC2018

  26. Lesion Border Detection in Dermoscopy Images Using Ensembles of Thresholding Methods

    Authors: M. Emre Celebi, Quan Wen, Sae Hwang, Hitoshi Iyatomi, Gerald Schaefer

    Abstract: Dermoscopy is one of the major imaging modalities used in the diagnosis of melanoma and other pigmented skin lesions. Due to the difficulty and subjectivity of human interpretation, automated analysis of dermoscopy images has become an important research area. Border detection is often the first step in this analysis. In many cases, the lesion can be roughly separated from the background skin usin… ▽ More

    Submitted 26 December, 2013; originally announced December 2013.

    Comments: 8 pages, 3 figures, 2 tables. arXiv admin note: substantial text overlap with arXiv:1009.1362

    ACM Class: I.4.6

    Journal ref: Skin Research and Technology 19 (2013) e252--e258

  27. Lesion Border Detection in Dermoscopy Images

    Authors: M. Emre Celebi, Hitoshi Iyatomi, Gerald Schaefer, William V. Stoecker

    Abstract: Background: Dermoscopy is one of the major imaging modalities used in the diagnosis of melanoma and other pigmented skin lesions. Due to the difficulty and subjectivity of human interpretation, computerized analysis of dermoscopy images has become an important research area. One of the most important steps in dermoscopy image analysis is the automated detection of lesion borders. Methods: In thi… ▽ More

    Submitted 30 October, 2010; originally announced November 2010.

    Comments: 10 pages, 1 figure, 3 tables

    ACM Class: I.4.6

    Journal ref: Computerized Medical Imaging and Graphics 33 (2009) 148--153

  28. Approximate Lesion Localization in Dermoscopy Images

    Authors: M. Emre Celebi, Hitoshi Iyatomi, Gerald Schaefer, William V. Stoecker

    Abstract: Background: Dermoscopy is one of the major imaging modalities used in the diagnosis of melanoma and other pigmented skin lesions. Due to the difficulty and subjectivity of human interpretation, automated analysis of dermoscopy images has become an important research area. Border detection is often the first step in this analysis. Methods: In this article, we present an approximate lesion localizat… ▽ More

    Submitted 6 September, 2010; originally announced September 2010.

    ACM Class: I.4.6

    Journal ref: Skin Research and Technology 15 (2009) 314-322

  29. An Improved Objective Evaluation Measure for Border Detection in Dermoscopy Images

    Authors: M. Emre Celebi, Gerald Schaefer, Hitoshi Iyatomi, William V. Stoecker, Joseph M. Malters, James M. Grichnik

    Abstract: Background: Dermoscopy is one of the major imaging modalities used in the diagnosis of melanoma and other pigmented skin lesions. Due to the difficulty and subjectivity of human interpretation, dermoscopy image analysis has become an important research area. One of the most important steps in dermoscopy image analysis is the automated detection of lesion borders. Although numerous methods have bee… ▽ More

    Submitted 6 September, 2010; originally announced September 2010.

    ACM Class: I.4.6

    Journal ref: Skin Research and Technology 15 (2009) 444-450

  30. Automatic Detection of Blue-White Veil and Related Structures in Dermoscopy Images

    Authors: M. Emre Celebi, Hitoshi Iyatomi, William V. Stoecker, Randy H. Moss, Harold S. Rabinovitz, Giuseppe Argenziano, H. Peter Soyer

    Abstract: Dermoscopy is a non-invasive skin imaging technique, which permits visualization of features of pigmented melanocytic neoplasms that are not discernable by examination with the naked eye. One of the most important features for the diagnosis of melanoma in dermoscopy images is the blue-white veil (irregular, structureless areas of confluent blue pigmentation with an overlying white "ground-glass"… ▽ More

    Submitted 6 September, 2010; originally announced September 2010.

    ACM Class: I.4.7; I.4.9

    Journal ref: Computerized Medical Imaging and Graphics 32 (2008) 670-677