Skip to main content

Showing 1–13 of 13 results for author: Kitada, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.11431  [pdf, other

    cs.CL

    Majority or Minority: Data Imbalance Learning Method for Named Entity Recognition

    Authors: Sota Nemoto, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Data imbalance presents a significant challenge in various machine learning (ML) tasks, particularly named entity recognition (NER) within natural language processing (NLP). NER exhibits a data imbalance with a long-tail distribution, featuring numerous minority classes (i.e., entity classes) and a single majority class (i.e., O-class). This imbalance leads to misclassifications of the entity clas… ▽ More

    Submitted 16 March, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 5 pages, 1 figures, 3 tables. Accepted at Practical ML for Low Resource Settings (PML4LRS) Workshop @ ICLR 2024

  2. arXiv:2303.14116  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.MM

    Improving Prediction Performance and Model Interpretability through Attention Mechanisms from Basic and Applied Research Perspectives

    Authors: Shunsuke Kitada

    Abstract: With the dramatic advances in deep learning technology, machine learning research is focusing on improving the interpretability of model predictions as well as prediction performance in both basic and applied research. While deep learning models have much higher prediction performance than traditional machine learning models, the specific prediction process is still difficult to interpret and/or e… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: The bulletin of Graduate School of Science and Engineering, Hosei University, Vol.64 (03/2023). This article draws heavily from arxiv:2009.12064, arxiv:2104.08763, arxiv:1905.07289, and arxiv:2204.11588

  3. arXiv:2211.09427  [pdf, other

    cs.CV cs.AI cs.CL cs.HC cs.LG

    Feedback is Needed for Retakes: An Explainable Poor Image Notification Framework for the Visually Impaired

    Authors: Kazuya Ohata, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: We propose a simple yet effective image captioning framework that can determine the quality of an image and notify the user of the reasons for any flaws in the image. Our framework first determines the quality of images and then generates captions using only those images that are determined to be of high quality. The user is notified by the flaws feature to retake if image quality is low, and this… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures. Accepted at 2022 IEEE 19th International Conference on Smart Communities: Improving Quality of Life Using ICT, IoT and AI (HONET) as a full paper

  4. arXiv:2209.03126  [pdf, other

    cs.MM cs.AI cs.CL cs.CV cs.LG

    DM$^2$S$^2$: Deep Multi-Modal Sequence Sets with Hierarchical Modality Attention

    Authors: Shunsuke Kitada, Yuki Iwazaki, Riku Togashi, Hitoshi Iyatomi

    Abstract: There is increasing interest in the use of multimodal data in various web applications, such as digital advertising and e-commerce. Typical methods for extracting important information from multimodal data rely on a mid-fusion architecture that combines the feature representations from multiple encoders. However, as the number of modalities increases, several potential problems with the mid-fusion… ▽ More

    Submitted 22 November, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 12 pages, 3 figures. Accepted by IEEE Access on Nov. 3, 2022

    Journal ref: in IEEE Access, vol. 10, pp. 120023-120034, 2022

  5. arXiv:2208.14244  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Expressions Causing Differences in Emotion Recognition in Social Networking Service Documents

    Authors: Tsubasa Nakagawa, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: It is often difficult to correctly infer a writer's emotion from text exchanged online, and differences in recognition between writers and readers can be problematic. In this paper, we propose a new framework for detecting sentences that create differences in emotion recognition between the writer and the reader and for detecting the kinds of expressions that cause such differences. The proposed f… ▽ More

    Submitted 3 September, 2022; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: 5 pages, 3 figures. Accepted at the 31st ACM International Conference on Information and Knowledge Management (CIKM '22) as a short paper

    Journal ref: Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM'22), October 17--21, 2022, Atlanta, GA, USA

  6. arXiv:2204.11588  [pdf, other

    cs.IR cs.AI cs.CL cs.CV cs.LG

    Ad Creative Discontinuation Prediction with Multi-Modal Multi-Task Neural Survival Networks

    Authors: Shunsuke Kitada, Hitoshi Iyatomi, Yoshifumi Seki

    Abstract: Discontinuing ad creatives at an appropriate time is one of the most important ad operations that can have a significant impact on sales. Such operational support for ineffective ads has been less explored than that for effective ads. After pre-analyzing 1,000,000 real-world ad creatives, we found that there are two types of discontinuation: short-term (i.e., cut-out) and long-term (i.e., wear-out… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: 23 pages, 5 figures. Accepted by Appl. Sci. on March 29th, 2022

    Journal ref: Appl. Sci. 2022, 12(7), 3594

  7. Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Although attention mechanisms have become fundamental components of deep learning models, they are vulnerable to perturbations, which may degrade the prediction performance and model interpretability. Adversarial training (AT) for attention mechanisms has successfully reduced such drawbacks by considering adversarial perturbations. However, this technique requires label information, and thus, its… ▽ More

    Submitted 25 December, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: 18 pages, 3 figures. Accepted for publication in Springer Applied Intelligence (APIN)

    Journal ref: Applied Intelligence, Springer, 2022

  8. arXiv:2011.04184  [pdf, other

    cs.CL cs.AI cs.LG

    Text Classification through Glyph-aware Disentangled Character Embedding and Semantic Sub-character Augmentation

    Authors: Takumi Aoki, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: We propose a new character-based text classification framework for non-alphabetic languages, such as Chinese and Japanese. Our framework consists of a variational character encoder (VCE) and character-level text classifier. The VCE is composed of a $β$-variational auto-encoder ($β$-VAE) that learns the proposed glyph-aware disentangled character embedding (GDCE). Since our GDCE provides zero-mean… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: 6 pages, 3 figures, Accepted at AACL-IJCNLP 2020: Student Research Workshop

  9. Attention Meets Perturbations: Robust and Interpretable Attention with Adversarial Training

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Although attention mechanisms have been applied to a variety of deep learning models and have been shown to improve the prediction performance, it has been reported to be vulnerable to perturbations to the mechanism. To overcome the vulnerability to perturbations in the mechanism, we are inspired by adversarial training (AT), which is a powerful regularization technique for enhancing the robustnes… ▽ More

    Submitted 30 June, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 12 pages, 4 figures. Accepted by IEEE Access on Jun. 21, 2021

    Journal ref: in IEEE Access, vol. 9, pp. 92974-92985, 2021

  10. arXiv:2006.11586  [pdf, other

    cs.CL

    AraDIC: Arabic Document Classification using Image-Based Character Embeddings and Class-Balanced Loss

    Authors: Mahmoud Daif, Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: Classical and some deep learning techniques for Arabic text classification often depend on complex morphological analysis, word segmentation, and hand-crafted feature engineering. These could be eliminated by using character-level features. We propose a novel end-to-end Arabic document classification framework, Arabic document image-based classifier (AraDIC), inspired by the work on image-based ch… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  11. Conversion Prediction Using Multi-task Conditional Attention Networks to Support the Creation of Effective Ad Creative

    Authors: Shunsuke Kitada, Hitoshi Iyatomi, Yoshifumi Seki

    Abstract: Accurately predicting conversions in advertisements is generally a challenging task, because such conversions do not occur frequently. In this paper, we propose a new framework to support creating high-performing ad creatives, including the accurate prediction of ad creative text conversions before delivering to the consumer. The proposed framework includes three key ideas: multi-task learning, co… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 9 pages, 6 figures. Accepted at The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2019) as an applied data science paper

    Journal ref: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '19), August 4--8, 2019, Anchorage, AK, USA

  12. End-to-End Text Classification via Image-based Embedding using Character-level Networks

    Authors: Shunsuke Kitada, Ryunosuke Kotani, Hitoshi Iyatomi

    Abstract: For analysing and/or understanding languages having no word boundaries based on morphological analysis such as Japanese, Chinese, and Thai, it is desirable to perform appropriate word segmentation before word embeddings. But it is inherently difficult in these languages. In recent years, various language models based on deep learning have made remarkable progress, and some of these methodologies u… ▽ More

    Submitted 10 October, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: To appear in IEEE Applied Imagery Pattern Recognition (AIPR) 2018 workshop

  13. arXiv:1809.02568  [pdf, ps, other

    cs.CV

    Skin lesion classification with ensemble of squeeze-and-excitation networks and semi-supervised learning

    Authors: Shunsuke Kitada, Hitoshi Iyatomi

    Abstract: In this report, we introduce the outline of our system in Task 3: Disease Classification of ISIC 2018: Skin Lesion Analysis Towards Melanoma Detection. We fine-tuned multiple pre-trained neural network models based on Squeeze-and-Excitation Networks (SENet) which achieved state-of-the-art results in the field of image recognition. In addition, we used the mean teachers as a semi-supervised learnin… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: 6 pages, 4 figures, ISIC2018