Skip to main content

Showing 1–15 of 15 results for author: Iizuka, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17319  [pdf, other

    cs.CL cs.AI

    JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset

    Authors: Atsumoto Ohashi, Ryu Hirai, Shinya Iizuka, Ryuichiro Higashinaka

    Abstract: Dialogue datasets are crucial for deep learning-based task-oriented dialogue system research. While numerous English language multi-domain task-oriented dialogue datasets have been developed and contributed to significant advancements in task-oriented dialogue systems, such a dataset does not exist in Japanese, and research in this area is limited compared to that in English. In this study, toward… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  2. arXiv:2312.13816  [pdf, other

    cs.CL cs.AI cs.RO

    Team Flow at DRC2023: Building Common Ground and Text-based Turn-taking in a Travel Agent Spoken Dialogue System

    Authors: Ryu Hirai, Shinya Iizuka, Haruhisa Iseno, Ao Guo, **g**g Jiang, Atsumoto Ohashi, Ryuichiro Higashinaka

    Abstract: At the Dialogue Robot Competition 2023 (DRC2023), which was held to improve the capability of dialogue robots, our team developed a system that could build common ground and take more natural turns based on user utterance texts. Our system generated queries for sightseeing spot searches using the common ground and engaged in dialogue while waiting for user comprehension.

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: This paper is part of the proceedings of the Dialogue Robot Competition 2023

  3. Visual Grounding of Whole Radiology Reports for 3D CT Images

    Authors: Akimichi Ichinose, Taro Hatsutani, Keigo Nakamura, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Shoji Kido, Noriyuki Tomiyama

    Abstract: Building a large-scale training dataset is an essential problem in the development of medical image recognition systems. Visual grounding techniques, which automatically associate objects in images with corresponding descriptions, can facilitate labeling of large number of images. However, visual grounding of radiology reports for CT images remains challenging, because so many kinds of anomalies a… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures. Accepted at MICCAI 2023

    Journal ref: Medical Image Computing and Computer Assisted Intervention Lecture Notes in Computer Science 14224 (2023) 611-621

  4. arXiv:2312.04779  [pdf, other

    eess.IV cs.CV cs.LG

    Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging

    Authors: Saeko Sasuga, Akira Kudo, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Atsushi Hamabe, Masayuki Ishii, Ichiro Takemasa

    Abstract: Rectal cancer is one of the most common diseases and a major cause of mortality. For deciding rectal cancer treatment plans, T-staging is important. However, evaluating the index from preoperative MRI images requires high radiologists' skill and experience. Therefore, the aim of this study is to segment the mesorectum, rectum, and rectal cancer region so that the system can predict T-stage from se… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 10 pages, 7 figures, Accepted to Data Augmentation, Labeling, and Imperfections (DALI) at MICCAI 2022

  5. arXiv:2309.14759  [pdf, other

    cs.GR cs.CV

    Diffusion-based Holistic Texture Rectification and Synthesis

    Authors: Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui

    Abstract: We present a novel framework for rectifying occlusions and distortions in degraded texture samples from natural images. Traditional texture synthesis approaches focus on generating textures from pristine samples, which necessitate meticulous preparation by humans and are often unattainable in most natural images. These challenges stem from the frequent occlusions and distortions of texture samples… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH Asia 2023 Conference Paper

  6. arXiv:2308.10111  [pdf, other

    cs.CV cs.GR

    Controllable Multi-domain Semantic Artwork Synthesis

    Authors: Yuantian Huang, Satoshi Iizuka, Edgar Simo-Serra, Kazuhiro Fukui

    Abstract: We present a novel framework for multi-domain synthesis of artwork from semantic layouts. One of the main limitations of this challenging task is the lack of publicly available segmentation datasets for art synthesis. To address this problem, we propose a dataset, which we call ArtSem, that contains 40,000 images of artwork from 4 different domains with their corresponding semantic label maps. We… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: 15 pages, accepted by CVMJ, to appear

  7. arXiv:2212.00567  [pdf, other

    cs.CV cs.RO

    P2Net: A Post-Processing Network for Refining Semantic Segmentation of LiDAR Point Cloud based on Consistency of Consecutive Frames

    Authors: Yutaka Momma, Weimin Wang, Edgar Simo-Serra, Satoshi Iizuka, Ryosuke Nakamura, Hiroshi Ishikawa

    Abstract: We present a lightweight post-processing method to refine the semantic segmentation results of point cloud sequences. Most existing methods usually segment frame by frame and encounter the inherent ambiguity of the problem: based on a measurement in a single frame, labels are sometimes difficult to predict even for humans. To remedy this problem, we propose to explicitly train a network to refine… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

  8. arXiv:2210.09518  [pdf, other

    cs.CL cs.AI cs.RO

    Team Flow at DRC2022: Pipeline System for Travel Destination Recommendation Task in Spoken Dialogue

    Authors: Ryu Hirai, Atsumoto Ohashi, Ao Guo, Hideki Shiroma, Xulin Zhou, Yukihiko Tone, Shinya Iizuka, Ryuichiro Higashinaka

    Abstract: To improve the interactive capabilities of a dialogue system, e.g., to adapt to different customers, the Dialogue Robot Competition (DRC2022) was held. As one of the teams, we built a dialogue system with a pipeline structure containing four modules. The natural language understanding (NLU) and natural language generation (NLG) modules were GPT-2 based models, and the dialogue state tracking (DST)… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: This paper is part of the proceedings of the Dialogue Robot Competition 2022

  9. arXiv:2207.12859  [pdf, other

    cs.CV

    Adaptive occlusion sensitivity analysis for visually explaining video recognition networks

    Authors: Tomoki Uchiyama, Naoya Sogi, Satoshi Iizuka, Koichiro Niinuma, Kazuhiro Fukui

    Abstract: This paper proposes a method for visually explaining the decision-making process of video recognition networks with a temporal extension of occlusion sensitivity analysis, called Adaptive Occlusion Sensitivity Analysis (AOSA). The key idea here is to occlude a specific volume of data by a 3D mask in an input 3D temporal-spatial data space and then measure the change degree in the output score. The… ▽ More

    Submitted 17 August, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: 11 pages

  10. arXiv:2009.13798  [pdf, other

    eess.IV cs.AI

    Automatic Segmentation, Localization, and Identification of Vertebrae in 3D CT Images Using Cascaded Convolutional Neural Networks

    Authors: Naoto Masuzawa, Yoshiro Kitamura, Keigo Nakamura, Satoshi Iizuka, Edgar Simo-Serra

    Abstract: This paper presents a method for automatic segmentation, localization, and identification of vertebrae in arbitrary 3D CT images. Many previous works do not perform the three tasks simultaneously even though requiring a priori knowledge of which part of the anatomy is visible in the 3D CT images. Our method tackles all these tasks in a single multi-stage framework without any assumptions. In the f… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  11. arXiv:2009.08692  [pdf, other

    cs.CV cs.GR

    DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement

    Authors: Satoshi Iizuka, Edgar Simo-Serra

    Abstract: The remastering of vintage film comprises of a diversity of sub-tasks including super-resolution, noise removal, and contrast enhancement which aim to restore the deteriorated film medium to its original state. Additionally, due to the technical limitations of the time, most vintage film is either recorded in black and white, or has low quality colors, for which colorization becomes necessary. In… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted to SIGGRAPH Asia 2019. Project page: http://iizuka.cs.tsukuba.ac.jp/projects/remastering/

  12. arXiv:2009.08674  [pdf, other

    cs.CV

    TopNet: Topology Preserving Metric Learning for Vessel Tree Reconstruction and Labelling

    Authors: Deepak Keshwani, Yoshiro Kitamura, Satoshi Ihara, Satoshi Iizuka, Edgar Simo-Serra

    Abstract: Reconstructing Portal Vein and Hepatic Vein trees from contrast enhanced abdominal CT scans is a prerequisite for preoperative liver surgery simulation. Existing deep learning based methods treat vascular tree reconstruction as a semantic segmentation problem. However, vessels such as hepatic and portal vein look very similar locally and need to be traced to their source for robust label assignmen… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted in MICCAI 2020

    Report number: 603

  13. arXiv:2003.11211  [pdf, other

    cs.CV

    Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval

    Authors: Shuhei Yokoo, Kohei Ozaki, Edgar Simo-Serra, Satoshi Iizuka

    Abstract: We propose an efficient pipeline for large-scale landmark image retrieval that addresses the diversity of the dataset through two-stage discriminative re-ranking. Our approach is based on embedding the images in a feature-space using a convolutional neural network trained with a cosine softmax loss. Due to the variance of the images, which include extreme viewpoint changes such as having to retrie… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: 10 pages, 5 figures

  14. arXiv:1908.11506  [pdf, other

    eess.IV cs.CV

    Virtual Thin Slice: 3D Conditional GAN-based Super-resolution for CT Slice Interval

    Authors: Akira Kudo, Yoshiro Kitamura, Yuanzhong Li, Satoshi Iizuka, Edgar Simo-Serra

    Abstract: Many CT slice images are stored with large slice intervals to reduce storage size in clinical practice. This leads to low resolution perpendicular to the slice images (i.e., z-axis), which is insufficient for 3D visualization or image analysis. In this paper, we present a novel architecture based on conditional Generative Adversarial Networks (cGANs) with the goal of generating high resolution ima… ▽ More

    Submitted 1 September, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: 10 pages, 6 figures, Accepted to Machine Learning for Medical Image Reconstruction (MLMIR) at MICCAI 2019

  15. arXiv:1703.08966  [pdf, other

    cs.CV

    Mastering Sketching: Adversarial Augmentation for Structured Prediction

    Authors: Edgar Simo-Serra, Satoshi Iizuka, Hiroshi Ishikawa

    Abstract: We present an integral framework for training sketch simplification networks that convert challenging rough sketches into clean line drawings. Our approach augments a simplification network with a discriminator network, training both networks jointly so that the discriminator network discerns whether a line drawing is a real training data or the output of the simplification network, which in turn… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: 12 pages, 14 figures