Skip to main content

Showing 1–29 of 29 results for author: Suzuki, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18863  [pdf, other

    cs.CV

    Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy

    Authors: Zijie Jiang, Yusuke Monno, Masatoshi Okutomi, Sho Suzuki, Kenji Miki

    Abstract: Enabling the synthesis of arbitrarily novel viewpoint images within a patient's stomach from pre-captured monocular gastroscopic images is a promising topic in stomach diagnosis. Typical methods to achieve this objective integrate traditional 3D reconstruction techniques, including structure-from-motion (SfM) and Poisson surface reconstruction. These methods produce explicit 3D representations, su… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted for EMBC 2024

  2. arXiv:2404.10999  [pdf

    cs.RO

    Machine-Learning-Enhanced Soft Robotic System Inspired by Rectal Functions for Investigating Fecal incontinence

    Authors: Zebing Mao, Sota Suzuki, Hiroyuki Nabae, Shoko Miyagawa, Koichi Suzumori, Shingo Maeda

    Abstract: Fecal incontinence, arising from a myriad of pathogenic mechanisms, has attracted considerable global attention. Despite its significance, the replication of the defecatory system for studying fecal incontinence mechanisms remains limited largely due to social stigma and taboos. Inspired by the rectum's functionalities, we have developed a soft robotic system, encompassing a power supply, pressure… ▽ More

    Submitted 1 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2403.17423  [pdf, other

    cs.CV stat.ML

    Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching

    Authors: Shohei Enomoto, Naoya Hasegawa, Kazuki Adachi, Taku Sasaki, Shin'ya Yamaguchi, Satoshi Suzuki, Takeharu Eda

    Abstract: Deep neural networks have achieved remarkable success in a variety of computer vision applications. However, there is a problem of degrading accuracy when the data distribution shifts between training and testing. As a solution of this problem, Test-time Adaptation~(TTA) has been well studied because of its practicality. Although TTA methods increase accuracy under distribution shift by updating t… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN2024

  4. arXiv:2311.13460  [pdf, other

    cs.LG stat.ML

    Multi-Objective Bayesian Optimization with Active Preference Learning

    Authors: Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki, Shinya Suzuki, Shion Takeno, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: There are a lot of real-world black-box optimization problems that need to optimize multiple criteria simultaneously. However, in a multi-objective optimization (MOO) problem, identifying the whole Pareto front requires the prohibitive search cost, while in many practical scenarios, the decision maker (DM) only needs a specific solution among the set of the Pareto optimal solutions. We propose a B… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  5. arXiv:2308.16454  [pdf, other

    cs.CV cs.LG

    Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff

    Authors: Satoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Sekitoshi Kanai, Naoki Makishima, Atsushi Ando, Ryo Masumura

    Abstract: This paper addresses the tradeoff between standard accuracy on clean examples and robustness against adversarial examples in deep neural networks (DNNs). Although adversarial training (AT) improves robustness, it degrades the standard accuracy, thus yielding the tradeoff. To mitigate this tradeoff, we propose a novel AT method called ARREST, which comprises three components: (i) adversarial finetu… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by International Conference on Computer Vision (ICCV) 2023

  6. arXiv:2306.02273  [pdf, ps, other

    cs.CL cs.SD eess.AS

    End-to-End Joint Target and Non-Target Speakers ASR

    Authors: Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando

    Abstract: This paper proposes a novel automatic speech recognition (ASR) system that can transcribe individual speaker's speech while identifying whether they are target or non-target speakers from multi-talker overlapped speech. Target-speaker ASR systems are a promising way to only transcribe a target speaker's speech by enrolling the target speaker's information. However, in conversational ASR applicatio… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted at Interspeech 2023

  7. arXiv:2304.11413  [pdf, other

    cs.HC

    Three-dimensional hand guidance by midair haptic display

    Authors: Koya Hiura, Shun Suzuki, Tao Morisaki, Masahiro Fujiwara, Yasutoshi Makino, Hiroyuki Shinoda

    Abstract: Guiding human movements using tactile information is one of the promising applications of haptics. Using midair ultrasonic haptic stimulation, it is possible to guide a hand without visual information.However, the information of movement shown by conventional methods was partial. It has not been shown a method to guide a hand to an arbitrary point in three dimensional space. In this study, we prop… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

  8. arXiv:2210.15937  [pdf, other

    cs.CL cs.SD eess.AS

    On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis

    Authors: Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato

    Abstract: This paper investigates the effectiveness and implementation of modality-specific large-scale pre-trained encoders for multimodal sentiment analysis~(MSA). Although the effectiveness of pre-trained encoders in various fields has been reported, conventional MSA methods employ them for only linguistic modality, and their application has not been investigated. This paper compares the features yielded… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted to SLT 2022

  9. arXiv:2207.04659  [pdf, other

    cs.SD eess.AS

    Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data

    Authors: Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura

    Abstract: In this paper, we investigate the semi-supervised joint training of text to speech (TTS) and automatic speech recognition (ASR), where a small amount of paired data and a large amount of unpaired text data are available. Conventional studies form a cycle called the TTS-ASR pipeline, where the multispeaker TTS model synthesizes speech from text with a reference speech and the ASR model reconstructs… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted to INTERSPEECH 2022

  10. arXiv:2203.03119  [pdf, other

    cs.DC cs.CR cs.CY

    Fabchain: Managing Audit-able 3D Print Job over Blockchain

    Authors: Ryosuke Abe, Shigeya Suzuki, Kenji Saito, Hiroya Tanaka, Osamu Nakamura, Jun Murai

    Abstract: Improvements in fabrication devices such as 3D printers are becoming possible for personal fabrication to freely fabricate any products. To clarify who is liable for the product, the fabricator should keep the fabrication history in an immutable and sustainably accessible manner. In this paper, we propose a new scheme, "Fabchain," that can record the fabrication history in such a manner. By utiliz… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

  11. arXiv:2112.07093  [pdf, other

    quant-ph cs.NI cs.SE

    QuISP: a Quantum Internet Simulation Package

    Authors: Ryosuke Satoh, Michal Hajdušek, Naphan Benchasattabuse, Shota Nagayama, Kentaro Teramoto, Takaaki Matsuo, Sara Ayman Metwalli, Takahiko Satoh, Shigeya Suzuki, Rodney Van Meter

    Abstract: We present an event-driven simulation package called QuISP for large-scale quantum networks built on top of the OMNeT++ discrete event simulation framework. Although the behavior of quantum networking devices have been revealed by recent research, it is still an open question how they will work in networks of a practical size. QuISP is designed to simulate large-scale quantum networks to investiga… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 17 pages, 12 figures

    Journal ref: 2022 IEEE International Conference on Quantum Computing and Engineering (QCE), pp 353-364 (2022)

  12. A Quantum Internet Architecture

    Authors: Rodney Van Meter, Ryosuke Satoh, Naphan Benchasattabuse, Takaaki Matsuo, Michal Hajdušek, Takahiko Satoh, Shota Nagayama, Shigeya Suzuki

    Abstract: Entangled quantum communication is advancing rapidly, with laboratory and metropolitan testbeds under development, but to date there is no unifying Quantum Internet architecture. We propose a Quantum Internet architecture centered around the Quantum Recursive Network Architecture (QRNA), using RuleSet-based connections established using a two-pass connection setup. Scalability and internetworking… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 17 pages, 7 numbered figures

    Journal ref: 2022 IEEE International Conference on Quantum Computing and Engineering (QCE), pp. 341-352 (2022)

  13. arXiv:2108.11018  [pdf, other

    cs.LG cs.CV

    A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?

    Authors: Hiroaki Mikami, Kenji Fukumizu, Shogo Murai, Shuji Suzuki, Yuta Kikuchi, Taiji Suzuki, Shin-ichi Maeda, Kohei Hayashi

    Abstract: Synthetic-to-real transfer learning is a framework in which a synthetically generated dataset is used to pre-train a model to improve its performance on real vision tasks. The most significant advantage of using synthetic images is that the ground-truth labels are automatically available, enabling unlimited expansion of the data size without human cost. However, synthetic data may have a huge doma… ▽ More

    Submitted 8 October, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

  14. arXiv:2107.13263  [pdf, other

    cs.CV

    Learning-Based Depth and Pose Estimation for Monocular Endoscope with Loss Generalization

    Authors: Aji Resindra Widya, Yusuke Monno, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

    Abstract: Gastroendoscopy has been a clinical standard for diagnosing and treating conditions that affect a part of a patient's digestive system, such as the stomach. Despite the fact that gastroendoscopy has a lot of advantages for patients, there exist some challenges for practitioners, such as the lack of 3D perception, including the depth and the endoscope pose information. Such challenges make navigati… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Accepted for EMBC 2021

  15. arXiv:2008.01523  [pdf, other

    cs.CL

    A System for Worldwide COVID-19 Information Aggregation

    Authors: Akiko Aizawa, Frederic Bergeron, Junjie Chen, Fei Cheng, Katsuhiko Hayashi, Kentaro Inui, Hiroyoshi Ito, Daisuke Kawahara, Masaru Kitsuregawa, Hirokazu Kiyomaru, Masaki Kobayashi, Takashi Kodama, Sadao Kurohashi, Qianying Liu, Masaki Matsubara, Yusuke Miyao, Atsuyuki Morishima, Yugo Murawaki, Kazumasa Omura, Haiyue Song, Eiichiro Sumita, Shinji Suzuki, Ribeka Tanaka, Yu Tanaka, Masashi Toyoda , et al. (4 additional authors not shown)

    Abstract: The global pandemic of COVID-19 has made the public pay close attention to related news, covering various domains, such as sanitation, treatment, and effects on education. Meanwhile, the COVID-19 condition is very different among the countries (e.g., policies and development of the epidemic), and thus citizens would be interested in news in foreign countries. We build a system for worldwide COVID-… ▽ More

    Submitted 11 October, 2020; v1 submitted 27 July, 2020; originally announced August 2020.

    Comments: Accepted to EMNLP 2020 Workshop NLP-COVID

  16. An Inductive Transfer Learning Approach using Cycle-consistent Adversarial Domain Adaptation with Application to Brain Tumor Segmentation

    Authors: Yuta Tokuoka, Shuji Suzuki, Yohei Sugawara

    Abstract: With recent advances in supervised machine learning for medical image analysis applications, the annotated medical image datasets of various domains are being shared extensively. Given that the annotation labelling requires medical expertise, such labels should be applied to as many learning tasks as possible. However, the multi-modal nature of each annotated image renders it difficult to share th… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: Proceedings of the 2019 6th International Conference on Biomedical and Bioinformatics Engineering, November 2019, Pages 44-48

  17. arXiv:2004.12288  [pdf, other

    cs.CV

    Stomach 3D Reconstruction Based on Virtual Chromoendoscopic Image Generation

    Authors: Aji Resindra Widya, Yusuke Monno, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

    Abstract: Gastric endoscopy is a standard clinical process that enables medical practitioners to diagnose various lesions inside a patient's stomach. If any lesion is found, it is very important to perceive the location of the lesion relative to the global view of the stomach. Our previous research showed that this could be addressed by reconstructing the whole stomach shape from chromoendoscopic images usi… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: Accepted for main conference in EMBC 2020

  18. arXiv:2002.02635  [pdf, other

    cs.HC

    Noncontact Thermal and Vibrotactile Display Using Focused Airborne Ultrasound

    Authors: Takaaki Kamigaki, Shun Suzuki, Hiroyuki Shinoda

    Abstract: In a typical mid-air haptics system, focused airborne ultrasound provides vibrotactile sensations to localized areas on a bare skin. Herein, a method for displaying thermal sensations to hands where mesh fabric gloves are worn is proposed. The gloves employed in this study are commercially available mesh fabric gloves with sound absorption characteristics, such as cotton work gloves without any ad… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 6 pages

  19. arXiv:1910.11534  [pdf, other

    cs.CV

    Team PFDet's Methods for Open Images Challenge 2019

    Authors: Yusuke Niitani, Toru Ogawa, Shuji Suzuki, Takuya Akiba, Tommi Kerola, Kohei Ozaki, Shotaro Sano

    Abstract: We present the instance segmentation and the object detection method used by team PFDet for Open Images Challenge 2019. We tackle a massive dataset size, huge class imbalance and federated annotations. Using this method, the team PFDet achieved 3rd and 4th place in the instance segmentation and the object detection track, respectively.

    Submitted 25 October, 2019; originally announced October 2019.

  20. arXiv:1908.00213  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    Chainer: A Deep Learning Framework for Accelerating the Research Cycle

    Authors: Seiya Tokui, Ryosuke Okuta, Takuya Akiba, Yusuke Niitani, Toru Ogawa, Shunta Saito, Shuji Suzuki, Kota Uenishi, Brian Vogel, Hiroyuki Yamazaki Vincent

    Abstract: Software frameworks for neural networks play a key role in the development and application of deep learning methods. In this paper, we introduce the Chainer framework, which intends to provide a flexible, intuitive, and high performance means of implementing the full range of deep learning models needed by researchers and practitioners. Chainer provides acceleration using Graphics Processing Units… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: Accepted for Applied Data Science Track in KDD'19

  21. arXiv:1906.00127  [pdf, other

    cs.LG stat.ML

    Multi-objective Bayesian Optimization using Pareto-frontier Entropy

    Authors: Shinya Suzuki, Shion Takeno, Tomoyuki Tamura, Kazuki Shitara, Masayuki Karasuyama

    Abstract: This paper studies an entropy-based multi-objective Bayesian optimization (MBO). The entropy search is successful approach to Bayesian optimization. However, for MBO, existing entropy-based methods ignore trade-off among objectives or introduce unreliable approximations. We propose a novel entropy-based MBO called Pareto-frontier entropy search (PFES) by considering the entropy of Pareto-frontier,… ▽ More

    Submitted 10 February, 2020; v1 submitted 31 May, 2019; originally announced June 2019.

  22. arXiv:1905.12988  [pdf, other

    cs.CV eess.IV

    3D Reconstruction of Whole Stomach from Endoscope Video Using Structure-from-Motion

    Authors: Aji Resindra Widya, Yusuke Monno, Kosuke Imahori, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

    Abstract: Gastric endoscopy is a common clinical practice that enables medical doctors to diagnose the stomach inside a body. In order to identify a gastric lesion's location such as early gastric cancer within the stomach, this work addressed to reconstruct the 3D shape of a whole stomach with color texture information generated from a standard monocular endoscope video. Previous works have tried to recons… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 5 pages, 4 figures, accepted in EMBC 2019

  23. arXiv:1811.10862  [pdf, other

    cs.CV

    Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects

    Authors: Yusuke Niitani, Takuya Akiba, Tommi Kerola, Toru Ogawa, Shotaro Sano, Shuji Suzuki

    Abstract: Efficient and reliable methods for training of object detectors are in higher demand than ever, and more and more data relevant to the field is becoming available. However, large datasets like Open Images Dataset v4 (OID) are sparsely annotated, and some measure must be taken in order to ensure the training of a reliable detector. In order to take the incompleteness of these datasets into account,… ▽ More

    Submitted 21 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

    Comments: CVPR2019 oral

  24. arXiv:1809.00778  [pdf, other

    cs.CV

    PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track

    Authors: Takuya Akiba, Tommi Kerola, Yusuke Niitani, Toru Ogawa, Shotaro Sano, Shuji Suzuki

    Abstract: We present a large-scale object detection system by team PFDet. Our system enables training with huge datasets using 512 GPUs, handles sparsely verified classes, and massive class imbalance. Using our method, we achieved 2nd place in the Google AI Open Images Object Detection Track 2018 on Kaggle.

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Technical report for Open Images Challenge 2018 Object Detection Track

  25. arXiv:1801.00464  [pdf

    cs.MA

    Comparative Analysis of Human Movement Prediction: Space Syntax and Inverse Reinforcement Learning

    Authors: Soma Suzuki

    Abstract: Space syntax matrix has been the main approach for human movement prediction in the urban environment. An alternative, relatively new methodology is an agent-based pedestrian model constructed using machine learning techniques. Even though both approaches have been studied intensively, the quantitative comparison between them has not been conducted. In this paper, comparative analysis of space syn… ▽ More

    Submitted 25 January, 2018; v1 submitted 1 January, 2018; originally announced January 2018.

  26. arXiv:1712.07887  [pdf

    cs.MA cs.AI

    Multiagent-based Participatory Urban Simulation through Inverse Reinforcement Learning

    Authors: Soma Suzuki

    Abstract: The multiagent-based participatory simulation features prominently in urban planning as the acquired model is considered as the hybrid system of the domain and the local knowledge. However, the key problem of generating realistic agents for particular social phenomena invariably remains. The existing models have attempted to dictate the factors involving human behavior, which appeared to be intrac… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

  27. arXiv:1711.04325  [pdf, other

    cs.DC cs.CV cs.LG

    Extremely Large Minibatch SGD: Training ResNet-50 on ImageNet in 15 Minutes

    Authors: Takuya Akiba, Shuji Suzuki, Keisuke Fukuda

    Abstract: We demonstrate that training ResNet-50 on ImageNet for 90 epochs can be achieved in 15 minutes with 1024 Tesla P100 GPUs. This was made possible by using a large minibatch size of 32k. To maintain accuracy with this large minibatch size, we employed several techniques such as RMSprop warm-up, batch normalization without moving averages, and a slow-start learning rate schedule. This paper also desc… ▽ More

    Submitted 12 November, 2017; originally announced November 2017.

    Comments: NIPS'17 Workshop: Deep Learning at Supercomputer Scale

  28. arXiv:1710.11351  [pdf, other

    cs.DC cs.LG cs.NE

    ChainerMN: Scalable Distributed Deep Learning Framework

    Authors: Takuya Akiba, Keisuke Fukuda, Shuji Suzuki

    Abstract: One of the keys for deep learning to have made a breakthrough in various fields was to utilize high computing powers centering around GPUs. Enabling the use of further computing abilities by distributed processing is essential not only to make the deep learning bigger and faster but also to tackle unsolved challenges. We present the design, implementation, and evaluation of ChainerMN, the distribu… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

  29. arXiv:1109.4357  [pdf, ps, other

    cs.LO

    Argument filterings and usable rules in higher-order rewrite systems

    Authors: Sho Suzuki, Keiichirou Kusakari, Frédéric Blanqui

    Abstract: The static dependency pair method is a method for proving the termination of higher-order rewrite systems a la Nipkow. It combines the dependency pair method introduced for first-order rewrite systems with the notion of strong computability introduced for typed lambda-calculi. Argument filterings and usable rules are two important methods of the dependency pair framework used by current state-of-t… ▽ More

    Submitted 20 September, 2011; originally announced September 2011.

    Journal ref: IPSJ Transactions on Programming 4, 2 (2011) 1-12