Skip to main content

Showing 1–8 of 8 results for author: Kikuchi, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05336  [pdf, other

    eess.IV cs.AI cs.CV

    Joint semi-supervised and contrastive learning enables zero-shot domain-adaptation and multi-domain segmentation

    Authors: Alvaro Gomariz, Yusuke Kikuchi, Yun Yvonna Li, Thomas Albrecht, Andreas Maunz, Daniela Ferrara, Huanxiang Lu, Orcun Goksel

    Abstract: Despite their effectiveness, current deep learning models face challenges with images coming from different domains with varying appearance and content. We introduce SegCLR, a versatile framework designed to segment volumetric images across different domains, employing supervised and contrastive learning simultaneously to effectively learn from both labeled and unlabeled data. We demonstrate the s… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  2. arXiv:2201.01250  [pdf, other

    cs.LG cs.CV

    Transfer Learning for Retinal Vascular Disease Detection: A Pilot Study with Diabetic Retinopathy and Retinopathy of Prematurity

    Authors: Guan Wang, Yusuke Kikuchi, **glin Yi, Qiong Zou, Rui Zhou, Xin Guo

    Abstract: Retinal vascular diseases affect the well-being of human body and sometimes provide vital signs of otherwise undetected bodily damage. Recently, deep learning techniques have been successfully applied for detection of diabetic retinopathy (DR). The main obstacle of applying deep learning techniques to detect most other retinal vascular diseases is the limited amount of data available. In this pape… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  3. arXiv:2108.11018  [pdf, other

    cs.LG cs.CV

    A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?

    Authors: Hiroaki Mikami, Kenji Fukumizu, Shogo Murai, Shuji Suzuki, Yuta Kikuchi, Taiji Suzuki, Shin-ichi Maeda, Kohei Hayashi

    Abstract: Synthetic-to-real transfer learning is a framework in which a synthetically generated dataset is used to pre-train a model to improve its performance on real vision tasks. The most significant advantage of using synthetic images is that the ground-truth labels are automatically available, enabling unlimited expansion of the data size without human cost. However, synthetic data may have a huge doma… ▽ More

    Submitted 8 October, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

  4. arXiv:2009.13331  [pdf, other

    cs.CV

    Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score

    Authors: He Huang, Shunta Saito, Yuta Kikuchi, Eiichi Matsumoto, Wei Tang, Philip S. Yu

    Abstract: Scene graph parsing aims to detect objects in an image scene and recognize their relations. Recent approaches have achieved high average scores on some popular benchmarks, but fail in detecting rare relations, as the highly long-tailed distribution of data biases the learning towards frequent labels. Motivated by the fact that detecting these rare relations can be critical in real-world applicatio… ▽ More

    Submitted 5 October, 2020; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: ACCV 2020

  5. arXiv:2006.06968  [pdf, other

    eess.IV cs.CV cs.LG

    Early Detection of Retinopathy of Prematurity (ROP) in Retinal Fundus Images Via Convolutional Neural Networks

    Authors: Xin Guo, Yusuke Kikuchi, Guan Wang, **glin Yi, Qiong Zou, Rui Zhou

    Abstract: Retinopathy of prematurity (ROP) is an abnormal blood vessel development in the retina of a prematurely-born infant or an infant with low birth weight. ROP is one of the leading causes for infant blindness globally. Early detection of ROP is critical to slow down and avert the progression to vision impairment caused by ROP. Yet there is limited awareness of ROP even among medical professionals. Co… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  6. arXiv:1710.06280  [pdf, other

    cs.RO cs.CL

    Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions

    Authors: Jun Hatori, Yuta Kikuchi, Sosuke Kobayashi, Kuniyuki Takahashi, Yuta Tsuboi, Yuya Unno, Wilson Ko, Jethro Tan

    Abstract: Comprehension of spoken natural language is an essential component for robots to communicate with human effectively. However, handling unconstrained spoken instructions is challenging due to (1) complex structures including a wide variety of expressions used in spoken language and (2) inherent ambiguity in interpretation of human instructions. In this paper, we propose the first comprehensive syst… ▽ More

    Submitted 27 March, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: 9 pages. International Conference on Robotics and Automation (ICRA) 2018. Accompanying videos are available at the following links: https://youtu.be/_Uyv1XIUqhk (the system submitted to ICRA-2018) and http://youtu.be/DGJazkyw0Ws (with improvements after ICRA-2018 submission)

  7. arXiv:1706.10031  [pdf, other

    stat.ML cs.LG

    Neural Sequence Model Training via $α$-divergence Minimization

    Authors: Sotetsu Koyamada, Yuta Kikuchi, Atsunori Kanemura, Shin-ichi Maeda, Shin Ishii

    Abstract: We propose a new neural sequence model training method in which the objective function is defined by $α$-divergence. We demonstrate that the objective function generalizes the maximum-likelihood (ML)-based and reinforcement learning (RL)-based objective functions as special cases (i.e., ML corresponds to $α\to 0$ and RL to $α\to1$). We also show that the gradient of the objective function can be c… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: 2017 ICML Workshop on Learning to Generate Natural Language (LGNL 2017)

  8. arXiv:1609.09552  [pdf, other

    cs.CL

    Controlling Output Length in Neural Encoder-Decoders

    Authors: Yuta Kikuchi, Graham Neubig, Ryohei Sasano, Hiroya Takamura, Manabu Okumura

    Abstract: Neural encoder-decoder models have shown great success in many sequence generation tasks. However, previous work has not investigated situations in which we would like to control the length of encoder-decoder outputs. This capability is crucial for applications such as text summarization, in which we have to generate concise summaries with a desired length. In this paper, we propose methods for co… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

    Comments: 11 pages. To appear in EMNLP 2016