Skip to main content

Showing 1–15 of 15 results for author: Omachi, S

.
  1. Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model

    Authors: Shoma Iwai, Tomo Miyazaki, Shinichiro Omachi

    Abstract: In recent years, neural network-driven image compression (NIC) has gained significant attention. Some works adopt deep generative models such as GANs and diffusion models to enhance perceptual quality (realism). A critical obstacle of these generative NIC methods is that each model is optimized for a single bit rate. Consequently, multiple models are required to compress images to different bit ra… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: WACV2024 Oral. Code is at https://github.com/iwa-shi/CRDR

  2. arXiv:2405.09873  [pdf, other

    cs.CV eess.IV

    IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model

    Authors: Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi

    Abstract: Infrared (IR) image super-resolution faces challenges from homogeneous background pixel distributions and sparse target regions, requiring models that effectively handle long-range dependencies and capture detailed local-global information. Recent advancements in Mamba-based (Selective Structured State Space Model) models, employing state space models, have shown significant potential in visual ta… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2403.16553  [pdf, other

    cond-mat.str-el

    Imaging quantum interference in a monolayer Kitaev quantum spin liquid candidate

    Authors: Y. Kohsaka, S. Akutagawa, S. Omachi, Y. Iwamichi, T. Ono, I. Tanaka, S. Tateishi, H. Murayama, S. Suetsugu, K. Hashimoto, T. Shibauchi, M. O. Takahashi, M. G. Yamada, S. Nikolaev, T. Mizushima, S. Fujimoto, T. Terashima, T. Asaba, Y. Kasahara, Y. Matsuda

    Abstract: Single atomic defects are prominent windows to look into host quantum states because collective responses from the host states emerge as localized states around the defects. Friedel oscillations and Kondo clouds in Fermi liquids are quintessential examples. However, the situation is quite different for quantum spin liquid (QSL), an exotic state of matter with fractionalized quasiparticles and topo… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2312.16455  [pdf, other

    eess.IV cs.CV cs.LG

    Learn From Orientation Prior for Radiograph Super-Resolution: Orientation Operator Transformer

    Authors: Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Kaiyuan Jiang, Zhengmi Tang, Shinichiro Omachi

    Abstract: Background and objective: High-resolution radiographic images play a pivotal role in the early diagnosis and treatment of skeletal muscle-related diseases. It is promising to enhance image quality by introducing single-image super-resolution (SISR) model into the radiology image field. However, the conventional image pipeline, which can learn a mixed map** between SR and denoising from the color… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted by Computer Methods and Programs in Biomedicine

  5. arXiv:2312.00689  [pdf, other

    eess.IV cs.CV

    Infrared Image Super-Resolution via GAN

    Authors: Yongsong Huang, Shinichiro Omachi

    Abstract: The ability of generative models to accurately fit data distributions has resulted in their widespread adoption and success in fields such as computer vision and natural language processing. In this chapter, we provide a brief overview of the application of generative models in the domain of infrared (IR) image super-resolution, including a discussion of the various challenges and adversarial trai… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Applications of Generative AI, Chapter 28

  6. arXiv:2311.08816  [pdf, other

    eess.IV cs.CV

    Target-oriented Domain Adaptation for Infrared Image Super-Resolution

    Authors: Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Yafei Dong, Shinichiro Omachi

    Abstract: Recent efforts have explored leveraging visible light images to enrich texture details in infrared (IR) super-resolution. However, this direct adaptation approach often becomes a double-edged sword, as it improves texture at the cost of introducing noise and blurring artifacts. To address these challenges, we propose the Target-oriented Domain Adaptation SRGAN (DASRGAN), an innovative framework sp… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 11 pages, 9 figures

  7. Deep Image Compression Using Scene Text Quality Assessment

    Authors: Shohei Uchigasaki, Tomo Miyazaki, Shinichiro Omachi

    Abstract: Image compression is a fundamental technology for Internet communication engineering. However, a high compression rate with general methods may degrade images, resulting in unreadable texts. In this paper, we propose an image compression method for maintaining text quality. We developed a scene text image quality assessment model to assess text quality in compressed images. The assessment model it… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted by Pattern Recognition, 2023

    Journal ref: Pattern Recognition, 2023

  8. arXiv:2212.12322  [pdf, other

    eess.IV cs.CV cs.LG

    Infrared Image Super-Resolution: Systematic Review, and Future Trends

    Authors: Yongsong Huang, Tomo Miyazaki, Xiaofeng Liu, Shinichiro Omachi

    Abstract: Image Super-Resolution (SR) is essential for a wide range of computer vision and image processing tasks. Investigating infrared (IR) image (or thermal images) super-resolution is a continuing concern within the development of deep learning. This survey aims to provide a comprehensive perspective of IR image super-resolution, including its applications, hardware imaging system dilemmas, and taxonom… ▽ More

    Submitted 15 November, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Submitted to IEEE TNNLS

  9. arXiv:2209.02397  [pdf, other

    cs.CV

    A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data

    Authors: Zhengmi Tang, Tomo Miyazaki, Shinichiro Omachi

    Abstract: Scene-text image synthesis techniques that aim to naturally compose text instances on background scene images are very appealing for training deep neural networks due to their ability to provide accurate and comprehensive annotation information. Prior studies have explored generating synthetic text images on two-dimensional and three-dimensional surfaces using rules derived from real-world observa… ▽ More

    Submitted 17 October, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

  10. arXiv:2208.03008  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Degradation: Radiograph Super-Resolution via AID-SRGAN

    Authors: Yongsong Huang, Qingzhong Wang, Shinichiro Omachi

    Abstract: In this paper, we present a medical AttentIon Denoising Super Resolution Generative Adversarial Network (AID-SRGAN) for diographic image super-resolution. First, we present a medical practical degradation model that considers various degradation factors beyond downsampling. To the best of our knowledge, this is the first composite degradation model proposed for radiographic images. Furthermore, we… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI 2022 Workshop. Code: https://github.com/yongsongH/AIDSRGAN-MICCAI2022

  11. Stroke-Based Scene Text Erasing Using Synthetic Data for Training

    Authors: Zhengmi Tang, Tomo Miyazaki, Yoshihiro Sugaya, Shinichiro Omachi

    Abstract: Scene text erasing, which replaces text regions with reasonable content in natural images, has drawn significant attention in the computer vision community in recent years. There are two potential subtasks in scene text erasing: text detection and image inpainting. Both subtasks require considerable data to achieve better performance; however, the lack of a large-scale real-world scene-text remova… ▽ More

    Submitted 3 December, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Journal ref: IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 30, 2021, 9306-9320

  12. Fidelity-Controllable Extreme Image Compression with Generative Adversarial Networks

    Authors: Shoma Iwai, Tomo Miyazaki, Yoshihiro Sugaya, Shinichiro Omachi

    Abstract: We propose a GAN-based image compression method working at extremely low bitrates below 0.1bpp. Most existing learned image compression methods suffer from blur at extremely low bitrates. Although GAN can help to reconstruct sharp images, there are two drawbacks. First, GAN makes training unstable. Second, the reconstructions often contain unpleasing noise or artifacts. To address both of the draw… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: 8 pages, 11 figures

    Journal ref: ICPR, 2020

  13. Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence

    Authors: Huy Manh Nguyen, Tomo Miyazaki, Yoshihiro Sugaya, Shinichiro Omachi

    Abstract: Visual-semantic embedding aims to learn a joint embedding space where related video and sentence instances are located close to each other. Most existing methods put instances in a single embedding space. However, they struggle to embed instances due to the difficulty of matching visual dynamics in videos to textual features in sentences. A single space is not enough to accommodate various videos… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: 8 pages, 5 figures

    Journal ref: Applied Sciences, 2021

  14. Structural Data Recognition with Graph Model Boosting

    Authors: Tomo Miyazaki, Shinichiro Omachi

    Abstract: This paper presents a novel method for structural data recognition using a large number of graph models. In general, prevalent methods for structural data recognition have two shortcomings: 1) Only a single model is used to capture structural variation. 2) Naive recognition methods are used, such as the nearest neighbor method. In this paper, we propose strengthening the recognition performance of… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: 8 pages

    Journal ref: IEEE Access, 2018

  15. Automatic Generation of Typographic Font from a Small Font Subset

    Authors: Tomo Miyazaki, Tatsunori Tsuchiya, Yoshihiro Sugaya, Shinichiro Omachi, Masakazu Iwamura, Seiichi Uchida, Koichi Kise

    Abstract: This paper addresses the automatic generation of a typographic font from a subset of characters. Specifically, we use a subset of a typographic font to extrapolate additional characters. Consequently, we obtain a complete font containing a number of characters sufficient for daily use. The automated generation of Japanese fonts is in high demand because a Japanese font requires over 1,000 characte… ▽ More

    Submitted 20 January, 2017; originally announced January 2017.

    Comments: 12 pages, 17 figures

    Journal ref: IEEE Computer Graphics and Applications, 2019