Skip to main content

Showing 1–22 of 22 results for author: Endo, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02419  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum Curriculum Learning

    Authors: Quoc Hoan Tran, Yasuhiro Endo, Hirotaka Oshima

    Abstract: Quantum machine learning (QML) requires significant quantum resources to achieve quantum advantage. Research should prioritize both the efficient design of quantum architectures and the development of learning strategies to optimize resource usage. We propose a framework called quantum curriculum learning (Q-CurL) for quantum data, where the curriculum introduces simpler tasks or data to the learn… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: main 5 pages, supplementary materials 6 pages

  2. arXiv:2406.18316  [pdf, other

    quant-ph cs.LG

    Trade-off between Gradient Measurement Efficiency and Expressivity in Deep Quantum Neural Networks

    Authors: Koki Chinzei, Shinichiro Yamano, Quoc Hoan Tran, Yasuhiro Endo, Hirotaka Oshima

    Abstract: Quantum neural networks (QNNs) require an efficient training algorithm to achieve practical quantum advantages. A promising approach is the use of gradient-based optimization algorithms, where gradients are estimated through quantum measurements. However, it is generally difficult to efficiently measure gradients in QNNs because the quantum state collapses upon measurement. In this work, we prove… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 32 pages, 11 figures

  3. arXiv:2405.16443  [pdf, other

    cs.CV cs.GR

    3D View Optimization for Improving Image Aesthetics

    Authors: Taichi Uchida, Yoshihiro Kanamori, Yuki Endo

    Abstract: Achieving aesthetically pleasing photography necessitates attention to multiple factors, including composition and capture conditions, which pose challenges to novices. Prior research has explored the enhancement of photo aesthetics post-capture through 2D manipulation techniques; however, these approaches offer limited search space for aesthetics. We introduce a pioneering method that employs 3D… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 10 pages

  4. arXiv:2403.17761  [pdf, other

    cs.CV cs.GR

    Makeup Prior Models for 3D Facial Makeup Estimation and Applications

    Authors: Xingchao Yang, Takafumi Taketomi, Yuki Endo, Yoshihiro Kanamori

    Abstract: In this work, we introduce two types of makeup prior models to extend existing 3D face prior models: PCA-based and StyleGAN2-based priors. The PCA-based prior model is a linear model that is easy to construct and is computationally efficient. However, it retains only low-frequency information. Conversely, the StyleGAN2-based model can represent high-frequency information with relatively higher com… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CVPR2024. Project: https://yangxingchao.github.io/makeup-priors-page

  5. arXiv:2401.02804  [pdf, other

    cs.CV cs.GR

    DiffBody: Diffusion-based Pose and Shape Editing of Human Images

    Authors: Yuta Okuyama, Yuki Endo, Yoshihiro Kanamori

    Abstract: Pose and body shape editing in a human image has received increasing attention. However, current methods often struggle with dataset biases and deteriorate realism and the person's identity when users make large edits. We propose a one-shot approach that enables large edits with identity preservation. To enable large edits, we fit a 3D body model, project the input image onto the 3D model, and cha… ▽ More

    Submitted 7 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted to WACV 2024, project page: https://www.cgg.cs.tsukuba.ac.jp/~okuyama/pub/diffbody/

  6. arXiv:2312.08809  [pdf, ps, other

    q-bio.NC cond-mat.dis-nn cs.LG stat.ML

    Performance evaluation of matrix factorization for fMRI data

    Authors: Yusuke Endo, Kou** Takeda

    Abstract: In the study of the brain, there is a hypothesis that sparse coding is realized in information representation of external stimuli, which is experimentally confirmed for visual stimulus recently. However, unlike the specific functional region in the brain, sparse coding in information processing in the whole brain has not been clarified sufficiently. In this study, we investigate the validity of sp… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 22 pages, 8 figures

    Journal ref: Neural Computation (2024) 36 (1) 128-150

  7. arXiv:2308.06027  [pdf, other

    cs.CV cs.GR

    Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation

    Authors: Yuki Endo

    Abstract: Text-to-image synthesis has achieved high-quality results with recent advances in diffusion models. However, text input alone has high spatial ambiguity and limited user controllability. Most existing methods allow spatial control through additional visual guidance (e.g., sketches and semantic masks) but require additional training with annotated images. In this paper, we propose a method for spat… ▽ More

    Submitted 30 October, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted to The Visual Computer, code: https://github.com/endo-yuki-t/MAG

  8. arXiv:2305.16759  [pdf, other

    cs.CV cs.GR

    StyleHumanCLIP: Text-guided Garment Manipulation for StyleGAN-Human

    Authors: Takato Yoshikawa, Yuki Endo, Yoshihiro Kanamori

    Abstract: This paper tackles text-guided control of StyleGAN for editing garments in full-body human images. Existing StyleGAN-based methods suffer from handling the rich diversity of garments and body shapes and poses. We propose a framework for text-guided full-body human image synthesis via an attention-based latent code mapper, which enables more disentangled control of StyleGAN than existing mappers. O… ▽ More

    Submitted 20 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: VISIAPP 2024, project page: https://www.cgg.cs.tsukuba.ac.jp/~yoshikawa/pub/style_human_clip/

  9. arXiv:2208.12408  [pdf, other

    cs.CV cs.GR

    User-Controllable Latent Transformer for StyleGAN Image Layout Editing

    Authors: Yuki Endo

    Abstract: Latent space exploration is a technique that discovers interpretable latent directions and manipulates latent codes to edit various attributes in images generated by generative adversarial networks (GANs). However, in previous work, spatial control is limited to simple transformations (e.g., translation and rotation), and it is laborious to identify appropriate latent directions and adjust their p… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted to Pacific Graphics 2022, project page: http://www.cgg.cs.tsukuba.ac.jp/~endo/projects/UserControllableLT

  10. arXiv:2206.05433  [pdf, ps, other

    physics.optics cs.ET

    Gigahertz-rate random speckle projection for high-speed single-pixel image classification

    Authors: **sei Hanawa, Tomoaki Niiyama, Yutaka Endo, Satoshi Sunada

    Abstract: Imaging techniques based on single-pixel detection, such as ghost imaging, can reconstruct or recognize a target scene from multiple measurements using a sequence of random mask patterns. However, the processing speed is limited by the low rate of the pattern generation. In this study, we propose an ultrafast method for random speckle pattern generation, which has the potential to overcome the lim… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

    Comments: 7 pages, 7 figures

    Journal ref: Optics Express Vol. 30, Issue 13, pp. 22911-22921 (2022)

  11. arXiv:2110.07272  [pdf, other

    cs.GR

    Relighting Humans in the Wild: Monocular Full-Body Human Relighting with Domain Adaptation

    Authors: Daichi Tajima, Yoshihiro Kanamori, Yuki Endo

    Abstract: The modern supervised approaches for human image relighting rely on training data generated from 3D human models. However, such datasets are often small (e.g., Light Stage data with a small number of individuals) or limited to diffuse materials (e.g., commercial 3D scanned human models). Thus, the human relighting techniques suffer from the poor generalization capability and synthetic-to-real doma… ▽ More

    Submitted 14 October, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted to Pacific Graphics 2021, project page: http://www.cgg.cs.tsukuba.ac.jp/~tajima/pub/relighting_in_the_wild/

  12. Diversifying Semantic Image Synthesis and Editing via Class- and Layer-wise VAEs

    Authors: Yuki Endo, Yoshihiro Kanamori

    Abstract: Semantic image synthesis is a process for generating photorealistic images from a single semantic mask. To enrich the diversity of multimodal image synthesis, previous methods have controlled the global appearance of an output image by learning a single latent space. However, a single latent code is often insufficient for capturing various object styles because object appearance depends on multipl… ▽ More

    Submitted 29 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Accepted to Pacific Graphics 2020, codes available at https://github.com/endo-yuki-t/DiversifyingSMIS

  13. arXiv:2103.14877  [pdf, other

    cs.CV cs.GR

    Few-shot Semantic Image Synthesis Using StyleGAN Prior

    Authors: Yuki Endo, Yoshihiro Kanamori

    Abstract: This paper tackles a challenging problem of generating photorealistic images from semantic layouts in few-shot scenarios where annotated training pairs are hardly available but pixel-wise annotation is quite costly. We present a training strategy that performs pseudo labeling of semantic masks using the StyleGAN prior. Our key idea is to construct a simple map** between the StyleGAN feature and… ▽ More

    Submitted 12 May, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: The source codes are available at https://github.com/endo-yuki-t/Fewshot-SMIS

  14. arXiv:1910.07192  [pdf, other

    cs.GR cs.CV

    Animating Landscape: Self-Supervised Learning of Decoupled Motion and Appearance for Single-Image Video Synthesis

    Authors: Yuki Endo, Yoshihiro Kanamori, Shigeru Kuriyama

    Abstract: Automatic generation of a high-quality video from a single image remains a challenging task despite the recent advances in deep generative models. This paper proposes a method that can create a high-resolution, long-term animation using convolutional neural networks (CNNs) from a single landscape image where we mainly focus on skies and waters. Our key observation is that the motion (e.g., moving… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Published at SIGGRAPH Asia 2019 (ACM Transactions on Graphics)

  15. Relighting Humans: Occlusion-Aware Inverse Rendering for Full-Body Human Images

    Authors: Yoshihiro Kanamori, Yuki Endo

    Abstract: Relighting of human images has various applications in image synthesis. For relighting, we must infer albedo, shape, and illumination from a human portrait. Previous techniques rely on human faces for this inference, based on spherical harmonics (SH) lighting. However, because they often ignore light occlusion, inferred shapes are biased and relit images are unnaturally bright particularly at holl… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: Published at SIGGRAPH Asia 2018 (ACM Transactions on Graphics). Project page with codes, pretrained models, and human model lists is at http://kanamori.cs.tsukuba.ac.jp/projects/relighting_human/

  16. Digital holographic particle volume reconstruction using a deep neural network

    Authors: Tomoyoshi Shimobaba, Takayuki Takahashi, Yota Yamamoto, Yutaka Endo, Atsushi Shiraki, Takashi Nishitsuji, Naoto Hoshikawa, Takashi Kakue, Tomoyosh Ito

    Abstract: This paper proposes a particle volume reconstruction directly from an in-line hologram using a deep neural network. Digital holographic volume reconstruction conventionally uses multiple diffraction calculations to obtain sectional reconstructed images from an in-line hologram, followed by detection of the lateral and axial positions, and the sizes of particles by using focus metrics. However, the… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

  17. arXiv:1710.08343  [pdf, ps, other

    cs.CV physics.optics

    Computational ghost imaging using deep learning

    Authors: Tomoyoshi Shimobaba, Yutaka Endo, Takashi Nishitsuji, Takayuki Takahashi, Yuki Nagahama, Satoki Hasegawa, Marie Sano, Ryuji Hirayama, Takashi Kakue, Atsushi Shiraki, Tomoyoshi Ito

    Abstract: Computational ghost imaging (CGI) is a single-pixel imaging technique that exploits the correlation between known random patterns and the measured intensity of light transmitted (or reflected) by an object. Although CGI can obtain two- or three- dimensional images with a single or a few bucket detectors, the quality of the reconstructed images is reduced by noise due to the reconstruction of image… ▽ More

    Submitted 18 October, 2017; originally announced October 2017.

  18. arXiv:1612.03959  [pdf, other

    cs.CV physics.optics

    Autoencoder-based holographic image restoration

    Authors: Tomoyoshi Shimobaba, Yutaka Endo, Ryuji Hirayama, Yuki Nagahama, Takayuki Takahashi, Takashi Nishitsuji, Takashi Kakue, Atsushi Shiraki, Naoki Takada, Nobuyuki Masuda, Tomoyoshi Ito

    Abstract: We propose a holographic image restoration method using an autoencoder, which is an artificial neural network. Because holographic reconstructed images are often contaminated by direct light, conjugate light, and speckle noise, the discrimination of reconstructed images may be difficult. In this paper, we demonstrate the restoration of reconstructed images from holograms that record page data in h… ▽ More

    Submitted 12 December, 2016; originally announced December 2016.

  19. arXiv:1504.01424  [pdf, ps, other

    physics.optics cs.MM

    Improvement of the image quality of random phase--free holography using an iterative method

    Authors: Tomoyoshi Shimobaba, Takashi Kakue, Yutaka Endo, Ryuji Hirayama, Daisuke Hiyama, Satoki Hasegawa, Yuki Nagahama, Marie Sano, Minoru Oikawa, Takashige Sugie, Tomoyoshi Ito

    Abstract: Our proposed method of random phase-free holography using virtual convergence light can obtain large reconstructed images exceeding the size of the hologram, without the assistance of random phase. The reconstructed images have low-speckle noise in the amplitude and phase-only holograms (kinoforms); however, in low-resolution holograms, we obtain a degraded image quality compared to the original i… ▽ More

    Submitted 6 April, 2015; originally announced April 2015.

  20. arXiv:1503.00360  [pdf, ps, other

    physics.optics cs.CR

    Optical encryption for large-sized images using random phase-free method

    Authors: Tomoyoshi Shimobaba, Takashi Kakue, Yutaka Endo, Ryuji Hirayama, Daisuke Hiyama, Satoki Hasegawa, Yuki Nagahama, Marie Sano, Takashige Sugie, Tomoyoshi Ito

    Abstract: We propose an optical encryption framework that can encrypt and decrypt large-sized images beyond the size of the encrypted image using our two methods: random phase-free method and scaled diffraction. In order to record the entire image information on the encrypted image, the large-sized images require the random phase to widely diffuse the object light over the encrypted image; however, the rand… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

  21. arXiv:1407.2971  [pdf, ps, other

    physics.optics cs.GR cs.MM

    Numerical investigation of lensless zoomable holographic multiple projections to tilted planes

    Authors: Tomoyoshi Shimobaba, Michal Makowski, Takashi Kakue, Naohisa Okada, Yutaka Endo, Ryuji Hirayam, Daisuke Hiyama, Satoki Hasegawa, Yuki Nagahama, Tomoyoshi Ito

    Abstract: This paper numerically investigates the feasibility of lensless zoomable holographic multiple projections to tilted planes. We have already developed lensless zoomable holographic single projection using scaled diffraction, which calculates diffraction between parallel planes with different sampling pitches. The structure of this zoomable holographic projection is very simple because it does not n… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.

  22. arXiv:1308.0376  [pdf, ps, other

    physics.optics cs.GR

    Calculation reduction method for color computer-generated hologram using color space conversion

    Authors: Tomoyoshi Shimobaba, Takashi Kakue, Minoru Oikawa, Naoki Takada, Naohisa Okada, Yutaka Endo, Ryuji Hirayama, Tomoyoshi Ito

    Abstract: We report a calculation reduction method for color computer-generated holograms (CGHs) using color space conversion. Color CGHs are generally calculated on RGB space. In this paper, we calculate color CGHs in other color spaces: for example, YCbCr color space. In YCbCr color space, a RGB image is converted to the luminance component (Y), blue-difference chroma (Cb) and red-difference chroma (Cr) c… ▽ More

    Submitted 1 August, 2013; originally announced August 2013.