Skip to main content

Showing 1–25 of 25 results for author: Yamaguchi, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08232  [pdf, other

    cs.CV cs.GR

    OpenCOLE: Towards Reproducible Automatic Graphic Design Generation

    Authors: Naoto Inoue, Kento Masui, Wataru Shimoda, Kota Yamaguchi

    Abstract: Automatic generation of graphic designs has recently received considerable attention. However, the state-of-the-art approaches are complex and rely on proprietary datasets, which creates reproducibility barriers. In this paper, we propose an open framework for automatic graphic design called OpenCOLE, where we build a modified version of the pioneering COLE and train our model exclusively on publi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: To appear as an extended abstract (EA) in Workshop on Graphic Design Understanding and Generation (in CVPR2024), code: https://github.com/CyberAgentAILab/OpenCOLE

  2. arXiv:2403.12784  [pdf, other

    cs.CV

    Total Disentanglement of Font Images into Style and Character Class Features

    Authors: Daichi Haraguchi, Wataru Shimoda, Kota Yamaguchi, Seiichi Uchida

    Abstract: In this paper, we demonstrate a total disentanglement of font images. Total disentanglement is a neural network-based method for decomposing each font image nonlinearly and completely into its style and content (i.e., character class) features. It uses a simple but careful training procedure to extract the common style feature from all `A'-`Z' images in the same font and the common content feature… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  3. arXiv:2311.13602  [pdf, other

    cs.CV

    Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

    Authors: Daichi Horita, Naoto Inoue, Kotaro Kikuchi, Kota Yamaguchi, Kiyoharu Aizawa

    Abstract: Content-aware graphic layout generation aims to automatically arrange visual elements along with a given content, such as an e-commerce product image. In this paper, we argue that the current layout generation approaches suffer from the limited training data for the high-dimensional layout structure. We show that a simple retrieval augmentation can significantly improve the generation quality. Our… ▽ More

    Submitted 15 April, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR 2024 (Oral), Project website: https://udonda.github.io/RALF/ , GitHub: https://github.com/CyberAgentAILab/RALF

  4. arXiv:2309.02099  [pdf, other

    cs.CV cs.MM

    Towards Diverse and Consistent Typography Generation

    Authors: Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi

    Abstract: In this work, we consider the typography generation task that aims at producing diverse typographic styling for the given graphic document. We formulate typography generation as a fine-grained attribute generation for multiple text elements and build an autoregressive model to generate diverse typography that matches the input design context. We further propose a simple yet effective sampling appr… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  5. arXiv:2303.18248  [pdf, other

    cs.CV

    Towards Flexible Multi-modal Document Models

    Authors: Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

    Abstract: Creative workflows for generating graphical documents involve complex inter-related tasks, such as aligning elements, choosing appropriate fonts, or employing aesthetically harmonious colors. In this work, we attempt at building a holistic model that can jointly solve many different design tasks. Our model, which we denote by FlexDM, treats vector graphic documents as a set of multi-modal elements… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: To be published in CVPR2023 (highlight), project page: https://cyberagentailab.github.io/flex-dm

  6. arXiv:2303.08137  [pdf, other

    cs.CV cs.GR

    LayoutDM: Discrete Diffusion Model for Controllable Layout Generation

    Authors: Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

    Abstract: Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element. In this work, we try to solve a broad range of layout generation tasks in a single model that is based on discrete state-space diffusion models. Our model, named LayoutDM, naturally handles the structured layout data in the d… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: To be published in CVPR2023, project page: https://cyberagentailab.github.io/layout-dm/

  7. arXiv:2303.01308  [pdf, ps, other

    cs.HC

    In-the-wild vibrotactile sensation: Perceptual transformation of vibrations from smartphones

    Authors: Keiko Yamaguchi, Satoshi Takahashi

    Abstract: Vibrations emitted by smartphones have become a part of our daily lives. The vibrations can add various meanings to the information people obtain from the screen. Hence, it is worth understanding the perceptual transformation of vibration with ordinary devices to evaluate the possibility of enriched vibrotactile communication via smartphones. This study assessed the reproducibility of vibrotactile… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 8 pages, 9 figures

  8. arXiv:2212.11541  [pdf, other

    cs.CV cs.MM

    Generative Colorization of Structured Mobile Web Pages

    Authors: Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi

    Abstract: Color is a critical design factor for web pages, affecting important factors such as viewer emotions and the overall trust and satisfaction of a website. Effective coloring requires design knowledge and expertise, but if this process could be automated through data-driven modeling, efficient exploration and alternative workflows would be possible. However, this direction remains underexplored due… ▽ More

    Submitted 23 January, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted to WACV 2023

  9. arXiv:2205.03549  [pdf

    physics.ins-det cs.CV eess.IV physics.app-ph

    Deep Learning-enabled Detection and Classification of Bacterial Colonies using a Thin Film Transistor (TFT) Image Sensor

    Authors: Yuzhu Li, Tairan Liu, Hatice Ceylan Koydemir, Hongda Wang, Keelan O'Riordan, Bijie Bai, Yuta Haga, Junji Kobashi, Hitoshi Tanaka, Takaya Tamaru, Kazunori Yamaguchi, Aydogan Ozcan

    Abstract: Early detection and identification of pathogenic bacteria such as Escherichia coli (E. coli) is an essential task for public health. The conventional culture-based methods for bacterial colony detection usually take >24 hours to get the final read-out. Here, we demonstrate a bacterial colony-forming-unit (CFU) detection system exploiting a thin-film-transistor (TFT)-based image sensor array that s… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

    Comments: 18 Pages, 6 Figures

    Journal ref: ACS Photonics (2022)

  10. arXiv:2201.06674  [pdf, other

    cs.CL

    TYPIC: A Corpus of Template-Based Diagnostic Comments on Argumentation

    Authors: Shoichi Naito, Shintaro Sawada, Chihiro Nakagawa, Naoya Inoue, Kenshi Yamaguchi, Iori Shimizu, Farjana Sultana Mim, Keshav Singh, Kentaro Inui

    Abstract: Providing feedback on the argumentation of the learner is essential for develo** critical thinking skills, however, it requires a lot of time and effort. To mitigate the overload on teachers, we aim to automate a process of providing feedback, especially giving diagnostic comments which point out the weaknesses inherent in the argumentation. It is recommended to give specific diagnostic comments… ▽ More

    Submitted 21 June, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    Comments: LREC2022. The dataset is available at https://github.com/cl-tohoku/TYPIC

  11. arXiv:2110.01890  [pdf, other

    cs.CV

    De-rendering Stylized Texts

    Authors: Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi

    Abstract: Editing raster text is a promising but challenging task. We propose to apply text vectorization for the task of raster text editing in display media, such as posters, web pages, or advertisements. In our approach, instead of applying image transformation or generation in the raster domain, we learn a text vectorization model to parse all the rendering parameters including text, location, size, fon… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted to ICCV 2021. Codes: https://github.com/CyberAgentAILab/derendering-text

  12. arXiv:2108.01249  [pdf, other

    cs.CV

    CanvasVAE: Learning to Generate Vector Graphic Documents

    Authors: Kota Yamaguchi

    Abstract: Vector graphic documents present visual elements in a resolution free, compact format and are often seen in creative applications. In this work, we attempt to learn a generative model of vector graphic documents. We define vector graphic documents by a multi-modal set of attributes associated to a canvas and a sequence of visual elements such as shapes, images, or texts, and train variational auto… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: to be published in ICCV 2021

  13. Constrained Graphic Layout Generation via Latent Optimization

    Authors: Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

    Abstract: It is common in graphic design humans visually arrange various elements according to their design intent and semantics. For example, a title text almost always appears on top of other elements in a document. In this work, we generate graphic layouts that can flexibly incorporate such design semantics, either specified implicitly or explicitly by a user. We optimize using the latent space of an off… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted by ACM Multimedia 2021

  14. A Novel Approach to Analyze Fashion Digital Archive from Humanities

    Authors: Satoshi Takahashi, Keiko Yamaguchi, Asuka Watanabe

    Abstract: Fashion styles adopted every day are an important aspect of culture, and style trend analysis helps provide a deeper understanding of our societies and cultures. To analyze everyday fashion trends from the humanities perspective, we need a digital archive that includes images of what people wore in their daily lives over an extended period. In fashion research, building digital fashion image archi… ▽ More

    Submitted 10 September, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: In Proceedings of 'The 23rd International Conference on Asia-Pacific Digital Libraries' 17 pages, 8 figures. arXiv admin note: text overlap with arXiv:2009.13395

    Journal ref: In International Conference on Asian Digital Libraries (pp. 179-194). Springer, Cham (2021)

  15. arXiv:2011.01428  [pdf, other

    cs.RO

    Leaf-like Origami with Bistability for Self-Adaptive Gras** Motions

    Authors: Hiromi Yasuda, Kyle Johnson, Vicente Arroyos, Koshiro Yamaguchi, Jordan R. Raney, **kyu Yang

    Abstract: The leaf-like origami structure was inspired by geometric patterns found in nature, exhibiting unique transitions between open and closed shapes. With a bistable energy landscape, leaf-like origami is able to replicate the autonomous gras** of objects observed in biological systems like the Venus flytrap. We show uniform gras** motions of the leaf-like origami, as well as various non-uniform g… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  16. arXiv:2009.13395  [pdf, ps, other

    cs.CV cs.DB cs.DL

    CAT STREET: Chronicle Archive of Tokyo Street-fashion

    Authors: Satoshi Takahashi, Keiko Yamaguchi, Asuka Watanabe

    Abstract: The analysis of daily-life fashion trends can provide us a profound understanding of our societies and cultures. However, no appropriate digital archive exists that includes images illustrating what people wore in their daily lives over an extended period. In this study, we propose a new fashion image archive, Chronicle Archive of Tokyo Street-fashion (CAT STREET), to shed light on daily-life fash… ▽ More

    Submitted 29 April, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 19 pages, 17 figures

  17. arXiv:1906.10269  [pdf, ps, other

    cs.CV

    Serif or Sans: Visual Font Analytics on Book Covers and Online Advertisements

    Authors: Yuto Shinahara, Takuro Karamatsu, Daisuke Harada, Kota Yamaguchi, Seiichi Uchida

    Abstract: In this paper, we conduct a large-scale study of font statistics in book covers and online advertisements. Through the statistical study, we try to understand how graphic designers relate fonts and content genres and identify the relationship between font styles, colors, and genres. We propose an automatic approach to extract font information from graphic designs by applying a sequence of characte… ▽ More

    Submitted 29 June, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: Accepted by ICDAR2019

  18. arXiv:1906.01196  [pdf

    quant-ph cs.ET

    Convolution filter embedded quantum gate autoencoder

    Authors: Kodai Shiba, Katsuyoshi Sakamoto, Koichi Yamaguchi, Dinesh Bahadur Malla, Tomah Sogabe

    Abstract: The autoencoder is one of machine learning algorithms used for feature extraction by dimension reduction of input data, denoising of images, and prior learning of neural networks. At the same time, autoencoders using quantum computers are also being developed. However, current quantum computers have a limited number of qubits, which makes it difficult to calculate big data. In this paper, as a sol… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 8 pages, 7 figures

  19. arXiv:1810.10258  [pdf, ps, other

    cs.DS

    A Maximum Edge-Weight Clique Extraction Algorithm Based on Branch-and-Bound

    Authors: Satoshi Shimizu, Kazuaki Yamaguchi, Sumio Masuda

    Abstract: The maximum edge-weight clique problem is to find a clique whose sum of edge-weight is the maximum for a given edge-weighted undirected graph. The problem is NP-hard and some branch-and-bound algorithms have been proposed. In this paper, we propose a new exact algorithm based on branch-and-bound. It assigns edge-weights to vertices and calculates upper bounds using vertex coloring. By some computa… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

  20. arXiv:1804.09979  [pdf, other

    cs.CV

    Recommending Outfits from Personal Closet

    Authors: Pongsate Tangseng, Kota Yamaguchi, Takayuki Okatani

    Abstract: We consider grading a fashion outfit for recommendation, where we assume that users have a closet of items and we aim at producing a score for an arbitrary combination of items in the closet. The challenge in outfit grading is that the input to the system is a bag of item pictures that are unordered and vary in size. We build a deep neural network-based system that can take variable-length items a… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  21. arXiv:1710.08049  [pdf, other

    cs.CV

    Feedback-prop: Convolutional Neural Network Inference under Partial Evidence

    Authors: Tianlu Wang, Kota Yamaguchi, Vicente Ordonez

    Abstract: We propose an inference procedure for deep convolutional neural networks (CNNs) when partial evidence is available. Our method consists of a general feedback-based propagation approach (feedback-prop) that boosts the prediction accuracy for an arbitrary set of unknown target labels when the values for a non-overlap** arbitrary set of target labels are known. We show that existing models trained… ▽ More

    Submitted 29 March, 2018; v1 submitted 22 October, 2017; originally announced October 2017.

    Comments: Accepted to CVPR 2018

  22. arXiv:1708.01892  [pdf, other

    cs.CV

    End-to-end learning potentials for structured attribute prediction

    Authors: Kota Yamaguchi, Takayuki Okatani, Takayuki Umeda, Kazuhiko Murasaki, Kyoko Sudo

    Abstract: We present a structured inference approach in deep neural networks for multiple attribute prediction. In attribute prediction, a common approach is to learn independent classifiers on top of a good feature representation. However, such classifiers assume conditional independence on features and do not explicitly consider the dependency between attributes in the inference process. We propose to for… ▽ More

    Submitted 6 August, 2017; originally announced August 2017.

  23. arXiv:1703.01386  [pdf, other

    cs.CV

    Looking at Outfit to Parse Clothing

    Authors: Pongsate Tangseng, Zhipeng Wu, Kota Yamaguchi

    Abstract: This paper extends fully-convolutional neural networks (FCN) for the clothing parsing problem. Clothing parsing requires higher-level knowledge on clothing semantics and contextual cues to disambiguate fine-grained categories. We extend FCN architecture with a side-branch network which we refer outfit encoder to predict a consistent set of clothing labels to encourage combinatorial preference, and… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

  24. arXiv:1607.07262  [pdf, other

    cs.CV

    Automatic Attribute Discovery with Neural Activations

    Authors: Sirion Vittayakorn, Takayuki Umeda, Kazuhiko Murasaki, Kyoko Sudo, Takayuki Okatani, Kota Yamaguchi

    Abstract: How can a machine learn to recognize visual attributes emerging out of online community without a definitive supervised dataset? This paper proposes an automatic approach to discover and analyze visual attributes from a noisy collection of image-text data on the Web. Our approach is based on the relationship between attributes and neural activations in the deep network. We characterize the visual… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

    Comments: ECCV 2016

  25. arXiv:1204.1393  [pdf, other

    cs.CV

    Continuous Markov Random Fields for Robust Stereo Estimation

    Authors: Koichiro Yamaguchi, Tamir Hazan, David McAllester, Raquel Urtasun

    Abstract: In this paper we present a novel slanted-plane MRF model which reasons jointly about occlusion boundaries as well as depth. We formulate the problem as the one of inference in a hybrid MRF composed of both continuous (i.e., slanted 3D planes) and discrete (i.e., occlusion boundaries) random variables. This allows us to define potentials encoding the ownership of the pixels that compose the boundar… ▽ More

    Submitted 5 April, 2012; originally announced April 2012.

    ACM Class: I.2.10; I.4.8