Skip to main content

Showing 1–50 of 69 results for author: Tanaka, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18820  [pdf, other

    cs.DC cs.LG

    Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training

    Authors: Xinyu Lian, Sam Ade Jacobs, Lev Kurilenko, Masahiro Tanaka, Stas Bekman, Olatunji Ruwase, Minjia Zhang

    Abstract: Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state across multiple accelerators, a requirement for model scaling. Consolidating distributed model state into a single checkpoint unacceptably slows down training, and is impractical at extreme scales. Distributed checkpoints, in contrast, are t… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.14329  [pdf, other

    cs.LG eess.IV

    Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization

    Authors: Tanapat Ratchatorn, Masayuki Tanaka

    Abstract: Recent advancements in learning algorithms have demonstrated that the sharpness of the loss surface is an effective measure for improving the generalization gap. Building upon this concept, Sharpness-Aware Minimization (SAM) was proposed to enhance model generalization and achieved state-of-the-art performance. SAM consists of two main steps, the weight perturbation step and the weight updating st… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted in ICIP2024. The project page can be accessed at http://www.vip.sc.e.titech.ac.jp/proj/AACE

  3. arXiv:2405.14146  [pdf, other

    cs.CV

    Hyperspectral Image Dataset for Individual Penguin Identification

    Authors: Youta Noboru, Yuko Ozasa, Masayuki Tanaka

    Abstract: Remote individual animal identification is important for food safety, sport, and animal conservation. Numerous existing remote individual animal identification studies have focused on RGB images. In this paper, we tackle individual penguin identification using hyperspectral (HS) images. To the best of our knowledge, it is the first work to analyze spectral differences between penguin individuals u… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted by 2024 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2024)

  4. arXiv:2405.04771  [pdf, other

    cs.CV

    Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches

    Authors: Qing Yu, Mikihiro Tanaka, Kent Fujiwara

    Abstract: To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial. However, unlike the abundance of image data, the scarcity of motion data has limited the performance of existing motion-language models. To counter this, we introduce "motion patches", a new representation of motion sequences, and propose using Vision Trans… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR 2024, Project website: https://yu1ut.com/MotionPatches-HP/

  5. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  6. arXiv:2403.11517  [pdf, other

    q-bio.NC cs.HC

    Inter-individual and inter-site neural code conversion and image reconstruction without shared stimuli

    Authors: Haibao Wang, Jun Kai Ho, Fan L. Cheng, Shuntaro C. Aoki, Yusuke Muraki, Misato Tanaka, Yukiyasu Kamitani

    Abstract: The human brain demonstrates substantial inter-individual variability in fine-grained functional topography, posing challenges in identifying common neural representations across individuals. Functional alignment has the potential to harmonize these individual differences. However, it typically requires an identical set of stimuli presented to different individuals, which is often unavailable. To… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2403.00363  [pdf, other

    quant-ph cs.AR

    SFQ counter-based precomputation for large-scale cryogenic VQE machines

    Authors: Yosuke Ueno, Satoshi Imamura, Yuna Tomida, Teruo Tanimoto, Masamitsu Tanaka, Yutaka Tabuchi, Koji Inoue, Hiroshi Nakamura

    Abstract: The variational quantum eigensolver (VQE) is a promising candidate that brings practical benefits from quantum computing. However, the required bandwidth in/out of a cryostat is a limiting factor to scale cryogenic quantum computers. We propose a tailored counter-based module with single flux quantum circuits in 4-K stage which precomputes a part of VQE calculation and reduces the amount of inter-… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 3 tables. Accepted by DAC'24 WIP poster session

  8. arXiv:2401.13868  [pdf, other

    cs.CE

    Shell topology optimization based on level set method

    Authors: Hiroki Kobayashi, Katsuya Nomura, Yuqing Zhou, Masato Tanaka, Atsushi Kawamoto, Tsuyoshi Nomura

    Abstract: This paper proposes a level set-based method for optimizing shell structures with large design changes in shape and topology. Conventional shell optimization methods, whether parametric or nonparametric, often only allow limited design changes in shape. In the proposed method, the shell structure is defined as the isosurface of a level set function. The level set function is iteratively updated ba… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 13 pages, 13 figures

  9. arXiv:2401.08671  [pdf, other

    cs.PF cs.LG

    DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

    Authors: Connor Holmes, Masahiro Tanaka, Michael Wyatt, Ammar Ahmad Awan, Jeff Rasley, Samyam Rajbhandari, Reza Yazdani Aminabadi, Heyang Qin, Arash Bakhtiari, Lev Kurilenko, Yuxiong He

    Abstract: The deployment and scaling of large language models (LLMs) have become critical as they permeate various applications, demanding high-throughput and low-latency serving systems. Existing frameworks struggle to balance these requirements, especially for workloads with long prompts. This paper introduces DeepSpeed-FastGen, a system that employs Dynamic SplitFuse, a novel prompt and generation compos… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  10. arXiv:2310.14581  [pdf, other

    cs.CV cs.AI

    Leveraging Image-Text Similarity and Caption Modification for the DataComp Challenge: Filtering Track and BYOD Track

    Authors: Shuhei Yokoo, Peifei Zhu, Yuchi Ishikawa, Mikihiro Tanaka, Masayoshi Kondo, Hirokatsu Kataoka

    Abstract: Large web crawl datasets have already played an important role in learning multimodal features with high generalization capabilities. However, there are still very limited studies investigating the details or improvements of data design. Recently, a DataComp challenge has been designed to propose the best training data with the fixed models. This paper presents our solution to both filtering track… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at the ICCV 2023 Workshop on Towards the Next Generation of Computer Vision Datasets: DataComp Track

  11. arXiv:2310.10985  [pdf

    cs.CE

    Computational synthesis of locomoting soft robots by topology optimization

    Authors: Hiroki Kobayashi, Farzad Gholami, S. Macrae Montgomery, Masato Tanaka, Liang Yue, Changyoung Yuhn, Yuki Sato, Atsushi Kawamoto, H. Jerry Qi, Tsuyoshi Nomura

    Abstract: Biological organisms have acquired sophisticated body shapes for walking or climbing through million-year evolutionary processes. In contrast, the components of locomoting soft robots, such as legs and arms, are designed in trial-and-error loops guided by a priori knowledge and experience, which leaves considerable room for improvement. Here, we present optimized soft robots that performed a speci… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 22 total pages (19 pages, 3 supplementary pages), 4 Figures, 4 Supplementary figures. 1 Supplementary table

  12. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  13. Inter-temperature Bandwidth Reduction in Cryogenic QAOA Machines

    Authors: Yosuke Ueno, Yuna Tomida, Teruo Tanimoto, Masamitsu Tanaka, Yutaka Tabuchi, Koji Inoue, Hiroshi Nakamura

    Abstract: The bandwidth limit between cryogenic and room-temperature environments is a critical bottleneck in superconducting noisy intermediate-scale quantum computers. This paper presents the first trial of algorithm-aware system-level optimization to solve this issue by targeting the quantum approximate optimization algorithm. Our counter-based cryogenic architecture using single-flux quantum logic shows… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 4 pages, 5 figures, 1 table. Accepted by IEEE Computer Architecture Letters,

  14. arXiv:2309.14509  [pdf, other

    cs.LG cs.CL cs.DC

    DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

    Authors: Sam Ade Jacobs, Masahiro Tanaka, Chengming Zhang, Minjia Zhang, Shuaiwen Leon Song, Samyam Rajbhandari, Yuxiong He

    Abstract: Computation in a typical Transformer-based large language model (LLM) can be characterized by batch size, hidden dimension, number of layers, and sequence length. Until now, system works for accelerating LLM training have focused on the first three dimensions: data parallelism for batch size, tensor parallelism for hidden size and pipeline parallelism for model depth or layers. These widely studie… ▽ More

    Submitted 4 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

  15. arXiv:2308.01320  [pdf, other

    cs.LG cs.AI cs.CL

    DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

    Authors: Zhewei Yao, Reza Yazdani Aminabadi, Olatunji Ruwase, Samyam Rajbhandari, Xiaoxia Wu, Ammar Ahmad Awan, Jeff Rasley, Minjia Zhang, Conglong Li, Connor Holmes, Zhongzhu Zhou, Michael Wyatt, Molly Smith, Lev Kurilenko, Heyang Qin, Masahiro Tanaka, Shuai Che, Shuaiwen Leon Song, Yuxiong He

    Abstract: ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 14 pages, 7 figures

  16. arXiv:2307.13985  [pdf, other

    cs.CR cs.CV

    Enhanced Security against Adversarial Examples Using a Random Ensemble of Encrypted Vision Transformer Models

    Authors: Ryota Iijima, Miki Tanaka, Sayaka Shiota, Hitoshi Kiya

    Abstract: Deep neural networks (DNNs) are well known to be vulnerable to adversarial examples (AEs). In addition, AEs have adversarial transferability, which means AEs generated for a source model can fool another black-box model (target model) with a non-trivial probability. In previous studies, it was confirmed that the vision transformer (ViT) is more robust against the property of adversarial transferab… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 4 pages, 3 figures

  17. arXiv:2306.11629  [pdf, other

    cs.SD cs.HC eess.AS

    Sound reconstruction from human brain activity via a generative model with brain-like auditory features

    Authors: Jong-Yun Park, Mitsuaki Tsukamoto, Misato Tanaka, Yukiyasu Kamitani

    Abstract: The successful reconstruction of perceptual experiences from human brain activity has provided insights into the neural representations of sensory experiences. However, reconstructing arbitrary sounds has been avoided due to the complexity of temporal sequences in sounds and the limited resolution of neuroimaging modalities. To overcome these challenges, leveraging the hierarchical nature of brain… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  18. arXiv:2303.05763  [pdf, other

    cs.CV cs.AI cs.HC

    Automatic Detection and Rectification of Paper Receipts on Smartphones

    Authors: Edward Whittaker, Masashi Tanaka, Ikuo Kitagishi

    Abstract: We describe the development of a real-time smartphone app that allows the user to digitize paper receipts in a novel way by "waving" their phone over the receipts and letting the app automatically detect and rectify the receipts for subsequent text recognition. We show that traditional computer vision algorithms for edge and corner detection do not robustly detect the non-linear and discontinuou… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  19. arXiv:2209.08724  [pdf, other

    cs.LG

    On the Adversarial Transferability of ConvMixer Models

    Authors: Ryota Iijima, Miki Tanaka, Isao Echizen, Hitoshi Kiya

    Abstract: Deep neural networks (DNNs) are well known to be vulnerable to adversarial examples (AEs). In addition, AEs have adversarial transferability, which means AEs generated for a source model can fool another black-box model (target model) with a non-trivial probability. In this paper, we investigate the property of adversarial transferability between models including ConvMixer, which is an isotropic n… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 pages, 5 figures, 5 tables. arXiv admin note: substantial text overlap with arXiv:2209.02997

  20. arXiv:2209.06027  [pdf, other

    eess.IV cs.CV

    Two-Step Color-Polarization Demosaicking Network

    Authors: Vy Nguyen, Masayuki Tanaka, Yusuke Monno, Masatoshi Okutomi

    Abstract: Polarization information of light in a scene is valuable for various image processing and computer vision tasks. A division-of-focal-plane polarimeter is a promising approach to capture the polarization images of different orientations in one shot, while it requires color-polarization demosaicking. In this paper, we propose a two-step color-polarization demosaicking network~(TCPDNet), which consis… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted in ICIP2022. Project page: http://www.ok.sc.e.titech.ac.jp/res/PolarDem/TCPDNet.html

  21. arXiv:2209.02997  [pdf, other

    cs.CV

    On the Transferability of Adversarial Examples between Encrypted Models

    Authors: Miki Tanaka, Isao Echizen, Hitoshi Kiya

    Abstract: Deep neural networks (DNNs) are well known to be vulnerable to adversarial examples (AEs). In addition, AEs have adversarial transferability, namely, AEs generated for a source model fool other (target) models. In this paper, we investigate the transferability of models encrypted for adversarially robust defense for the first time. To objectively verify the property of transferability, the robustn… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: to be appear in ISPACS 2022

  22. arXiv:2208.05758  [pdf, other

    quant-ph cs.AR

    NEO-QEC: Neural Network Enhanced Online Superconducting Decoder for Surface Codes

    Authors: Yosuke Ueno, Masaaki Kondo, Masamitsu Tanaka, Yasunari Suzuki, Yutaka Tabuchi

    Abstract: Quantum error correction (QEC) is essential for quantum computing to mitigate the effect of errors on qubits, and surface code (SC) is one of the most promising QEC methods. Decoding SCs is the most computational expensive task in the control device of quantum computers (QCs), and many works focus on accurate decoding algorithms for SCs, including ones with neural networks (NNs). Practical QCs als… ▽ More

    Submitted 1 September, 2022; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 13 pages, 9 figures, 5 tables

  23. arXiv:2208.05198  [pdf, other

    cs.CV

    A Detection Method of Temporally Operated Videos Using Robust Hashing

    Authors: Shoko Niwa, Miki Tanaka, Hitoshi Kiya

    Abstract: SNS providers are known to carry out the recompression and resizing of uploaded videos/images, but most conventional methods for detecting tampered videos/images are not robust enough against such operations. In addition, videos are temporally operated such as the insertion of new frames and the permutation of frames, of which operations are difficult to be detected by using conventional methods.… ▽ More

    Submitted 11 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: To appear in 2022 IEEE 11th Global Conference on Consumer Electronics (GCCE 2022)

  24. arXiv:2207.01847  [pdf, other

    cs.LG

    PoF: Post-Training of Feature Extractor for Improving Generalization

    Authors: Ikuro Sato, Ryota Yamada, Masayuki Tanaka, Nakamasa Inoue, Rei Kawakami

    Abstract: It has been intensively investigated that the local shape, especially flatness, of the loss landscape near a minimum plays an important role for generalization of deep models. We developed a training algorithm called PoF: Post-Training of Feature Extractor that updates the feature extractor part of an already-trained deep model to search a flatter minimum. The characteristics are two-fold: 1) Feat… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML2022. Contains a link to the code

  25. arXiv:2205.13344  [pdf, other

    cs.RO eess.SY

    A neural network based controller for underwater robotic vehicles

    Authors: Josiane Maria Macedo Fernandes, Marcelo Costa Tanaka, Raimundo Carlos Silvério Freire Júnior, Wallace Moreira Bessa

    Abstract: Due to the enormous technological improvements obtained in the last decades it is possible to use robotic vehicles for underwater exploration. This work describes the development of a dynamic positioning system for remotely operated underwater vehicles based. The adopted approach is developed using Lyapunov Stability Theory and enhanced by a neural network based algorithm for uncertainty and distu… ▽ More

    Submitted 17 June, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: References added. This is a slightly updated version of the work presented at the COBEM 2011 - 21st Congress of Mechanical Engineering, 2011, Natal Brazil

  26. arXiv:2109.14348  [pdf, ps, other

    cs.CR eess.SY

    Smart-home anomaly detection using combination of in-home situation and user behavior

    Authors: Masaaki Yamauchi, Masahiro Tanaka, Yuichi Ohsita, Masayuki Murata, Kensuke Ueda, Yoshiaki Kato

    Abstract: Internet-of-things (IoT) devices are vulnerable to malicious operations by attackers, which can cause physical and economic harm to users; therefore, we previously proposed a sequence-based method that modeled user behavior as sequences of in-home events and a base home state to detect anomalous operations. However, that method modeled users' home states based on the time of day; hence, attackers… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 13 pages, 22 figures,

  27. arXiv:2108.01892  [pdf, other

    cs.CV eess.IV

    A universal detector of CNN-generated images using properties of checkerboard artifacts in the frequency domain

    Authors: Miki Tanaka, Sayaka Shiota, Hitoshi Kiya

    Abstract: We propose a novel universal detector for detecting images generated by using CNNs. In this paper, properties of checkerboard artifacts in CNN-generated images are considered, and the spectrum of images is enhanced in accordance with the properties. Next, a classifier is trained by using the enhanced spectrums to judge a query image to be a CNN-generated ones or not. In addition, an ensemble of th… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: to be appear in GCCE 2021

  28. arXiv:2107.11196  [pdf, other

    cs.CV

    Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU

    Authors: Napat Wanchaitanawong, Masayuki Tanaka, Takashi Shibata, Masatoshi Okutomi

    Abstract: The combined use of multiple modalities enables accurate pedestrian detection under poor lighting conditions by using the high visibility areas from these modalities together. The vital assumption for the combination use is that there is no or only a weak misalignment between the two modalities. In general, however, this assumption often breaks in actual situations. Due to this assumption's breakd… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: Accepted by MVA2021

  29. arXiv:2107.10524  [pdf, other

    cs.CV

    Geometric Data Augmentation Based on Feature Map Ensemble

    Authors: Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi

    Abstract: Deep convolutional networks have become the mainstream in computer vision applications. Although CNNs have been successful in many computer vision tasks, it is not free from drawbacks. The performance of CNN is dramatically degraded by geometric transformation, such as large rotations. In this paper, we propose a novel CNN architecture that can improve the robustness against geometric transformati… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted to ICIP2021

  30. arXiv:2105.13954  [pdf, other

    math.OC cs.LG

    A Gradient Method for Multilevel Optimization

    Authors: Ryo Sato, Mirai Tanaka, Akiko Takeda

    Abstract: Although application examples of multilevel optimization have already been discussed since the 1990s, the development of solution methods was almost limited to bilevel cases due to the difficulty of the problem. In recent years, in machine learning, Franceschi et al. have proposed a method for solving bilevel optimization problems by replacing their lower-level problems with the $T$ steepest desce… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021 camera-ready, 27 pages

  31. arXiv:2103.16063  [pdf, ps, other

    cs.LG cs.DC

    Automatic Graph Partitioning for Very Large-scale Deep Learning

    Authors: Masahiro Tanaka, Kenjiro Taura, Toshihiro Hanawa, Kentaro Torisawa

    Abstract: This work proposes RaNNC (Rapid Neural Network Connector) as middleware for automatic hybrid parallelism. In recent deep learning research, as exemplified by T5 and GPT-3, the size of neural network models continues to grow. Since such models do not fit into the memory of accelerator devices, they need to be partitioned by model parallelism techniques. Moreover, to accelerate training for huge tra… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted to the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2021), May 2021

  32. arXiv:2103.02198  [pdf, other

    cs.CV cs.AI cs.LG

    Bulk Production Augmentation Towards Explainable Melanoma Diagnosis

    Authors: Kasumi Obi, Quan Huu Cap, Noriko Umegaki-Arao, Masaru Tanaka, Hitoshi Iyatomi

    Abstract: Although highly accurate automated diagnostic techniques for melanoma have been reported, the realization of a system capable of providing diagnostic evidence based on medical indices remains an open issue because of difficulties in obtaining reliable training data. In this paper, we propose bulk production augmentation (BPA) to generate high-quality, diverse pseudo-skin tumor images with the desi… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES2020), Best Paper Award Student Category in Biomedical Imaging and Image Processing

  33. arXiv:2102.01313  [pdf, other

    cs.MM cs.CV

    Fake-image detection with Robust Hashing

    Authors: Miki Tanaka, Hitoshi Kiya

    Abstract: In this paper, we investigate whether robust hashing has a possibility to robustly detect fake-images even when multiple manipulation techniques such as JPEG compression are applied to images for the first time. In an experiment, the proposed fake detection with robust hashing is demonstrated to outperform state-of-the-art one under the use of various datasets including fake images generated with… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: to be appear in Life Tech 2021

  34. arXiv:2102.00691  [pdf, other

    cs.DM

    New Formulation for Coloring Circle Graphs and its Application to Capacitated Stowage Stack Minimization

    Authors: Masato Tanaka, Tomomi Matsui

    Abstract: A circle graph is a graph in which the adjacency of vertices can be represented as the intersection of chords of a circle. The problem of calculating the chromatic number is known to be NP-complete, even on circle graphs. In this paper, we propose a new integer linear programming formulation for a coloring problem on circle graphs. We also show that the linear relaxation problem of our formulation… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: 23 pages, 5 figures

    MSC Class: 05C15; 90C10; 90C27; 90C35; 05C72

  35. Pseudo Polynomial Size LP Formulation for Calculating the Least Core Value of Weighted Voting Games

    Authors: Masato Tanaka, Tomomi Matsui

    Abstract: In this paper, we propose a pseudo polynomial size LP formulation for finding a payoff vector in the least core of a weighted voting game. The numbers of variables and constraints in our formulation are both bounded by $\mbox{O}(n W_+)$, where $n$ is the number of players and $W_+$ is the total sum of (integer) voting weights. When we employ our formulation, a commercial LP solver calculates a pay… ▽ More

    Submitted 23 August, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: 14 pages, 1 figure

    MSC Class: 91B12; 90C05

    Journal ref: Mathematical Social Sciences, Volume 115, 2022, Pages 47-51

  36. Monte Carlo Methods for Calculating Shapley-Shubik Power Index in Weighted Majority Games

    Authors: Yuto Ushioda, Masato Tanaka, Tomomi Matsui

    Abstract: This paper addresses Monte Carlo algorithms for calculating the Shapley-Shubik power index in weighted majority games. First, we analyze a naive Monte Carlo algorithm and discuss the required number of samples. We then propose an efficient Monte Carlo algorithm and show that our algorithm reduces the required number of samples as compared to the naive algorithm.

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 19pages

    MSC Class: 91-08; 90-08; 65C05

    Journal ref: Games 2022, 13(3), 44

  37. arXiv:2012.00287  [pdf, other

    cs.CV eess.IV

    CycleGAN without checkerboard artifacts for counter-forensics of fake-image detection

    Authors: Takayuki Osakabe, Miki Tanaka, Yuma Kinoshita, Hitoshi Kiya

    Abstract: In this paper, we propose a novel CycleGAN without checkerboard artifacts for counter-forensics of fake-image detection. Recent rapid advances in image manipulation tools and deep image synthesis techniques, such as Generative Adversarial Networks (GANs) have easily generated fake images, so detecting manipulated images has become an urgent issue. Most state-of-the-art forgery detection methods as… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  38. arXiv:2011.10232  [pdf, other

    cs.CV cs.GR eess.IV

    Deep Snapshot HDR Imaging Using Multi-Exposure Color Filter Array

    Authors: Takeru Suda, Masayuki Tanaka, Yusuke Monno, Masatoshi Okutomi

    Abstract: In this paper, we propose a deep snapshot high dynamic range (HDR) imaging framework that can effectively reconstruct an HDR image from the RAW data captured using a multi-exposure color filter array (ME-CFA), which consists of a mosaic pattern of RGB filters with different exposure levels. To effectively learn the HDR image reconstruction network, we introduce the idea of luminance normalization… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: Accepted at ACCV2020 (Oral). Project page: http://www.ok.sc.e.titech.ac.jp/res/DSHDR/

  39. arXiv:2011.06788  [pdf, other

    cs.CV

    Adaptive Future Frame Prediction with Ensemble Network

    Authors: Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi, Yoko Sasaki

    Abstract: Future frame prediction in videos is a challenging problem because videos include complicated movements and large appearance changes. Learning-based future frame prediction approaches have been proposed in kinds of literature. A common limitation of the existing learning-based approaches is a mismatch of training data and test data. In the future frame prediction task, we can obtain the ground tru… ▽ More

    Submitted 15 November, 2020; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Accepted at 25th International Conference on Pattern Recognition Workshop (ICPRW 2020)

  40. arXiv:2010.08092  [pdf, other

    cs.CV

    Human Segmentation with Dynamic LiDAR Data

    Authors: Tao Zhong, Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi

    Abstract: Consecutive LiDAR scans compose dynamic 3D sequences, which contain more abundant information than a single frame. Similar to the development history of image and video perception, dynamic 3D sequence perception starts to come into sight after inspiring research on static 3D data perception. This work proposes a spatio-temporal neural network for human segmentation with the dynamic LiDAR point clo… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  41. arXiv:2009.11558  [pdf, other

    cs.DB

    An Analysis of Concurrency Control Protocols for In-Memory Databases with CCBench (Extended Version)

    Authors: Takayuki Tanabe, Takashi Hoshino, Hideyuki Kawashima, Jun Nemoto, Masahiro Tanaka, Osamu Tatebe

    Abstract: This paper presents yet another concurrency control analysis platform, CCBench. CCBench supports seven protocols (Silo, TicToc, MOCC, Cicada, SI, SI with latch-free SSN, 2PL) and seven versatile optimization methods and enables the configuration of seven workload parameters. We analyzed the protocols and optimization methods using various workload parameters and a thread count of 224. Previous stu… ▽ More

    Submitted 18 August, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: A short version is accepted at VLDB 2020 (PVLDB Volume 13, Issue 13). Code is at https://github.com/thawk105/ccbench

    ACM Class: H.2.4

  42. arXiv:2007.14292  [pdf, other

    eess.IV cs.CV

    Monochrome and Color Polarization Demosaicking Using Edge-Aware Residual Interpolation

    Authors: Miki Morimatsu, Yusuke Monno, Masayuki Tanaka, Masatoshi Okutomi

    Abstract: A division-of-focal-plane or microgrid image polarimeter enables us to acquire a set of polarization images in one shot. Since the polarimeter consists of an image sensor equipped with a monochrome or color polarization filter array (MPFA or CPFA), the demosaicking process to interpolate missing pixel values plays a crucial role in obtaining high-quality polarization images. In this paper, we prop… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted in ICIP2020. Dataset and code are available at http://www.ok.sc.e.titech.ac.jp/res/PolarDem/index.html

  43. Unsupervised Learning of Image Segmentation Based on Differentiable Feature Clustering

    Authors: Wonjik Kim, Asako Kanezaki, Masayuki Tanaka

    Abstract: The usage of convolutional neural networks (CNNs) for unsupervised image segmentation was investigated in this study. In the proposed approach, label prediction and network parameter learning are alternately iterated to meet the following criteria: (a) pixels of similar features should be assigned the same label, (b) spatially continuous pixels should be assigned the same label, and (c) the number… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: IEEE Transactions on Image Processing, Accepted in July, 2020

  44. arXiv:2006.08145  [pdf, other

    cs.CV eess.IV

    Classifying degraded images over various levels of degradation

    Authors: Kazuki Endo, Masayuki Tanaka, Masatoshi Okutomi

    Abstract: Classification for degraded images having various levels of degradation is very important in practical applications. This paper proposes a convolutional neural network to classify degraded images by using a restoration network and an ensemble learning. The results demonstrate that the proposed network can classify degraded images over various levels of degradation well. This paper also reveals how… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted by the 27th IEEE International Conference on Image Processing (ICIP 2020)

  45. arXiv:2003.05093  [pdf, other

    cs.CV

    Learning-Based Human Segmentation and Velocity Estimation Using Automatic Labeled LiDAR Sequence for Training

    Authors: Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi, Yoko Sasaki

    Abstract: In this paper, we propose an automatic labeled sequential data generation pipeline for human segmentation and velocity estimation with point clouds. Considering the impact of deep neural networks, state-of-the-art network architectures have been proposed for human recognition using point clouds captured by Light Detection and Ranging (LiDAR). However, one disadvantage is that legacy datasets may o… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

    Comments: Please check the following URL for more information. http://www.ok.sc.e.titech.ac.jp/res/LHD/

  46. arXiv:2003.03305  [pdf, other

    cs.CV

    Captioning Images with Novel Objects via Online Vocabulary Expansion

    Authors: Mikihiro Tanaka, Tatsuya Harada

    Abstract: In this study, we introduce a low cost method for generating descriptions from images containing novel objects. Generally, constructing a model, which can explain images with novel objects, is costly because of the following: (1) collecting a large amount of data for each category, and (2) retraining the entire system. If humans see a small number of novel objects, they are able to estimate their… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  47. arXiv:2001.07761  [pdf, other

    cs.CV cs.LG eess.IV

    Block-wise Scrambled Image Recognition Using Adaptation Network

    Authors: Koki Madono, Masayuki Tanaka, Masaki Onishi, Tetsuji Ogawa

    Abstract: In this study, a perceptually hidden object-recognition method is investigated to generate secure images recognizable by humans but not machines. Hence, both the perceptual information hiding and the corresponding object recognition methods should be developed. Block-wise image scrambling is introduced to hide perceptual information from a third party. In addition, an adaptation network is propose… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: 6 pages Artificial Intelligence of Things(AAAI-2020 WS)

  48. arXiv:1909.07156  [pdf, other

    cs.LG cs.AI stat.ML

    New Perspective of Interpretability of Deep Neural Networks

    Authors: Masanari Kimura, Masayuki Tanaka

    Abstract: Deep neural networks (DNNs) are known as black-box models. In other words, it is difficult to interpret the internal state of the model. Improving the interpretability of DNNs is one of the hot research topics. However, at present, the definition of interpretability for DNNs is vague, and the question of what is a highly explanatory model is still controversial. To address this issue, we provide t… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

  49. arXiv:1906.04868  [pdf, other

    cs.LG stat.ML

    Semi-flat minima and saddle points by embedding neural networks to overparameterization

    Authors: Kenji Fukumizu, Shoichiro Yamaguchi, Yoh-ichi Mototake, Mirai Tanaka

    Abstract: We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different part… ▽ More

    Submitted 14 June, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 38 pages, 4 figures

  50. arXiv:1906.01150  [pdf, other

    cs.LG stat.ML

    Breaking Inter-Layer Co-Adaptation by Classifier Anonymization

    Authors: Ikuro Sato, Kohta Ishikawa, Guoqing Liu, Masayuki Tanaka

    Abstract: This study addresses an issue of co-adaptation between a feature extractor and a classifier in a neural network. A naive joint optimization of a feature extractor and a classifier often brings situations in which an excessively complex feature distribution adapted to a very specific classifier degrades the test performance. We introduce a method called Feature-extractor Optimization through Classi… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 9 pages. Accepted to ICML 2019