Skip to main content

Showing 1–15 of 15 results for author: Ohashi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16807  [pdf, other

    cs.CV cs.AI cs.GR cs.MM

    Extreme Compression of Adaptive Neural Images

    Authors: Leo Hoshikawa, Marcos V. Conde, Takeshi Ohashi, Atsushi Irie

    Abstract: Implicit Neural Representations (INRs) and Neural Fields are a novel paradigm for signal representation, from images and audio to 3D scenes and videos. The fundamental idea is to represent a signal as a continuous and differentiable neural network. This idea offers unprecedented benefits such as continuous resolution and memory efficiency, enabling new compression techniques. However, representing… ▽ More

    Submitted 4 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Technical Report. Work in progress

  2. arXiv:2404.03256  [pdf, other

    cs.CV

    Multi Positive Contrastive Learning with Pose-Consistent Generated Images

    Authors: Sho Inayoshi, Aji Resindra Widya, Satoshi Ozaki, Junji Otsuka, Takeshi Ohashi

    Abstract: Model pre-training has become essential in various recognition tasks. Meanwhile, with the remarkable advancements in image generation models, pre-training methods utilizing generated images have also emerged given their ability to produce unlimited training data. However, while existing methods utilizing generated images excel in classification, they fall short in more practical tasks, such as hum… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2403.20080  [pdf, other

    cs.CV

    Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter

    Authors: Yuiko Sakuma, Masakazu Yoshimura, Junji Otsuka, Atsushi Irie, Takeshi Ohashi

    Abstract: Compression of large and performant vision foundation models (VFMs) into arbitrary bit-wise operations (BitOPs) allows their deployment on various hardware. We propose to fine-tune a VFM to a mixed-precision quantized supernet. The supernet-based neural architecture search (NAS) can be adopted for this purpose, which trains a supernet, and then subnets within arbitrary hardware budgets can be extr… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  4. arXiv:2403.10091  [pdf, other

    eess.IV cs.CV

    PQDynamicISP: Dynamically Controlled Image Signal Processor for Any Image Sensors Pursuing Perceptual Quality

    Authors: Masakazu Yoshimura, Junji Otsuka, Takeshi Ohashi

    Abstract: Full DNN-based image signal processors (ISPs) have been actively studied and have achieved superior image quality compared to conventional ISPs. In contrast to this trend, we propose a lightweight ISP that consists of simple conventional ISP functions but achieves high image quality by increasing expressiveness. Specifically, instead of tuning the parameters of the ISP, we propose to control them… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Keywords: image signal processor, ISP, image enhancement, tone map**

  5. arXiv:2307.03338  [pdf

    cs.HC

    From Conservatism to Innovation: The Sequential and Iterative Process of Smart Livestock Technology Adoption in Japanese Small-Farm Systems

    Authors: Takumi Ohashi, Miki Saijo, Kento Suzuki, Shinsuke Arafuka

    Abstract: As global demand for animal products is projected to increase significantly by 2050, driven by population growth and increased incomes, smart livestock technologies are essential for improving efficiency, animal welfare, and environmental sustainability. Conducted within the unique agricultural context of Japan, characterized by small-scale, family-run farms and strong government protection polici… ▽ More

    Submitted 17 June, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 58 pages, 3 figures

    MSC Class: 91C99 ACM Class: J.4

  6. arXiv:2303.13916  [pdf, other

    cs.CV eess.IV

    Self-Supervised Reversed Image Signal Processing via Reference-Guided Dynamic Parameter Selection

    Authors: Junji Otsuka, Masakazu Yoshimura, Takeshi Ohashi

    Abstract: Unprocessed sensor outputs (RAW images) potentially improve both low-level and high-level computer vision algorithms, but the lack of large-scale RAW image datasets is a barrier to research. Thus, reversed Image Signal Processing (ISP) which converts existing RGB images into RAW images has been studied. However, most existing methods require camera-specific metadata or paired RGB and RAW images to… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 19 pages, 12 figures

  7. arXiv:2303.12293  [pdf

    cs.CY cs.HC

    Designing the Metaverse: A Sco** Review to Map Current Research Effort on Ethical Implications

    Authors: Matteo Zallio, Takumi Ohashi, P. John Clarkson

    Abstract: The metaverse and digital, virtual environments have been part of recent history as places in which people can socialize, work and spend time playing games. However, the infancy of the development of these digital, virtual environments brings some challenges that are still not fully depicted. With this article, we seek to identify and map the currently available knowledge and scientific effort to… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 9 pages, 2 figures

  8. arXiv:2211.05654  [pdf, other

    cs.CV

    Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer

    Authors: Siddharth Sagar Nijhawan, Leo Hoshikawa, Atsushi Irie, Masakazu Yoshimura, Junji Otsuka, Takeshi Ohashi

    Abstract: We propose a light-weight and highly efficient Joint Detection and Tracking pipeline for the task of Multi-Object Tracking using a fully-transformer architecture. It is a modified version of TransTrack, which overcomes the computational bottleneck associated with its design, and at the same time, achieves state-of-the-art MOTA score of 73.20%. The model design is driven by a transformer based back… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  9. arXiv:2211.01146  [pdf, other

    cs.CV cs.AI eess.SY

    DynamicISP: Dynamically Controlled Image Signal Processor for Image Recognition

    Authors: Masakazu Yoshimura, Junji Otsuka, Atsushi Irie, Takeshi Ohashi

    Abstract: Image Signal Processors (ISPs) play important roles in image recognition tasks as well as in the perceptual quality of captured images. In most cases, experts make a lot of effort to manually tune many parameters of ISPs, but the parameters are sub-optimal. In the literature, two types of techniques have been actively studied: a machine learning-based parameter tuning technique and a DNN-based ISP… ▽ More

    Submitted 27 August, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to ICCV2023. Several updates from v2 including additional experiments and modification of typos in Auto Gain equation

  10. arXiv:2210.16046  [pdf, other

    cs.CV eess.IV

    Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments

    Authors: Masakazu Yoshimura, Junji Otsuka, Atsushi Irie, Takeshi Ohashi

    Abstract: Image recognition models that work in challenging environments (e.g., extremely dark, blurry, or high dynamic range conditions) must be useful. However, creating training datasets for such environments is expensive and hard due to the difficulties of data collection and annotation. It is desirable if we could get a robust model without the need for hard-to-obtain datasets. One simple approach is t… ▽ More

    Submitted 27 March, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted to CVPR2023

  11. arXiv:2201.07152  [pdf

    cs.HC

    The Evolution of Assistive Technology: A Literature Review of Technology Developments and Applications

    Authors: Matteo Zallio, Takumi Ohashi

    Abstract: The term Assistive Technology has evolved over the years and identifies equipment or product systems, whether acquired, modified, or customized, that are used to increase, maintain, or improve functional capabilities of individuals with disabilities. Considering the advances that have been made, what trends can be identified to provide evidence of the evolution of AT as devices that foster accessi… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 9 pages, 4 figures

  12. arXiv:2103.12993  [pdf, other

    cs.IT math.PR

    Analysis of QoS in Heterogeneous Networks with Clustered Deployment and Caching Aware Capacity Allocation

    Authors: Takehiro Ohashi

    Abstract: In cellular networks, the densification of connected devices and base stations engender the ever-growing traffic intensity, and caching popular contents with smart management is a promising way to alleviate such consequences. Our research extends the previously proposed analysis of three-tier cache enabled Heterogeneous Networks (HetNets). The main contributions are threefold. We consider the more… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  13. Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild

    Authors: Takuya Ohashi, Yosuke Ikegami, Yoshihiko Nakamura

    Abstract: Although many studies have investigated markerless motion capture, the technology has not been applied to real sports or concerts. In this paper, we propose a markerless motion capture method with spatiotemporal accuracy and smoothness from multiple cameras in wide-space and multi-person environments. The proposed method predicts each person's 3D pose and determines the bounding box of multi-camer… ▽ More

    Submitted 14 October, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Journal ref: Image and Vision Computing, Volume 104, 2020

  14. arXiv:1912.03880  [pdf, other

    cs.RO cs.CV

    Video Motion Capture from the Part Confidence Maps of Multi-Camera Images by Spatiotemporal Filtering Using the Human Skeletal Model

    Authors: Takuya Ohashi, Yosuke Ikegami, Kazuki Yamamoto, Wataru Takano, Yoshihiko Nakamura

    Abstract: This paper discusses video motion capture, namely, 3D reconstruction of human motion from multi-camera images. After the Part Confidence Maps are computed from each camera image, the proposed spatiotemporal filter is applied to deliver the human motion data with accuracy and smoothness for human motion analysis. The spatiotemporal filter uses the human skeleton and mixes temporal smoothing in two-… ▽ More

    Submitted 10 December, 2019; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: International Conference on Intelligent Robots and Systems (IROS), 2018

  15. arXiv:1901.09792  [pdf, other

    cs.RO cs.AI cs.LG

    Sensorimotor learning for artificial body perception

    Authors: German Diez-Valencia, Takuya Ohashi, Pablo Lanillos, Gordon Cheng

    Abstract: Artificial self-perception is the machine ability to perceive its own body, i.e., the mastery of modal and intermodal contingencies of performing an action with a specific sensors/actuators body configuration. In other words, the spatio-temporal patterns that relate its sensors (e.g. visual, proprioceptive, tactile, etc.), its actions and its body latent variables are responsible of the distinctio… ▽ More

    Submitted 15 January, 2019; originally announced January 2019.

    Comments: Workshop on Crossmodal Learning for Intelligent Robotics. IEEE Int. Conference on Intelligent Robots and Systems (IROS 2018)