Skip to main content

Showing 1–12 of 12 results for author: Ikehata, S

.
  1. arXiv:2405.05924  [pdf

    cs.HC

    Privacy Protection and Video Manipulation in Immersive Media

    Authors: Leslie Wöhler, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In comparison to traditional footage, 360° videos can convey engaging, immersive experiences and even be utilized to create interactive virtual environments. Like regular recordings, these videos need to consider the privacy of recorded people and could be targets for video manipulations. However, due to their properties like enhanced presence, the effects on users might differ from traditional, n… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: This is an accepted position statement of CHI 2024 Workshop (Novel Approaches for Understanding and Mitigating Emerging New Harms in Immersive and Embodied Virtual Spaces: A Workshop at CHI 2024)

  2. arXiv:2403.16141  [pdf, other

    cs.CV

    Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes

    Authors: Takashi Otonari, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Recent advancements in the study of Neural Radiance Fields (NeRF) for dynamic scenes often involve explicit modeling of scene dynamics. However, this approach faces challenges in modeling scene dynamics in urban environments, where moving objects of various categories and scales are present. In such settings, it becomes crucial to effectively eliminate moving objects to accurately reconstruct stat… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), Project website: https://otonari726.github.io/entitynerf/

  3. arXiv:2303.15724  [pdf, other

    cs.CV cs.GR

    Scalable, Detailed and Mask-Free Universal Photometric Stereo

    Authors: Satoshi Ikehata

    Abstract: In this paper, we introduce SDM-UniPS, a groundbreaking Scalable, Detailed, Mask-free, and Universal Photometric Stereo network. Our approach can recover astonishingly intricate surface normal maps, rivaling the quality of 3D scanners, even when images are captured under unknown, spatially-varying lighting conditions in uncontrolled environments. We have extended previous universal photometric ste… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 (Highlight). The source code will be available at https://github.com/satoshi-ikehata/SDM-UniPS-CVPR2023

  4. arXiv:2212.03635  [pdf, other

    cs.CV cs.GR

    Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images

    Authors: Takashi Otonari, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: In recent years, the performance of novel view synthesis using perspective images has dramatically improved with the advent of neural radiance fields (NeRF). This study proposes two novel techniques that effectively build NeRF for 360{\textdegree} omnidirectional images. Due to the characteristics of a 360{\textdegree} image of ERP format that has spatial distortion in their high latitude regions… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: Accepted at the 33rd British Machine Vision Conference (BMVC) 2022

  5. arXiv:2211.11386  [pdf, other

    cs.CV

    PS-Transformer: Learning Sparse Photometric Stereo Network using Self-Attention Mechanism

    Authors: Satoshi Ikehata

    Abstract: Existing deep calibrated photometric stereo networks basically aggregate observations under different lights based on the pre-defined operations such as linear projection and max pooling. While they are effective with the dense capture, simple first-order operations often fail to capture the high-order interactions among observations under small number of different lights. To tackle this issue, th… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: BMVC2021. Code and Supplementary are available at https://github.com/satoshi-ikehata/PS-Transformer-BMVC2021

    Journal ref: BMVC. Vol. 2. No. 4. 2021

  6. Saliency-based Multiple Region of Interest Detection from a Single 360° image

    Authors: Yuuki Sawabe, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: 360° images are informative -- it contains omnidirectional visual information around the camera. However, the areas that cover a 360° image is much larger than the human's field of view, therefore important information in different view directions is easily overlooked. To tackle this issue, we propose a method for predicting the optimal set of Region of Interest (RoI) from a single 360° image usin… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Journal ref: in IEEE Access, vol. 10, pp. 89124-89133, 2022

  7. arXiv:2206.02452  [pdf, other

    cs.CV eess.IV

    Universal Photometric Stereo Network using Global Lighting Contexts

    Authors: Satoshi Ikehata

    Abstract: This paper tackles a new photometric stereo task, named universal photometric stereo. Unlike existing tasks that assumed specific physical lighting models; hence, drastically limited their usability, a solution algorithm of this task is supposed to work for objects with diverse shapes and materials under arbitrary lighting variations without assuming any specific models. To solve this extremely ch… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR2022. Code and Dataset at https://satoshi-ikehata.github.io/cvpr2022/univps_cvpr2022.html

  8. arXiv:2204.04634  [pdf, other

    cs.CV cs.MM

    Intersection Prediction from Single 360° Image via Deep Detection of Possible Direction of Travel

    Authors: Naoki Sugimoto, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Movie-Map, an interactive first-person-view map that engages the user in a simulated walking experience, comprises short 360° video segments separated by traffic intersections that are seamlessly connected according to the viewer's direction of travel. However, in wide urban-scale areas with numerous intersecting roads, manual intersection segmentation requires significant human effort. Therefore,… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in BMVC

  9. arXiv:2202.03176  [pdf, other

    cs.CV

    Field-of-View IoU for Object Detection in 360° Images

    Authors: Miao Cao, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: 360° cameras have gained popularity over the last few years. In this paper, we propose two fundamental techniques -- Field-of-View IoU (FoV-IoU) and 360Augmentation for object detection in 360° images. Although most object detection neural networks designed for the perspective images are applicable to 360° images in equirectangular projection (ERP) format, their performance deteriorates owing to t… ▽ More

    Submitted 22 September, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  10. arXiv:1808.10093  [pdf, other

    cs.CV

    CNN-PS: CNN-based Photometric Stereo for General Non-Convex Surfaces

    Authors: Satoshi Ikehata

    Abstract: Most conventional photometric stereo algorithms inversely solve a BRDF-based image formation model. However, the actual imaging process is often far more complex due to the global light transport on the non-convex surfaces. This paper presents a photometric stereo network that directly learns relationships between the photometric stereo input and surface normals of a scene. For handling unordered,… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Accepted in ECCV 2018 (ECCV2018). Source code and supplementary are available at https://github.com/satoshi-ikehata/CNN-PS

  11. arXiv:1808.08544  [pdf, other

    cs.CV

    Scale Drift Correction of Camera Geo-Localization using Geo-Tagged Images

    Authors: Kazuya Iwami, Satoshi Ikehata, Kiyoharu Aizawa

    Abstract: Camera geo-localization from a monocular video is a fundamental task for video analysis and autonomous navigation. Although 3D reconstruction is a key technique to obtain camera poses, monocular 3D reconstruction in a large environment tends to result in the accumulation of errors in rotation, translation, and especially in scale: a problem known as scale drift. To overcome these errors, we propos… ▽ More

    Submitted 26 August, 2018; originally announced August 2018.

    Comments: ECCV Workshop CVRSUAD

  12. arXiv:1612.01256  [pdf, other

    cs.CV

    Panoramic Structure from Motion via Geometric Relationship Detection

    Authors: Satoshi Ikehata, Ivaylo Boyadzhiev, Qi Shan, Yasutaka Furukawa

    Abstract: This paper addresses the problem of Structure from Motion (SfM) for indoor panoramic image streams, extremely challenging even for the state-of-the-art due to the lack of textures and minimal parallax. The key idea is the fusion of single-view and multi-view reconstruction techniques via geometric relationship detection (e.g., detecting 2D lines as coplanar in 3D). Rough geometry suffices to perfo… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.