Skip to main content

Showing 1–20 of 20 results for author: Nobuhara, S

.
  1. arXiv:2312.04553  [pdf, other

    cs.CV eess.IV

    SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing

    Authors: Tomoki Ichikawa, Shohei Nobuhara, Ko Nishino

    Abstract: Can we capture shape and reflectance in stealth? Such capability would be valuable for many application domains in vision, xR, robotics, and HCI. We introduce structured polarization for invisible depth and reflectance sensing (SPIDeRS), the first depth and reflectance sensing method using patterns of polarized light. The key idea is to modulate the angle of linear polarization (AoLP) of projected… ▽ More

    Submitted 31 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: to be published in CVPR 2024

  2. arXiv:2310.17632  [pdf, other

    cs.CV

    DeepShaRM: Multi-View Shape and Reflectance Map Recovery Under Unknown Lighting

    Authors: Kohei Yamashita, Shohei Nobuhara, Ko Nishino

    Abstract: Geometry reconstruction of textureless, non-Lambertian objects under unknown natural illumination (i.e., in the wild) remains challenging as correspondences cannot be established and the reflectance cannot be expressed in simple analytical forms. We derive a novel multi-view method, DeepShaRM, that achieves state-of-the-art accuracy on this challenging task. Unlike past methods that formulate this… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 3DV 2024

  3. arXiv:2304.06977  [pdf, other

    cs.CV

    DeePoint: Visual Pointing Recognition and Direction Estimation

    Authors: Shu Nakamura, Yasutomo Kawanishi, Shohei Nobuhara, Ko Nishino

    Abstract: In this paper, we realize automatic visual recognition and direction estimation of pointing. We introduce the first neural pointing understanding method based on two key contributions. The first is the introduction of a first-of-its-kind large-scale dataset for pointing recognition and direction estimation, which we refer to as the DP Dataset. DP Dataset consists of more than 2 million frames of 3… ▽ More

    Submitted 11 September, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: to be published in ICCV 2023

  4. arXiv:2303.17890  [pdf, other

    cs.CV

    Fooling Polarization-based Vision using Locally Controllable Polarizing Projection

    Authors: Zhuoxiao Li, Zhihang Zhong, Shohei Nobuhara, Ko Nishino, Yinqiang Zheng

    Abstract: Polarization is a fundamental property of light that encodes abundant information regarding surface shape, material, illumination and viewing geometry. The computer vision community has witnessed a blossom of polarization-based vision applications, such as reflection removal, shape-from-polarization, transparent object segmentation and color constancy, partially due to the emergence of single-chip… ▽ More

    Submitted 19 June, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

  5. arXiv:2303.13477  [pdf, other

    cs.CV

    TransPoser: Transformer as an Optimizer for Joint Object Shape and Pose Estimation

    Authors: Yuta Yoshitake, Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We propose a novel method for joint estimation of shape and pose of rigid objects from their sequentially observed RGB-D images. In sharp contrast to past approaches that rely on complex non-linear optimization, we propose to formulate it as a neural optimization that learns to efficiently estimate the shape and pose. We introduce Deep Directional Distance Function (DeepDDF), a neural network that… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  6. arXiv:2303.09534  [pdf, other

    cs.CV

    InCrowdFormer: On-Ground Pedestrian World Model From Egocentric Views

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce an on-ground Pedestrian World Model, a computational model that can predict how pedestrians move around an observer in the crowd on the ground plane, but from just the egocentric-views of the observer. Our model, InCrowdFormer, fully leverages the Transformer architecture by modeling pedestrian interaction and egocentric to top-down view transformation with attention, and autoregressi… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  7. arXiv:2212.04483  [pdf, other

    cs.CV

    Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection

    Authors: Tomoki Ichikawa, Yoshiki Fukao, Shohei Nobuhara, Ko Nishino

    Abstract: Computer vision applications have heavily relied on the linear combination of Lambertian diffuse and microfacet specular reflection models for representing reflected radiance, which turns out to be physically incompatible and limited in applicability. In this paper, we derive a novel analytical reflectance model, which we refer to as Fresnel Microfacet BRDF model, that is physically accurate and g… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  8. arXiv:2210.06332  [pdf, other

    cs.CV

    ViewBirdiformer: Learning to recover ground-plane crowd trajectories and ego-motion from a single ego-centric view

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel learning-based method for view birdification, the task of recovering ground-plane trajectories of pedestrians of a crowd and their observer in the same crowd just from the observed ego-centric video. View birdification becomes essential for mobile robot navigation and localization in dense crowds where the static background is hard to see and reliably track. It is challenging… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  9. arXiv:2207.11876  [pdf, other

    cs.CV

    nLMVS-Net: Deep Non-Lambertian Multi-View Stereo

    Authors: Kohei Yamashita, Yuto Enyo, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel multi-view stereo (MVS) method that can simultaneously recover not just per-pixel depth but also surface normals, together with the reflectance of textureless, complex non-Lambertian surfaces captured under known but natural illumination. Our key idea is to formulate MVS as an end-to-end learnable network, which we refer to as nLMVS-Net, that seamlessly integrates radiometric… ▽ More

    Submitted 10 November, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted to WACV 2023

  10. arXiv:2207.03870  [pdf, other

    cs.CV

    BlindSpotNet: Seeing Where We Cannot See

    Authors: Taichi Fukuda, Kotaro Hasegawa, Shinya Ishizaki, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce 2D blind spot estimation as a critical visual task for road scene understanding. By automatically detecting road regions that are occluded from the vehicle's vantage point, we can proactively alert a manual driver or a self-driving system to potential causes of accidents (e.g., draw attention to a road region from which a child may spring out). Detecting blind spots in full 3D would b… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  11. arXiv:2111.05060  [pdf, other

    cs.CV

    View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce view birdification, the problem of recovering ground-plane movements of people in a crowd from an ego-centric video captured from an observer (e.g., a person or a vehicle) also moving in the crowd. Recovered ground-plane movements would provide a sound basis for situational understanding and benefit downstream applications in computer vision and robotics. In this paper, we formulate v… ▽ More

    Submitted 25 October, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Extended journal version of the original paper at BMVC 2021

  12. arXiv:2103.15501  [pdf, other

    cs.CV

    Structure of Multiple Mirror System from Kaleidoscopic Projections of Single 3D Point

    Authors: Kosuke Takahashi, Shohei Nobuhara

    Abstract: This paper proposes a novel algorithm of discovering the structure of a kaleidoscopic imaging system that consists of multiple planar mirrors and a camera. The kaleidoscopic imaging system can be recognized as the virtual multi-camera system and has strong advantages in that the virtual cameras are strictly synchronized and have the same intrinsic parameters. In this paper, we focus on the extrins… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  13. arXiv:2008.07049  [pdf, other

    cs.CV

    Video Region Annotation with Sparse Bounding Boxes

    Authors: Yuzheng Xu, Yang Wu, Nur Sabrina binti Zuraimi, Shohei Nobuhara, Ko Nishino

    Abstract: Video analysis has been moving towards more detailed interpretation (e.g. segmentation) with encouraging progresses. These tasks, however, increasingly rely on densely annotated training data both in space and time. Since such annotation is labour-intensive, few densely annotated video data with detailed region boundaries exist. This work aims to resolve this dilemma by learning to automatically g… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted for publication in BMVC 2020 (Oral)

  14. arXiv:2008.04030  [pdf, other

    cs.CV

    Invertible Neural BRDF for Object Inverse Rendering

    Authors: Zhe Chen, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel neural network-based BRDF model and a Bayesian framework for object inverse rendering, i.e., joint estimation of reflectance and natural illumination from a single image of an object of known geometry. The BRDF is expressed with an invertible neural network, namely, normalizing flow, which provides the expressive power of a high-dimensional representation, computational simpli… ▽ More

    Submitted 11 August, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: accepted to ECCV 2020 as spotlight

  15. arXiv:2003.04260  [pdf, other

    cs.CV cs.RO eess.IV

    SOIC: Semantic Online Initialization and Calibration for LiDAR and Camera

    Authors: Weimin Wang, Shohei Nobuhara, Ryosuke Nakamura, Ken Sakurada

    Abstract: This paper presents a novel semantic-based online extrinsic calibration approach, SOIC (so, I see), for Light Detection and Ranging (LiDAR) and camera sensors. Previous online calibration methods usually need prior knowledge of rough initial values for optimization. The proposed approach removes this limitation by converting the initialization problem to a Perspective-n-Point (PnP) problem with th… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  16. arXiv:1912.04663  [pdf, other

    cs.CV

    3D-GMNet: Single-View 3D Shape Recovery as A Gaussian Mixture

    Authors: Kohei Yamashita, Shohei Nobuhara, Ko Nishino

    Abstract: In this paper, we introduce 3D-GMNet, a deep neural network for 3D object shape reconstruction from a single image. As the name suggests, 3D-GMNet recovers 3D shape as a Gaussian mixture. In contrast to voxels, point clouds, or meshes, a Gaussian mixture representation provides an analytical expression with a small memory footprint while accurately representing the target 3D shape. At the same tim… ▽ More

    Submitted 15 August, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: BMVC 2020

  17. arXiv:1906.10284  [pdf, other

    cs.CV

    Appearance and Shape from Water Reflection

    Authors: Ryo Kawahara, Meng-Yu Jennifer Kuo, Shohei Nobuhara, Ko Nishino

    Abstract: This paper introduces single-image geometric and appearance reconstruction from water reflection photography, i.e., images capturing direct and water-reflected real-world scenes. Water reflection offers an additional viewpoint to the direct sight, collectively forming a stereo pair. The water-reflected scene, however, includes internally scattered and reflected environmental illumination in additi… ▽ More

    Submitted 7 January, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: WACV 2020

  18. arXiv:1810.06327  [pdf, other

    cs.CV

    Deep Photovoltaic Nowcasting

    Authors: **song Zhang, Rodrigo Verschae, Shohei Nobuhara, Jean-François Lalonde

    Abstract: Predicting the short-term power output of a photovoltaic panel is an important task for the efficient management of smart grids. Short-term forecasting at the minute scale, also known as nowcasting, can benefit from sky images captured by regular cameras and installed close to the solar panel. However, estimating the weather conditions from these images---sun intensity, cloud appearance and moveme… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

    Comments: 28 pages, 10 figure, 4 tables, preprint accepted to Solar Energy

  19. arXiv:1703.02826  [pdf, other

    cs.CV

    A Linear Extrinsic Calibration of Kaleidoscopic Imaging System from Single 3D Point

    Authors: Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama

    Abstract: This paper proposes a new extrinsic calibration of kaleidoscopic imaging system by estimating normals and distances of the mirrors. The problem to be solved in this paper is a simultaneous estimation of all mirror parameters consistent throughout multiple reflections. Unlike conventional methods utilizing a pair of direct and mirrored images of a reference 3D object to estimate the parameters on a… ▽ More

    Submitted 27 May, 2017; v1 submitted 8 March, 2017; originally announced March 2017.

    Comments: to appear in CVPR 2017

  20. arXiv:1612.03153  [pdf, other

    cs.CV

    Panoptic Studio: A Massively Multiview System for Social Interaction Capture

    Authors: Hanbyul Joo, Tomas Simon, Xulong Li, Hao Liu, Lei Tan, Lin Gui, Sean Banerjee, Timothy Godisart, Bart Nabbe, Iain Matthews, Takeo Kanade, Shohei Nobuhara, Yaser Sheikh

    Abstract: We present an approach to capture the 3D motion of a group of people engaged in a social interaction. The core challenges in capturing social interactions are: (1) occlusion is functional and frequent; (2) subtle motion needs to be measured over a space large enough to host a social group; (3) human appearance and configuration variation is immense; and (4) attaching markers to the body may prime… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence