Skip to main content

Showing 1–28 of 28 results for author: Nishino, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14530  [pdf, other

    cs.CV

    Multistable Shape from Shading Emerges from Patch Diffusion

    Authors: Xinran Nicole Han, Todd Zickler, Ko Nishino

    Abstract: Models for monocular shape reconstruction of surfaces with diffuse reflection -- shape from shading -- ought to produce distributions of outputs, because there are fundamental mathematical ambiguities of both continuous (e.g., bas-relief) and discrete (e.g., convex/concave) varieties which are also experienced by humans. Yet, the outputs of current models are limited to point estimates or tight di… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2312.04553  [pdf, other

    cs.CV eess.IV

    SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing

    Authors: Tomoki Ichikawa, Shohei Nobuhara, Ko Nishino

    Abstract: Can we capture shape and reflectance in stealth? Such capability would be valuable for many application domains in vision, xR, robotics, and HCI. We introduce structured polarization for invisible depth and reflectance sensing (SPIDeRS), the first depth and reflectance sensing method using patterns of polarized light. The key idea is to modulate the angle of linear polarization (AoLP) of projected… ▽ More

    Submitted 31 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: to be published in CVPR 2024

  3. arXiv:2312.04530  [pdf, other

    cs.CV cs.RO

    Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation

    Authors: Genki Kinoshita, Ko Nishino

    Abstract: In this paper, we introduce a novel training method for making any monocular depth network learn absolute scale and estimate metric road-scene depth just from regular training data, i.e., driving videos. We refer to this training framework as StableCamH. The key idea is to leverage cars found on the road as sources of scale supervision but to incorporate them in the training robustly. StableCamH d… ▽ More

    Submitted 20 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

  4. arXiv:2312.04529  [pdf, other

    cs.CV

    Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectance

    Authors: Yuto Enyo, Ko Nishino

    Abstract: Reflectance bounds the frequency spectrum of illumination in the object appearance. In this paper, we introduce the first stochastic inverse rendering method, which recovers the attenuated frequency spectrum of an illumination jointly with the reflectance of an object of known geometry from a single image. Our key idea is to solve this blind inverse problem in the reflectance map, an appearance re… ▽ More

    Submitted 26 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: to be published in CVPR 2024

  5. arXiv:2312.04527  [pdf, other

    cs.CV

    Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection

    Authors: Kohei Yamashita, Vincent Lepetit, Ko Nishino

    Abstract: Computer vision has long relied on two kinds of correspondences: pixel correspondences in images and 3D correspondences on object surfaces. Is there another kind, and if there is, what can they do for us? In this paper, we introduce correspondences of the third kind we call reflection correspondences and show that they can help estimate camera pose by just looking at objects without relying on the… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  6. arXiv:2310.17632  [pdf, other

    cs.CV

    DeepShaRM: Multi-View Shape and Reflectance Map Recovery Under Unknown Lighting

    Authors: Kohei Yamashita, Shohei Nobuhara, Ko Nishino

    Abstract: Geometry reconstruction of textureless, non-Lambertian objects under unknown natural illumination (i.e., in the wild) remains challenging as correspondences cannot be established and the reflectance cannot be expressed in simple analytical forms. We derive a novel multi-view method, DeepShaRM, that achieves state-of-the-art accuracy on this challenging task. Unlike past methods that formulate this… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 3DV 2024

  7. arXiv:2304.06977  [pdf, other

    cs.CV

    DeePoint: Visual Pointing Recognition and Direction Estimation

    Authors: Shu Nakamura, Yasutomo Kawanishi, Shohei Nobuhara, Ko Nishino

    Abstract: In this paper, we realize automatic visual recognition and direction estimation of pointing. We introduce the first neural pointing understanding method based on two key contributions. The first is the introduction of a first-of-its-kind large-scale dataset for pointing recognition and direction estimation, which we refer to as the DP Dataset. DP Dataset consists of more than 2 million frames of 3… ▽ More

    Submitted 11 September, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: to be published in ICCV 2023

  8. arXiv:2303.17890  [pdf, other

    cs.CV

    Fooling Polarization-based Vision using Locally Controllable Polarizing Projection

    Authors: Zhuoxiao Li, Zhihang Zhong, Shohei Nobuhara, Ko Nishino, Yinqiang Zheng

    Abstract: Polarization is a fundamental property of light that encodes abundant information regarding surface shape, material, illumination and viewing geometry. The computer vision community has witnessed a blossom of polarization-based vision applications, such as reflection removal, shape-from-polarization, transparent object segmentation and color constancy, partially due to the emergence of single-chip… ▽ More

    Submitted 19 June, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

  9. arXiv:2303.13477  [pdf, other

    cs.CV

    TransPoser: Transformer as an Optimizer for Joint Object Shape and Pose Estimation

    Authors: Yuta Yoshitake, Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We propose a novel method for joint estimation of shape and pose of rigid objects from their sequentially observed RGB-D images. In sharp contrast to past approaches that rely on complex non-linear optimization, we propose to formulate it as a neural optimization that learns to efficiently estimate the shape and pose. We introduce Deep Directional Distance Function (DeepDDF), a neural network that… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  10. arXiv:2303.09534  [pdf, other

    cs.CV

    InCrowdFormer: On-Ground Pedestrian World Model From Egocentric Views

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce an on-ground Pedestrian World Model, a computational model that can predict how pedestrians move around an observer in the crowd on the ground plane, but from just the egocentric-views of the observer. Our model, InCrowdFormer, fully leverages the Transformer architecture by modeling pedestrian interaction and egocentric to top-down view transformation with attention, and autoregressi… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  11. arXiv:2212.04483  [pdf, other

    cs.CV

    Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection

    Authors: Tomoki Ichikawa, Yoshiki Fukao, Shohei Nobuhara, Ko Nishino

    Abstract: Computer vision applications have heavily relied on the linear combination of Lambertian diffuse and microfacet specular reflection models for representing reflected radiance, which turns out to be physically incompatible and limited in applicability. In this paper, we derive a novel analytical reflectance model, which we refer to as Fresnel Microfacet BRDF model, that is physically accurate and g… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  12. arXiv:2210.06332  [pdf, other

    cs.CV

    ViewBirdiformer: Learning to recover ground-plane crowd trajectories and ego-motion from a single ego-centric view

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel learning-based method for view birdification, the task of recovering ground-plane trajectories of pedestrians of a crowd and their observer in the same crowd just from the observed ego-centric video. View birdification becomes essential for mobile robot navigation and localization in dense crowds where the static background is hard to see and reliably track. It is challenging… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  13. arXiv:2207.11876  [pdf, other

    cs.CV

    nLMVS-Net: Deep Non-Lambertian Multi-View Stereo

    Authors: Kohei Yamashita, Yuto Enyo, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel multi-view stereo (MVS) method that can simultaneously recover not just per-pixel depth but also surface normals, together with the reflectance of textureless, complex non-Lambertian surfaces captured under known but natural illumination. Our key idea is to formulate MVS as an end-to-end learnable network, which we refer to as nLMVS-Net, that seamlessly integrates radiometric… ▽ More

    Submitted 10 November, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted to WACV 2023

  14. arXiv:2207.03870  [pdf, other

    cs.CV

    BlindSpotNet: Seeing Where We Cannot See

    Authors: Taichi Fukuda, Kotaro Hasegawa, Shinya Ishizaki, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce 2D blind spot estimation as a critical visual task for road scene understanding. By automatically detecting road regions that are occluded from the vehicle's vantage point, we can proactively alert a manual driver or a self-driving system to potential causes of accidents (e.g., draw attention to a road region from which a child may spring out). Detecting blind spots in full 3D would b… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  15. arXiv:2111.05060  [pdf, other

    cs.CV

    View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements

    Authors: Mai Nishimura, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce view birdification, the problem of recovering ground-plane movements of people in a crowd from an ego-centric video captured from an observer (e.g., a person or a vehicle) also moving in the crowd. Recovered ground-plane movements would provide a sound basis for situational understanding and benefit downstream applications in computer vision and robotics. In this paper, we formulate v… ▽ More

    Submitted 25 October, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Extended journal version of the original paper at BMVC 2021

  16. arXiv:2009.11072  [pdf, other

    cs.CV

    Differential Viewpoints for Ground Terrain Material Recognition

    Authors: Jia Xue, Hang Zhang, Ko Nishino, Kristin J. Dana

    Abstract: Computational surface modeling that underlies material recognition has transitioned from reflectance modeling using in-lab controlled radiometric measurements to image-based representations based on internet-mined single-view images captured in the scene. We take a middle-ground approach for material recognition that takes advantage of both rich radiometric cues and flexible image capture. A key c… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: substantial text overlap with arXiv:1612.02372

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2020

  17. arXiv:2008.07049  [pdf, other

    cs.CV

    Video Region Annotation with Sparse Bounding Boxes

    Authors: Yuzheng Xu, Yang Wu, Nur Sabrina binti Zuraimi, Shohei Nobuhara, Ko Nishino

    Abstract: Video analysis has been moving towards more detailed interpretation (e.g. segmentation) with encouraging progresses. These tasks, however, increasingly rely on densely annotated training data both in space and time. Since such annotation is labour-intensive, few densely annotated video data with detailed region boundaries exist. This work aims to resolve this dilemma by learning to automatically g… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted for publication in BMVC 2020 (Oral)

  18. arXiv:2008.04030  [pdf, other

    cs.CV

    Invertible Neural BRDF for Object Inverse Rendering

    Authors: Zhe Chen, Shohei Nobuhara, Ko Nishino

    Abstract: We introduce a novel neural network-based BRDF model and a Bayesian framework for object inverse rendering, i.e., joint estimation of reflectance and natural illumination from a single image of an object of known geometry. The BRDF is expressed with an invertible neural network, namely, normalizing flow, which provides the expressive power of a high-dimensional representation, computational simpli… ▽ More

    Submitted 11 August, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: accepted to ECCV 2020 as spotlight

  19. arXiv:1912.04663  [pdf, other

    cs.CV

    3D-GMNet: Single-View 3D Shape Recovery as A Gaussian Mixture

    Authors: Kohei Yamashita, Shohei Nobuhara, Ko Nishino

    Abstract: In this paper, we introduce 3D-GMNet, a deep neural network for 3D object shape reconstruction from a single image. As the name suggests, 3D-GMNet recovers 3D shape as a Gaussian mixture. In contrast to voxels, point clouds, or meshes, a Gaussian mixture representation provides an analytical expression with a small memory footprint while accurately representing the target 3D shape. At the same tim… ▽ More

    Submitted 15 August, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: BMVC 2020

  20. arXiv:1906.10284  [pdf, other

    cs.CV

    Appearance and Shape from Water Reflection

    Authors: Ryo Kawahara, Meng-Yu Jennifer Kuo, Shohei Nobuhara, Ko Nishino

    Abstract: This paper introduces single-image geometric and appearance reconstruction from water reflection photography, i.e., images capturing direct and water-reflected real-world scenes. Water reflection offers an additional viewpoint to the direct sight, collectively forming a stereo pair. The water-reflected scene, however, includes internally scattered and reflected environmental illumination in additi… ▽ More

    Submitted 7 January, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: WACV 2020

  21. arXiv:1811.03331  [pdf, other

    cs.CV

    Improving Multi-Person Pose Estimation using Label Correction

    Authors: Naoki Kato, Tianqi Li, Kohei Nishino, Yusuke Uchida

    Abstract: Significant attention is being paid to multi-person pose estimation methods recently, as there has been rapid progress in the field owing to convolutional neural networks. Especially, recent method which exploits part confidence maps and Part Affinity Fields (PAFs) has achieved accurate real-time prediction of multi-person keypoints. However, human annotated labels are sometimes inappropriate for… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  22. arXiv:1801.03127  [pdf, other

    cs.CV

    Recognizing Material Properties from Images

    Authors: Gabriel Schwartz, Ko Nishino

    Abstract: Humans rely on properties of the materials that make up objects to guide our interactions with them. Gras** smooth materials, for example, requires care, and softness is an ideal property for fabric used in bedding. Even when these properties are not visual (e.g. softness is a physical property), we may still infer their presence visually. We refer to such material properties as visual material… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

  23. arXiv:1612.02372  [pdf, other

    cs.CV

    Differential Angular Imaging for Material Recognition

    Authors: Jia Xue, Hang Zhang, Kristin Dana, Ko Nishino

    Abstract: Material recognition for real-world outdoor surfaces has become increasingly important for computer vision to support its operation "in the wild." Computational surface modeling that underlies material recognition has transitioned from reflectance modeling using in-lab controlled radiometric measurements to image-based representations based on internet-mined images of materials captured in the sce… ▽ More

    Submitted 13 July, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

  24. arXiv:1611.09394  [pdf, other

    cs.CV

    Material Recognition from Local Appearance in Global Context

    Authors: Gabriel Schwartz, Ko Nishino

    Abstract: Recognition of materials has proven to be a challenging problem due to the wide variation in appearance within and between categories. Global image context, such as where the material is or what object it makes up, can be crucial to recognizing the material. Existing methods, however, operate on an implicit fusion of materials and context by using large receptive fields as input (i.e., large image… ▽ More

    Submitted 12 April, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

  25. arXiv:1604.01354  [pdf, other

    cs.CV

    Radiometric Scene Decomposition: Scene Reflectance, Illumination, and Geometry from RGB-D Images

    Authors: Stephen Lombardi, Ko Nishino

    Abstract: Recovering the radiometric properties of a scene (i.e., the reflectance, illumination, and geometry) is a long-sought ability of computer vision that can provide invaluable information for a wide range of applications. Deciphering the radiometric ingredients from the appearance of a real-world scene, as opposed to a single isolated object, is particularly challenging as it generally consists of va… ▽ More

    Submitted 5 April, 2016; originally announced April 2016.

    Comments: 16 pages

  26. arXiv:1604.01345  [pdf, other

    cs.CV

    Integrating Local Material Recognition with Large-Scale Perceptual Attribute Discovery

    Authors: Gabriel Schwartz, Ko Nishino

    Abstract: Material attributes have been shown to provide a discriminative intermediate representation for recognizing materials, especially for the challenging task of recognition from local material appearance (i.e., regardless of object and scene context). In the past, however, material attributes have been recognized separately preceding category recognition. In contrast, neuroscience studies on material… ▽ More

    Submitted 12 April, 2017; v1 submitted 5 April, 2016; originally announced April 2016.

  27. arXiv:1603.07998  [pdf, other

    cs.CV

    Friction from Reflectance: Deep Reflectance Codes for Predicting Physical Surface Properties from One-Shot In-Field Reflectance

    Authors: Hang Zhang, Kristin Dana, Ko Nishino

    Abstract: Images are the standard input for vision algorithms, but one-shot infield reflectance measurements are creating new opportunities for recognition and scene understanding. In this work, we address the question of what reflectance can reveal about materials in an efficient manner. We go beyond the question of recognition and labeling and ask the question: What intrinsic physical properties of the su… ▽ More

    Submitted 10 July, 2016; v1 submitted 25 March, 2016; originally announced March 2016.

  28. arXiv:1502.02092  [pdf, other

    cs.CV

    Reflectance Hashing for Material Recognition

    Authors: Hang Zhang, Kristin Dana, Ko Nishino

    Abstract: We introduce a novel method for using reflectance to identify materials. Reflectance offers a unique signature of the material but is challenging to measure and use for recognizing materials due to its high-dimensionality. In this work, one-shot reflectance is captured using a unique optical camera measuring {\it reflectance disks} where the pixel coordinates correspond to surface viewing angles.… ▽ More

    Submitted 6 February, 2015; originally announced February 2015.