Skip to main content

Showing 51–100 of 137 results for author: Lee, G H

.
  1. arXiv:2207.09332  [pdf, other

    cs.CV

    Rethinking IoU-based Optimization for Single-stage 3D Object Detection

    Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee

    Abstract: Since Intersection-over-Union (IoU) based optimization maintains the consistency of the final IoU prediction metric and losses, it has been widely used in both regression and classification branches of single-stage 2D object detectors. Recently, several 3D object detection methods adopt IoU-based optimization and directly replace the 2D IoU with 3D IoU. However, such a direct computation in 3D is… ▽ More

    Submitted 20 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV2022. The code is available at https://github.com/hlsheng1/RDIoU

  2. arXiv:2207.04892  [pdf, other

    cs.CV

    Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation

    Authors: Zhun Zhong, Yuyang Zhao, Gim Hee Lee, Nicu Sebe

    Abstract: In this paper, we consider the problem of domain generalization in semantic segmentation, which aims to learn a robust model using only labeled synthetic (source) data. The model is expected to perform well on unseen real (target) domains. Our study finds that the image style variation can largely influence the model's performance and the style features can be well represented by the channel-wise… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022

  3. arXiv:2205.13579  [pdf, other

    cs.CV

    CA-UDA: Class-Aware Unsupervised Domain Adaptation with Optimal Assignment and Pseudo-Label Refinement

    Authors: Can Zhang, Gim Hee Lee

    Abstract: Recent works on unsupervised domain adaptation (UDA) focus on the selection of good pseudo-labels as surrogates for the missing labels in the target data. However, source domain bias that deteriorates the pseudo-labels can still exist since the shared network of the source and target domains are typically used for the pseudo-label selections. The suboptimal feature space source-to-target domain al… ▽ More

    Submitted 30 May, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  4. arXiv:2205.09068  [pdf, other

    cs.CV cs.MM

    VRAG: Region Attention Graphs for Content-Based Video Retrieval

    Authors: Kennard Ng, Ser-Nam Lim, Gim Hee Lee

    Abstract: Content-based Video Retrieval (CBVR) is used on media-sharing platforms for applications such as video recommendation and filtering. To manage databases that scale to billions of videos, video-level approaches that use fixed-size embeddings are preferred due to their efficiency. In this paper, we introduce Video Region Attention Graph Networks (VRAG) that improves the state-of-the-art of video-lev… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  5. arXiv:2205.04042  [pdf, other

    cs.CV

    Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning

    Authors: Na Dong, Yongqiang Zhang, Mingli Ding, Gim Hee Lee

    Abstract: Incremental few-shot object detection aims at detecting novel classes without forgetting knowledge of the base classes with only a few labeled training data from the novel classes. Most related prior works are on incremental object detection that rely on the availability of abundant training samples per novel class that substantially limits the scalability to real-world setting where novel data ca… ▽ More

    Submitted 27 February, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted by AAAI2023

  6. arXiv:2204.02548  [pdf, other

    cs.CV

    Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation

    Authors: Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee

    Abstract: In this paper, we study the task of synthetic-to-real domain generalized semantic segmentation, which aims to learn a model that is robust to unseen real-world scenes using only synthetic data. The large domain shift between synthetic and real-world data, including the limited source environmental variations and the large distribution gap between synthetic and real-world data, significantly hinder… ▽ More

    Submitted 19 July, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: ECCV 2022

  7. arXiv:2203.14517  [pdf, other

    cs.CV

    REGTR: End-to-end Point Cloud Correspondences with Transformers

    Authors: Zi Jian Yew, Gim Hee Lee

    Abstract: Despite recent success in incorporating learning into point cloud registration, many works focus on learning feature descriptors and continue to rely on nearest-neighbor feature matching and outlier filtering through RANSAC to obtain the final set of correspondences for pose estimation. In this work, we conjecture that attention mechanisms can replace the role of explicit feature matching and RANS… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 15 pages, 11 figures, CVPR2022

  8. arXiv:2203.03498  [pdf, other

    cs.CV

    Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation

    Authors: Meng Tian, Gim Hee Lee

    Abstract: State-of-the-art approaches for 6D object pose estimation require large amounts of labeled data to train the deep networks. However, the acquisition of 6D object pose annotations is tedious and labor-intensive in large quantity. To alleviate this problem, we propose a weakly supervised 6D object pose estimation approach based on 2D keypoint detection. Our method trains only on image pairs with kno… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  9. arXiv:2112.07241  [pdf, other

    cs.CV

    Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection

    Authors: Na Zhao, Gim Hee Lee

    Abstract: Deep learning-based approaches have shown remarkable performance in the 3D object detection task. However, they suffer from a catastrophic performance drop on the originally trained classes when incrementally learning new classes without revisiting the old data. This "catastrophic forgetting" phenomenon impedes the deployment of 3D object detection approaches in real-world scenarios, where continu… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022

  10. arXiv:2112.01900  [pdf, other

    cs.CV

    Novel Class Discovery in Semantic Segmentation

    Authors: Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee

    Abstract: We introduce a new setting of Novel Class Discovery in Semantic Segmentation (NCDSS), which aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes. In contrast to existing approaches that look at novel class discovery in image classification, we focus on the more challenging semantic segmentation. In NCDSS, we need to distinguish the… ▽ More

    Submitted 28 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: CVPR 2022

  11. arXiv:2111.10946  [pdf, other

    cs.RO

    A General Framework for Lifelong Localization and Map** in Changing Environment

    Authors: Min Zhao, Xin Guo, Le Song, Baoxing Qin, Xuesong Shi, Gim Hee Lee, Guanghui Sun

    Abstract: The environment of most real-world scenarios such as malls and supermarkets changes at all times. A pre-built map that does not account for these changes becomes out-of-date easily. Therefore, it is necessary to have an up-to-date model of the environment to facilitate long-term operation of a robot. To this end, this paper presents a general lifelong simultaneous localization and map** (SLAM) f… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  12. arXiv:2111.08176  [pdf, other

    cs.CV cs.LG

    Coarse-to-fine Animal Pose and Shape Estimation

    Authors: Chen Li, Gim Hee Lee

    Abstract: Most existing animal pose and shape estimation approaches reconstruct animal meshes with a parametric SMAL model. This is because the low-dimensional pose and shape parameters of the SMAL model makes it easier for deep networks to learn the high-dimensional animal meshes. However, the SMAL model is learned from scans of toy animals with limited pose and shape variations, and thus may not be able t… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted by Neurips2021

  13. arXiv:2111.00728  [pdf, other

    cs.CV

    Learning Iterative Robust Transformation Synchronization

    Authors: Zi Jian Yew, Gim Hee Lee

    Abstract: Transformation Synchronization is the problem of recovering absolute transformations from a given set of pairwise relative motions. Despite its usefulness, the problem remains challenging due to the influences from noisy and outlier relative motions, and the difficulty to model analytically and suppress them with high fidelity. In this work, we avoid handcrafting robust loss functions, and propose… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: To appear in 3DV2021

  14. arXiv:2110.15017  [pdf, other

    cs.CV

    Bridging Non Co-occurrence with Unlabeled In-the-wild Data for Incremental Object Detection

    Authors: Na Dong, Yongqiang Zhang, Mingli Ding, Gim Hee Lee

    Abstract: Deep networks have shown remarkable results in the task of object detection. However, their performance suffers critical drops when they are subsequently trained on novel classes without any sample from the base classes originally used to train the model. This phenomenon is known as catastrophic forgetting. Recently, several incremental learning methods are proposed to mitigate catastrophic forget… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted paper at NeurIPS 2021

  15. arXiv:2108.09936  [pdf, other

    cs.CV cs.AI

    Voxel-based Network for Shape Completion by Leveraging Edge Generation

    Authors: Xiaogang Wang, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: Deep learning technique has yielded significant improvements in point cloud completion with the aim of completing missing object shapes from partial inputs. However, most existing methods fail to recover realistic structures due to over-smoothing of fine-grained details. In this paper, we develop a voxel-based network for point cloud completion by leveraging edge generation (VE-PCN). We first embe… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  16. arXiv:2106.03422  [pdf, other

    cs.CV

    Source-Free Open Compound Domain Adaptation in Semantic Segmentation

    Authors: Yuyang Zhao, Zhun Zhong, Zhiming Luo, Gim Hee Lee, Nicu Sebe

    Abstract: In this work, we introduce a new concept, named source-free open compound domain adaptation (SF-OCDA), and study it in semantic segmentation. SF-OCDA is more challenging than the traditional domain adaptation but it is more practical. It jointly considers (1) the issues of data privacy and data storage and (2) the scenario of multiple target domains and unseen open domains. In SF-OCDA, only the so… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  17. arXiv:2105.11636  [pdf, other

    cs.CV cs.LG

    FILTRA: Rethinking Steerable CNN by Filter Transform

    Authors: Bo Li, Qili Wang, Gim Hee Lee

    Abstract: Steerable CNN imposes the prior knowledge of transformation invariance or equivariance in the network architecture to enhance the the network robustness on geometry transformation of data and reduce overfitting. It has been an intuitive and widely used technique to construct a steerable filter by augmenting a filter with its transformed copies in the past decades, which is named as filter transfor… ▽ More

    Submitted 15 February, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: ICML 2021

  18. arXiv:2104.03501  [pdf, other

    cs.CV cs.AI

    DeepI2P: Image-to-Point Cloud Registration via Deep Classification

    Authors: Jiaxin Li, Gim Hee Lee

    Abstract: This paper presents DeepI2P: a novel approach for cross-modality registration between an image and a point cloud. Given an image (e.g. from a rgb-camera) and a general point cloud (e.g. from a 3D Lidar scanner) captured at different locations in the same scene, our method estimates the relative rigid transformation between the coordinate frames of the camera and Lidar. Learning common feature desc… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: CVPR 2021. Main paper and supplementary materials

  19. arXiv:2104.02385  [pdf, other

    cs.CV

    Learning Spatial Context with Graph Neural Network for Multi-Person Pose Grou**

    Authors: Jiahao Lin, Gim Hee Lee

    Abstract: Bottom-up approaches for image-based multi-person pose estimation consist of two stages: (1) keypoint detection and (2) grou** of the detected keypoints to form person instances. Current grou** approaches rely on learned embedding from only visual features that completely ignore the spatial configuration of human poses. In this work, we formulate the grou** task as a graph partitioning probl… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 7 pages, 4 figures. Accepted in ICRA 2021

  20. arXiv:2104.02273  [pdf, other

    cs.CV

    Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo

    Authors: Jiahao Lin, Gim Hee Lee

    Abstract: Existing approaches for multi-view multi-person 3D pose estimation explicitly establish cross-view correspondences to group 2D pose detections from multiple camera views and solve for the 3D pose estimation for each person. Establishing cross-view correspondences is challenging in multi-person scenes, and incorrect correspondences will lead to sub-optimal performance for the multi-stage pipeline.… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 10 pages, 5 figures. Accepted in CVPR 2021

  21. arXiv:2103.14910  [pdf, other

    cs.CV cs.GR cs.LG

    MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis

    Authors: Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee

    Abstract: In this paper, we propose MINE to perform novel view synthesis and depth estimation via dense 3D reconstruction from a single image. Our approach is a continuous depth generalization of the Multiplane Images (MPI) by introducing the NEural radiance fields (NeRF). Given a single image as input, MINE predicts a 4-channel image (RGB and volume density) at arbitrary depth values to jointly reconstruct… ▽ More

    Submitted 30 July, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: ICCV 2021. Main paper and supplementary materials

  22. arXiv:2103.14843  [pdf, other

    cs.CV

    From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation

    Authors: Chen Li, Gim Hee Lee

    Abstract: Animal pose estimation is an important field that has received increasing attention in the recent years. The main challenge for this task is the lack of labeled data. Existing works circumvent this problem with pseudo labels generated from data of other easily accessible domains such as synthetic data. However, these pseudo labels are noisy even with consistency check or confidence-based filtering… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

    Comments: CVPR2021

  23. arXiv:2103.14314  [pdf, other

    cs.CV

    City-scale Scene Change Detection using Point Clouds

    Authors: Zi Jian Yew, Gim Hee Lee

    Abstract: We propose a method for detecting structural changes in a city using images captured from vehicular mounted cameras over traversals at two different times. We first generate 3D point clouds for each traversal from the images and approximate GNSS/INS readings using Structure-from-Motion (SfM). A direct comparison of the two point clouds for change detection is not ideal due to inaccurate geo-locati… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 8 pages, 10 figures. To be presented at ICRA2021

  24. arXiv:2010.08719  [pdf, other

    cs.CV cs.AI

    Cascaded Refinement Network for Point Cloud Completion with Self-supervision

    Authors: Xiaogang Wang, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: Point clouds are often sparse and incomplete, which imposes difficulties for real-world applications. Existing shape completion methods tend to generate rough shapes without fine-grained details. Considering this, we introduce a two-branch network for shape completion. The first branch is a cascaded shape completion sub-network to synthesize complete objects, where we propose to use the partial in… ▽ More

    Submitted 26 August, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: Accepted by PAMI. Extended version of the following paper: Cascaded Refinement Network for Point Cloud Completion. CVPR 2020. arXiv link: arXiv:2004.03327

  25. arXiv:2008.05770  [pdf, other

    cs.CV

    Weakly Supervised Generative Network for Multiple 3D Human Pose Hypotheses

    Authors: Chen Li, Gim Hee Lee

    Abstract: 3D human pose estimation from a single image is an inverse problem due to the inherent ambiguity of the missing depth. Several previous works addressed the inverse problem by generating multiple hypotheses. However, these works are strongly supervised and require ground truth 2D-to-3D correspondences which can be difficult to obtain. In this paper, we propose a weakly supervised deep generative ne… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted to BMVC2020

  26. arXiv:2008.00394  [pdf, other

    cs.CV cs.LG eess.IV

    Point Cloud Completion by Learning Shape Priors

    Authors: Xiaogang Wang, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: In view of the difficulty in reconstructing object details in point cloud completion, we propose a shape prior learning method for object completion. The shape priors include geometric information in both complete and the partial point clouds. We design a feature alignment strategy to learn the shape prior from complete points, and a coarse to fine strategy to incorporate partial prior in the fine… ▽ More

    Submitted 15 July, 2021; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: IROS 2020

  27. arXiv:2007.10986  [pdf, other

    cs.CV

    Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry

    Authors: He Chen, Pengfei Guo, Pengfei Li, Gim Hee Lee, Gregory Chirikjian

    Abstract: Epipolar constraints are at the core of feature matching and depth estimation in current multi-person multi-camera 3D human pose estimation methods. Despite the satisfactory performance of this formulation in sparser crowd scenes, its effectiveness is frequently challenged under denser crowd circumstances mainly due to two sources of ambiguity. The first is the mismatch of human joints resulting f… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  28. arXiv:2007.08943  [pdf, other

    cs.CV

    HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization

    Authors: Jiahao Lin, Gim Hee Lee

    Abstract: Current works on multi-person 3D pose estimation mainly focus on the estimation of the 3D joint locations relative to the root joint and ignore the absolute locations of each pose. In this paper, we propose the Human Depth Estimation Network (HDNet), an end-to-end framework for absolute root joint localization in the camera coordinate space. Our HDNet first estimates the 2D human pose with heatmap… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 16 pages, 5 figures. Accepted in ECCV 2020

  29. arXiv:2007.08454  [pdf, other

    cs.CV

    Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation

    Authors: Meng Tian, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image. To handle the intra-class shape variation, we propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior. Additionally, our network infers the dense correspondences between the depth observation of th… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted at ECCV 2020

  30. arXiv:2007.07686  [pdf, other

    cs.RO cs.CV

    Relative Pose Estimation of Calibrated Cameras with Known $\mathrm{SE}(3)$ Invariants

    Authors: Bo Li, Evgeniy Martyushev, Gim Hee Lee

    Abstract: The $\mathrm{SE}(3)$ invariants of a pose include its rotation angle and screw translation. In this paper, we present a complete comprehensive study of the relative pose estimation problem for a calibrated camera constrained by known $\mathrm{SE}(3)$ invariant, which involves 5 minimal problems in total. These problems reduces the minimal number of point pairs for relative pose estimation and impr… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

  31. arXiv:2007.00860  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Enhanced graphitic domains of unreduced graphene oxide and the interplay of hydration behaviour and catalytic activity

    Authors: Tobias Foller, Rahman Daiyan, Xiaoheng **, Joshua Leverett, Hangyel Kim, Richard Webster, Jeaniffer E. Yap, Xinyue Wen, Aditya Rawal, K. Kanishka H. DeSilva, Masamichi Yoshimura, Heriberto Bustamante, Shery L. Y. Chang, Priyank Kumar, Yi You, Gwan Hyoung Lee, Rose Amal, Rakesh Joshi

    Abstract: Previous studies indicate that the properties of graphene oxide (GO) can be significantly improved by enhancing its graphitic domain size through thermal diffusion and clustering of functional groups. Remarkably, this transition takes place below the decomposition temperature of the functional groups and thus allows fine-tuning of graphitic domains without compromising with the functionality of GO… ▽ More

    Submitted 21 May, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

  32. arXiv:2006.12052  [pdf, other

    cs.CV

    Few-shot 3D Point Cloud Semantic Segmentation

    Authors: Na Zhao, Tat-Seng Chua, Gim Hee Lee

    Abstract: Many existing approaches for 3D point cloud semantic segmentation are fully supervised. These fully supervised approaches heavily rely on large amounts of labeled training data that are difficult to obtain and cannot segment new classes after training. To mitigate these limitations, we propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method to… ▽ More

    Submitted 29 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: CVPR 2021

    ACM Class: I.2.10; I.4.6

  33. Production of $π^0$ and $η$ mesons in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV

    Authors: U. Acharya, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, J. Alexander, K. Aoki, N. Apadula, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, X. Bai, B. Bannier, K. N. Barish, S. Bathe, V. Baublis, C. Baumann, S. Baumgart, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov , et al. (378 additional authors not shown)

    Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider measured $Ï€^0$ and $η$ mesons at midrapidity in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV in a wide transverse momentum range. Measurements were performed in the $Ï€^0(η)\rightarrowγγ$ decay modes. A strong suppression of $Ï€^0$ and $η$ meson production at high transverse momentum was observed in central U$+$U collisions relative to b… ▽ More

    Submitted 13 November, 2020; v1 submitted 29 May, 2020; originally announced May 2020.

    Comments: 403 authors from 72 institutions, 13 pages, 6 figures, 7 tables, 2012 data. v2 is version accepted by Physical Review C. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

    Journal ref: Phys. Rev. C 102, 064905 (2020)

  34. arXiv:2004.04091  [pdf, other

    cs.CV

    Weakly Supervised Semantic Point Cloud Segmentation:Towards 10X Fewer Labels

    Authors: Xun Xu, Gim Hee Lee

    Abstract: Point cloud analysis has received much attention recently; and segmentation is one of the most important tasks. The success of existing approaches is attributed to deep network design and large amount of labelled training data, where the latter is assumed to be always available. However, obtaining 3d point cloud segmentation labels is often very costly in practice. In this work, we propose a weakl… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: CVPR2020

  35. arXiv:2004.03327  [pdf, other

    cs.CV

    Cascaded Refinement Network for Point Cloud Completion

    Authors: Xiaogang Wang, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: Point clouds are often sparse and incomplete. Existing shape completion methods are incapable of generating details of objects or learning the complex point distributions. To this end, we propose a cascaded refinement network together with a coarse-to-fine strategy to synthesize the detailed object shapes. Considering the local details of partial input with the global shape information together, w… ▽ More

    Submitted 5 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: CVPR2020

  36. RPM-Net: Robust Point Matching using Learned Features

    Authors: Zi Jian Yew, Gim Hee Lee

    Abstract: Iterative Closest Point (ICP) solves the rigid point cloud registration problem iteratively in two steps: (1) make hard assignments of spatially closest point correspondences, and then (2) find the least-squares rigid transformation. The hard assignments of closest point correspondences based on spatial distances are sensitive to the initial rigid transformation and noisy/outlier points, which oft… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 10 pages, 4 figures. To appear in CVPR2020

  37. arXiv:2003.13255  [pdf, other

    eess.SP cs.IT

    Joint Orthogonal Band and Power Allocation for Energy Fairness in WPT System with Nonlinear Logarithmic Energy Harvesting Model

    Authors: Jaeseob Han, Gyeong Ho Lee, Sangdon Park, Jun Kyun Choi

    Abstract: Wireless power transmission (WPT) is expected to play an important role in the Internet of Things services by providing the perpetual operation of IoT sensors. However, to prolong the IoT network's lifetime, the efficient resource allocation algorithm is required, in particular, the energy fairness issue among IoT sensors has been a critical challenge of the WPT system. In this paper, considering… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 12 pages, 27 figures

  38. arXiv:2003.00188  [pdf, other

    cs.CV

    Robust 6D Object Pose Estimation by Learning RGB-D Features

    Authors: Meng Tian, Liang Pan, Marcelo H Ang Jr, Gim Hee Lee

    Abstract: Accurate 6D object pose estimation is fundamental to robotic manipulation and gras**. Previous methods follow a local optimization approach which minimizes the distance between closest point pairs to handle the rotation ambiguity of symmetric objects. In this work, we propose a novel discrete-continuous formulation for rotation regression to resolve this local-optimum problem. We uniformly sampl… ▽ More

    Submitted 9 March, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: Accepted at ICRA 2020

  39. arXiv:1912.11803  [pdf, other

    cs.CV

    SESS: Self-Ensembling Semi-Supervised 3D Object Detection

    Authors: Na Zhao, Tat-Seng Chua, Gim Hee Lee

    Abstract: The performance of existing point cloud-based 3D object detection methods heavily relies on large-scale high-quality 3D annotations. However, such annotations are often tedious and expensive to collect. Semi-supervised learning is a good alternative to mitigate the data annotation issue, but has remained largely unexplored in 3D object detection. Inspired by the recent success of self-ensembling t… ▽ More

    Submitted 17 March, 2021; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 Oral

  40. arXiv:1909.12555  [pdf, other

    cs.LG stat.ML

    Identifying through Flows for Recovering Latent Representations

    Authors: Shen Li, Bryan Hooi, Gim Hee Lee

    Abstract: Identifiability, or recovery of the true latent representations from which the observed data originates, is de facto a fundamental goal of representation learning. Yet, most deep generative models do not address the question of identifiability, and thus fail to deliver on the promise of the recovery of the true latent sources that generate the observations. Recent work proposed identifiable genera… ▽ More

    Submitted 26 April, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  41. arXiv:1908.08289  [pdf, other

    cs.CV

    Trajectory Space Factorization for Deep Video-Based 3D Human Pose Estimation

    Authors: Jiahao Lin, Gim Hee Lee

    Abstract: Existing deep learning approaches on 3d human pose estimation for videos are either based on Recurrent or Convolutional Neural Networks (RNNs or CNNs). However, RNN-based frameworks can only tackle sequences with limited frames because sequential models are sensitive to bad frames and tend to drift over long sequences. Although existing CNN-based temporal frameworks attempt to address the sensitiv… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: 13 pages, 5 figures. Accepted in BMVC 2019

  42. PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation

    Authors: Na Zhao, Tat-Seng Chua, Gim Hee Lee

    Abstract: In this paper, we present the PS^2-Net -- a locally and globally aware deep learning framework for semantic segmentation on 3D scene-level point clouds. In order to deeply incorporate local structures and global context to support 3D scene segmentation, our network is built on four repeatedly stacked encoders, where each encoder has two basic components: EdgeConv that captures local structures and… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

  43. arXiv:1907.13185  [pdf, other

    cs.CV

    Degeneracy in Self-Calibration Revisited and a Deep Learning Solution for Uncalibrated SLAM

    Authors: Bingbing Zhuang, Quoc-Huy Tran, Pan Ji, Gim Hee Lee, Loong Fah Cheong, Manmohan Chandraker

    Abstract: Self-calibration of camera intrinsics and radial distortion has a long history of research in the computer vision community. However, it remains rare to see real applications of such techniques to modern Simultaneous Localization And Map** (SLAM) systems, especially in driving scenarios. In this paper, we revisit the geometric approach to this problem, and provide a theoretical proof that explic… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: To appear at IROS 2019

  44. arXiv:1907.09798  [pdf, other

    cs.CV

    PointAtrousGraph: Deep Hierarchical Encoder-Decoder with Point Atrous Convolution for Unorganized 3D Points

    Authors: Liang Pan, Chee-Meng Chew, Gim Hee Lee

    Abstract: Motivated by the success of encoding multi-scale contextual information for image analysis, we propose our PointAtrousGraph (PAG) - a deep permutation-invariant hierarchical encoder-decoder for efficiently exploiting multi-scale edge features in point clouds. Our PAG is constructed by several novel modules, such as Point Atrous Convolution (PAC), Edge-preserved Pooling (EP) and Edge-preserved Unpo… ▽ More

    Submitted 13 September, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: 11 pages, 10 figures

  45. arXiv:1905.09634  [pdf, other

    cs.CV cs.RO

    Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes

    Authors: Ziquan Lan, Zi Jian Yew, Gim Hee Lee

    Abstract: Outlier feature matches and loop-closures that survived front-end data association can lead to catastrophic failures in the back-end optimization of large-scale point cloud based 3D reconstruction. To alleviate this problem, we propose a probabilistic approach for robust back-end optimization in the presence of outliers. More specifically, we model the problem as a Bayesian network and solve it us… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: CVPR 2019, 8 pages, 5 figures

  46. arXiv:1904.10300  [pdf, other

    cs.CV

    Transferable Semi-supervised 3D Object Detection from RGB-D Data

    Authors: Yew Siang Tang, Gim Hee Lee

    Abstract: We investigate the direction of training a 3D object detector for new object classes from only 2D bounding box labels of these new classes, while simultaneously transferring information from 3D bounding box labels of the existing classes. To this end, we propose a transferable semi-supervised 3D object detection model that learns a 3D object detector network from training data with two disjoint se… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

  47. arXiv:1904.09742  [pdf, other

    cs.CV

    2D3D-MatchNet: Learning to Match Keypoints Across 2D Image and 3D Point Cloud

    Authors: Mengdan Feng, Sixing Hu, Marcelo Ang, Gim Hee Lee

    Abstract: Large-scale point cloud generated from 3D sensors is more accurate than its image-based counterpart. However, it is seldom used in visual pose estimation due to the difficulty in obtaining 2D-3D image to point cloud correspondences. In this paper, we propose the 2D3D-MatchNet - an end-to-end deep network architecture to jointly learn the descriptors for 2D and 3D keypoint from image and point clou… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  48. arXiv:1904.05547  [pdf, other

    cs.CV

    Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network

    Authors: Chen Li, Gim Hee Lee

    Abstract: 3D human pose estimation from a monocular image or 2D joints is an ill-posed problem because of depth ambiguity and occluded joints. We argue that 3D human pose estimation from a monocular input is an inverse problem where multiple feasible solutions can exist. In this paper, we propose a novel approach to generate multiple feasible hypotheses of the 3D pose from 2D joints.In contrast to existing… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: CVPR 2019

  49. arXiv:1904.00319  [pdf, other

    cs.CV

    Discrete Rotation Equivariance for Point Cloud Recognition

    Authors: Jiaxin Li, Yingcai Bi, Gim Hee Lee

    Abstract: Despite the recent active research on processing point clouds with deep networks, few attention has been on the sensitivity of the networks to rotations. In this paper, we propose a deep learning architecture that achieves discrete $\mathbf{SO}(2)$/$\mathbf{SO}(3)$ rotation equivariance for point cloud recognition. Specifically, the rotation of an input point cloud with elements of a rotation grou… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

    Comments: The 2019 International Conference on Robotics and Automation (ICRA)

  50. arXiv:1904.00229  [pdf, other

    cs.CV

    USIP: Unsupervised Stable Interest Point Detection from 3D Point Clouds

    Authors: Jiaxin Li, Gim Hee Lee

    Abstract: In this paper, we propose the USIP detector: an Unsupervised Stable Interest Point detector that can detect highly repeatable and accurately localized keypoints from 3D point clouds under arbitrary transformations without the need for any ground truth training data. Our USIP detector consists of a feature proposal network that learns stable keypoints from input 3D point clouds and their respective… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

    Comments: 19 pages