Skip to main content

Showing 1–48 of 48 results for author: Hua, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18581  [pdf, other

    cs.CV cs.GR

    Dream-in-Style: Text-to-3D Generation using Stylized Score Distillation

    Authors: Hubert Kompanowski, Binh-Son Hua

    Abstract: We present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the content aligning with the text prompt and the style following the reference image. To simultaneously generate the 3D object and perform style transfer in one go, we propose a stylized score distillation… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2404.08590  [pdf, other

    cs.CV cs.AI

    Improving Referring Image Segmentation using Vision-Aware Text Features

    Authors: Hai Nguyen-Truong, E-Ro Nguyen, Tuan-Anh Vu, Minh-Triet Tran, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Referring image segmentation is a challenging task that involves generating pixel-wise segmentation masks based on natural language descriptions. Existing methods have relied mostly on visual features to generate the segmentation masks while treating text features as supporting components. This over-reliance on visual features can lead to suboptimal results, especially in complex scenarios where t… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 30 pages including supplementary

  3. arXiv:2401.13937  [pdf, other

    cs.CV

    Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

    Authors: Quang-Trung Truong, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Video object segmentation is a fundamental research problem in computer vision. Recent techniques have often applied attention mechanism to object representation learning from video sequences. However, due to temporal changes in the video data, attention maps may not well align with the objects of interest across video frames, causing accumulated errors in long-term video processing. In addition,… ▽ More

    Submitted 18 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: under review

  4. arXiv:2312.17505  [pdf, other

    cs.CV cs.AI cs.CL

    Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation

    Authors: Tuan-Anh Vu, Duc Thanh Nguyen, Qing Guo, Binh-Son Hua, Nhat Minh Chung, Ivor W. Tsang, Sai-Kit Yeung

    Abstract: Text-to-image diffusion techniques have shown exceptional capability of producing high-quality images from text descriptions. This indicates that there exists a strong correlation between the visual and textual domains. In addition, text-image discriminative models such as CLIP excel in image labelling from text prompts, thanks to the rich and diverse information available from open concepts. In t… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: This work is under review

  5. arXiv:2312.02192  [pdf, other

    cs.CV

    DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding

    Authors: Uy Dieu Tran, Minh Luu, Phong Nguyen, Janne Heikkila, Khoi Nguyen, Binh-Son Hua

    Abstract: Text-to-3D synthesis has recently emerged as a new approach to sampling 3D models by adopting pretrained text-to-image models as guiding visual priors. An intriguing but underexplored problem with existing text-to-3D methods is that 3D models obtained from the sampling-by-optimization procedure tend to have mode collapses, and hence poor diversity in their results. In this paper, we provide an ana… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  6. arXiv:2311.18328  [pdf, other

    cs.CV cs.AI cs.GR

    Advances in 3D Neural Stylization: A Survey

    Authors: Yingshu Chen, Guocheng Shao, Ka Chun Shum, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Modern artificial intelligence offers a novel and transformative approach to creating digital art across diverse styles and modalities like images, videos and 3D data, unleashing the power of creativity and revolutionizing the way that we perceive and interact with visual content. This paper reports on recent advances in stylized 3D asset creation and manipulation with the expressive power of neur… ▽ More

    Submitted 18 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  7. arXiv:2311.13152  [pdf, other

    cs.CV

    Test-Time Augmentation for 3D Point Cloud Classification and Segmentation

    Authors: Tuan-Anh Vu, Srinjay Sarkar, Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Data augmentation is a powerful technique to enhance the performance of a deep learning task but has received less attention in 3D deep learning. It is well known that when 3D shapes are sparsely represented with low point density, the performance of the downstream tasks drops significantly. This work explores test-time augmentation (TTA) for 3D point clouds. We are inspired by the recent revoluti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: This paper is accepted in 3DV 2024

  8. arXiv:2309.12668  [pdf, other

    cs.RO

    UWA360CAM: A 360$^{\circ}$ 24/7 Real-Time Streaming Camera System for Underwater Applications

    Authors: Quan-Dung Pham, Yipeng Zhu, Tan-Sang Ha, K. H. Long Nguyen, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Omnidirectional camera is a cost-effective and information-rich sensor highly suitable for many marine applications and the ocean scientific community, encompassing several domains such as augmented reality, map**, motion estimation, visual surveillance, and simultaneous localization and map**. However, designing and constructing such a high-quality 360$^{\circ}$ real-time streaming camera sys… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  9. arXiv:2309.11281  [pdf, other

    cs.CV

    Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

    Authors: Ka Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: Neural radiance field is an emerging rendering method that generates high-quality multi-view consistent images from a neural scene representation and volume rendering. Although neural radiance field-based techniques are robust for scene reconstruction, their ability to add or remove objects remains limited. This paper proposes a new language-driven approach for object manipulation with neural radi… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: CVPR 2024

  10. arXiv:2309.10684  [pdf, other

    cs.CV cs.GR

    Locally Stylized Neural Radiance Fields

    Authors: Hong-Wing Pang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: In recent years, there has been increasing interest in applying stylization on 3D scenes from a reference style image, in particular onto neural radiance fields (NeRF). While performing stylization directly on NeRF guarantees appearance consistency over arbitrary novel views, it is a challenging problem to guide the transfer of patterns from the style image onto different parts of the NeRF scene.… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  11. arXiv:2307.14395  [pdf, other

    cs.LG cs.AI

    Learning to simulate partially known spatio-temporal dynamics with trainable difference operators

    Authors: Xiang Huang, Zhuoyuan Li, Hongsheng Liu, Zidong Wang, Hongye Zhou, Bin Dong, Bei Hua

    Abstract: Recently, using neural networks to simulate spatio-temporal dynamics has received a lot of attention. However, most existing methods adopt pure data-driven black-box models, which have limited accuracy and interpretability. By combining trainable difference operators with black-box models, we propose a new hybrid architecture explicitly embedded with partial prior knowledge of the underlying PDEs… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  12. arXiv:2307.13251  [pdf, other

    cs.CV cs.AI

    GaPro: Box-Supervised 3D Point Cloud Instance Segmentation Using Gaussian Processes as Pseudo Labelers

    Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

    Abstract: Instance segmentation on 3D point clouds (3DIS) is a longstanding challenge in computer vision, where state-of-the-art methods are mainly based on full supervision. As annotating ground truth dense instance masks is tedious and expensive, solving 3DIS with weak supervision has become more practical. In this paper, we propose GaPro, a new instance segmentation for 3D point clouds using axis-aligned… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  13. arXiv:2307.09621  [pdf, other

    cs.CV

    Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration

    Authors: Ka Chun Shum, Hong-Wing Pang, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: In this paper, we address the problem of conditional scene decoration for 360-degree images. Our method takes a 360-degree background photograph of an indoor scene and generates decorated images of the same scene in the panorama view. To do this, we develop a 360-aware object layout generator that learns latent object vectors in the 360-degree view to enable a variety of furniture arrangements for… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: ICCV2023

  14. arXiv:2303.00246  [pdf, other

    cs.CV

    ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

    Authors: Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

    Abstract: Existing 3D instance segmentation methods are predominated by the bottom-up design -- manually fine-tuned algorithm to group points into clusters followed by a refinement network. However, by relying on the quality of the clusters, these methods generate susceptible results when (1) nearby objects with the same semantic class are packed together, or (2) large objects with loosely connected regions… ▽ More

    Submitted 26 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  15. arXiv:2212.13535  [pdf, other

    cs.CV cs.AI

    From Single-Visit to Multi-Visit Image-Based Models: Single-Visit Models are Enough to Predict Obstructive Hydronephrosis

    Authors: Stanley Bryan Z. Hua, Mandy Rickard, John Weaver, Alice Xiang, Daniel Alvarez, Kyla N. Velear, Kunj Sheth, Gregory E. Tasian, Armando J. Lorenzo, Anna Goldenberg, Lauren Erdman

    Abstract: Previous work has shown the potential of deep learning to predict renal obstruction using kidney ultrasound images. However, these image-based classifiers have been trained with the goal of single-visit inference in mind. We compare methods from video action recognition (i.e. convolutional pooling, LSTM, TSM) to adapt single-visit convolutional models to handle multiple visit inference. We demonst… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: Paper accepted to SIPAIM 2022 (in Valparaiso, Chile)

  16. arXiv:2211.08702  [pdf, other

    cs.CV cs.AI cs.GR

    PointInverter: Point Cloud Reconstruction and Editing via a Generative Model with Shape Priors

    Authors: Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: In this paper, we propose a new method for map** a 3D point cloud to the latent space of a 3D generative adversarial network. Our generative model for 3D point clouds is based on SP-GAN, a state-of-the-art sphere-guided 3D point cloud generator. We derive an efficient way to encode an input 3D point cloud to the latent space of the SP-GAN. Our point cloud encoder can resolve the point ordering i… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: WACV 2023 paper. 8 pages of main content, 2 pages of references, 7 pages of supplementary material

  17. arXiv:2211.07422  [pdf, other

    cs.GR

    Regression-based Monte Carlo Integration

    Authors: Corentin Salaün, Adrien Gruson, Binh-Son Hua, Toshiya Hachisuka, Gurprit Singh

    Abstract: Monte Carlo integration is typically interpreted as an estimator of the expected value using stochastic samples. There exists an alternative interpretation in calculus where Monte Carlo integration can be seen as estimating a \emph{constant} function -- from the stochastic evaluations of the integrand -- that integrates to the original integral. The integral mean value theorem states that this \em… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 14 pages, 16 figures, ACM Trans. Graph., Vol. 41, No. 4, Article 79. Publication date: July 2022

    Journal ref: ACM Trans. Graph., Vol. 41, No. 4, Article 79. Publication date: July 2022

  18. arXiv:2210.15904  [pdf, other

    cs.CV cs.AI cs.GR

    Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis

    Authors: Bach Tran, Binh-Son Hua, Anh Tuan Tran, Minh Hoai

    Abstract: Recently, great progress has been made in 3D deep learning with the emergence of deep neural networks specifically designed for 3D point clouds. These networks are often trained from scratch or from pre-trained models learned purely from point cloud data. Inspired by the success of deep learning in the image domain, we devise a novel pre-training technique for better model initialization by utiliz… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: ACCV 2022 paper. 14 pages of content, 4 pages of references, 6 pages of supplementary material

  19. arXiv:2210.15897  [pdf, other

    eess.IV cs.CV cs.GR

    Single-Image HDR Reconstruction by Multi-Exposure Generation

    Authors: Phuoc-Hieu Le, Quynh Le, Rang Nguyen, Binh-Son Hua

    Abstract: High dynamic range (HDR) imaging is an indispensable technique in modern photography. Traditional methods focus on HDR reconstruction from multiple images, solving the core problems of image alignment, fusion, and tone map**, yet having a perfect solution due to ghosting and other visual artifacts in the reconstruction. Recent attempts at single-image HDR reconstruction show a promising alternat… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: WACV 2023 paper. 8 pages of content, 2 pages of references, 8 pages of supplementary material

  20. arXiv:2209.05800  [pdf, other

    cs.CV cs.GR cs.MM

    Time-of-Day Neural Style Transfer for Architectural Photographs

    Authors: Yingshu Chen, Tuan-Anh Vu, Ka-Chun Shum, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Architectural photography is a genre of photography that focuses on capturing a building or structure in the foreground with dramatic lighting in the background. Inspired by recent successes in image-to-image translation methods, we aim to perform style transfer for architectural photographs. However, the special composition in architectural photography poses great challenges for style transfer in… ▽ More

    Submitted 27 October, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: Updated version with corrected equations. Paper published at the International Conference on Computational Photography (ICCP) 2022. 12 pages of content with 6 pages of supplementary materials

  21. arXiv:2207.10785  [pdf, other

    cs.CV

    Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

    Authors: Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen

    Abstract: We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between th… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  22. arXiv:2206.04679  [pdf, other

    cs.LG cs.CV

    POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

    Authors: Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

    Abstract: In this work, we propose to use out-of-distribution samples, i.e., unlabeled samples coming from outside the target classes, to improve few-shot learning. Specifically, we exploit the easily available out-of-distribution samples to drive the classifier to avoid irrelevant features by maximizing the distance from prototypes to out-of-distribution samples while minimizing that of in-distribution sam… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2021 (First two authors contribute equally)

  23. arXiv:2203.16482  [pdf, other

    cs.CV

    RFNet-4D++: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds with Cross-Attention Spatio-Temporal Features

    Authors: Tuan-Anh Vu, Duc Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham, Sai-Kit Yeung

    Abstract: Object reconstruction from 3D point clouds has been a long-standing research problem in computer vision and computer graphics, and achieved impressive progress. However, reconstruction from time-varying point clouds (a.k.a. 4D point clouds) is generally overlooked. In this paper, we propose a new network architecture, namely RFNet-4D++, that jointly reconstructs objects and their motion flows from… ▽ More

    Submitted 17 October, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: TPAMI journal extension of ECCV 2022 arXiv:2203.16482

  24. arXiv:2203.08965  [pdf, other

    eess.IV cs.CV cs.LG

    3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation

    Authors: Tan Nguyen, Binh-Son Hua, Ngan Le

    Abstract: Medical image segmentation has been so far achieving promising results with Convolutional Neural Networks (CNNs). However, it is arguable that in traditional CNNs, its pooling layer tends to discard important information such as positions. Moreover, CNNs are sensitive to rotation and affine transformation. Capsule network is a data-efficient network design proposed to overcome such limitations by… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted in MICCAI 2021

  25. arXiv:2203.08964  [pdf, other

    cs.CV

    Point-Unet: A Context-aware Point-based Neural Network for Volumetric Segmentation

    Authors: Ngoc-Vuong Ho, Tan Nguyen, Gia-Han Diep, Ngan Le, Binh-Son Hua

    Abstract: Medical image analysis using deep learning has recently been prevalent, showing great performance for various downstream tasks including medical image segmentation and its sibling, volumetric image segmentation. Particularly, a typical volumetric segmentation network strongly relies on a voxel grid representation which treats volumetric data as a stack of individual voxel `slices', which allows le… ▽ More

    Submitted 28 February, 2024; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted in MICCAI 2021

  26. arXiv:2202.13094  [pdf, other

    cs.CV cs.AI

    RIConv++: Effective Rotation Invariant Convolutions for 3D Point Clouds Deep Learning

    Authors: Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: 3D point clouds deep learning is a promising field of research that allows a neural network to learn features of point clouds directly, making it a robust tool for solving 3D scene understanding tasks. While recent works show that point cloud convolutions can be invariant to translation and point permutation, investigations of the rotation invariance property for point cloud convolution has been s… ▽ More

    Submitted 20 March, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: Authors' version. Accepted to International Journal of Computer Vision (IJCV) 2022

  27. arXiv:2201.05905  [pdf, other

    eess.IV cs.CV

    SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled Data

    Authors: Minh Tran, Loi Ly, Binh-Son Hua, Ngan Le

    Abstract: Capsule network is a recent new deep network architecture that has been applied successfully for medical image segmentation tasks. This work extends capsule networks for volumetric medical image segmentation with self-supervised learning. To improve on the problem of weight initialization compared to previous capsule networks, we leverage self-supervised learning for capsule networks pre-training,… ▽ More

    Submitted 28 March, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: Accepted to ISBI 2022

  28. arXiv:2112.01398  [pdf, other

    cs.CV

    TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

    Authors: Tan M. Dinh, Rang Nguyen, Binh-Son Hua

    Abstract: In this paper, we conduct a study on the state-of-the-art methods for text-to-image synthesis and propose a framework to evaluate these methods. We consider syntheses where an image contains a single or multiple objects. Our study outlines several issues in the current evaluation pipeline: (i) for image quality assessment, a commonly used metric, e.g., Inception Score (IS), is often either miscali… ▽ More

    Submitted 19 July, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to ECCV 2022; TISE toolbox is available at https://github.com/VinAIResearch/tise-toolbox

  29. arXiv:2112.00719  [pdf, other

    cs.CV

    HyperInverter: Improving StyleGAN Inversion via Hypernetwork

    Authors: Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua

    Abstract: Real-world image manipulation has achieved fantastic progress in recent years as a result of the exploration and utilization of GAN latent spaces. GAN inversion is the first step in this pipeline, which aims to map the real image to the latent code faithfully. Unfortunately, the majority of existing GAN inversion methods fail to meet at least one of the three requirements listed below: high recons… ▽ More

    Submitted 4 April, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022; Project page is located at https://di-mi-ta.github.io/HyperInverter/

  30. arXiv:2111.11646  [pdf, other

    cs.CV cs.AI q-bio.QM

    CytoImageNet: A large-scale pretraining dataset for bioimage transfer learning

    Authors: Stanley Bryan Z. Hua, Alex X. Lu, Alan M. Moses

    Abstract: Motivation: In recent years, image-based biological assays have steadily become high-throughput, sparking a need for fast automated methods to extract biologically-meaningful information from hundreds of thousands of images. Taking inspiration from the success of ImageNet, we curate CytoImageNet, a large-scale dataset of openly-sourced and weakly-labeled microscopy images (890K images, 894 classes… ▽ More

    Submitted 23 November, 2021; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted paper at NeurIPS 2021 Learning Meaningful Representations for Life (LMRL) Workshop

  31. arXiv:2111.08823  [pdf, other

    cs.LG cs.AI physics.comp-ph

    Meta-Auto-Decoder for Solving Parametric Partial Differential Equations

    Authors: Xiang Huang, Zhanhong Ye, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, Fan Yu, Bei Hua, Lei Chen, Bin Dong

    Abstract: Many important problems in science and engineering require solving the so-called parametric partial differential equations (PDEs), i.e., PDEs with different physical parameters, boundary conditions, shapes of computation domains, etc. Recently, building learning-based numerical solvers for parametric PDEs has become an emerging new field. One category of methods such as the Deep Galerkin Method (D… ▽ More

    Submitted 18 November, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

  32. arXiv:2111.01394  [pdf, other

    cs.LG cs.AI physics.comp-ph

    Solving Partial Differential Equations with Point Source Based on Physics-Informed Neural Networks

    Authors: Xiang Huang, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Bingya Weng, Min Wang, Haotian Chu, **g Zhou, Fan Yu, Bei Hua, Lei Chen, Bin Dong

    Abstract: In recent years, deep learning technology has been used to solve partial differential equations (PDEs), among which the physics-informed neural networks (PINNs) emerges to be a promising method for solving both forward and inverse PDE problems. PDEs with a point source that is expressed as a Dirac delta function in the governing equations are mathematical models of many physical processes. However… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  33. arXiv:2108.12471  [pdf, other

    q-bio.QM cs.LG

    Machine learning on DNA-encoded library count data using an uncertainty-aware probabilistic loss function

    Authors: Katherine S. Lim, Andrew G. Reidenbach, Bruce K. Hua, Jeremy W. Mason, Christopher J. Gerry, Paul A. Clemons, Connor W. Coley

    Abstract: DNA-encoded library (DEL) screening and quantitative structure-activity relationship (QSAR) modeling are two techniques used in drug discovery to find small molecules that bind a protein target. Applying QSAR modeling to DEL data can facilitate the selection of compounds for off-DNA synthesis and evaluation. Such a combined approach has been shown recently by training binary classifiers to learn D… ▽ More

    Submitted 27 April, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

  34. arXiv:2108.01806  [pdf, other

    cs.CV cs.GR

    Neural Scene Decoration from a Single Photograph

    Authors: Hong-Wing Pang, Yingshu Chen, Phuoc-Hieu Le, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: Furnishing and rendering indoor scenes has been a long-standing task for interior design, where artists create a conceptual design for the space, build a 3D model of the space, decorate, and then perform rendering. Although the task is important, it is tedious and requires tremendous effort. In this paper, we introduce a new problem of domain-specific indoor scene image synthesis, namely neural sc… ▽ More

    Submitted 25 July, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: ECCV 2022 paper. 14 pages of main content, 4 pages of references, and 11 pages of appendix

  35. arXiv:2107.13368  [pdf

    cs.CY

    Evaluating the weight sensitivity in AHP-based flood risk estimation models

    Authors: Hong** Zhang, Zhenfeng Shao, Bin Hua, Xiao Huang, **qi Zhao, Wenfu Wu, Yewen Fan

    Abstract: In the analytic hierarchy process (AHP) based flood risk estimation models, it is widely acknowledged that different weighting criteria can lead to different results. In this study, we evaluated and discussed the sensitivity of flood risk estimation brought by judgment matrix definition by investigating the performance of pixel-based and sub-watershed-based AHP models. Taking a flood event that oc… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: 42pages, 12 figures, 7 tables. It is about sensitivity analyzing of flood risk estimation using pixels or sub-watershed as basic unit

    MSC Class: 86A05 ACM Class: I.6.0

  36. arXiv:2105.03193  [pdf, other

    cs.LG cs.AI

    Network Pruning That Matters: A Case Study on Retraining Variants

    Authors: Duong H. Le, Binh-Son Hua

    Abstract: Network pruning is an effective method to reduce the computational expense of over-parameterized neural networks for deployment on low-resource systems. Recent state-of-the-art techniques for retraining pruned networks such as weight rewinding and learning rate rewinding have been shown to outperform the traditional fine-tuning technique in recovering the lost accuracy (Renda et al., 2020), but so… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted at ICLR 2021 (Poster)

  37. arXiv:2102.04014  [pdf, other

    cs.CV

    Point-set Distances for Learning Representations of 3D Point Clouds

    Authors: Trung Nguyen, Quang-Hieu Pham, Tam Le, Tung Pham, Nhat Ho, Binh-Son Hua

    Abstract: Learning an effective representation of 3D point clouds requires a good metric to measure the discrepancy between two 3D point sets, which is non-trivial due to their irregularity. Most of the previous works resort to using the Chamfer discrepancy or Earth Mover's distance, but those metrics are either ineffective in measuring the differences between point clouds or computationally expensive. In t… ▽ More

    Submitted 14 September, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: ICCV 2021 camera-ready paper (8 pages) with supplementary (3.5 pages)

  38. arXiv:2008.12066  [pdf, other

    cs.CV

    Minimal Adversarial Examples for Deep Learning on 3D Point Clouds

    Authors: Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: With recent developments of convolutional neural networks, deep learning for 3D point clouds has shown significant progress in various 3D scene understanding tasks, e.g., object recognition, semantic segmentation. In a safety-critical environment, it is however not well understood how such deep learning models are vulnerable to adversarial examples. In this work, we explore adversarial attacks for… ▽ More

    Submitted 17 September, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: ICCV 2021 camera-ready paper (8 pages)

  39. arXiv:2008.02986  [pdf, other

    cs.CV

    Global Context Aware Convolutions for 3D Point Cloud Understanding

    Authors: Zhiyuan Zhang, Binh-Son Hua, Wei Chen, Yibin Tian, Sai-Kit Yeung

    Abstract: Recent advances in deep learning for 3D point clouds have shown great promises in scene understanding tasks thanks to the introduction of convolution operators to consume 3D point clouds directly in a neural network. Point cloud data, however, could have arbitrary rotations, especially those acquired from 3D scanning. Recent works show that it is possible to design point cloud convolutions with ro… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  40. arXiv:1911.09326  [pdf, other

    cs.CV

    LCD: Learned Cross-Domain Descriptors for 2D-3D Matching

    Authors: Quang-Hieu Pham, Mikaela Angelina Uy, Binh-Son Hua, Duc Thanh Nguyen, Gemma Roig, Sai-Kit Yeung

    Abstract: In this work, we present a novel method to learn a local cross-domain descriptor for 2D image and 3D point cloud matching. Our proposed method is a dual auto-encoder neural network that maps 2D and 3D input into a shared latent space representation. We show that such local cross-domain descriptors in the shared embedding are more discriminative than those obtained from individual training in 2D an… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: Accepted to AAAI 2020 (Oral)

  41. arXiv:1908.06297  [pdf, other

    cs.CV

    Rotation Invariant Convolutions for 3D Point Clouds Deep Learning

    Authors: Zhiyuan Zhang, Binh-Son Hua, David W. Rosen, Sai-Kit Yeung

    Abstract: Recent progresses in 3D deep learning has shown that it is possible to design special convolution operators to consume point cloud data. However, a typical drawback is that rotation invariance is often not guaranteed, resulting in networks being trained with data augmented with rotations. In this paper, we introduce a novel convolution operator for point clouds that achieves rotation invariance. O… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

    Comments: International Conference on 3D Vision (3DV) 2019

  42. arXiv:1908.06295  [pdf, other

    cs.CV

    ShellNet: Efficient Point Cloud Convolutional Neural Networks using Concentric Shells Statistics

    Authors: Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Deep learning with 3D data has progressed significantly since the introduction of convolutional neural networks that can handle point order ambiguity in point cloud data. While being able to achieve good accuracies in various scene understanding tasks, previous methods often have low training speed and complex network architecture. In this paper, we address these problems by proposing an efficient… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

    Comments: International Conference on Computer Vision (ICCV) 2019 Oral

  43. arXiv:1908.04616  [pdf, other

    cs.CV

    Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data

    Authors: Mikaela Angelina Uy, Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: Deep learning techniques for point cloud data have demonstrated great potentials in solving classical problems in 3D computer vision such as 3D object classification and segmentation. Several recent 3D object classification methods have reported state-of-the-art performance on CAD model datasets such as ModelNet40 with high accuracy (~92%). Despite such impressive results, in this paper, we argue… ▽ More

    Submitted 19 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: ICCV 2019 Oral

  44. arXiv:1904.00699  [pdf, other

    cs.CV

    JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields

    Authors: Quang-Hieu Pham, Duc Thanh Nguyen, Binh-Son Hua, Gemma Roig, Sai-Kit Yeung

    Abstract: Deep learning techniques have become the to-go models for most vision-related tasks on 2D images. However, their power has not been fully realised on several tasks in 3D space, e.g., 3D scene understanding. In this work, we jointly address the problems of semantic and instance segmentation of 3D point clouds. Specifically, we develop a multi-task pointwise network that simultaneously performs two… ▽ More

    Submitted 5 April, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: CVPR 2019 (Oral). More information at https://pqhieu.github.io/cvpr19.html

  45. arXiv:1804.00257  [pdf, other

    cs.CV

    Real-time Progressive 3D Semantic Segmentation for Indoor Scene

    Authors: Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: The widespread adoption of autonomous systems such as drones and assistant robots has created a need for real-time high-quality semantic scene segmentation. In this paper, we propose an efficient yet robust technique for on-the-fly dense reconstruction and semantic segmentation of 3D indoor scenes. To guarantee (near) real-time performance, our method is built atop an efficient super-voxel cluster… ▽ More

    Submitted 5 April, 2019; v1 submitted 1 April, 2018; originally announced April 2018.

    Comments: WACV 2019. More information at https://pqhieu.github.io/wacv19.html

  46. arXiv:1712.05245  [pdf, other

    cs.CV cs.LG

    Pointwise Convolutional Neural Networks

    Authors: Binh-Son Hua, Minh-Khoi Tran, Sai-Kit Yeung

    Abstract: Deep learning with 3D data such as reconstructed point clouds and CAD models has received great research interests recently. However, the capability of using point clouds with convolutional neural network has been so far not fully explored. In this paper, we present a convolutional neural network for semantic segmentation and object recognition with 3D point clouds. At the core of our network is p… ▽ More

    Submitted 29 March, 2018; v1 submitted 14 December, 2017; originally announced December 2017.

    Comments: 10 pages, 6 figures, 10 tables. Paper accepted to CVPR 2018

  47. Calibration of depth cameras using denoised depth images

    Authors: Ramanpreet Singh Pahwa, Minh N. Do, Tian Tsong Ng, Binh-Son Hua

    Abstract: Depth sensing devices have created various new applications in scientific and commercial research with the advent of Microsoft Kinect and PMD (Photon Mixing Device) cameras. Most of these applications require the depth cameras to be pre-calibrated. However, traditional calibration methods using a checkerboard do not work very well for depth cameras due to the low image resolution. In this paper, w… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

    Comments: 5 pages, 3 figures, conference

    Journal ref: 2014 IEEE International Conference on Image Processing (ICIP), Paris, 2014, pp. 3459-3463

  48. arXiv:1610.05883  [pdf, other

    cs.CV

    A Robust 3D-2D Interactive Tool for Scene Segmentation and Annotation

    Authors: Duc Thanh Nguyen, Binh-Son Hua, Lap-Fai Yu, Sai-Kit Yeung

    Abstract: Recent advances of 3D acquisition devices have enabled large-scale acquisition of 3D scene data. Such data, if completely and well annotated, can serve as useful ingredients for a wide spectrum of computer vision and graphics works such as data-driven modeling and scene understanding, object detection and recognition. However, annotating a vast amount of 3D scene data remains challenging due to th… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.

    Comments: 14 pages