Skip to main content

Showing 1–50 of 187 results for author: Lau, W

.
  1. arXiv:2406.10652  [pdf, other

    cs.CV

    MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images

    Authors: Tao Yan, Weijiang He, Chenglong Wang, Xiangjie Zhu, Yinghui Wang, Rynson W. H. Lau

    Abstract: Since rainy weather always degrades image quality and poses significant challenges to most computer vision-based intelligent systems, image de-raining has been a hot research topic. Fortunately, in a rainy light field (LF) image, background obscured by rain streaks in one sub-view may be visible in the other sub-views, and implicit depth information and recorded 4D structural information may benef… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 13 pages, 13 figures, 4 tables

  2. arXiv:2406.01476  [pdf, other

    cs.CV

    DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors

    Authors: Tianyu Huang, Yihan Zeng, Hui Li, Wangmeng Zuo, Rynson W. H. Lau

    Abstract: Dynamic 3D interaction has witnessed great interest in recent works, while creating such 4D content remains challenging. One solution is to animate 3D scenes with physics-based simulation, and the other is to learn the deformation of static 3D objects with the distillation of video generative models. The former one requires assigning precise physical properties to the target object, otherwise the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Technical report. Codes are released at: https://github.com/tyhuang0428/DreamPhysics

  3. arXiv:2405.17725  [pdf, other

    cs.CV

    Color Shift Estimation-and-Correction for Image Enhancement

    Authors: Yiyu Li, Ke Xu, Gerhard Petrus Hancke, Rynson W. H. Lau

    Abstract: Images captured under sub-optimal illumination conditions may contain both over- and under-exposures. Current approaches mainly focus on adjusting image brightness, which may exacerbate the color tone distortion in under-exposed areas and fail to restore accurate colors in over-exposed regions. We observe that over- and under-exposed regions display opposite color tone distribution shifts with res… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: CVPR2024 accepted paper

  4. arXiv:2404.13551  [pdf, other

    cs.SD eess.AS

    AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition

    Authors: Kin Wai Lau, Yasar Abbas Ur Rehman, Lai-Man Po

    Abstract: Recent research has successfully adapted vision-based convolutional neural network (CNN) architectures for audio recognition tasks using Mel-Spectrograms. However, these CNNs have high computational costs and memory requirements, limiting their deployment on low-end edge devices. Motivated by the success of efficient vision models like InceptionNeXt and ConvNeXt, we propose AudioRepInceptionNeXt,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  5. arXiv:2403.18545  [pdf, other

    cs.DC

    Optimal Resource Efficiency with Fairness in Heterogeneous GPU Clusters

    Authors: Zizhao Mo, Huanle Xu, Wing Cheong Lau

    Abstract: Ensuring the highest training throughput to maximize resource efficiency, while maintaining fairness among users, is critical for deep learning (DL) training in heterogeneous GPU clusters. However, current DL schedulers provide only limited fairness properties and suboptimal training throughput, impeding tenants from effectively leveraging heterogeneous resources. The underlying design challenge s… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  6. arXiv:2403.16224  [pdf, other

    cs.CV

    Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

    Authors: Haoyuan Wang, Wenbo Hu, Lei Zhu, Rynson W. H. Lau

    Abstract: Inverse rendering aims at recovering both geometry and materials of objects. It provides a more compatible reconstruction for conventional rendering engines, compared with the neural radiance fields (NeRFs). On the other hand, existing NeRF-based inverse rendering methods cannot handle glossy objects with local light interactions well, as they typically oversimplify the illumination as a 2D enviro… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 paper. Project webpage https://whyy.site/paper/nep

  7. arXiv:2403.15383  [pdf, other

    cs.CV

    ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars

    Authors: Zhenwei Wang, Tengfei Wang, Gerhard Hancke, Ziwei Liu, Rynson W. H. Lau

    Abstract: Real-world applications often require a large gallery of 3D assets that share a consistent theme. While remarkable advances have been made in general 3D content creation from text or image, synthesizing customized 3D assets following the shared theme of input 3D exemplars remains an open and challenging problem. In this work, we present ThemeStation, a novel approach for theme-aware 3D-to-3D gener… ▽ More

    Submitted 15 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted to SIGGRAPH 2024. Project page: https://3dthemestation.github.io/

  8. arXiv:2403.00644  [pdf, other

    cs.CV

    Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

    Authors: Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W. H. Lau

    Abstract: Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity result… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR2024. Replaced some celebrity images to avoid copyright disputes

  9. arXiv:2402.14808  [pdf, other

    cs.CL

    RelayAttention for Efficient Large Language Model Serving with Long System Prompts

    Authors: Lei Zhu, Xinjiang Wang, Wayne Zhang, Rynson W. H. Lau

    Abstract: A practical large language model (LLM) service may involve a long system prompt, which specifies the instructions, examples, and knowledge documents of the task and is reused across requests. However, the long system prompt causes throughput/latency bottlenecks as the cost of generating the next token grows w.r.t. the sequence length. This paper aims to improve the efficiency of LLM services that… ▽ More

    Submitted 30 May, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: accepted by the ACL 2024 main conference

  10. arXiv:2402.13631  [pdf, other

    cs.CV

    Delving into Dark Regions for Robust Shadow Detection

    Authors: Huankang Guan, Ke Xu, Rynson W. H. Lau

    Abstract: Shadow detection is a challenging task as it requires a comprehensive understanding of shadow characteristics and global/local illumination conditions. We observe from our experiment that state-of-the-art deep methods tend to have higher error rates in differentiating shadow pixels from non-shadow pixels in dark regions (ie, regions with low-intensity values). Our key insight to this problem is th… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  11. arXiv:2402.02889  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

    Authors: Yasar Abbas Ur Rehman, Kin Wai Lau, Yuyang Xie, Lan Ma, Jiajun Shen

    Abstract: The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  12. arXiv:2402.00341  [pdf, other

    cs.CV

    Recasting Regional Lighting for Shadow Removal

    Authors: Yuhao Liu, Zhanghan Ke, Ke Xu, Fang Liu, Zhenwei Wang, Rynson W. H. Lau

    Abstract: Removing shadows requires an understanding of both lighting conditions and object textures in a scene. Existing methods typically learn pixel-level color map**s between shadow and non-shadow images, in which the joint modeling of lighting and object textures is implicit and inadequate. We observe that in a shadow region, the degradation degree of object textures depends on the local illumination… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 (Oral)

  13. arXiv:2312.11017  [pdf, ps, other

    cs.IT math.CO math.NT

    Information Inequalities via Ideas from Additive Combinatorics

    Authors: Chin Wa Lau, Chandra Nair

    Abstract: Ruzsa's equivalence theorem provided a framework for converting certain families of inequalities in additive combinatorics to entropic inequalities (which sometimes did not possess stand-alone entropic proofs). In this work, we first establish formal equivalences between some families (different from Ruzsa) of inequalities in additive combinatorics and entropic ones. As a first step to further the… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 15 pages, The authors were made aware that some of the results had been obtained earlier. The revised version acknowledges and references this work. A conference version of this was published in the proceeding of IEEE ISIT 2023. s

  14. arXiv:2312.06439  [pdf, other

    cs.CV

    DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior

    Authors: Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo

    Abstract: 3D generation has raised great attention in recent years. With the success of text-to-image diffusion models, the 2D-lifting technique becomes a promising route to controllable 3D generation. However, these methods tend to present inconsistent geometry, which is also known as the Janus problem. We observe that the problem is caused mainly by two aspects, i.e., viewpoint bias in 2D diffusion models… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR 2024

  15. arXiv:2312.05429  [pdf

    cs.LG cs.CY stat.AP

    Mitigating Nonlinear Algorithmic Bias in Binary Classification

    Authors: Wendy Hui, Wai Kwong Lau

    Abstract: This paper proposes the use of causal modeling to detect and mitigate algorithmic bias that is nonlinear in the protected attribute. We provide a general overview of our approach. We use the German Credit data set, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitig… ▽ More

    Submitted 7 May, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 12 tables. arXiv admin note: text overlap with arXiv:2310.12421

  16. arXiv:2312.03917  [pdf, other

    astro-ph.GA

    [OIII] 5007 emissions in extremely red quasars (ERQs) are compact

    Authors: Marie Wingyee Lau, Serena Perrotta, Fred Hamann, Jarred Gillette, David S. N. Rupke, Andrey Vayner, Nadia L. Zakamska, Dominika Wylezalek

    Abstract: ``Extremely red quasars'' (ERQs) are a non-radio-selected, intrinsically luminous population of quasars at cosmic noon selected by their extremely red colour from rest-frame UV to mid-IR. ERQs are uniquely associated with exceptionally broad and blueshifted [OIII] 5007 emission reaching speeds >6000 km s^-1. We obtained adaptive optics integral-field spectroscopic observations using Keck/OSIRIS an… ▽ More

    Submitted 27 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted by MNRAS

  17. arXiv:2310.16587  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations

    Authors: Tsai Hor Chan, Kin Wai Lau, Jiajun Shen, Guosheng Yin, Lequan Yu

    Abstract: Uncertainty estimation aims to evaluate the confidence of a trained deep neural network. However, existing uncertainty estimation approaches rely on low-dimensional distributional assumptions and thus suffer from the high dimensionality of latent features. Existing approaches tend to focus on uncertainty on discrete classification probabilities, which leads to poor generalizability to uncertainty… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  18. arXiv:2310.12421  [pdf

    cs.LG cs.CY

    Detecting and Mitigating Algorithmic Bias in Binary Classification using Causal Modeling

    Authors: Wendy Hui, Wai Kwong Lau

    Abstract: This paper proposes the use of causal modeling to detect and mitigate algorithmic bias. We provide a brief description of causal modeling and a general overview of our approach. We then use the Adult dataset, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitigation.… ▽ More

    Submitted 8 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 11 pages, 2 figures, 6 tables, R-script in appendix

  19. arXiv:2309.17175  [pdf, other

    cs.CV

    TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

    Authors: Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo

    Abstract: Recent works learn 3D representation explicitly under text-3D guidance. However, limited text-3D data restricts the vocabulary scale and text control of generations. Generators may easily fall into a stereotype concept for certain text prompts, thus losing open-vocabulary generation ability. To tackle this issue, we introduce a conditional 3D generative model, namely TextField3D. Specifically, rat… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by ICLR 2024

  20. arXiv:2309.15163  [pdf, other

    astro-ph.IM physics.ed-ph

    WTH! Wok the Hydrogen: Measurement of Galactic Neutral Hydrogen in Noisy Urban Environment Using Kitchenware

    Authors: Leo W. H. Fung, Albert Wai Kit Lau, Ka Hung Chan, Ming Tony Shing

    Abstract: Astronomy observation is difficult in urban environments due to the background noise generated by human activities. Consequently, promoting astronomy in metropolitan areas is challenging. In this work, we propose a low-cost, educational experiment called Wok the Hydrogen (WTH) that offers opportunities for scientific observation in urban environments, specifically the observation of the $21$ cm (… ▽ More

    Submitted 28 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Updated: minor corrections to typos

  21. arXiv:2309.01439  [pdf, other

    cs.CV

    Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN

    Authors: Kin Wai Lau, Lai-Man Po, Yasar Abbas Ur Rehman

    Abstract: Visual Attention Networks (VAN) with Large Kernel Attention (LKA) modules have been shown to provide remarkable performance, that surpasses Vision Transformers (ViTs), on a range of vision-based tasks. However, the depth-wise convolutional layer in these LKA modules incurs a quadratic increase in the computational and memory footprints with increasing convolutional kernel size. To mitigate these p… ▽ More

    Submitted 19 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  22. arXiv:2308.03059  [pdf, other

    cs.CV cs.AI cs.GR

    Language-based Photo Color Adjustment for Graphic Designs

    Authors: Zhenwei Wang, Nanxuan Zhao, Gerhard Hancke, Rynson W. H. Lau

    Abstract: Adjusting the photo color to associate with some design elements is an essential way for a graphic design to effectively deliver its message and make it aesthetically pleasing. However, existing tools and previous works face a dilemma between the ease of use and level of expressiveness. To this end, we introduce an interactive language-based approach for photo recoloring, which provides an intuiti… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 15 pages, 19 figures. Accepted by SIGGRAPH 2023. Project page: https://zhenwwang.github.io/langrecol

  23. arXiv:2307.10664  [pdf, other

    cs.CV cs.GR

    Lighting up NeRF via Unsupervised Decomposition and Enhancement

    Authors: Haoyuan Wang, Xiaogang Xu, Ke Xu, Rynson WH. Lau

    Abstract: Neural Radiance Field (NeRF) is a promising approach for synthesizing novel views, given a set of images and the corresponding camera poses of a scene. However, images photographed from a low-light scene can hardly be used to train a NeRF model to produce high-quality results, due to their low pixel intensities, heavy noise, and color distortion. Combining existing low-light image enhancement meth… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: ICCV 2023. Project website: https://whyy.site/paper/llnerf

  24. arXiv:2307.07265  [pdf, other

    cs.SD cs.AI eess.AS

    AudioInceptionNeXt: TCL AI LAB Submission to EPIC-SOUND Audio-Based-Interaction-Recognition Challenge 2023

    Authors: Kin Wai Lau, Yasar Abbas Ur Rehman, Yuyang Xie, Lan Ma

    Abstract: This report presents the technical details of our submission to the 2023 Epic-Kitchen EPIC-SOUNDS Audio-Based Interaction Recognition Challenge. The task is to learn the map** from audio samples to their corresponding action labels. To achieve this goal, we propose a simple yet effective single-stream CNN-based architecture called AudioInceptionNeXt that operates on the time-frequency log-mel-sp… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  25. arXiv:2305.11223  [pdf, ps, other

    astro-ph.GA

    Accurate Systemic Redshifts and Outflow Speeds for Extremely Red Quasars (ERQs)

    Authors: Jarred Gillette, Fred Hamann, Marie Wingyee Lau, Serena Perrotta

    Abstract: Extremely Red Quasars (ERQs) are thought to represent a brief episode of young quasar and galactic evolution characterized by rapid outflows and obscured growth due to dusty environments. We use new redshift measurements from CO and Ly$α$ emission-lines to better constrain outflow velocities from previous line measurements. We present sample of 82 ERQs, and the analysis confirms that ERQs have a h… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 8 pages, 4 figures, submitted to Monthly Notices of the Royal Astronomical Society

  26. arXiv:2305.05197  [pdf, other

    astro-ph.IM

    Initial On-Sky Performance testing of the Single-Photon Imager for Nanosecond Astrophysics (SPINA) system

    Authors: Albert Wai Kit Lau, Nurzhan Shaimoldin, Zhanat Maksut, Yan Yan Chan, Mehdi Shafiee, Bruce Grossan, George F. Smoot

    Abstract: This work presents an initial on-sky performance measurement of the Single-Photon Imager for Nanosecond Astrophysics (SPINA) system, part of our Ultra-Fast Astronomy (UFA) program. We developed the SPINA system based on the position-sensitive silicon photomultiplier (PS-SiPM) detector to record both photoelectron (P.E.) temporal and spatial information. The initial on-sky testing of the SPINA syst… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  27. arXiv:2303.13511  [pdf, other

    cs.CV cs.AI cs.LG

    Neural Preset for Color Style Transfer

    Authors: Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W. H. Lau

    Abstract: In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed. Our method is based on two core designs. First, we propose Deterministic Neural Color Map** (DNCM) to consistently operate on each pixel via an image-adaptive color map** matrix, avoiding ar… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Project page with demos: https://zhkkke.github.io/NeuralPreset . Artifact-free real-time 4K color style transfer via AI-generated presets. CVPR 2023

  28. arXiv:2303.12835  [pdf, ps, other

    astro-ph.GA

    Compact and Quiescent Circumgalactic Medium and Ly$α$ Halos around Extremely Red Quasars (ERQs)

    Authors: Jarred Gillette, Marie Wingyee Lau, Fred Hamann, Serena Perrotta, David S. N. Rupke, Dominika Wylezalek, Nadia L. Zakamska, Andrey Vayner

    Abstract: Red quasars may represent a young stage of galaxy evolution that provide important feedback to their host galaxies. We are studying a population of extremely red quasars (ERQs) with exceptionally fast and powerful outflows, at median redshift $z$ = 2.6. We present Keck/KCWI integral field spectra of 11 ERQs, which have a median color $i-W3$ = 5.9~mag, median… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 19 pages, 9 figures, submitted to Monthly Notices of the Royal Astronomical Society

  29. arXiv:2301.03182  [pdf, other

    cs.CV

    Structure-Informed Shadow Removal Networks

    Authors: Yuhao Liu, Qing Guo, Lan Fu, Zhanghan Ke, Ke Xu, Wei Feng, Ivor W. Tsang, Rynson W. H. Lau

    Abstract: Existing deep learning-based shadow removal methods still produce images with shadow remnants. These shadow remnants typically exist in homogeneous regions with low-intensity values, making them untraceable in the existing image-to-image map** paradigm. We observe that shadows mainly degrade images at the image-structure level (in which humans perceive object shapes and continuous colors). Hence… ▽ More

    Submitted 1 February, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: IEEE TIP

  30. arXiv:2211.15644  [pdf, other

    cs.CV

    Efficient Mirror Detection via Multi-level Heterogeneous Learning

    Authors: Ruozhen He, Jiaying Lin, Rynson W. H. Lau

    Abstract: We present HetNet (Multi-level \textbf{Het}erogeneous \textbf{Net}work), a highly efficient mirror detection network. Current mirror detection methods focus more on performance than efficiency, limiting the real-time applications (such as drones). Their lack of efficiency is aroused by the common design of adopting homogeneous modules at different levels, which ignores the difference between diffe… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023. The code is available at https://github.com/Catherine-R-He/HetNet

  31. arXiv:2211.11223  [pdf, ps, other

    math.PR

    Inverse clustering of Gibbs Partitions via independent fragmentation and dual dependent coagulation operators

    Authors: Man Wai Ho, Lancelot F. James, John W. Lau

    Abstract: Gibbs partitions of the integers generated by stable subordinators of index $α\in(0,1)$ form remarkable classes of random partitions where in principle much is known about their properties, including practically effortless obtainment of otherwise complex asymptotic results potentially relevant to applications in general combinatorial stochastic processes, random tree/graph growth models and Bayesi… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    MSC Class: Primary 60C05; 60G09 Secondary 60G57; 60E99

  32. arXiv:2210.01055  [pdf, other

    cs.CV

    CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training

    Authors: Tianyu Huang, Bowen Dong, Yunhan Yang, Xiaoshui Huang, Rynson W. H. Lau, Wanli Ouyang, Wangmeng Zuo

    Abstract: Pre-training across 3D vision and language remains under development because of limited training data. Recent works attempt to transfer vision-language pre-training models to 3D vision. PointCLIP converts point cloud data to multi-view depth maps, adopting CLIP for shape classification. However, its performance is restricted by the domain gap between rendered depth maps and images, as well as the… ▽ More

    Submitted 22 August, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted by ICCV2023

  33. arXiv:2209.05245  [pdf, other

    cs.NE cs.LG eess.IV

    Continual learning benefits from multiple sleep mechanisms: NREM, REM, and Synaptic Downscaling

    Authors: Brian S. Robinson, Clare W. Lau, Alexander New, Shane M. Nichols, Erik C. Johnson, Michael Wolmetz, William G. Coon

    Abstract: Learning new tasks and skills in succession without losing prior learning (i.e., catastrophic forgetting) is a computational challenge for both artificial and biological neural networks, yet artificial systems struggle to achieve parity with their biological analogues. Mammalian brains employ numerous neural operations in support of continual learning during sleep. These are ripe for artificial ad… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 9 pages, 12 figures, code available upon reasonable request. Corresponding author: William G. Coon ([email protected])

  34. Large-Field Contextual Feature Learning for Glass Detection

    Authors: Haiyang Mei, Xin Yang, Letian Yu, Qiang Zhang, Xiaopeng Wei, Rynson W. H. Lau

    Abstract: Glass is very common in our daily life. Existing computer vision systems neglect it and thus may have severe consequences, e.g., a robot may crash into a glass wall. However, sensing the presence of glass is not straightforward. The key challenge is that arbitrary objects/scenes can appear behind the glass. In this paper, we propose an important problem of detecting glass surfaces from a single RG… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

  35. Rain Removal from Light Field Images with 4D Convolution and Multi-scale Gaussian Process

    Authors: Tao Yan, Mingyue Li, Bin Li, Yang Yang, Rynson W. H. Lau

    Abstract: Existing deraining methods focus mainly on a single input image. However, with just a single input image, it is extremely difficult to accurately detect and remove rain streaks, in order to restore a rain-free image. In contrast, a light field image (LFI) embeds abundant 3D structure and texture information of the target scene by recording the direction and position of each incident ray via a plen… ▽ More

    Submitted 27 January, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: This paper has been published on IEEE Transactions on Image Processing

    Journal ref: IEEE Transactions on Image Processing (2023), v32, pages 921-936

  36. arXiv:2208.01104  [pdf

    physics.flu-dyn cond-mat.mes-hall

    Theory of the force of Friction Acting on Water Chains Flowing through Carbon Nanotubes

    Authors: J. B. Sokoloff, A. W. C. Lau

    Abstract: A simple model for the friction experienced by the one dimensional water chains that flow through subnanometer diameter carbon nanotubes is studied. The model is based on a lowest order perturbation theory treatment of the friction experienced by the water chains due to the excitation of phonon and electron excitations in both the nanotube and the water chain, as a result of the motion of the chai… ▽ More

    Submitted 24 January, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures

  37. arXiv:2207.14083  [pdf, other

    cs.CV

    Weakly-Supervised Camouflaged Object Detection with Scribble Annotations

    Authors: Ruozhen He, Qihua Dong, Jiaying Lin, Rynson W. H. Lau

    Abstract: Existing camouflaged object detection (COD) methods rely heavily on large-scale datasets with pixel-wise annotations. However, due to the ambiguous boundary, annotating camouflage objects pixel-wisely is very time-consuming and labor-intensive, taking ~60mins to label one image. In this paper, we propose the first weakly-supervised COD method, using scribble annotations as supervision. To achieve… ▽ More

    Submitted 28 November, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: Accepted to AAAI 2023. The code and dataset are available at https://github.com/dddraxxx/Weakly-Supervised-Camouflaged-Object-Detection-with-Scribble-Annotations

  38. arXiv:2207.06332  [pdf, other

    cs.CV

    Symmetry-Aware Transformer-based Mirror Detection

    Authors: Tianyu Huang, Bowen Dong, Jiaying Lin, Xiaohui Liu, Rynson W. H. Lau, Wangmeng Zuo

    Abstract: Mirror detection aims to identify the mirror regions in the given input image. Existing works mainly focus on integrating the semantic features and structural features to mine specific relations between mirror and non-mirror regions, or introducing mirror properties like depth or chirality to help analyze the existence of mirrors. In this work, we observe that a real object typically forms a loose… ▽ More

    Submitted 4 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  39. arXiv:2207.01322  [pdf, other

    cs.CV

    Harmonizer: Learning to Perform White-Box Image and Video Harmonization

    Authors: Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W. H. Lau

    Abstract: Recent works on image harmonization solve the problem as a pixel-wise image translation task via large autoencoders. They have unsatisfactory performances and slow inference speeds when dealing with high-resolution images. In this work, we observe that adjusting the input arguments of basic image filters, e.g., brightness and contrast, is sufficient for humans to produce realistic images from the… ▽ More

    Submitted 20 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  40. arXiv:2206.11250  [pdf, other

    cs.CV

    Depth-aware Glass Surface Detection with Cross-modal Context Mining

    Authors: Jiaying Lin, Yuen Hei Yeung, Rynson W. H. Lau

    Abstract: Glass surfaces are becoming increasingly ubiquitous as modern buildings tend to use a lot of glass panels. This however poses substantial challenges on the operations of autonomous systems such as robots, self-driving cars and drones, as the glass panels can become transparent obstacles to the navigation.Existing works attempt to exploit various cues, including glass boundary context or reflection… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  41. arXiv:2205.01636  [pdf, other

    cond-mat.quant-gas cond-mat.mes-hall quant-ph

    Atomtronic multi-terminal Aharonov-Bohm interferometer

    Authors: Jonathan Wei Zhong Lau, Koon Siang Gan, Rainer Dumke, Luigi Amico, Leong-Chuan Kwek, Tobias Haug

    Abstract: We study a multi-functional device for cold atoms consisting of a three-terminal ring circuit pierced by a synthetic magnetic flux, where the ring can be continuous or discretized. The flux controls the atomic current through the ring via the Aharonov-Bohm effect. Our device shows a flux-induced transition of reflections from an Andreev-like negative density to positive density. Further, the flux… ▽ More

    Submitted 15 May, 2023; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 17 pages, 11 figures

  42. Convex Optimization for Nonequilibrium Steady States on a Hybrid Quantum Processor

    Authors: Jonathan Wei Zhong Lau, Kian Hwee Lim, Kishor Bharti, Leong-Chuan Kwek, Sai Vinjanampathy

    Abstract: Finding the transient and steady state properties of open quantum systems is a central problem in various fields of quantum technologies. Here, we present a quantum-assisted algorithm to determine the steady states of open system dynamics. By reformulating the problem of finding the fixed point of Lindblad dynamics as a feasibility semidefinite program, we bypass several well-known issues with var… ▽ More

    Submitted 7 July, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: 14 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 130, 240601 (2023)

  43. arXiv:2203.17257  [pdf, other

    cs.CV

    Rethinking Video Salient Object Ranking

    Authors: Jiaying Lin, Huankang Guan, Rynson W. H. Lau

    Abstract: Salient Object Ranking (SOR) involves ranking the degree of saliency of multiple salient objects in an input image. Most recently, a method is proposed for ranking salient objects in an input video based on a predicted fixation map. It relies solely on the density of the fixations within the salient objects to infer their saliency ranks, which is incompatible with human perception of saliency rank… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  44. arXiv:2203.12108  [pdf, other

    quant-ph

    Efficient, ever-ready quantum memory at room temperature for single photons

    Authors: Anthony C. Leung, W. Y. Sarah Lau, Aaron D. Tranter, Karun V. Paul, Markus Rambach, Ben C. Buchler, ** Koy Lam, Andrew G. White, Till J. Weinhold

    Abstract: Efficient quantum memories will be an essential building block of large scale networked quantum systems and provide a link between flying photonic qubits and atomic or quasi-atomic local quantum processors. To provide a path to scalability avoidance of bulky, difficult to maintain systems such as high vacuum and low temperature cryogenics is imperative. Memory efficiencies above 50% are required t… ▽ More

    Submitted 29 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 11 pages, 10 figures

  45. arXiv:2203.09416  [pdf, other

    cs.CV

    Bi-directional Object-context Prioritization Learning for Saliency Ranking

    Authors: Xin Tian, Ke Xu, Xin Yang, Lin Du, Baocai Yin, Rynson W. H. Lau

    Abstract: The saliency ranking task is recently proposed to study the visual behavior that humans would typically shift their attention over different objects of a scene based on their degrees of saliency. Existing approaches focus on learning either object-object or object-scene relations. Such a strategy follows the idea of object-based attention in Psychology, but it tends to favor those objects with str… ▽ More

    Submitted 22 March, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  46. Probing the Inner Circumgalactic Medium and Quasar Illumination around the Reddest `Extremely Red Quasar' (ERQ)

    Authors: Marie Wingyee Lau, Fred Hamann, Jarred Gillette, Serena Perrotta, David S. N. Rupke, Dominika Wylezalek, Nadia L. Zakamska

    Abstract: Dusty quasars might be in a young stage of galaxy evolution with prominent quasar feedback. A recently discovered population of luminous, extremely red quasars at $z\sim$~2--4 has extreme spectral properties related to exceptionally powerful quasar-driven outflows. We present Keck/KCWI observations of the reddest known ERQ, at $z=$\,2.3184, with extremely fast [\ion{O}{III}]~$λ$5007 outflow at… ▽ More

    Submitted 2 August, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: 20 pages, 18 figures, published in MNRAS

  47. arXiv:2203.02285  [pdf, other

    cs.ET cs.LG physics.optics quant-ph

    A photonic chip-based machine learning approach for the prediction of molecular properties

    Authors: Hui Zhang, Jonathan Wei Zhong Lau, Lingxiao Wan, Liang Shi, Hong Cai, Xianshu Luo, Patrick Lo, Chee-Kong Lee, Leong-Chuan Kwek, Ai Qun Liu

    Abstract: Machine learning methods have revolutionized the discovery process of new molecules and materials. However, the intensive training process of neural networks for molecules with ever-increasing complexity has resulted in exponential growth in computation cost, leading to long simulation time and high energy consumption. Photonic chip technology offers an alternative platform for implementing neural… ▽ More

    Submitted 25 December, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  48. arXiv:2112.13512  [pdf

    cs.CL

    Event-based clinical findings extraction from radiology reports with pre-trained language model

    Authors: Wilson Lau, Kevin Lybarger, Martin L. Gunn, Meliha Yetisgen

    Abstract: Radiology reports contain a diverse and rich set of clinical abnormalities documented by radiologists during their interpretation of the images. Comprehensive semantic representations of radiological findings would enable a wide range of secondary use applications to support diagnosis, triage, outcomes prediction, and clinical research. In this paper, we present a new corpus of radiology reports a… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  49. arXiv:2112.04753  [pdf

    cond-mat.mtrl-sci

    Highly active hydrogen evolution facilitated by topological surface states on a Pd/SnTe metal/topological crystalline insulator heterostructure

    Authors: Qing Qu, Bin Liu, Wing Sum Lau, Ding Pan, Iam Keong Sou

    Abstract: Recently, topological quantum materials have emerged as a promising electrocatalyst for hydrogen evolution reaction (HER). However, most of their performance largely lags behind noble metals such as benchmark platinum (Pt). In this work, a Pd(20nm)/SnTe(70nm) heterostructure, fabricated by molecular beam epitaxy and electron beam evaporation, is found to display much higher electrocatalytic activi… ▽ More

    Submitted 1 September, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 40 pages, 17 figures. arXiv admin note: text overlap with arXiv:2102.08062

  50. arXiv:2112.02082  [pdf, other

    cs.CV

    Geometry-aware Two-scale PIFu Representation for Human Reconstruction

    Authors: Zheng Dong, Ke Xu, Ziheng Duan, Hujun Bao, Weiwei Xu, Rynson W. H. Lau

    Abstract: Although PIFu-based 3D human reconstruction methods are popular, the quality of recovered details is still unsatisfactory. In a sparse (e.g., 3 RGBD sensors) capture setting, the depth noise is typically amplified in the PIFu representation, resulting in flat facial surfaces and geometry-fallible bodies. In this paper, we propose a novel geometry-aware two-scale PIFu for 3D human reconstruction fr… ▽ More

    Submitted 27 September, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted by NeurIPS 2022. 20 pages, 20 figures