Skip to main content

Showing 101–150 of 368 results for author: Ling, H

.
  1. arXiv:2112.09283  [pdf

    physics.optics

    Cavity Induced Extraordinary Optical Transmission and Active Modulation with Graphene

    Authors: Yifei Zhang, Baoqing Zhang, Mingming Feng, Haotian Ling, Xijian Zhang, Yiming Wang, Xiaomu Wang, Qingpu Wang, Aimin Song

    Abstract: Extraordinary optical transmission (EOT) is a phenomenon of exceptional light transmission through a metallic film with hole arrays enhanced by surface plasmon (SP) resonance, which stimulates renewed research hotspots in metamaterials, subwavelength optics, and plasmonics. Below the frequency of the first order SP mode, f_pl0, the metallic film typically shows strong reflection and no EOT. Here,… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 12 pages, 4 figures

  2. arXiv:2112.07120  [pdf, other

    cs.IT

    Simple Coding Techniques for Many-Hop Relaying

    Authors: Yan Hao Ling, Jonathan Scarlett

    Abstract: In this paper, we study the problem of relaying a single bit of information across a series of binary symmetric channels, and the associated trade-off between the number of hops $m$, the transmission time $n$, and the error probability. We introduce a simple, efficient, and deterministic protocol that attains positive information velocity (i.e., a non-vanishing ratio $\frac{m}{n}$ and small error… ▽ More

    Submitted 7 December, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: IEEE Transactions on Information Theory, Volume 68, Issue 11, pp. 7043-7053, Nov. 2022

  3. Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images

    Authors: Gongyang Li, Zhi Liu, Weisi Lin, Haibin Ling

    Abstract: In the computer vision community, great progresses have been achieved in salient object detection from natural scene images (NSI-SOD); by contrast, salient object detection in optical remote sensing images (RSI-SOD) remains to be a challenging emerging topic. The unique characteristics of optical RSIs, such as scales, illuminations and imaging orientations, bring significant differences between NS… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 12 pages, 7 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2021

  4. arXiv:2112.00995  [pdf, other

    cs.CV

    SwinTrack: A Simple and Strong Baseline for Transformer Tracking

    Authors: Liting Lin, Heng Fan, Zhipeng Zhang, Yong Xu, Haibin Ling

    Abstract: Recently Transformer has been largely explored in tracking and shown state-of-the-art (SOTA) performance. However, existing efforts mainly focus on fusing and enhancing features generated by convolutional neural networks (CNNs). The potential of Transformer in representation learning remains under-explored. In this paper, we aim to further unleash the power of Transformer by proposing a simple yet… ▽ More

    Submitted 13 October, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 22 pages, 10 figures

    Journal ref: Advances in Neural Information Processing Systems, 2022

  5. arXiv:2111.14725  [pdf, other

    cs.CV

    Searching the Search Space of Vision Transformer

    Authors: Minghao Chen, Kan Wu, Bolin Ni, Houwen Peng, Bei Liu, Jianlong Fu, Hongyang Chao, Haibin Ling

    Abstract: Vision Transformer has shown great visual representation power in substantial vision tasks such as recognition and detection, and thus been attracting fast-growing efforts on manually designing more effective architectures. In this paper, we propose to use neural architecture search to automate this process, by searching not only the architecture but also the search space. The central idea is to g… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted to NIPS 2021

  6. arXiv:2111.07566  [pdf

    physics.optics

    Swee** Plasma Frequency of Terahertz Surface Plasmon Polaritons with Graphene

    Authors: Mingming Feng, Baoqing Zhang, Haotian Ling, Zihao Zhang, Yiming Wang, Yilin Wang, Xijian Zhang, **rang Hua, Qingpu Wang, Aimin Song, Yifei Zhang

    Abstract: Plasma frequency is the spectral boundary for low-loss propagation and evanescent decay of surface plasmon polariton (SPP) waves, which corresponds to a high cut-off phenomenon and is typically utilized for identifying SPPs. At terahertz (THz) frequencies, a metal line with periodic metallic grooves can mimic the conventional optical SPPs, which is referred to as designer SPPs. Theoretically, the… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 19pages, 6 figures

  7. arXiv:2111.03186  [pdf, other

    cs.CV cs.AI

    EditGAN: High-Precision Semantic Image Editing

    Authors: Huan Ling, Karsten Kreis, Daiqing Li, Seung Wook Kim, Antonio Torralba, Sanja Fidler

    Abstract: Generative adversarial networks (GANs) have recently found applications in image editing. However, most GAN based image editing methods often require large scale datasets with semantic segmentation annotations for training, only provide high level control, or merely interpolate between different images. Here, we propose EditGAN, a novel method for high quality, high precision semantic image editin… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  8. arXiv:2110.09662  [pdf, other

    eess.IV cs.CV

    Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

    Authors: Heng Fan, Jiaxiang Ren, Jie Yang, Yi-Xian Qin, Haibin Ling

    Abstract: Objectives. The aim of this study was to investigate whether a deep convolutional neural network (CNN) with an attention module can detect osteoporosis on panoramic radiographs. Study Design. A dataset of 70 panoramic radiographs (PRs) from 70 different subjects of age between 49 to 60 was used, including 49 subjects with osteoporosis and 21 normal subjects. We utilized the leave-one-out cross-v… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 9 pages

  9. arXiv:2110.09057  [pdf, other

    cs.LG math.OC

    Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization

    Authors: Tao Sun, Huaming Ling, Zuoqiang Shi, Dongsheng Li, Bao Wang

    Abstract: Heavy ball momentum is crucial in accelerating (stochastic) gradient-based optimization algorithms for machine learning. Existing heavy ball momentum is usually weighted by a uniform hyperparameter, which relies on excessive tuning. Moreover, the calibrated fixed hyperparameter may not lead to optimal performance. In this paper, to eliminate the effort for tuning the momentum-related hyperparamete… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  10. arXiv:2110.03139  [pdf, other

    cond-mat.quant-gas cond-mat.mes-hall quant-ph

    Topological study of a Bogoliubov-de Gennes system of pseudo spin-$1/2$ bosons with conserved magnetization in a honeycomb lattice

    Authors: Hong Y. Ling, Ben Kain

    Abstract: We consider a Bogolibov-de Geenes (BdG) Hamiltonian, which is a non-Hermitian Hamiltonian with pseudo-Hermiticity, for a system of (pseudo) spin-$1/2$ bosons in a honeycomb lattice under the condition that the population difference between the two spin components, i.e., magnetization, is a constant. Such a system is capable of acting as a topological amplifier, under time-reversal symmetry, with s… ▽ More

    Submitted 8 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 20 pages, 7 figures

    Journal ref: Phys. Rev. A 105, 023319 (2022)

  11. arXiv:2110.01676  [pdf, other

    cs.CV

    Deep Learning Approach Protecting Privacy in Camera-Based Critical Applications

    Authors: Gautham Ramajayam, Tao Sun, Chiu C. Tan, Lannan Luo, Haibin Ling

    Abstract: Many critical applications rely on cameras to capture video footage for analytical purposes. This has led to concerns about these cameras accidentally capturing more information than is necessary. In this paper, we propose a deep learning approach towards protecting privacy in camera-based systems. Instead of specifying specific objects (e.g. faces) are privacy sensitive, our technique distinguish… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  12. arXiv:2109.00240  [pdf, other

    cs.CV

    Joint Graph Learning and Matching for Semantic Feature Correspondence

    Authors: He Liu, Tao Wang, Yidong Li, Congyan Lang, Yi **, Haibin Ling

    Abstract: In recent years, powered by the learned discriminative representation via graph neural network (GNN) models, deep graph matching methods have made great progresses in the task of matching semantic features. However, these methods usually rely on heuristically generated graph patterns, which may introduce unreliable relationships to hurt the matching performance. In this paper, we propose a joint \… ▽ More

    Submitted 17 November, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

  13. arXiv:2108.06017  [pdf, other

    cs.CV

    AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Metric Learning

    Authors: Hong Wang, Yuefan Deng, Shinjae Yoo, Haibin Ling, Yuewei Lin

    Abstract: While deep neural networks have shown impressive performance in many tasks, they are fragile to carefully designed adversarial attacks. We propose a novel adversarial training-based model by Attention Guided Knowledge Distillation and Bi-directional Metric Learning (AGKD-BML). The attention knowledge is obtained from a weight-fixed model trained on a clean dataset, referred to as a teacher model,… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 paper

  14. arXiv:2108.00616  [pdf, other

    cs.CV

    RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth

    Authors: Mengyang Pu, Ya** Huang, Qingji Guan, Haibin Ling

    Abstract: As a fundamental building block in computer vision, edges can be categorised into four types according to the discontinuity in surface-Reflectance, Illumination, surface-Normal or Depth. While great progress has been made in detecting generic or individual types of edges, it remains under-explored to comprehensively study all four edge types together. In this paper, we propose a novel neural netwo… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV2021

  15. From Monopoly to Competition: Optimal Contests Prevail

    Authors: Xiaotie Deng, Yotam Gafni, Ron Lavi, Tao Lin, Hongyi Ling

    Abstract: We study competition among contests in a general model that allows for an arbitrary and heterogeneous space of contest design, where the goal of the contest designers is to maximize the contestants' sum of efforts. Our main result shows that optimal contests in the monopolistic setting (i.e., those that maximize the sum of efforts in a model with a single contest) form an equilibrium in the model… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  16. arXiv:2107.08766  [pdf, other

    cs.CV

    VisDrone-CC2020: The Vision Meets Drone Crowd Counting Challenge Results

    Authors: Dawei Du, Longyin Wen, Pengfei Zhu, Heng Fan, Qinghua Hu, Haibin Ling, Mubarak Shah, Junwen Pan, Ali Al-Ali, Amr Mohamed, Bakour Imene, Bin Dong, Binyu Zhang, Bouchali Hadia Nesma, Chenfeng Xu, Chenzhen Duan, Ciro Castiello, Corrado Mencar, Dingkang Liang, Florian Krüger, Gennaro Vessio, Giovanna Castellano, Jieru Wang, Junyu Gao, Khalid Abualsaud , et al. (30 additional authors not shown)

    Abstract: Crowd counting on the drone platform is an interesting topic in computer vision, which brings new challenges such as small object inference, background clutter and wide viewpoint. However, there are few algorithms focusing on crowd counting on the drone-captured data due to the lack of comprehensive datasets. To this end, we collect a large-scale dataset and organize the Vision Meets Drone Crowd C… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: The method description of A7 Mutil-Scale Aware based SFANet (M-SFANet) is updated and missing references are added

    Journal ref: European Conference on Computer Vision. Springer, Cham, 2020: 675-691

  17. arXiv:2107.00651  [pdf, other

    cs.CV

    AutoFormer: Searching Transformers for Visual Recognition

    Authors: Minghao Chen, Houwen Peng, Jianlong Fu, Haibin Ling

    Abstract: Recently, pure transformer-based models have shown great potentials for vision tasks such as image classification and detection. However, the design of transformer networks is challenging. It has been observed that the depth, embedding dimension, and number of heads can largely affect the performance of vision transformers. Previous models configure these dimensions based upon manual crafting. In… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Github: https://github.com/microsoft/AutoML

  18. CBNet: A Composite Backbone Network Architecture for Object Detection

    Authors: Tingting Liang, Xiaojie Chu, Yudong Liu, Yongtao Wang, Zhi Tang, Wei Chu, **gdong Chen, Haibin Ling

    Abstract: Modern top-performing object detectors depend heavily on backbone networks, whose advances bring consistent performance gains through exploring more effective network structures. In this paper, we propose a novel and flexible backbone framework, namely CBNetV2, to construct high-performance detectors using existing open-sourced pre-trained backbones under the pre-training fine-tuning paradigm. In… ▽ More

    Submitted 18 October, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: IEEE Transactions on Image Processing (TIP) camera ready

  19. arXiv:2106.06744  [pdf, other

    eess.IV cs.CV

    DeepMMSA: A Novel Multimodal Deep Learning Method for Non-small Cell Lung Cancer Survival Analysis

    Authors: Yujiao Wu, Jie Ma, Xiaoshui Huang, Sai Ho Ling, Steven Weidong Su

    Abstract: Lung cancer is the leading cause of cancer death worldwide. The critical reason for the deaths is delayed diagnosis and poor prognosis. With the accelerated development of deep learning techniques, it has been successfully applied extensively in many real-world applications, including health sectors such as medical image interpretation and disease diagnosis. By combining more modalities that being… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: 7 Submitted to IEEE TBME

  20. arXiv:2106.03432  [pdf, other

    cs.CV

    Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification

    Authors: Yifeng Ding, Shuwei Dong, Yujun Tong, Zhanyu Ma, Bo Xiao, Haibin Ling

    Abstract: Classifying the sub-categories of an object from the same super-category (e.g., bird) in a fine-grained visual classification (FGVC) task highly relies on mining multiple discriminative features. Existing approaches mainly tackle this problem by introducing attention mechanisms to locate the discriminative parts or feature encoding approaches to extract the highly parameterized features in a weakl… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  21. DFGC 2021: A DeepFake Game Competition

    Authors: Bo Peng, Hongxing Fan, Wei Wang, **g Dong, Yuezun Li, Siwei Lyu, Qi Li, Zhenan Sun, Han Chen, Baoying Chen, Yanjie Hu, Shenghai Luo, Junrui Huang, Yutong Yao, Boyuan Liu, Hefei Ling, Guosheng Zhang, Zhiliang Xu, Changtao Miao, Changlei Lu, Shan He, Xiaoyan Wu, Wanyi Zhuang

    Abstract: This paper presents a summary of the DFGC 2021 competition. DeepFake technology is develo** fast, and realistic face-swaps are increasingly deceiving and hard to detect. At the same time, DeepFake detection methods are also improving. There is a two-party game between DeepFake creators and detectors. This competition provides a common platform for benchmarking the adversarial game between curren… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Journal ref: 2021 IEEE International Joint Conference on Biometrics (IJCB), 2021, pp. 1-8

  22. arXiv:2105.14065  [pdf, other

    cs.CV

    TransCamP: Graph Transformer for 6-DoF Camera Pose Estimation

    Authors: Xinyi Li, Haibin Ling

    Abstract: Camera pose estimation or camera relocalization is the centerpiece in numerous computer vision tasks such as visual odometry, structure from motion (SfM) and SLAM. In this paper we propose a neural network approach with a graph transformer backbone, namely TransCamP, to address the camera relocalization problem. In contrast with prior work where the pose regression is mainly guided by photometric… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  23. arXiv:2104.06565  [pdf, other

    cs.IT math.PR

    Optimal Rates of Teaching and Learning Under Uncertainty

    Authors: Yan Hao Ling, Jonathan Scarlett

    Abstract: In this paper, we consider a recently-proposed model of teaching and learning under uncertainty, in which a teacher receives independent observations of a single bit corrupted by binary symmetric noise, and sequentially transmits to a student through another binary symmetric channel based on the bits observed so far. After a given number $n$ of transmissions, the student outputs an estimate of the… ▽ More

    Submitted 7 December, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: IEEE Transactions on Information Theory, Volume 67, Issue 11, pp. 7067-7080, Nov. 2021. This version slightly modifies/expands the 'Existing Results' section

  24. arXiv:2104.06490  [pdf, other

    cs.CV

    DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

    Authors: Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler

    Abstract: We introduce DatasetGAN: an automatic procedure to generate massive datasets of high-quality semantically segmented images requiring minimal human effort. Current deep networks are extremely data-hungry, benefiting from training on large-scale datasets, which are time consuming to annotate. Our method relies on the power of recent GANs to generate realistic images. We show how the GAN latent code… ▽ More

    Submitted 19 April, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021 as an Oral paper. Webpage: https://nv-tlabs.github.io/datasetGAN/

  25. arXiv:2104.00597  [pdf, other

    cs.CV

    One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

    Authors: Minghao Chen, Houwen Peng, Jianlong Fu, Haibin Ling

    Abstract: Despite remarkable progress achieved, most neural architecture search (NAS) methods focus on searching for one single accurate and robust architecture. To further build models with better generalization capability and performance, model ensemble is usually adopted and performs better than stand-alone models. Inspired by the merits of model ensemble, we propose to search for multiple diverse models… ▽ More

    Submitted 16 July, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  26. arXiv:2104.00194  [pdf, other

    cs.CV

    TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

    Authors: Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu

    Abstract: Tracking multiple objects in videos relies on modeling the spatial-temporal interactions of the objects. In this paper, we propose a solution named TransMOT, which leverages powerful graph transformers to efficiently model the spatial and temporal interactions among the objects. TransMOT effectively models the interactions of a large number of objects by arranging the trajectories of the tracked o… ▽ More

    Submitted 3 April, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

  27. Magnetic skyrmion crystal at a topological insulator surface

    Authors: Stefan Divic, Henry Ling, T. Pereg-Barnea, Arun Paramekanti

    Abstract: We consider a magnetic skyrmion crystal formed at the surface of a topological insulator. Incorporating the exchange interaction between the helical Dirac surface states and the periodic Néel or Bloch skyrmion texture, we obtain the resulting electronic band structure and discuss the constraints that symmetries impose on the energies and Berry curvature. We find substantive qualitative differences… ▽ More

    Submitted 13 February, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: 21 pages, 7 figures

    Journal ref: Phys. Rev. B 105, 035156 (2022)

  28. arXiv:2103.14337  [pdf, other

    cs.CV cs.AI

    Hands-on Guidance for Distilling Object Detectors

    Authors: Yangyang Qin, Hefei Ling, Zhenghai He, Yuxuan Shi, Lei Wu

    Abstract: Knowledge distillation can lead to deploy-friendly networks against the plagued computational complexity problem, but previous methods neglect the feature hierarchy in detectors. Motivated by this, we propose a general framework for detection distillation. Our method, called Hands-on Guidance Distillation, distills the latent knowledge of all stage features for imposing more comprehensive supervis… ▽ More

    Submitted 12 May, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at ICME2021

  29. Character Controllers Using Motion VAEs

    Authors: Hung Yu Ling, Fabio Zinno, George Cheng, Michiel van de Panne

    Abstract: A fundamental problem in computer animation is that of realizing purposeful and realistic human movement given a sufficiently-rich set of motion capture clips. We learn data-driven generative models of human movement using autoregressive conditional variational autoencoders, or Motion VAEs. The latent variables of the learned autoencoder define the action space for the movement and thereby govern… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Project page: https://www.cs.ubc.ca/~hyuling/projects/mvae/ ; Code: https://github.com/electronicarts/character-motion-vaes

  30. arXiv:2103.14028  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph physics.optics

    Light-Matter Coupling in Scalable Van der Waals Superlattices

    Authors: Pawan Kumar, Jason Lynch, Baokun Song, Haonan Ling, Francisco Barrera, Huiqin Zhang, Surendra B. Anantharaman, Jagrit Digani, Haoyue Zhu, Tanushree H. Choudhury, Clifford McAleese, Xiaochen Wang, Ben R. Conran, Oliver Whear, Michael J. Motala, Michael Snure, Christopher Muratore, Joan M. Redwing, Nicholas R. Glavin, Eric A. Stach, Artur R. Davoyan, Deep Jariwala

    Abstract: Two-dimensional (2D) crystals have renewed opportunities in design and assembly of artificial lattices without the constraints of epitaxy. However, the lack of thickness control in exfoliated van der Waals (vdW) layers prevents realization of repeat units with high fidelity. Recent availability of uniform, wafer-scale samples permits engineering of both electronic and optical dispersions in stacks… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 4 figures + supporting

  31. arXiv:2103.04507  [pdf, other

    cs.CV

    OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection

    Authors: Tingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling

    Abstract: Recently, neural architecture search (NAS) has been exploited to design feature pyramid networks (FPNs) and achieved promising results for visual object detection. Encouraged by the success, we propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy. Specifically, we first introduce six… ▽ More

    Submitted 11 March, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: To appear in CVPR 2021

  32. arXiv:2102.05454  [pdf, other

    cs.CV

    On the Robustness of Multi-View Rotation Averaging

    Authors: Xinyi Li, Haibin Ling

    Abstract: Rotation averaging is a synchronization process on single or multiple rotation groups, and is a fundamental problem in many computer vision tasks such as multi-view structure from motion (SfM). Specifically, rotation averaging involves the recovery of an underlying pose-graph consistency from pairwise relative camera poses. Specifically, given pairwise motion in rotation groups, especially 3-dimen… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  33. Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

    Authors: Gongyang Li, Zhi Liu, Ran Shi, Zheng Hu, Weijie Wei, Yong Wu, Mengke Huang, Haibin Ling

    Abstract: As a natural way for human-computer interaction, fixation provides a promising solution for interactive image segmentation. In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction. In particular, we first construct a new PFOS dataset by carefully co… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted by IEEE TIP. Code: https://github.com/MathLee/OLBPNet4PFOS

  34. arXiv:2012.11803  [pdf, other

    cs.CV cs.CR eess.IV

    Modeling Deep Learning Based Privacy Attacks on Physical Mail

    Authors: Bingyao Huang, Ruyi Lian, Dimitris Samaras, Haibin Ling

    Abstract: Mail privacy protection aims to prevent unauthorized access to hidden content within an envelope since normal paper envelopes are not as safe as we think. In this paper, for the first time, we show that with a well designed deep learning model, the hidden content may be largely recovered without opening the envelope. We start by modeling deep learning-based privacy attacks on physical mail content… ▽ More

    Submitted 25 March, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Source code: https://github.com/BingyaoHuang/Neural-STE

  35. arXiv:2012.10728  [pdf, other

    cs.CV

    Political Posters Identification with Appearance-Text Fusion

    Authors: Xuan Qin, Meizhu Liu, Yifan Hu, Christina Moo, Christian M. Riblet, Changwei Hu, Kevin Yen, Haibin Ling

    Abstract: In this paper, we propose a method that efficiently utilizes appearance features and text vectors to accurately classify political posters from other similar political images. The majority of this work focuses on political posters that are designed to serve as a promotion of a certain political event, and the automated identification of which can lead to the generation of detailed statistics and m… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

  36. SPAA: Stealthy Projector-based Adversarial Attacks on Deep Image Classifiers

    Authors: Bingyao Huang, Haibin Ling

    Abstract: Light-based adversarial attacks use spatial augmented reality (SAR) techniques to fool image classifiers by altering the physical light condition with a controllable light source, e.g., a projector. Compared with physical attacks that place hand-crafted adversarial objects, projector-based ones obviate modifying the physical entities, and can be performed transiently and dynamically by altering th… ▽ More

    Submitted 17 March, 2022; v1 submitted 10 December, 2020; originally announced December 2020.

  37. arXiv:2011.14935  [pdf, ps, other

    cond-mat.quant-gas cond-mat.mes-hall quant-ph

    Selection Rule for Topological Amplifiers in Bogoliubov de Gennes Systems

    Authors: Hong Y. Ling, Ben Kain

    Abstract: Dynamical instability is an inherent feature of bosonic systems described by the Bogoliubov de Geenes (BdG) Hamiltonian. Since it causes the BdG system to collapse, it is generally thought that it should be avoided. Recently, there has been much effort to harness this instability for the benefit of creating a topological amplifier with stable bulk bands but unstable edge modes which can be populat… ▽ More

    Submitted 20 August, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: 12 pages and 3 figures. q = 0.2t_1 in the caption of fig. 1 in our published paper [Phys. Rev. A 104, 013305 (2021)] has been changed to the correct one, q = 0.4t_1

    Journal ref: Phys. Rev. A 104, 013305 (2021)

  38. arXiv:2011.12483  [pdf, other

    cs.CV

    CRACT: Cascaded Regression-Align-Classification for Robust Visual Tracking

    Authors: Heng Fan, Haibin Ling

    Abstract: High quality object proposals are crucial in visual tracking algorithms that utilize region proposal network (RPN). Refinement of these proposals, typically by box regression and classification in parallel, has been popularly adopted to boost tracking performance. However, it still meets problems when dealing with complex and dynamic background. Thus motivated, in this paper we introduce an improv… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: tech. report

  39. arXiv:2011.11858  [pdf, other

    cs.CV cs.AI cs.LG

    GMOT-40: A Benchmark for Generic Multiple Object Tracking

    Authors: Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling

    Abstract: Multiple Object Tracking (MOT) has witnessed remarkable advances in recent years. However, existing studies dominantly request prior knowledge of the tracking target, and hence may not generalize well to unseen categories. In contrast, Generic Multiple Object Tracking (GMOT), which requires little prior information about the target, is largely under-explored. In this paper, we make contributions t… ▽ More

    Submitted 7 April, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

  40. arXiv:2011.10875  [pdf, other

    cs.CV

    Transparent Object Tracking Benchmark

    Authors: Heng Fan, Halady Akhilesha Miththanthaya, Harshit, Siranjiv Ramana Rajan, Xiaoqiong Liu, Zhilin Zou, Yuewei Lin, Haibin Ling

    Abstract: Visual tracking has achieved considerable progress in recent years. However, current research in the field mainly focuses on tracking of opaque objects, while little attention is paid to transparent object tracking. In this paper, we make the first attempt in exploring this problem by proposing a Transparent Object Tracking Benchmark (TOTB). Specifically, TOTB consists of 225 videos (86K frames) f… ▽ More

    Submitted 1 August, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

    Comments: Tech. Report

  41. arXiv:2011.01163  [pdf, other

    cs.CV cs.RO

    Pushing the Envelope of Rotation Averaging for Visual SLAM

    Authors: Xinyi Li, Lin Yuan, Longin Jan Latecki, Haibin Ling

    Abstract: As an essential part of structure from motion (SfM) and Simultaneous Localization and Map** (SLAM) systems, motion averaging has been extensively studied in the past years and continues to attract surging research attention. While canonical approaches such as bundle adjustment are predominantly inherited in most of state-of-the-art SLAM systems to estimate and update the trajectory in the robot… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  42. arXiv:2011.00372  [pdf, other

    cs.RO cs.CV

    Pose Estimation of Specular and Symmetrical Objects

    Authors: Jiaming Hu, Hongyi Ling, Priyam Parashar, Aayush Naik, Henrik Christensen

    Abstract: In the robotic industry, specular and textureless metallic components are ubiquitous. The 6D pose estimation of such objects with only a monocular RGB camera is difficult because of the absence of rich texture features. Furthermore, the appearance of specularity heavily depends on the camera viewpoint and environmental light conditions making traditional methods, like template matching, fail. In t… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: submitted to ICRA 2021

  43. arXiv:2010.11671  [pdf, other

    cs.HC cs.CV cs.RO

    Motion Planning Combines Psychological Safety and Motion Prediction for a Sense Motive Robot

    Authors: He**g Ling, Guoliang Liu, Guohui Tian

    Abstract: Human safety is the most important demand for human robot interaction and collaboration (HRIC), which not only refers to physical safety, but also includes psychological safety. Although many robots with different configurations have entered our living and working environments, the human safety problem is still an ongoing research problem in human-robot coexistence scenarios. This paper addresses… ▽ More

    Submitted 23 October, 2020; v1 submitted 29 September, 2020; originally announced October 2020.

    Comments: submitted to RAL/ICRA2021

  44. arXiv:2010.09125  [pdf, other

    cs.CV cs.LG

    Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering

    Authors: Yuxuan Zhang, Wenzheng Chen, Huan Ling, Jun Gao, Yinan Zhang, Antonio Torralba, Sanja Fidler

    Abstract: Differentiable rendering has paved the way to training neural networks to perform "inverse graphics" tasks such as predicting 3D geometry from monocular photographs. To train high performing models, most of the current approaches rely on multi-view imagery which are not readily available in practice. Recent Generative Adversarial Networks (GANs) that synthesize images, in contrast, seem to acquire… ▽ More

    Submitted 20 April, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted to ICLR 2021 as an Oral paper

  45. arXiv:2010.03740  [pdf, other

    eess.IV cs.CV

    Bone Feature Segmentation in Ultrasound Spine Image with Robustness to Speckle and Regular Occlusion Noise

    Authors: Zixun Huang, Li-Wen Wang, Frank H. F. Leung, Sunetra Banerjee, De Yang, Timothy Lee, Juan Lyu, Sai Ho Ling, Yong-** Zheng

    Abstract: 3D ultrasound imaging shows great promise for scoliosis diagnosis thanks to its low-costing, radiation-free and real-time characteristics. The key to accessing scoliosis by ultrasound imaging is to accurately segment the bone area and measure the scoliosis degree based on the symmetry of the bone features. The ultrasound images tend to contain many speckles and regular occlusion noise which is dif… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: SMC2020

  46. arXiv:2009.08105  [pdf

    physics.optics physics.app-ph

    All van der Waals Integrated Nanophotonics

    Authors: Haonan Ling, Renjie Li, Artur R. Davoyan

    Abstract: Integrated optics is at the heart of a wide range of systems from remote sensing and communications to computing and quantum information processing. Demand for smaller and more energy efficient structures stimulates search for more advanced material platforms. Here, we propose a concept of an all van der Waals photonics, where we show that electronically bulk transition metal dichalcogenide (TMDC)… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: 12 pages, 5 figures

  47. arXiv:2009.03465  [pdf, other

    cs.CV

    LaSOT: A High-quality Large-scale Single Object Tracking Benchmark

    Authors: Heng Fan, Hexin Bai, Liting Lin, Fan Yang, Peng Chu, Ge Deng, Sijia Yu, Harshit, Mingzhen Huang, Juehuan Liu, Yong Xu, Chunyuan Liao, Lin Yuan, Haibin Ling

    Abstract: Despite great recent advances in visual tracking, its further development, including both algorithm design and evaluation, is limited due to lack of dedicated large-scale benchmarks. To address this problem, we present LaSOT, a high-quality Large-scale Single Object Tracking benchmark. LaSOT contains a diverse selection of 85 object classes, and offers 1,550 totaling more than 3.87 million frames.… ▽ More

    Submitted 11 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: Tech Report. Update project website

  48. arXiv:2008.09721  [pdf, other

    cs.CV cs.AI

    ScribbleBox: Interactive Annotation Framework for Video Object Segmentation

    Authors: Bowen Chen, Huan Ling, Xiaohui Zeng, Gao Jun, Ziyue Xu, Sanja Fidler

    Abstract: Manually labeling video datasets for segmentation tasks is extremely time consuming. In this paper, we introduce ScribbleBox, a novel interactive framework for annotating object instances with masks in videos. In particular, we split annotation into two steps: annotating objects with tracked boxes, and labeling masks inside these tracks. We introduce automation and interaction in both steps. Box t… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  49. arXiv:2008.03673  [pdf, other

    cs.CV

    Feature Space Augmentation for Long-Tailed Data

    Authors: Peng Chu, Xiao Bian, Shaopeng Liu, Haibin Ling

    Abstract: Real-world data often follow a long-tailed distribution as the frequency of each class is typically different. For example, a dataset can have a large number of under-represented classes and a few classes with more than sufficient data. However, a model to represent the dataset is usually expected to have reasonably homogeneous performances across classes. Introducing class-balanced loss and advan… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: To be appeared in ECCV 2020

  50. End-to-end Full Projector Compensation

    Authors: Bingyao Huang, Tao Sun, Haibin Ling

    Abstract: Full projector compensation aims to modify a projector input image to compensate for both geometric and photometric disturbance of the projection surface. Traditional methods usually solve the two parts separately and may suffer from suboptimal solutions. In this paper, we propose the first end-to-end differentiable solution, named CompenNeSt++, to solve the two problems jointly. First, we propose… ▽ More

    Submitted 7 January, 2021; v1 submitted 30 July, 2020; originally announced August 2020.

    Comments: Source code: https://github.com/BingyaoHuang/CompenNeSt-plusplus. arXiv admin note: text overlap with arXiv:1908.06246, arXiv:1904.04335