Skip to main content

Showing 1–48 of 48 results for author: Yue, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16529  [pdf, other

    cs.CL

    Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing

    Authors: Hao Yue, Shaopeng Lai, Chengyi Yang, Liang Zhang, Junfeng Yao, **song Su

    Abstract: Cross-document Relation Extraction aims to predict the relation between target entities located in different documents. In this regard, the dominant models commonly retain useful information for relation prediction via bridge entities, which allows the model to elaborately capture the intrinsic interdependence between target entities. However, these studies ignore the non-bridge entities, each of… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  2. arXiv:2406.13007  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Night Photography Rendering

    Authors: Egor Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banić, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy Terekhin, Shuwei Yue, Yuyang Liu, Minchen Wei, Lu Xu, Chao Zhang, Yasi Wang, Furkan Kınlı, Doğa Yılmaz, Barış Özcan, Furkan Kıraç, Shuai Liu, **gyuan Xiao , et al. (25 additional authors not shown)

    Abstract: This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality output images in the standard RGB (sRGB) space. Unlike the previous year's competition, the challenge images were collected with a mobile phone and the speed of algo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 10 figures

  3. arXiv:2406.08771  [pdf, other

    cs.SD cs.AI eess.AS

    MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection

    Authors: Da Mu, Zhicheng Zhang, Haobo Yue

    Abstract: Sound Event Localization and Detection (SELD) involves detecting and localizing sound events using multichannel sound recordings. Previously proposed Event-Independent Network V2 (EINV2) has achieved outstanding performance on SELD. However, it still faces challenges in effectively extracting features across spectral, spatial, and temporal domains. This paper proposes a three-stage network structu… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  4. arXiv:2405.17490  [pdf, other

    cs.LG stat.ML

    Revisit, Extend, and Enhance Hessian-Free Influence Functions

    Authors: Ziao Yang, Han Yue, Jian Chen, Hongfu Liu

    Abstract: Influence functions serve as crucial tools for assessing sample influence in model interpretation, subset training set selection, noisy label detection, and more. By employing the first-order Taylor extension, influence functions can estimate sample influence without the need for expensive model retraining. However, applying influence functions directly to deep models presents challenges, primaril… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.17489  [pdf, other

    cs.LG cs.AI

    On the Inflation of KNN-Shapley Value

    Authors: Ziao Yang, Han Yue, Jian Chen, Hongfu Liu

    Abstract: Shapley value-based data valuation methods, originating from cooperative game theory, quantify the usefulness of each individual sample by considering its contribution to all possible training subsets. Despite their extensive applications, these methods encounter the challenge of value inflation - while samples with negative Shapley values are detrimental, some with positive values can also be har… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2404.19534  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu **, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huan**g Yue, **gyu Yang , et al. (38 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  7. arXiv:2404.00661  [pdf, other

    cs.CV

    DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion

    Authors: Chunyang Bi, Xin Luo, Sheng Shen, Mengxi Zhang, Huan**g Yue, **gyu Yang

    Abstract: Diffusion models, known for their powerful generative capabilities, play a crucial role in addressing real-world super-resolution challenges. However, these models often focus on improving local textures while neglecting the impacts of global degradation, which can significantly reduce semantic fidelity and lead to inaccurate reconstructions and suboptimal super-resolution performance. To address… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  8. arXiv:2403.04697  [pdf, other

    cs.CV cs.AI

    AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors

    Authors: Kaishen Yuan, Zitong Yu, Xin Liu, Weicheng Xie, Huan**g Yue, **gyu Yang

    Abstract: Facial Action Units (AU) is a vital concept in the realm of affective computing, and AU detection has always been a hot research topic. Existing methods suffer from overfitting issues due to the utilization of a large number of learnable parameters on scarce AU-annotated datasets or heavy reliance on substantial additional relevant data. Parameter-Efficient Transfer Learning (PETL) provides a prom… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 19 pages, 6 figures

  9. arXiv:2402.18922  [pdf, other

    cs.CV

    A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection

    Authors: Chao Hao, Zitong Yu, Xin Liu, Jun Xu, Huan**g Yue, **gyu Yang

    Abstract: Camouflaged object detection (COD) and salient object detection (SOD) are two distinct yet closely-related computer vision tasks widely studied during the past decades. Though sharing the same purpose of segmenting an image into binary foreground and background regions, their distinction lies in the fact that COD focuses on concealed objects hidden in the image, while SOD concentrates on the most… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: submitted to IEEE TIP

  10. arXiv:2401.07629  [pdf, other

    cs.CV

    Fine-Grained Prototypes Distillation for Few-Shot Object Detection

    Authors: Zichen Wang, Bo Yang, Haonan Yue, Zhenghao Ma

    Abstract: Few-shot object detection (FSOD) aims at extending a generic detector for novel object detection with only a few training examples. It attracts great concerns recently due to the practical meanings. Meta-learning has been demonstrated to be an effective paradigm for this task. In general, methods based on meta-learning employ an additional support branch to encode novel examples (a.k.a. support im… ▽ More

    Submitted 12 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  11. arXiv:2401.04976  [pdf, other

    eess.AS cs.SD

    Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection

    Authors: Haobo Yue, Zhicheng Zhang, Da Mu, Yonghao Dang, Jianqin Yin, ** Tang

    Abstract: Recently, 2D convolution has been found unqualified in sound event detection (SED). It enforces translation equivariance on sound events along frequency axis, which is not a shift-invariant dimension. To address this issue, dynamic convolution is used to model the frequency dependency of sound events. In this paper, we proposed the first full-dynamic method named \emph{full-frequency dynamic convo… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures, submitted to ICME2024

  12. arXiv:2312.17050  [pdf, other

    cs.CV

    KeDuSR: Real-World Dual-Lens Super-Resolution via Kernel-Free Matching

    Authors: Huan**g Yue, Zifan Cui, Kun Li, **gyu Yang

    Abstract: Dual-lens super-resolution (SR) is a practical scenario for reference (Ref) based SR by utilizing the telephoto image (Ref) to assist the super-resolution of the low-resolution wide-angle image (LR input). Different from general RefSR, the Ref in dual-lens SR only covers the overlapped field of view (FoV) area. However, current dual-lens SR methods rarely utilize these specific characteristics and… ▽ More

    Submitted 2 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 14 pages, 10 figures. Accepted by AAAI-2024

  13. arXiv:2312.06723  [pdf, other

    cs.CV

    Learning to See Low-Light Images via Feature Domain Adaptation

    Authors: Qirui Yang, Qihua Cheng, Huan**g Yue, Le Zhang, Yihao Liu, **gyu Yang

    Abstract: Raw low light image enhancement (LLIE) has achieved much better performance than the sRGB domain enhancement methods due to the merits of raw data. However, the ambiguity between noisy to clean and raw to sRGB map**s may mislead the single-stage enhancement networks. The two-stage networks avoid ambiguity by decoupling the two map**s but usually have large computing complexity. To solve this p… ▽ More

    Submitted 19 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  14. arXiv:2311.18814  [pdf, other

    cs.CV

    Is Underwater Image Enhancement All Object Detectors Need?

    Authors: Yudong Wang, Jichang Guo, Wanru He, Huan Gao, Huihui Yue, Zenan Zhang, Chongyi Li

    Abstract: Underwater object detection is a crucial and challenging problem in marine engineering and aquatic robot. The difficulty is partly because of the degradation of underwater images caused by light selective absorption and scattering. Intuitively, enhancing underwater images can benefit high-level applications like underwater object detection. However, it is still unclear whether all object detectors… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 17 pages, 9 figures

  15. arXiv:2311.15727  [pdf, other

    cs.CV

    RISAM: Referring Image Segmentation via Mutual-Aware Attention Features

    Authors: Mengxi Zhang, Yiming Liu, Xiangjun Yin, Huan**g Yue, **gyu Yang

    Abstract: Referring image segmentation (RIS) aims to segment a particular region based on a language expression prompt. Existing methods incorporate linguistic features into visual features and obtain multi-modal features for mask decoding. However, these methods may segment the visually salient entity instead of the correct referring region, as the multi-modal features are dominated by the abundant visual… ▽ More

    Submitted 21 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  16. arXiv:2310.20332  [pdf, other

    cs.CV

    Recaptured Raw Screen Image and Video Demoiréing via Channel and Spatial Modulations

    Authors: Huan**g Yue, Yijia Cheng, Xin Liu, **gyu Yang

    Abstract: Capturing screen contents by smartphone cameras has become a common way for information sharing. However, these images and videos are often degraded by moiré patterns, which are caused by frequency aliasing between the camera filter array and digital display grids. We observe that the moiré patterns in raw domain is simpler than those in sRGB domain, and the moiré patterns in raw color channels ha… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  17. Unbiased and Robust: External Attention-enhanced Graph Contrastive Learning for Cross-domain Sequential Recommendation

    Authors: Xinhua Wang, Hou** Yue, Zizheng Wang, Liancheng Xu, **yu Zhang

    Abstract: Cross-domain sequential recommenders (CSRs) are gaining considerable research attention as they can capture user sequential preference by leveraging side information from multiple domains. However, these works typically follow an ideal setup, i.e., different domains obey similar data distribution, which ignores the bias brought by asymmetric interaction densities (a.k.a. the inter-domain density b… ▽ More

    Submitted 17 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 9 pages, 4 figures, accepted by ICDM 2023 (workshop-GML4Rec)

  18. arXiv:2308.07770  [pdf, other

    cs.CV

    Multi-scale Promoted Self-adjusting Correlation Learning for Facial Action Unit Detection

    Authors: Xin Liu, Kaishen Yuan, Xuesong Niu, **gang Shi, Zitong Yu, Huan**g Yue, **gyu Yang

    Abstract: Facial Action Unit (AU) detection is a crucial task in affective computing and social robotics as it helps to identify emotions expressed through facial expressions. Anatomically, there are innumerable correlations between AUs, which contain rich information and are vital for AU detection. Previous methods used fixed AU correlations based on expert experience or statistical rules on specific bench… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 13pages, 7 figures

  19. arXiv:2308.06709  [pdf, other

    math.OC cs.LG

    The Hard-Constraint PINNs for Interface Optimal Control Problems

    Authors: Ming-Chih Lai, Yongcun Song, Xiaoming Yuan, Hangrui Yue, Tianyou Zeng

    Abstract: We show that the physics-informed neural networks (PINNs), in combination with some recently developed discontinuity capturing neural networks, can be applied to solve optimal control problems subject to partial differential equations (PDEs) with interfaces and some control constraints. The resulting algorithm is mesh-free and scalable to different PDEs, and it ensures the control constraints rigo… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  20. arXiv:2307.09248  [pdf, other

    cs.LG eess.SP

    Application of BERT in Wind Power Forecasting-Teletraan's Solution in Baidu KDD Cup 2022

    Authors: Longxing Tan, Hongying Yue

    Abstract: Nowadays, wind energy has drawn increasing attention as its important role in carbon neutrality and sustainable development. When wind power is integrated into the power grid, precise forecasting is necessary for the sustainability and security of the system. However, the unpredictable nature and long sequence prediction make it especially challenging. In this technical report, we introduce the BE… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  21. arXiv:2307.00296  [pdf, other

    math.OC cs.LG math.NA

    Accelerated primal-dual methods with enlarged step sizes and operator learning for nonsmooth optimal control problems

    Authors: Yongcun Song, Xiaoming Yuan, Hangrui Yue

    Abstract: We consider a general class of nonsmooth optimal control problems with partial differential equation (PDE) constraints, which are very challenging due to its nonsmooth objective functionals and the resulting high-dimensional and ill-conditioned systems after discretization. We focus on the application of a primal-dual method, with which different types of variables can be treated individually and… ▽ More

    Submitted 25 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

  22. arXiv:2306.10311  [pdf, other

    eess.IV cs.CV

    Efficient HDR Reconstruction from Real-World Raw Images

    Authors: Qirui Yang, Yihao Liu, Qihua Chen, Huan**g Yue, Kun Li, **gyu Yang

    Abstract: The widespread usage of high-definition screens on edge devices stimulates a strong demand for efficient high dynamic range (HDR) algorithms. However, many existing HDR methods either deliver unsatisfactory results or consume too much computational and memory resources, hindering their application to high-resolution images (usually with more than 12 megapixels) in practice. In addition, existing H… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

  23. arXiv:2306.02301  [pdf, other

    cs.CV

    rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological Measurement

    Authors: Xin Liu, Yuting Zhang, Zitong Yu, Hao Lu, Huan**g Yue, **gyu Yang

    Abstract: Remote photoplethysmography (rPPG) is an important technique for perceiving human vital signs, which has received extensive attention. For a long time, researchers have focused on supervised methods that rely on large amounts of labeled data. These methods are limited by the requirement for large amounts of data and the difficulty of acquiring ground truth physiological signals. To address these i… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  24. arXiv:2305.00767  [pdf, other

    cs.CV cs.LG

    RViDeformer: Efficient Raw Video Denoising Transformer with a Larger Benchmark Dataset

    Authors: Huan**g Yue, Cong Cao, Lei Liao, **gyu Yang

    Abstract: In recent years, raw video denoising has garnered increased attention due to the consistency with the imaging process and well-studied noise modeling in the raw domain. However, two problems still hinder the denoising performance. Firstly, there is no large dataset with realistic motions for supervised raw video denoising, as capturing noisy and clean frames for real dynamic scenes is difficult. T… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 16 pages,15 figures

  25. arXiv:2304.04773  [pdf, other

    eess.IV cs.CV

    HDR Video Reconstruction with a Large Dynamic Dataset in Raw and sRGB Domains

    Authors: Huan**g Yue, Yubo Peng, Biting Yu, Xuanwu Yin, Zhenyu Zhou, **gyu Yang

    Abstract: High dynamic range (HDR) video reconstruction is attracting more and more attention due to the superior visual quality compared with those of low dynamic range (LDR) videos. The availability of LDR-HDR training pairs is essential for the HDR reconstruction quality. However, there are still no real LDR-HDR pairs for dynamic scenes due to the difficulty in capturing LDR-HDR frames simultaneously. In… ▽ More

    Submitted 12 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  26. arXiv:2303.07327  [pdf, other

    cs.CV eess.IV

    Unsupervised HDR Image and Video Tone Map** via Contrastive Learning

    Authors: Cong Cao, Huan**g Yue, Xin Liu, **gyu Yang

    Abstract: Capturing high dynamic range (HDR) images (videos) is attractive because it can reveal the details in both dark and bright regions. Since the mainstream screens only support low dynamic range (LDR) content, tone map** algorithm is required to compress the dynamic range of HDR images (videos). Although image tone map** has been widely explored, video tone map** is lagging behind, especially f… ▽ More

    Submitted 26 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  27. arXiv:2302.08309  [pdf, other

    math.OC cs.LG

    The ADMM-PINNs Algorithmic Framework for Nonsmooth PDE-Constrained Optimization: A Deep Learning Approach

    Authors: Yongcun Song, Xiaoming Yuan, Hangrui Yue

    Abstract: We study the combination of the alternating direction method of multipliers (ADMM) with physics-informed neural networks (PINNs) for a general class of nonsmooth partial differential equation (PDE)-constrained optimization problems, where additional regularization can be employed for constraints on the control or design variables. The resulting ADMM-PINNs algorithmic framework substantially enlarg… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  28. arXiv:2212.03651  [pdf, other

    cs.CV

    Cyclically Disentangled Feature Translation for Face Anti-spoofing

    Authors: Haixiao Yue, Keyao Wang, Guosheng Zhang, Haocheng Feng, Junyu Han, Errui Ding, **gdong Wang

    Abstract: Current domain adaptation methods for face anti-spoofing leverage labeled source domain data and unlabeled target domain data to obtain a promising generalizable decision boundary. However, it is usually difficult for these methods to achieve a perfect domain-invariant liveness feature disentanglement, which may degrade the final classification performance by domain differences in illumination, fa… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI2023

  29. arXiv:2211.10981  [pdf, other

    cs.CV

    Real-time Local Feature with Global Visual Information Enhancement

    Authors: **yu Miao, Haosong Yue, Zhong Liu, Xingming Wu, Zaojun Fang, Guilin Yang

    Abstract: Local feature provides compact and invariant image representation for various visual tasks. Current deep learning-based local feature algorithms always utilize convolution neural network (CNN) architecture with limited receptive field. Besides, even with high-performance GPU devices, the computational efficiency of local features cannot be satisfactory. In this paper, we tackle such problems by pr… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: 6 pages, 5 figures, 2 tables. Accepted by ICIEA 2022

  30. arXiv:2210.08812  [pdf, other

    cs.CV cs.MM

    ITSRN++: Stronger and Better Implicit Transformer Network for Continuous Screen Content Image Super-Resolution

    Authors: Sheng Shen, Huan**g Yue, **gyu Yang, Kun Li

    Abstract: Nowadays, online screen sharing and remote cooperation are becoming ubiquitous. However, the screen content may be downsampled and compressed during transmission, while it may be displayed on large screens or the users would zoom in for detail observation at the receiver side. Therefore, develo** a strong and effective screen content image (SCI) super-resolution (SR) method is demanded. We obser… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 14pages,10 figures

  31. arXiv:2209.12475  [pdf, other

    cs.CV eess.IV

    Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset

    Authors: Huan**g Yue, Zhiming Zhang, **gyu Yang

    Abstract: In recent years, real image super-resolution (SR) has achieved promising results due to the development of SR datasets and corresponding real SR methods. In contrast, the field of real video SR is lagging behind, especially for real raw videos. Considering the superiority of raw image SR over sRGB image SR, we construct a real-world raw video SR (Real-RawVSR) dataset and propose a corresponding SR… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted by ECCV2022

  32. Multi-task Envisioning Transformer-based Autoencoder for Corporate Credit Rating Migration Early Prediction

    Authors: Han Yue, Steve Xia, Hongfu Liu

    Abstract: Corporate credit ratings issued by third-party rating agencies are quantified assessments of a company's creditworthiness. Credit Ratings highly correlate to the likelihood of a company defaulting on its debt obligations. These ratings play critical roles in investment decision-making as one of the key risk factors. They are also central to the regulatory framework such as BASEL II in calculating… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: SIGKDD Conference on Knowledge Discovery and Data Mining, 2022

  33. arXiv:2205.09802  [pdf, other

    cs.CV cs.AI cs.LG

    Label-invariant Augmentation for Semi-Supervised Graph Classification

    Authors: Han Yue, Chunhui Zhang, Chuxu Zhang, Hongfu Liu

    Abstract: Recently, contrastiveness-based augmentation surges a new climax in the computer vision domain, where some operations, including rotation, crop, and flip, combined with dedicated algorithms, dramatically increase the model generalization and robustness. Following this trend, some pioneering attempts employ the similar idea to graph data. Nevertheless, unlike images, it is much more difficult to de… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  34. arXiv:2205.01048  [pdf, other

    cs.RO

    Center-of-Mass-based Robust Grasp Pose Adaptation Using RGBD Camera and Force/Torque Sensing

    Authors: Shang Liu, Xiaobao Wei, Lulu Wang, **g Zhang, Boyu Li, Haosong Yue

    Abstract: Object drop** may occur when the robotic arm grasps objects with uneven mass distribution due to additional moments generated by objects' gravity. To solve this problem, we present a novel work that does not require extra wrist and tactile sensors and large amounts of experiments for learning. First, we obtain the center-of-mass position of the rod object using the widely fixed joint torque sens… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  35. arXiv:2203.02792  [pdf, other

    cs.CV

    Adversarial Dual-Student with Differentiable Spatial War** for Semi-Supervised Semantic Segmentation

    Authors: Cong Cao, Tianwei Lin, Dongliang He, Fu Li, Huan**g Yue, **gyu Yang, Errui Ding

    Abstract: A common challenge posed to robust semantic segmentation is the expensive data annotation cost. Existing semi-supervised solutions show great potential for solving this problem. Their key idea is constructing consistency regularization with unsupervised data augmentation from unlabeled data for model training. The perturbations for unlabeled data enable the consistency training loss, which benefit… ▽ More

    Submitted 27 September, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  36. arXiv:2201.10315  [pdf, other

    cs.AI

    Comparison research on binary relations based on transitive degrees and cluster degrees

    Authors: Zhaohao Wang, Huifang Yue

    Abstract: Interval-valued information systems are generalized models of single-valued information systems. By rough set approach, interval-valued information systems have been extensively studied. Authors could establish many binary relations from the same interval-valued information system. In this paper, we do some researches on comparing these binary relations so as to provide numerical scales for choosi… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  37. arXiv:2112.08325  [pdf, other

    cs.CV

    ForgeryNet -- Face Forgery Analysis Challenge 2021: Methods and Results

    Authors: Yinan He, Lu Sheng, **g Shao, Ziwei Liu, Zhaofan Zou, Zhizhi Guo, Shan Jiang, Curitis Sun, Guosheng Zhang, Keyao Wang, Haixiao Yue, Zhibin Hong, Wanguo Wang, Zhenyu Li, Qi Wang, Zhenli Wang, Ronghao Xu, Mingwen Zhang, Zhiheng Wang, Zhenhang Huang, Tianming Zhang, Ningning Zhao

    Abstract: The rapid progress of photorealistic synthesis techniques has reached a critical point where the boundary between real and manipulated images starts to blur. Recently, a mega-scale deep face forgery dataset, ForgeryNet which comprised of 2.9 million images and 221,247 videos has been released. It is by far the largest publicly available in terms of data-scale, manipulations (7 image-level approach… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Technical report. Challenge website: https://competitions.codalab.org/competitions/33386

  38. arXiv:2112.06174  [pdf, other

    cs.CV cs.MM

    Implicit Transformer Network for Screen Content Image Continuous Super-Resolution

    Authors: **gyu Yang, Sheng Shen, Huan**g Yue, Kun Li

    Abstract: Nowadays, there is an explosive growth of screen contents due to the wide application of screen sharing, remote cooperation, and online education. To match the limited terminal bandwidth, high-resolution (HR) screen contents may be downsampled and compressed. At the receiver side, the super-resolution (SR) of low-resolution (LR) screen content images (SCIs) is highly demanded by the HR display or… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: 24 pages with 3 figures, NeurIPS 2021

  39. arXiv:2107.14611  [pdf, other

    cs.CV cs.RO

    Automatic Vocabulary and Graph Verification for Accurate Loop Closure Detection

    Authors: Haosong Yue, **yu Miao, Weihai Chen, Wei Wang, Fanghong Guo, Zhengguo Li

    Abstract: Localizing pre-visited places during long-term simultaneous localization and map**, i.e. loop closure detection (LCD), is a crucial technique to correct accumulated inconsistencies. As one of the most effective and efficient solutions, Bag-of-Words (BoW) builds a visual vocabulary to associate features and then detect loops. Most existing approaches that build vocabularies off-line determine sca… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: 11 pages, 9 figures

  40. arXiv:2104.11548  [pdf, other

    cs.CV

    Fine-Grained Texture Identification for Reliable Product Traceability

    Authors: Junsong Wang, Yubo Li, Zhiyong Chang, Haitao Yue, Yonghua Lin

    Abstract: Texture exists in lots of the products, such as wood, beef and compression tea. These abundant and stochastic texture patterns are significantly different between any two products. Unlike the traditional digital ID tracking, in this paper, we propose a novel approach for product traceability, which directly uses the natural texture of the product itself as the unique identifier. A texture identifi… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  41. arXiv:2103.10023  [pdf, other

    cs.CV cs.RO

    Discriminative and Semantic Feature Selection for Place Recognition towards Dynamic Environments

    Authors: Yuxin Tian, **yu MIao, Xingming Wu, Haosong Yue, Zhong Liu, Weihai Chen

    Abstract: Features play an important role in various visual tasks, especially in visual place recognition applied in perceptual changing environments. In this paper, we address the challenges of place recognition due to dynamics and confusable patterns by proposing a discriminative and semantic feature selection network, dubbed as DSFeat. Supervised by both semantic information and attention mechanism, we c… ▽ More

    Submitted 20 March, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: The paper is under consideration at Pattern Recognition Letters

  42. arXiv:2103.07138  [pdf, other

    cs.CV

    UIEC^2-Net: CNN-based Underwater Image Enhancement Using Two Color Space

    Authors: Yudong Wang, Jichang Guo, Huan Gao, Huihui Yue

    Abstract: Underwater image enhancement has attracted much attention due to the rise of marine resource development in recent years. Benefit from the powerful representation capabilities of Convolution Neural Networks(CNNs), multiple underwater image enhancement algorithms based on CNNs have been proposed in the last few years. However, almost all of these algorithms employ RGB color space setting, which is… ▽ More

    Submitted 13 April, 2021; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: 11 pages, 11 figures

  43. arXiv:2012.00234  [pdf, other

    cs.CV cs.RO

    RaP-Net: A Region-wise and Point-wise Weighting Network to Extract Robust Features for Indoor Localization

    Authors: Dongjiang Li, **yu Miao, Xuesong Shi, Yuxin Tian, Qiwei Long, Tianyu Cai, ** Guo, Hongfei Yu, Wei Yang, Haosong Yue, Qi Wei, Fei Qiao

    Abstract: Feature extraction plays an important role in visual localization. Unreliable features on dynamic objects or repetitive regions will interfere with feature matching and challenge indoor localization greatly. To address the problem, we propose a novel network, RaP-Net, to simultaneously predict region-wise invariability and point-wise reliability, and then extract features by considering both of th… ▽ More

    Submitted 22 August, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: IROS 2021

  44. arXiv:2009.05722  [pdf, other

    cs.CV

    Generator Versus Segmentor: Pseudo-healthy Synthesis

    Authors: Zhang Yunlong, Li Chenxin, Lin Xin, Sun Liyan, Zhuang Yihong, Huang Yue, Ding Xinghao, Liu Xiaoqing, Yu Yizhou

    Abstract: This paper investigates the problem of pseudo-healthy synthesis that is defined as synthesizing a subject-specific pathology-free image from a pathological one. Recent approaches based on Generative Adversarial Network (GAN) have been developed for this task. However, these methods will inevitably fall into the trade-off between preserving the subject-specific identity and generating healthy-like… ▽ More

    Submitted 15 July, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Accepted by MICCAI2021

  45. arXiv:2005.03922  [pdf, other

    cs.CV

    Learning Generalized Spoof Cues for Face Anti-spoofing

    Authors: Haocheng Feng, Zhibin Hong, Haixiao Yue, Yang Chen, Keyao Wang, Junyu Han, **gtuo Liu, Errui Ding

    Abstract: Many existing face anti-spoofing (FAS) methods focus on modeling the decision boundaries for some predefined spoof types. However, the diversity of the spoof samples including the unknown ones hinders the effective decision boundary modeling and leads to weak generalization capability. In this paper, we reformulate FAS in an anomaly detection perspective and propose a residual-learning framework t… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 16 pages

  46. arXiv:2003.14013  [pdf, other

    eess.IV cs.CV cs.LG

    Supervised Raw Video Denoising with a Benchmark Dataset on Dynamic Scenes

    Authors: Huan**g Yue, Cong Cao, Lei Liao, Ronghe Chu, **gyu Yang

    Abstract: In recent years, the supervised learning strategy for real noisy image denoising has been emerging and has achieved promising results. In contrast, realistic noise removal for raw noisy videos is rarely studied due to the lack of noisy-clean pairs for dynamic scenes. Clean video frames for dynamic scenes cannot be captured with a long-exposure shutter or averaging multi-shots as was done for stati… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: CVPR2020 accepted paper

  47. arXiv:1906.07892  [pdf, other

    cs.CV

    Learning to Reconstruct and Understand Indoor Scenes from Sparse Views

    Authors: **gyu Yang, Ji Xu, Kun Li, Yu-Kun Lai, Huan**g Yue, Jianzhi Lu, Hao Wu, Yebin Liu

    Abstract: This paper proposes a new method for simultaneous 3D reconstruction and semantic segmentation of indoor scenes. Unlike existing methods that require recording a video using a color camera and/or a depth camera, our method only needs a small number of (e.g., 3-5) color images from uncalibrated sparse views as input, which greatly simplifies data acquisition and extends applicable scenarios. Since d… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  48. Performance Optimization and Parallelization of a Parabolic Equation Solver in Computational Ocean Acoustics on Modern Many-core Computer

    Authors: Min Xu, Yongxian Wang, Anthony Theodore Chronopoulos, Hao Yue

    Abstract: As one of open-source codes widely used in computational ocean acoustics, FOR3D can provide a very good estimate for underwater acoustic propagation. In this paper, we propose a performance optimization and parallelization to speed up the running of FOR3D. We utilized a variety of methods to enhance the entire performance, such as using a multi-threaded programming model to exploit the potential c… ▽ More

    Submitted 11 November, 2017; v1 submitted 31 October, 2017; originally announced November 2017.

    Comments: 9 pages, 8 figures, 3 tables. preprint for the International Conference on Computer Science and Application Engineering (CSAE2017). 2017.10.21-23, Shanghai, China