Skip to main content

Showing 1–50 of 78 results for author: Fan, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15716  [pdf, other

    eess.IV cs.CV

    Predicting fluorescent labels in label-free microscopy images with pix2pix and adaptive loss in Light My Cells challenge

    Authors: Han Liu, Hao Li, Jiacheng Wang, Yubo Fan, Zhoubing Xu, Ipek Oguz

    Abstract: Fluorescence labeling is the standard approach to reveal cellular structures and other subcellular constituents for microscopy images. However, this invasive procedure may perturb or even kill the cells and the procedure itself is highly time-consuming and complex. Recently, in silico labeling has emerged as a promising alternative, aiming to use machine learning models to directly predict the flu… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.20073  [pdf, other

    cs.IT eess.SP

    Power Allocation for Cell-Free Massive MIMO ISAC Systems with OTFS Signal

    Authors: Yifei Fan, Shaochuan Wu, Xixi Bi, Guoyu Li

    Abstract: Applying integrated sensing and communication (ISAC) to a cell-free massive multiple-input multiple-output (CF mMIMO) architecture has attracted increasing attention. This approach equips CF mMIMO networks with sensing capabilities and resolves the problem of unreliable service at cell edges in conventional cellular networks. However, existing studies on CF-ISAC systems have focused on the applica… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: This work is submitted to IEEE for possible publication

  3. arXiv:2405.16197  [pdf, other

    cs.CV eess.IV

    A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior

    Authors: Fuheng Zhou, Dikai Wei, Ye Fan, Yulong Huang, Yonggang Zhang

    Abstract: Although deep learning based models for underwater image enhancement have achieved good performance, they face limitations in both lightweight and effectiveness, which prevents their deployment and application on resource-constrained platforms. Moreover, most existing deep learning based models use data compression to get high-level semantic information in latent space instead of using the origina… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 10 pages

  4. arXiv:2403.13909  [pdf, other

    cs.LG eess.SY

    Sequential Modeling of Complex Marine Navigation: Case Study on a Passenger Vessel (Student Abstract)

    Authors: Yimeng Fan, Pedram Agand, Mo Chen, Edward J. Park, Allison Kennedy, Chanwoo Bae

    Abstract: The maritime industry's continuous commitment to sustainability has led to a dedicated exploration of methods to reduce vessel fuel consumption. This paper undertakes this challenge through a machine learning approach, leveraging a real-world dataset spanning two years of a ferry in west coast Canada. Our focus centers on the creation of a time series forecasting model given the dynamic and static… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures, AAAI 2024 student abstract

  5. arXiv:2401.10345  [pdf, other

    eess.IV

    Attack and Defense Analysis of Learned Image Compression

    Authors: Tianyu Zhu, Heming Sun, Xiankui Xiong, Xuanpeng Zhu, Yong Gong, Minge **g, Yibo Fan

    Abstract: Learned image compression (LIC) is becoming more and more popular these years with its high efficiency and outstanding compression quality. Still, the practicality against modified inputs added with specific noise could not be ignored. White-box attacks such as FGSM and PGD use only gradient to compute adversarial images that mislead LIC models to output unexpected results. Our experiments compare… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  6. arXiv:2312.14239  [pdf, other

    cs.CV eess.IV

    PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

    Authors: Tzofi Klinghoffer, Xiaoyu Xiang, Siddharth Somasundaram, Yuchen Fan, Christian Richardt, Ramesh Raskar, Rakesh Ranjan

    Abstract: 3D reconstruction from a single-view is challenging because of the ambiguity from monocular cues and lack of information about occluded regions. Neural radiance fields (NeRF), while popular for view synthesis and 3D reconstruction, are typically reliant on multi-view images. Existing methods for single-view 3D reconstruction with NeRF rely on either data priors to hallucinate views of occluded reg… ▽ More

    Submitted 5 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project Page: https://platonerf.github.io/

  7. arXiv:2312.03640  [pdf, other

    eess.IV cs.CV

    Training Neural Networks on RAW and HDR Images for Restoration Tasks

    Authors: Lei Luo, Alexandre Chapiro, Xiaoyu Xiang, Yuchen Fan, Rakesh Ranjan, Rafal Mantiuk

    Abstract: The vast majority of standard image and video content available online is represented in display-encoded color spaces, in which pixel values are conveniently scaled to a limited range (0-1) and the color distribution is approximately perceptually uniform. In contrast, both camera RAW and high dynamic range (HDR) images are often represented in linear color spaces, in which color values are linearl… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  8. arXiv:2311.11325  [pdf, other

    cs.CV eess.IV

    MoVideo: Motion-Aware Video Generation with Diffusion Models

    Authors: **gyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc Van Gool, Rakesh Ranjan

    Abstract: While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos and images, i.e., motion. In this paper, we propose a novel motion-aware video generation (MoVideo) framework that takes motion into consideration from two aspe… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: project homepage: https://**gyunliang.github.io/MoVideo

  9. arXiv:2311.05477  [pdf, other

    eess.IV cs.CV cs.LG

    Using ResNet to Utilize 4-class T2-FLAIR Slice Classification Based on the Cholinergic Pathways Hyperintensities Scale for Pathological Aging

    Authors: Wei-Chun Kevin Tsai, Yi-Chien Liu, Ming-Chun Yu, Chia-Ju Chou, Sui-Hing Yan, Yang-Teng Fan, Yan-Hsiang Huang, Yen-Ling Chiu, Yi-Fang Chuang, Ran-Zan Wang, Yao-Chia Shih

    Abstract: The Cholinergic Pathways Hyperintensities Scale (CHIPS) is a visual rating scale used to assess the extent of cholinergic white matter hyperintensities in T2-FLAIR images, serving as an indicator of dementia severity. However, the manual selection of four specific slices for rating throughout the entire brain is a time-consuming process. Our goal was to develop a deep learning-based model capable… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, 2 tables

  10. arXiv:2311.01702  [pdf

    eess.IV cs.CV

    Medical Image Segmentation with Domain Adaptation: A Survey

    Authors: Yuemeng Li, Yong Fan

    Abstract: Deep learning (DL) has shown remarkable success in various medical imaging data analysis applications. However, it remains challenging for DL models to achieve good generalization, especially when the training and testing datasets are collected at sites with different scanners, due to domain shift caused by differences in data distributions. Domain adaptation has emerged as an effective means to a… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Survey

  11. arXiv:2310.14515  [pdf

    physics.optics eess.IV

    First realization of macroscopic Fourier ptychography for hundred-meter distance sub-diffraction imaging

    Authors: Qi Zhang, Yuran Lu, Yinghui Guo, Yingjie Shang, Mingbo Pu, Yulong Fan, Rui Zhou, Xiaoyin Li, Fei Zhang, Mingfeng Xu, Xiangang Luo

    Abstract: Fourier ptychography (FP) imaging, drawing on the idea of synthetic aperture, has been demonstrated as a potential approach for remote sub-diffraction-limited imaging. Nevertheless, the farthest imaging distance is still limited around 10 m even though there has been a significant improvement in macroscopic FP. The most severely issue in increasing the imaging distance is FoV limitation caused by… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  12. arXiv:2309.08323  [pdf

    cs.RO eess.SY

    MLP Based Continuous Gait Recognition of a Powered Ankle Prosthesis with Serial Elastic Actuator

    Authors: Yanze Li, Feixing Chen, **gqi Cao, Ruoqi Zhao, Xuan Yang, Xingbang Yang, Yubo Fan

    Abstract: Powered ankle prostheses effectively assist people with lower limb amputation to perform daily activities. High performance prostheses with adjustable compliance and capability to predict and implement amputee's intent are crucial for them to be comparable to or better than a real limb. However, current designs fail to provide simple yet effective compliance of the joint with full potential of mod… ▽ More

    Submitted 30 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Submitted to IROS 2024

  13. arXiv:2309.04154  [pdf, other

    cs.RO eess.SY

    A novel model for layer jamming-based continuum robots

    Authors: Bowen Yi, Yeman Fan, Dikai Liu

    Abstract: Continuum robots with variable stiffness have gained wide popularity in the last decade. Layer jamming (LJ) has emerged as a simple and efficient technique to achieve tunable stiffness for continuum robots. Despite its merits, the development of a control-oriented dynamical model tailored for this specific class of robots remains an open problem in the literature. This paper aims to present the fi… ▽ More

    Submitted 11 September, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  14. arXiv:2308.16551  [pdf

    eess.IV cs.CV

    Object Detection for Caries or Pit and Fissure Sealing Requirement in Children's First Permanent Molars

    Authors: Chenyao Jiang, Shiyao Zhai, Hengrui Song, Yuqing Ma, Yachen Fan, Yancheng Fang, Dongmei Yu, Canyang Zhang, Sanyang Han, Runming Wang, Yong Liu, Jianbo Li, Peiwu Qin

    Abstract: Dental caries is one of the most common oral diseases that, if left untreated, can lead to a variety of oral problems. It mainly occurs inside the pits and fissures on the occlusal/buccal/palatal surfaces of molars and children are a high-risk group for pit and fissure caries in permanent molars. Pit and fissure sealing is one of the most effective methods that is widely used in prevention of pit… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  15. arXiv:2308.12440  [pdf

    eess.IV cs.CV

    HNAS-reg: hierarchical neural architecture search for deformable medical image registration

    Authors: Jiong Wu, Yong Fan

    Abstract: Convolutional neural networks (CNNs) have been widely used to build deep learning models for medical image registration, but manually designed network architectures are not necessarily optimal. This paper presents a hierarchical NAS framework (HNAS-Reg), consisting of both convolutional operation search and network topology search, to identify the optimal network architecture for deformable medica… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  16. arXiv:2307.05249  [pdf, other

    eess.IV cs.CV cs.LG

    DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

    Authors: Zhiwen Yang, Yang Zhou, Hui Zhang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Multi-center positron emission tomography (PET) image synthesis aims at recovering low-dose PET images from multiple different centers. The generalizability of existing methods can still be suboptimal for a multi-center study due to domain shifts, which result from non-identical data distribution among centers with different imaging systems/protocols. While some approaches address domain shifts by… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: This article has been early accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  17. arXiv:2306.13101  [pdf, other

    eess.SP cs.AI cs.LG

    BrainNet: Epileptic Wave Detection from SEEG with Hierarchical Graph Diffusion Learning

    Authors: Junru Chen, Yang Yang, Tao Yu, Yingying Fan, Xiaolong Mo, Carl Yang

    Abstract: Epilepsy is one of the most serious neurological diseases, affecting 1-2% of the world's population. The diagnosis of epilepsy depends heavily on the recognition of epileptic waves, i.e., disordered electrical brainwave activity in the patient's brain. Existing works have begun to employ machine learning models to detect epileptic waves via cortical electroencephalogram (EEG). However, the recentl… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  18. arXiv:2306.03865  [pdf, other

    cs.RO eess.SY

    Simultaneous Position-and-Stiffness Control of Underactuated Antagonistic Tendon-Driven Continuum Robots

    Authors: Bowen Yi, Yeman Fan, Dikai Liu, Jose Guadalupe Romero

    Abstract: Continuum robots have gained widespread popularity due to their inherent compliance and flexibility, particularly their adjustable levels of stiffness for various application scenarios. Despite efforts to dynamic modeling and control synthesis over the past decade, few studies have incorporated stiffness regulation into their feedback control design; however, this is one of the initial motivations… ▽ More

    Submitted 13 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  19. Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

    Authors: Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI https://doi.org/10.1109/TMI.2023.3275609, IEEE Transactions on Medical Imaging. Code: https://github.com/wuyongjianCODE/Cyclic

  20. arXiv:2306.02132  [pdf, ps, other

    math.OC eess.SY

    Formation Control with Unknown Directions and General Coupling Coefficients

    Authors: Zhen Li, Yang Tang, Yongqing Fan, Tingwen Huang

    Abstract: Generally, the normal displacement-based formation control has a sensing mode that requires the agent not only to have certain knowledge of its direction, but also to gather its local information characterized by nonnegative coupling coefficients. However, the direction may be unknown in the sensing processes, and the coupling coefficients may also involve negative ones due to some circumstances.… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  21. arXiv:2304.04428  [pdf, other

    eess.SP

    SPHR-SAR-Net: Superpixel High-resolution SAR Imaging Network Based on Nonlocal Total Variation

    Authors: Guoru Zhou, Zhongqiu Xu, Yizhe Fan, Zhe Zhang, Xiaolan Qiu, Bingchen Zhang, Kun Fu, Yirong Wu

    Abstract: High-resolution is a key trend in the development of synthetic aperture radar (SAR), which enables the capture of fine details and accurate representation of backscattering properties. However, traditional high-resolution SAR imaging algorithms face several challenges. Firstly, these algorithms tend to focus on local information, neglecting non-local information between different pixel patches. Se… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  22. arXiv:2304.03076  [pdf, other

    eess.IV cs.MM

    Fast QTMT Partition for VVC Intra Coding Using U-Net Framework

    Authors: Zhao Zan, Leilei Huang, ShuShi Chen, Xiantao Zhang, Zhenghui Zhao, Haibing Yin, Yibo Fan

    Abstract: Versatile Video Coding (VVC) has significantly increased encoding efficiency at the expense of numerous complex coding tools, particularly the flexible Quad-Tree plus Multi-type Tree (QTMT) block partition. This paper proposes a deep learning-based algorithm applied in fast QTMT partition for VVC intra coding. Our solution greatly reduces encoding time by early termination of less-likely intra pre… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  23. arXiv:2304.00658  [pdf, other

    eess.AS

    Improving Meeting Inclusiveness using Speech Interruption Analysis

    Authors: Szu-Wei Fu, Yaran Fan, Yasaman Hosseinkashi, Jayant Gupchup, Ross Cutler

    Abstract: Meetings are a pervasive method of communication within all types of companies and organizations, and using remote collaboration systems to conduct meetings has increased dramatically since the COVID-19 pandemic. However, not all meetings are inclusive, especially in terms of the participation rates among attendees. In a recent large-scale survey conducted at Microsoft, the top suggestion given by… ▽ More

    Submitted 4 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

  24. arXiv:2303.12270  [pdf, other

    cs.CV eess.IV

    EBSR: Enhanced Binary Neural Network for Image Super-Resolution

    Authors: Renjie Wei, Shuwen Zhang, Zechun Liu, Meng Li, Yuchen Fan, Runsheng Wang, Ru Huang

    Abstract: While the performance of deep convolutional neural networks for image super-resolution (SR) has improved significantly, the rapid increase of memory and computation requirements hinders their deployment on resource-constrained devices. Quantized networks, especially binary neural networks (BNN) for SR have been proposed to significantly improve the model inference efficiency but suffer from large… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  25. arXiv:2303.02922  [pdf, other

    eess.IV cs.CV

    SurfNN: Joint Reconstruction of Multiple Cortical Surfaces from Magnetic Resonance Images

    Authors: Hao Zheng, Hongming Li, Yong Fan

    Abstract: To achieve fast, robust, and accurate reconstruction of the human cortical surfaces from 3D magnetic resonance images (MRIs), we develop a novel deep learning-based framework, referred to as SurfNN, to reconstruct simultaneously both inner (between white matter and gray matter) and outer (pial) surfaces from MRIs. Different from existing deep learning-based cortical surface reconstruction methods… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: ISBI 2023

  26. arXiv:2302.06167  [pdf

    eess.IV

    An Error-Surface-Based Fractional Motion Estimation Algorithm and Hardware Implementation for VVC

    Authors: Shushi Chen, Leilei Huang, Jiahao Liu, Chao Liu, Yibo Fan

    Abstract: Versatile Video Coding (VVC) introduces more coding tools to improve compression efficiency compared to its predecessor High Efficiency Video Coding (HEVC). For inter-frame coding, Fractional Motion Estimation (FME) still has a high computational effort, which limits the real-time processing capability of the video encoder. In this context, this paper proposes an error-surface-based FME algorithm… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  27. arXiv:2302.04948  [pdf

    eess.SP

    NR Conformance Testing of Analog Radio-over-LWIR FSO Fronthaul link for 6G Distributed MIMO Networks

    Authors: Rafael Puerta, Mengyao Han, Mahdieh Joharifar, Richard Schatz, Yan-Ting Sun, Yuchuan Fan, Anders Djupsjöbacka, Grégory Maisons, Johan Abautret, Roland Teissier, Lu Zhang, Sandis Spolitis, Muguang Wang, Vjaceslavs Bobrovs, Sebastian Lourdudoss, Xianbin Yu, Sergei Popov, Oskars Ozolins, Xiaodan Pang

    Abstract: We experimentally test the compliance with 5G/NR 3GPP technical specifications of an analog radio-over-FSO link at 9 μm. The ACLR and EVM transmitter requirements are fulfilled validating the suitability of LWIR FSO for 6G fronthaul.

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted in Optical Fiber Communication Conference (OFC) 2023, 3 pages, 2 figures

  28. Decentralized Eigendecomposition for Online Learning over Graphs with Applications

    Authors: Yufan Fan, Minh Trinh-Hoang, Cemil Emre Ardic, Marius Pesavento

    Abstract: In this paper, the problem of decentralized eigenvalue decomposition of a general symmetric matrix that is important, e.g., in Principal Component Analysis, is studied, and a decentralized online learning algorithm is proposed. Instead of collecting all information in a fusion center, the proposed algorithm involves only local interactions among adjacent agents. It benefits from the representation… ▽ More

    Submitted 11 August, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

  29. arXiv:2207.02399  [pdf

    eess.IV cs.CV

    Learning Apparent Diffusion Coefficient Maps from Accelerated Radial k-Space Diffusion-Weighted MRI in Mice using a Deep CNN-Transformer Model

    Authors: Yuemeng Li, Miguel Romanello Joaquim, Stephen Pickup, Hee Kwon Song, Rong Zhou, Yong Fan

    Abstract: Purpose: To accelerate radially sampled diffusion weighted spin-echo (Rad-DW-SE) acquisition method for generating high quality apparent diffusion coefficient (ADC) maps. Methods: A deep learning method was developed to generate accurate ADC maps from accelerated DWI data acquired with the Rad-DW-SE method. The deep learning method integrates convolutional neural networks (CNNs) with vision transf… ▽ More

    Submitted 1 August, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by Magnetic Resonance in Medicine

    Journal ref: Magn Reson Med 2023

  30. arXiv:2206.10385  [pdf, other

    eess.IV cs.AI cs.LG

    Approximate Equivariance SO(3) Needlet Convolution

    Authors: Kai Yi, Jialin Chen, Yu Guang Wang, Bingxin Zhou, Pietro Liò, Yanan Fan, Jan Hamann

    Abstract: This paper develops a rotation-invariant needlet convolution for rotation group SO(3) to distill multiscale information of spherical signals. The spherical needlet transform is generalized from $\mathbb{S}^2$ onto the SO(3) group, which decomposes a spherical signal to approximate and detailed spectral coefficients by a set of tight framelet operators. The spherical signal during the decomposition… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  31. arXiv:2206.05054  [pdf, other

    eess.IV cs.CV

    A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

    Authors: Yu Fan, Zicheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

    Abstract: Point cloud is one of the most widely used digital formats of 3D models, the visual quality of which is quite sensitive to distortions such as downsampling, noise, and compression. To tackle the challenge of point cloud quality assessment (PCQA) in scenarios where reference is not available, we propose a no-reference quality assessment metric for colored point cloud based on captured video sequenc… ▽ More

    Submitted 20 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022

  32. arXiv:2206.02146  [pdf, other

    cs.CV eess.IV

    Recurrent Video Restoration Transformer with Guided Deformable Attention

    Authors: **gyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van Gool

    Abstract: Video restoration aims at restoring multiple high-quality frames from multiple low-quality frames. Existing video restoration methods generally fall into two extreme cases, i.e., they either restore all frames in parallel or restore the video frame by frame in a recurrent way, which would result in different merits and drawbacks. Typically, the former has the advantage of temporal information fusi… ▽ More

    Submitted 12 November, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted by NeurIPS 2022. Code: https://github.com/**gyunLiang/RVRT

  33. 3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis

    Authors: Yang Zhou, Zhiwen Yang, Hui Zhang, Eric I-Chao Chang, Yubo Fan, Yan Xu

    Abstract: Potential radioactive hazards in full-dose positron emission tomography (PET) imaging remain a concern, whereas the quality of low-dose images is never desirable for clinical use. So it is of great interest to translate low-dose PET images into full-dose. Previous studies based on deep learning methods usually directly extract hierarchical features for reconstruction. We notice that the importance… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TMI.2022.3156614, IEEE Transactions on Medical Imaging

    Journal ref: IEEE Transactions on Medical Imaging, 2022, 41(8): 2092-2104

  34. arXiv:2203.04959  [pdf, other

    eess.IV cs.CV

    ModDrop++: A Dynamic Filter Network with Intra-subject Co-training for Multiple Sclerosis Lesion Segmentation with Missing Modalities

    Authors: Han Liu, Yubo Fan, Hao Li, Jiacheng Wang, Dewei Hu, Can Cui, Ho Hin Lee, Huahong Zhang, Ipek Oguz

    Abstract: Multiple Sclerosis (MS) is a chronic neuroinflammatory disease and multi-modality MRIs are routinely used to monitor MS lesions. Many automatic MS lesion segmentation models have been developed and have reached human-level performance. However, most established methods assume the MRI modalities used during training are also available during testing, which is not guaranteed in clinical practice. Pr… ▽ More

    Submitted 1 July, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: MICCAI 2022

  35. arXiv:2201.12288  [pdf, other

    cs.CV eess.IV

    VRT: A Video Restoration Transformer

    Authors: **gyun Liang, Jiezhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc Van Gool

    Abstract: Video restoration (e.g., video super-resolution) aims to restore high-quality frames from low-quality frames. Different from single image restoration, video restoration generally requires to utilize temporal information from multiple adjacent but usually misaligned video frames. Existing deep methods generally tackle with this by exploiting a sliding window strategy or a recurrent architecture, wh… ▽ More

    Submitted 15 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: add results on VFI and STVSR; SOTA results (+up to 2.16dB) on video SR, video deblurring, video denoising, video frame interpolation and space-time video super-resolution. Code: https://github.com/**gyunLiang/VRT

  36. arXiv:2201.08221  [pdf

    physics.ins-det eess.SP

    A 1.5GS/s 8b Pipelined-SAR ADC with Output Level Shifting Settling Technique in 14nm CMOS

    Authors: Yuanming Zhu, Shengchang Cai, Shiva Kiran, Yang-Hang Fan, Po-Hsuan Chang, Sebastian Hoyos, Samuel Palermo

    Abstract: A single channel 1.5GS/s 8-bit pipelined-SAR ADC utilizes a novel output level shifting (OLS) settling technique to reduce the power and enable low-voltage operation of the dynamic residue amplifier. The ADC consists of a 4-bit first stage and a 5-bit second stage, with 1-bit redundancy to relax the offset, gain, and settling requirements of the first stage. Employing the OLS technique allows for… ▽ More

    Submitted 20 August, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: it is a 4 page and 9 figure IEEE Custom Integrated Circuit Conference paper

    Journal ref: IEEE Custom Integrated Circuit Conference 2020

  37. CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwannoma and Cochlea Segmentation

    Authors: Reuben Dorent, Aaron Kujawa, Marina Ivory, Spyridon Bakas, Nicola Rieke, Samuel Joutard, Ben Glocker, Jorge Cardoso, Marc Modat, Kayhan Batmanghelich, Arseniy Belkov, Maria Baldeon Calisto, Jae Won Choi, Benoit M. Dawant, Hexin Dong, Sergio Escalera, Yubo Fan, Lasse Hansen, Mattias P. Heinrich, Smriti Joshi, Victoriya Kashtanova, Hyeon Gyu Kim, Satoshi Kondo, Christian N. Kruse, Susana K. Lai-Yuen , et al. (15 additional authors not shown)

    Abstract: Domain Adaptation (DA) has recently raised strong interests in the medical imaging community. While a large variety of DA techniques has been proposed for image segmentation, most of these techniques have been validated either on private datasets or on small publicly available datasets. Moreover, these datasets mostly addressed single-class problems. To tackle these limitations, the Cross-Modality… ▽ More

    Submitted 14 December, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: In Medical Image Analysis

  38. arXiv:2201.01492  [pdf, other

    eess.IV cs.CV

    FAVER: Blind Quality Prediction of Variable Frame Rate Videos

    Authors: Qi Zheng, Zhengzhong Tu, Pavan C. Madhusudana, Xiaoyang Zeng, Alan C. Bovik, Yibo Fan

    Abstract: Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales. Recent advances in mobile devices and cloud computing techniques have made it possible to capture, process, and share high resolution, high frame rate (HFR) videos across the Internet nearly instantaneously. Being able to monitor and control the quality of these streamed… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures

  39. arXiv:2112.04914  [pdf, other

    eess.AS cs.LG cs.SD

    End-to-end Alexa Device Arbitration

    Authors: Jarred Barber, Yifeng Fan, Tao Zhang

    Abstract: We introduce a variant of the speaker localization problem, which we call device arbitration. In the device arbitration problem, a user utters a keyword that is detected by multiple distributed microphone arrays (smart home devices), and we want to determine which device was closest to the user. Rather than solving the full localization problem, we propose an end-to-end machine learning system. Th… ▽ More

    Submitted 16 February, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted for ICASSP 2022

  40. arXiv:2111.02283  [pdf, other

    cs.RO eess.SY

    A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Sha** for Mobile Robots

    Authors: Xinyi Yu, Siyu Xu, Yuehai Fan, Linlin Ou

    Abstract: To solve the coupling problem of control loops and the adaptive parameter tuning problem in the multi-input multi-output (MIMO) PID control system, a self-adaptive LSAC-PID algorithm is proposed based on deep reinforcement learning (RL) and Lyapunov-based reward sha** in this paper. For complex and unknown mobile robot control environment, an RL-based MIMO PID hybrid control strategy is firstly… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 11 pages, 13 figures

  41. arXiv:2111.00485  [pdf, other

    cs.CV eess.IV

    Learned Image Compression with Separate Hyperprior Decoders

    Authors: Zhao Zan, Chao Liu, Heming Sun, Xiaoyang Zeng, Yibo Fan

    Abstract: Learned image compression techniques have achieved considerable development in recent years. In this paper, we find that the performance bottleneck lies in the use of a single hyperprior decoder, in which case the ternary Gaussian model collapses to a binary one. To solve this, we propose to use three hyperprior decoders to separate the decoding process of the mixed parameters in discrete Gaussian… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: This paper has been accepted by IEEE Open Journal of Circuits and Systems

  42. arXiv:2109.06274  [pdf, other

    eess.IV cs.CV

    Cross-Modality Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation

    Authors: Han Liu, Yubo Fan, Can Cui, Dingjie Su, Andrew McNeil, Benoit M. Dawant

    Abstract: Automatic methods to segment the vestibular schwannoma (VS) tumors and the cochlea from magnetic resonance imaging (MRI) are critical to VS treatment planning. Although supervised methods have achieved satisfactory performance in VS segmentation, they require full annotations by experts, which is laborious and time-consuming. In this work, we aim to tackle the VS and cochlea segmentation problem i… ▽ More

    Submitted 8 November, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

  43. arXiv:2108.08551  [pdf, other

    eess.IV cs.CV cs.MM

    Learned Video Compression with Residual Prediction and Loop Filter

    Authors: Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan

    Abstract: In this paper, we propose a learned video codec with a residual prediction network (RP-Net) and a feature-aided loop filter (LF-Net). For the RP-Net, we exploit the residual of previous multiple frames to further eliminate the redundancy of the current frame residual. For the LF-Net, the features from residual decoding network and the motion compensation network are used to aid the reconstruction… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  44. arXiv:2108.01522  [pdf, other

    eess.IV

    CSMCNet: Scalable Video Compressive Sensing Reconstruction with Interpretable Motion Estimation

    Authors: Bowen Huang, Xiao Yan, **jia Zhou, Yibo Fan

    Abstract: Most deep network methods for compressive sensing reconstruction suffer from the black-box characteristic of DNN. In this paper, a deep neural network with interpretable motion estimation named CSMCNet is proposed. The network is able to realize high-quality reconstruction of video compressive sensing by unfolding the iterative steps of optimization based algorithms. A DNN based, multi-hypothesis… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: 12 pages, 10 pages, 5 tables

  45. arXiv:2107.03987  [pdf

    eess.IV cs.CV

    Atlas-Based Segmentation of Intracochlear Anatomy in Metal Artifact Affected CT Images of the Ear with Co-trained Deep Neural Networks

    Authors: Jianing Wang, Dingjie Su, Yubo Fan, Srijata Chakravorti, Jack H. Noble, Benoit M. Dawant

    Abstract: We propose an atlas-based method to segment the intracochlear anatomy (ICA) in the post-implantation CT (Post-CT) images of cochlear implant (CI) recipients that preserves the point-to-point correspondence between the meshes in the atlas and the segmented volumes. To solve this problem, which is challenging because of the strong artifacts produced by the implant, we use a pair of co-trained deep n… ▽ More

    Submitted 9 July, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 10 pages, 5 figures

  46. arXiv:2106.06011  [pdf, other

    cs.CV cs.LG eess.IV

    A self-adapting super-resolution structures framework for automatic design of GAN

    Authors: Yibo Guo, Haidi Wang, Yiming Fan, Shunyao Li, Mingliang Xu

    Abstract: With the development of deep learning, the single super-resolution image reconstruction network models are becoming more and more complex. Small changes in hyperparameters of the models have a greater impact on model performance. In the existing works, experts have gradually explored a set of optimal model parameters based on empirical values or performing brute-force search. In this paper, we int… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 9 pages, 6 figures

  47. arXiv:2106.05545  [pdf, other

    eess.IV cs.CV cs.LG

    Super-Resolution Image Reconstruction Based on Self-Calibrated Convolutional GAN

    Authors: Yibo Guo, Haidi Wang, Yiming Fan, Shunyao Li, Mingliang Xu

    Abstract: With the effective application of deep learning in computer vision, breakthroughs have been made in the research of super-resolution images reconstruction. However, many researches have pointed out that the insufficiency of the neural network extraction on image features may bring the deteriorating of newly reconstructed image. On the other hand, the generated pictures are sometimes too artificial… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 8 pages, 3 figures

  48. arXiv:2105.15077  [pdf, other

    cs.CV cs.LG eess.IV

    SDNet: mutil-branch for single image deraining using swin

    Authors: Fuxiang Tan, YuTing Kong, Yingying Fan, Feng Liu, Daxin Zhou, Hao zhang, Long Chen, Liang Gao, Yurong Qian

    Abstract: Rain streaks degrade the image quality and seriously affect the performance of subsequent computer vision tasks, such as autonomous driving, social security, etc. Therefore, removing rain streaks from a given rainy images is of great significance. Convolutional neural networks(CNN) have been widely used in image deraining tasks, however, the local computational characteristics of convolutional ope… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

  49. DSR: Direct Simultaneous Registration for Multiple 3D Images

    Authors: Zhehua Mao, Liang Zhao, Shoudong Huang, Yiting Fan, Alex Pui-Wai Lee

    Abstract: This paper presents a novel algorithm named Direct Simultaneous Registration (DSR) that registers a collection of 3D images in a simultaneous fashion without specifying any reference image, feature extraction and matching, or information loss or reuse. The algorithm optimizes the global poses of local image frames by maximizing the similarity between a predefined panoramic image and local images.… ▽ More

    Submitted 15 August, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 10 pages, 3 figures, The 25th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2022

    Journal ref: Medical Image Computing and Computer Assisted Intervention (2022)

  50. arXiv:2101.09642  [pdf

    eess.IV cs.CV cs.MM

    Image Compression with Encoder-Decoder Matched Semantic Segmentation

    Authors: Trinh Man Hoang, **jia Zhou, Yibo Fan

    Abstract: In recent years, layered image compression is demonstrated to be a promising direction, which encodes a compact representation of the input image and apply an up-sampling network to reconstruct the image. To further improve the quality of the reconstructed image, some works transmit the semantic segment together with the compressed image data. Consequently, the compression ratio is also decreased… ▽ More

    Submitted 30 January, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

    Journal ref: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 2020, pp. 619-623