Skip to main content

Showing 1–50 of 51 results for author: Fan, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14977  [pdf, other

    cs.AI eess.IV

    Trustworthy Enhanced Multi-view Multi-modal Alzheimer's Disease Prediction with Brain-wide Imaging Transcriptomics Data

    Authors: Shan Cong, Zhoujie Fan, Hongwei Liu, Yinghan Zhang, Xin Wang, Haoran Luo, Xiaohui Yao

    Abstract: Brain transcriptomics provides insights into the molecular mechanisms by which the brain coordinates its functions and processes. However, existing multimodal methods for predicting Alzheimer's disease (AD) primarily rely on imaging and sometimes genetic data, often neglecting the transcriptomic basis of brain. Furthermore, while striving to integrate complementary information between modalities,… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.08423  [pdf, other

    eess.IV cs.CV

    NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution

    Authors: Yihong Chen, Zhen Fan, Shuai Dong, Zhiwei Chen, Wenjie Li, Minghui Qin, Min Zeng, Xubing Lu, Guofu Zhou, Xingsen Gao, Jun-Ming Liu

    Abstract: Stereo image super-resolution (SR) refers to the reconstruction of a high-resolution (HR) image from a pair of low-resolution (LR) images as typically captured by a dual-camera device. To enhance the quality of SR images, most previous studies focused on increasing the number and size of feature maps and introducing complex and computationally intensive structures, resulting in models with high co… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  3. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  4. arXiv:2403.12028  [pdf, other

    cs.CV cs.AI eess.IV

    Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail

    Authors: Ming** Chen, Junhao Chen, Xiaojun Ye, Huan-ang Gao, Xiaoxue Chen, Zhaoxin Fan, Hao Zhao

    Abstract: 3D human body reconstruction has been a challenge in the field of computer vision. Previous methods are often time-consuming and difficult to capture the detailed appearance of the human body. In this paper, we propose a new method called \emph{Ultraman} for fast reconstruction of textured 3D human models from a single image. Compared to existing techniques, \emph{Ultraman} greatly improves the re… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project Page: https://air-discover.github.io/Ultraman/

  5. arXiv:2403.02566  [pdf, other

    eess.IV cs.CV

    Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

    Authors: Zhaoxin Fan, Runmin Jiang, Junhao Wu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu

    Abstract: 3D medical image segmentation is a challenging task with crucial implications for disease diagnosis and treatment planning. Recent advances in deep learning have significantly enhanced fully supervised medical image segmentation. However, this approach heavily relies on labor-intensive and time-consuming fully annotated ground-truth labels, particularly for 3D volumes. To overcome this limitation,… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2403.02010  [pdf, other

    cs.SD eess.AS

    SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR

    Authors: Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma

    Abstract: Multi-talker automatic speech recognition plays a crucial role in scenarios involving multi-party interactions, such as meetings and conversations. Due to its inherent complexity, this task has been receiving increasing attention. Notably, the serialized output training (SOT) stands out among various approaches because of its simplistic architecture and exceptional performance. However, the freque… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  7. arXiv:2402.13629  [pdf, other

    eess.IV cs.CV

    Adversarial Purification and Fine-tuning for Robust UDC Image Restoration

    Authors: Zhenbo Song, Zhenyuan Zhang, Kaihao Zhang, Wenhan Luo, Zhaoxin Fan, Jianfeng Lu

    Abstract: This study delves into the enhancement of Under-Display Camera (UDC) image restoration models, focusing on their robustness against adversarial attacks. Despite its innovative approach to seamless display integration, UDC technology faces unique image degradation challenges exacerbated by the susceptibility to adversarial perturbations. Our research initially conducts an in-depth robustness evalua… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  8. arXiv:2402.11274  [pdf, other

    eess.IV cs.CV cs.LG

    TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method

    Authors: Chenyan Zhang, Yifei Chen, Zhenxiong Fan, Yiyu Huang, Wenchao Weng, Ruiquan Ge, Dong Zeng, Changmiao Wang

    Abstract: Recently, diffusion models have gained significant attention as a novel set of deep learning-based generative methods. These models attempt to sample data from a Gaussian distribution that adheres to a target distribution, and have been successfully adapted to the reconstruction of MRI data. However, as an unconditional generative model, the diffusion model typically disrupts image coordination be… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 5 pages, 2 figures, accept ISBI2024

    Journal ref: ISBI 2024

  9. arXiv:2311.11997  [pdf, other

    eess.SY

    Smart Energy Network Digital Twins: Findings from a UK-Based Demonstrator Project

    Authors: Matthew Deakin, Marta Vanin, Zhong Fan, Dirk Van Hertem

    Abstract: Digital Twins promise to deliver a step-change in distribution system operations and planning, but there are few real-world examples that explore the challenges of combining imperfect model and measurement data, and then use these as the basis for subsequent analysis. In this work we propose a Digital Twin framework for electrical distribution systems and implement that framework on the Smart Ener… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  10. CrackCLF: Automatic Pavement Crack Detection based on Closed-Loop Feedback

    Authors: Chong Li, Zhun Fan, Ying Chen, Huibiao Lin, Laura Moretti, Giuseppe Loprencipe, Weihua Sheng, Kelvin C. P. Wang

    Abstract: Automatic pavement crack detection is an important task to ensure the functional performances of pavements during their service life. Inspired by deep learning (DL), the encoder-decoder framework is a powerful tool for crack detection. However, these models are usually open-loop (OL) systems that tend to treat thin cracks as the background. Meanwhile, these models can not automatically correct err… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Journal ref: IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS,2023

  11. arXiv:2311.05479  [pdf, other

    eess.IV cs.CV physics.med-ph

    Retinal OCT Synthesis with Denoising Diffusion Probabilistic Models for Layer Segmentation

    Authors: Yuli Wu, Weidong He, Dennis Eschweiler, Ningxin Dou, Zixin Fan, Shengli Mi, Peter Walter, Johannes Stegmaier

    Abstract: Modern biomedical image analysis using deep learning often encounters the challenge of limited annotated data. To overcome this issue, deep generative models can be employed to synthesize realistic biomedical images. In this regard, we propose an image synthesis method that utilizes denoising diffusion probabilistic models (DDPMs) to automatically generate retinal optical coherence tomography (OCT… ▽ More

    Submitted 6 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: ISBI 2024

  12. arXiv:2310.10300  [pdf, other

    cs.SD cs.IR eess.AS

    BeatDance: A Beat-Based Model-Agnostic Contrastive Learning Framework for Music-Dance Retrieval

    Authors: Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan

    Abstract: Dance and music are closely related forms of expression, with mutual retrieval between dance videos and music being a fundamental task in various fields like education, art, and sports. However, existing methods often suffer from unnatural generation effects or fail to fully explore the correlation between music and dance. To overcome these challenges, we propose BeatDance, a novel beat-based mode… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  13. arXiv:2309.12191  [pdf, other

    eess.SP physics.ins-det

    Exploring the Correlation Between Ultrasound Speed and the State of Health of LiFePO$_4$ Prismatic Cells

    Authors: Shengyuan Zhang, Peng Zuo, Xuesong Yin, Zheng Fan

    Abstract: Electric vehicles (EVs) have become a popular mode of transportation, with their performance depending on the ageing of the Li-ion batteries used to power them. However, it can be challenging and time-consuming to determine the capacity retention of a battery in service. A rapid and reliable testing method for state of health (SoH) determination is desired. Ultrasonic testing techniques are promis… ▽ More

    Submitted 24 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

  14. arXiv:2306.00544  [pdf, other

    cs.IT eess.SP

    Codebook Configuration for RIS-aided Systems via Implicit Neural Representations

    Authors: Huiying Yang, Ru**g Xiong, Yao Xiao, Zhijie Fan, Tiebin Mi, Robert Caiming Qiu, Zenan Ling

    Abstract: Reconfigurable Intelligent Surface (RIS) is envisioned to be an enabling technique in 6G wireless communications. By configuring the reflection beamforming codebook, RIS focuses signals on target receivers to enhance signal strength. In this paper, we investigate the codebook configuration for RIS-aided communication systems. We formulate an implicit relationship between user's coordinates informa… ▽ More

    Submitted 28 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  15. arXiv:2303.11089  [pdf, other

    cs.CV cs.SD eess.AS

    EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

    Authors: Ziqiao Peng, Haoyu Wu, Zhenbo Song, Hao Xu, Xiangyu Zhu, Jun He, Hongyan Liu, Zhaoxin Fan

    Abstract: Speech-driven 3D face animation aims to generate realistic facial expressions that match the speech content and emotion. However, existing methods often neglect emotional facial expressions or fail to disentangle them from speech content. To address this issue, this paper proposes an end-to-end neural network to disentangle different emotions in speech so as to generate rich 3D facial expressions.… ▽ More

    Submitted 25 August, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted by ICCV 2023

  16. arXiv:2303.09663  [pdf, other

    cs.CV eess.IV

    Efficient Computation Sharing for Multi-Task Visual Scene Understanding

    Authors: Sara Shoouri, Mingyu Yang, Zichen Fan, Hun-Seok Kim

    Abstract: Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks. Despite the benefits of multi-task learning, such techniques can struggle with balancing the loss for each task, leading to potential performance degradation. We present a novel computation- and parameter-sharing framework th… ▽ More

    Submitted 14 August, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Camera-Ready version. Accepted to ICCV 2023

  17. arXiv:2212.13021  [pdf, other

    eess.SP

    Diameter Estimation of Cylindrical Metal Bar Using Wideband Dual-Polarized Ground-Penetrating Radar

    Authors: Hai-Han Sun, Weixia Cheng, Zheng Fan

    Abstract: Ground-penetrating radar (GPR) has been an effective technology for locating metal bars in civil engineering structures. However, the accurate sizing of subsurface metal bars of small diameters remains a challenging problem for the existing reflection pattern-based method due to the limited resolution of GPR. To address the issue, we propose a reflection power-based method by exploring the relatio… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: 14 pages, 15 figures, will be published at IEEE Transactions on Instrumentation and Measurement

  18. arXiv:2212.00532  [pdf, other

    eess.IV cs.CV

    EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

    Authors: Liyu Shi, Xiaoyan Li, Weiming Hu, Haoyuan Chen, **g Chen, Zizhen Fan, Minghe Gao, Yujie **g, Guotao Lu, Deguo Ma, Zhiyu Ma, Qingtao Meng, Dechao Tang, Hongzan Sun, Marcin Grzegorzek, Shouliang Qi, Yueyang Teng, Chen Li

    Abstract: Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when comp… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  19. arXiv:2211.09381  [pdf, other

    cs.SD eess.AS

    Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

    Authors: Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

    Abstract: In multi-talker scenarios such as meetings and conversations, speech processing systems are usually required to segment the audio and then transcribe each segmentation. These two stages are addressed separately by speaker change detection (SCD) and automatic speech recognition (ASR). Most previous SCD systems rely solely on speaker information and ignore the importance of speech content. In this p… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  20. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  21. arXiv:2210.05762  [pdf, other

    eess.IV cs.CV

    Joint localization and classification of breast tumors on ultrasound images using a novel auxiliary attention-based framework

    Authors: Zong Fan, ** Gong, Shanshan Tang, Christine U. Lee, Xiaohui Zhang, Pengfei Song, Shigao Chen, Hua Li

    Abstract: Automatic breast lesion detection and classification is an important task in computer-aided diagnosis, in which breast ultrasound (BUS) imaging is a common and frequently used screening tool. Recently, a number of deep learning-based methods have been proposed for joint localization and classification of breast lesions using BUS images. In these methods, features extracted by a shared network trun… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  22. arXiv:2208.12779  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    Battery and Hydrogen Energy Storage Control in a Smart Energy Network with Flexible Energy Demand using Deep Reinforcement Learning

    Authors: Cephas Samende, Zhong Fan, Jun Cao

    Abstract: Smart energy networks provide for an effective means to accommodate high penetrations of variable renewable energy sources like solar and wind, which are key for deep decarbonisation of energy production. However, given the variability of the renewables as well as the energy demand, it is imperative to develop effective control and energy storage schemes to manage the variable energy generation an… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 13 pages, 10 figures

  23. arXiv:2207.14166  [pdf, ps, other

    cs.CV cs.LG eess.IV

    RHA-Net: An Encoder-Decoder Network with Residual Blocks and Hybrid Attention Mechanisms for Pavement Crack Segmentation

    Authors: Guijie Zhu, Zhun Fan, Jiacheng Liu, Duan Yuan, Peili Ma, Meihua Wang, Weihua Sheng, Kelvin C. P. Wang

    Abstract: The acquisition and evaluation of pavement surface data play an essential role in pavement condition evaluation. In this paper, an efficient and effective end-to-end network for automatic pavement crack segmentation, called RHA-Net, is proposed to improve the pavement crack segmentation accuracy. The RHA-Net is built by integrating residual blocks (ResBlocks) and hybrid attention blocks into the e… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  24. Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire

    Authors: Zhiyun Fan, Linhao Dong, Meng Cai, Zejun Ma, Bo Xu

    Abstract: Speaker change detection is an important task in multi-party interactions such as meetings and conversations. In this paper, we address the speaker change detection task from the perspective of sequence transduction. Specifically, we propose a novel encoder-decoder framework that directly converts the input feature sequence to the speaker identity sequence. The difference-based continuous integrat… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Signal Processing Letters 2022

  25. arXiv:2206.11629  [pdf, other

    cs.CV eess.IV

    Global Sensing and Measurements Reuse for Image Compressed Sensing

    Authors: Zi-En Fan, Feng Lian, Jia-Ni Quan

    Abstract: Recently, deep network-based image compressed sensing methods achieved high reconstruction quality and reduced computational overhead compared with traditional methods. However, existing methods obtain measurements only from partial features in the network and use them only once for image reconstruction. They ignore there are low, mid, and high-level features in the network\cite{zeiler2014visualiz… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  26. arXiv:2206.11501  [pdf, other

    eess.IV cs.CV

    A novel adversarial learning strategy for medical image classification

    Authors: Zong Fan, Xiaohui Zhang, Jacob A. Gasienica, Jennifer Potts, Su Ruan, Wade Thorstad, Hiram Gay, Pengfei Song, Xiaowei Wang, Hua Li

    Abstract: Deep learning (DL) techniques have been extensively utilized for medical image classification. Most DL-based classification networks are generally structured hierarchically and optimized through the minimization of a single loss function measured at the end of the networks. However, such a single loss design could potentially lead to optimization of one specific value of interest but fail to lever… ▽ More

    Submitted 7 July, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  27. Learning to Remove Clutter in Real-World GPR Images Using Hybrid Data

    Authors: Hai-Han Sun, Weixia Cheng, Zheng Fan

    Abstract: The clutter in the ground-penetrating radar (GPR) radargram disguises or distorts subsurface target responses, which severely affects the accuracy of target detection and identification. Existing clutter removal methods either leave residual clutter or deform target responses when facing complex and irregular clutter in the real-world radargram. To tackle the challenge of clutter removal in real s… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  28. arXiv:2204.05184  [pdf, other

    cs.NI cs.LG eess.SP

    Domain Adversarial Graph Convolutional Network Based on RSSI and Crowdsensing for Indoor Localization

    Authors: Mingxin Zhang, Zipei Fan, Ryosuke Shibasaki, Xuan Song

    Abstract: In recent years, the use of WiFi fingerprints for indoor positioning has grown in popularity, largely due to the widespread availability of WiFi and the proliferation of mobile communication devices. However, many existing methods for constructing fingerprint datasets rely on labor-intensive and time-consuming processes of collecting large amounts of data. Additionally, these methods often focus o… ▽ More

    Submitted 31 March, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: IEEE Internet of Things Journal

    Journal ref: IEEE Internet of Things Journal, vol. 10, no. 15, pp. 13662-13672, 2023

  29. arXiv:2204.00293  [pdf

    eess.SY

    The role of living laboratories in unlocking the potential of low-carbon energy technologies on the journey to net-zero

    Authors: Zhong Fan, Jun Cao, Taskin Jamal, Chris Fogwill, Cephas Samende, Zoe Robinson, Fiona Polack, Mark Ormerod, Sharon George, Adam Peacock, David Healey

    Abstract: We demonstrate the potential role of one of the largest at scale multi-vector Smart Energy Network Demonstrator (SEND).

    Submitted 1 April, 2022; originally announced April 2022.

  30. arXiv:2203.07677  [pdf, other

    eess.IV cs.CV

    Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning

    Authors: Xiang Chen, Zhentao Fan, Pengpeng Li, Longgang Dai, Caihua Kong, Zhuoran Zheng, Yufeng Huang, Yufeng Li

    Abstract: We offer a practical unpaired learning based image dehazing network from an unpaired set of clear and hazy images. This paper provides a new perspective to treat image dehazing as a two-class separated factor disentanglement task, i.e, the task-relevant factor of clear image reconstruction and the task-irrelevant factor of haze-relevant distribution. To achieve the disentanglement of these two-cla… ▽ More

    Submitted 12 July, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  31. arXiv:2111.10898  [pdf, other

    cs.MA cs.AI cs.LG eess.SY

    Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning

    Authors: Daniel J. B. Harrold, Jun Cao, Zhong Fan

    Abstract: In this paper, multi-agent reinforcement learning is used to control a hybrid energy storage system working collaboratively to reduce the energy costs of a microgrid through maximising the value of renewable energy and trading. The agents must learn to control three different types of energy storage system suited for short, medium, and long-term storage under fluctuating demand, dynamic wholesale… ▽ More

    Submitted 5 December, 2021; v1 submitted 21 November, 2021; originally announced November 2021.

  32. arXiv:2108.09053  [pdf, ps, other

    eess.SY

    Multi-Agent Deep Deterministic Policy Gradient Algorithm for Peer-to-Peer Energy Trading Considering Distribution Network Constraints

    Authors: Cephas Samende, Jun Cao, Zhong Fan

    Abstract: In this paper, we investigate an energy cost minimization problem for prosumers participating in peer-to-peer energy trading. Due to (i) uncertainties caused by renewable energy generation and consumption, (ii) difficulties in develo** an accurate and efficient energy trading model, and (iii) the need to satisfy distribution network constraints, it is challenging for prosumers to obtain optimal… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: 15 pages, 19 figures

  33. The energy revolution: cyber physical advances and opportunities for smart local energy systems

    Authors: Nandor Verba, Elena Gaura, Stephen McArthur, George Konstantopoulos, Jianzhoug Wu, Zhong Fan, Dimitrios Athanasiadis, Pablo Rodolfo Baldivieso Monasterios, Euan Morris, Jeffrey Hardy

    Abstract: We have designed a two-stage, 10-step process to give organisations a method to analyse small local energy systems (SLES) projects based on their Cyber Physical System components in order to develop future-proof energy systems. SLES are often developed for a specific range of use cases and functions, and these match the specific requirements and needs of the community, location or site under con… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: White Paper on Cyber Physical Advances relevant to Smart Local Energy Systems

    Report number: June, 2020, EnergyREV, University of Strathclyde Publishing: Glasgow, UK, ISBN 978-1-909522-58-9

  34. arXiv:2104.11590  [pdf, other

    cs.RO eess.SY

    A Prioritized Trajectory Planning Algorithm for Connected and Automated Vehicle Mandatory Lane Changes

    Authors: Nachuan Li, Austen Z. Fan, Riley Fischer, Wissam Kontar, Bin Ran

    Abstract: We introduce a prioritized system-optimal algorithm for mandatory lane change (MLC) behavior of connected and automated vehicles (CAV) from a dedicated lane. Our approach applies a cooperative lane change that prioritizes the decisions of lane changing vehicles which are closer to the end of the diverging zone (DZ), and optimizes the predicted total system travel time. Our experiments on synthetic… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  35. arXiv:2012.06185  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Exploring wav2vec 2.0 on speaker verification and language identification

    Authors: Zhiyun Fan, Meng Li, Shiyu Zhou, Bo Xu

    Abstract: Wav2vec 2.0 is a recently proposed self-supervised framework for speech representation learning. It follows a two-stage training process of pre-training and fine-tuning, and performs well in speech recognition tasks especially ultra-low resource cases. In this work, we attempt to extend self-supervised framework to speaker verification and language identification. First, we use some preliminary ex… ▽ More

    Submitted 14 January, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: Self-supervised, speaker verification, language identification, multi-task learning, wav2vec 2.0

  36. arXiv:2010.15560  [pdf, other

    eess.IV cs.CV

    Genetic U-Net: Automatically Designed Deep Networks for Retinal Vessel Segmentation Using a Genetic Algorithm

    Authors: Jiahong Wei, Zhun Fan

    Abstract: Recently, many methods based on hand-designed convolutional neural networks (CNNs) have achieved promising results in automatic retinal vessel segmentation. However, these CNNs remain constrained in capturing retinal vessels in complex fundus images. To improve their segmentation performance, these CNNs tend to have many parameters, which may lead to overfitting and high computational complexity.… ▽ More

    Submitted 11 June, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  37. arXiv:2010.10691  [pdf, other

    cs.SD cs.LG eess.AS

    Prediction of Object Geometry from Acoustic Scattering Using Convolutional Neural Networks

    Authors: Ziqi Fan, Vibhav Vineet, Chenshen Lu, T. W. Wu, Kyla McMullen

    Abstract: Acoustic scattering is strongly influenced by boundary geometry of objects over which sound scatters. The present work proposes a method to infer object geometry from scattering features by training convolutional neural networks. The training data is generated from a fast numerical solver developed on CUDA. The complete set of simulations is sampled to generate multiple datasets containing differe… ▽ More

    Submitted 10 February, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted by ICASSP 2021

  38. arXiv:2010.08682  [pdf, other

    cs.CV cs.LG eess.IV

    MeshMVS: Multi-View Stereo Guided Mesh Reconstruction

    Authors: Rakesh Shrestha, Zhiwen Fan, Qingkun Su, Zuozhuo Dai, Siyu Zhu, ** Tan

    Abstract: Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. These color image semantics only implicitly encode 3D information, potentially limiting the accuracy of the generated shapes. In this paper we propose a multi-view mesh generation method which incorporates geometry… ▽ More

    Submitted 11 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  39. arXiv:2007.00477  [pdf

    cs.CV cs.LG eess.IV

    Automatic Crack Detection on Road Pavements Using Encoder Decoder Architecture

    Authors: Zhun Fan, Chong Li, Ying Chen, Jiahong Wei, Giuseppe Loprencipe, Xiaopeng Chen, Paola Di Mascio

    Abstract: Inspired by the development of deep learning in computer vision and object detection, the proposed algorithm considers an encoder-decoder architecture with hierarchical feature learning and dilated convolution, named U-Hierarchical Dilated Network (U-HDN), to perform crack detection in an end-to-end method. Crack characteristics with multiple context information are automatically able to learn and… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  40. arXiv:2002.06817  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Addressing the confounds of accompaniments in singer identification

    Authors: Tsung-Han Hsieh, Kai-Hsiang Cheng, Zhe-Cheng Fan, Yu-Ching Yang, Yi-Hsuan Yang

    Abstract: Identifying singers is an important task with many applications. However, the task remains challenging due to many issues. One major issue is related to the confounding factors from the background instrumental music that is mixed with the vocals in music production. A singer identification model may learn to extract non-vocal related features from the instrumental part of the songs, if a singer on… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  41. arXiv:2002.03241  [pdf

    cs.CV cs.LG eess.IV

    Ensemble of Deep Convolutional Neural Networks for Automatic Pavement Crack Detection and Measurement

    Authors: Zhun Fan, Chong Li, Ying Chen, Paola Di Mascio, Xiaopeng Chen, Guijie Zhu, Giuseppe Loprencipe

    Abstract: Automated pavement crack detection and measurement are important road issues. Agencies have to guarantee the improvement of road safety. Conventional crack detection and measurement algorithms can be extremely time-consuming and low efficiency. Therefore, recently, innovative algorithms have received increased attention from researchers. In this paper, we propose an ensemble of convolutional neura… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

  42. arXiv:2001.06678  [pdf

    eess.IV cs.CV

    Evolutionary Neural Architecture Search for Retinal Vessel Segmentation

    Authors: Zhun Fan, Jiahong Wei, Guijie Zhu, Jiajie Mo, Wenji Li

    Abstract: The accurate retinal vessel segmentation (RVS) is of great significance to assist doctors in the diagnosis of ophthalmology diseases and other systemic diseases. Manually designing a valid neural network architecture for retinal vessel segmentation requires high expertise and a large workload. In order to improve the performance of vessel segmentation and reduce the workload of manually designing… ▽ More

    Submitted 18 March, 2020; v1 submitted 18 January, 2020; originally announced January 2020.

  43. arXiv:2001.01557  [pdf, other

    cs.CL cs.SD eess.AS

    Speaker-aware speech-transformer

    Authors: Zhiyun Fan, Jie Li, Shiyu Zhou, Bo Xu

    Abstract: Recently, end-to-end (E2E) models become a competitive alternative to the conventional hybrid automatic speech recognition (ASR) systems. However, they still suffer from speaker mismatch in training and testing condition. In this paper, we use Speech-Transformer (ST) as the study platform to investigate speaker aware training of E2E models. We propose a model called Speaker-Aware Speech-Transforme… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  44. arXiv:1912.11000  [pdf

    eess.IV cs.CV

    Fully Automated Multi-Organ Segmentation in Abdominal Magnetic Resonance Imaging with Deep Neural Networks

    Authors: Yuhua Chen, Dan Ruan, Jiayu Xiao, Lixia Wang, Bin Sun, Rola Saouaf, Wensha Yang, Debiao Li, Zhaoyang Fan

    Abstract: Segmentation of multiple organs-at-risk (OARs) is essential for radiation therapy treatment planning and other clinical applications. We developed an Automated deep Learning-based Abdominal Multi-Organ segmentation (ALAMO) framework based on 2D U-net and a densely connected network structure with tailored design in data augmentation and training procedures such as deep connection, auxiliary superv… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: 21 pages, 4 figures, submitted to the journal Medical Physics

  45. arXiv:1912.03696  [pdf, other

    cs.OH eess.IV

    High-Freedom Inverse Design with Deep Neural Network for Metasurface Filter in the Visible

    Authors: Xiao Han, Ziyang Fan, Chao Li, Zeyang Liu, L. Jay Guo

    Abstract: In order to obtain a metasurface structure capable of filtering the light of a specific wavelength in the visible band, traditional method usually traverses the space consisting of possible designs, searching for a potentially satisfying device by performing iterative calculations to solve Maxwell's equations. In this paper, we propose a neural network that can complete an inverse design process t… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  46. arXiv:1911.01802  [pdf, other

    eess.AS cs.LG cs.SD eess.IV eess.SP

    Fast acoustic scattering using convolutional neural networks

    Authors: Ziqi Fan, Vibhav Vineet, Hannes Gamper, Nikunj Raghuvanshi

    Abstract: Diffracted scattering and occlusion are important acoustic effects in interactive auralization and noise control applications, typically requiring expensive numerical simulation. We propose training a convolutional neural network to map from a convex scatterer's cross-section to a 2D slice of the resulting spatial loudness distribution. We show that employing a full-resolution residual network for… ▽ More

    Submitted 15 February, 2020; v1 submitted 30 October, 2019; originally announced November 2019.

    Comments: Accepted by ICASSP 2020

  47. arXiv:1910.12418  [pdf, other

    cs.SD cs.CL eess.AS

    Unsupervised pre-training for sequence to sequence speech recognition

    Authors: Zhiyun Fan, Shiyu Zhou, Bo Xu

    Abstract: This paper proposes a novel approach to pre-train encoder-decoder sequence-to-sequence (seq2seq) model with unpaired speech and transcripts respectively. Our pre-training method is divided into two stages, named acoustic pre-trianing and linguistic pre-training. In the acoustic pre-training stage, we use a large amount of speech to pre-train the encoder by predicting masked speech feature chunks w… ▽ More

    Submitted 1 January, 2020; v1 submitted 27 October, 2019; originally announced October 2019.

  48. arXiv:1906.12193  [pdf, other

    eess.IV cs.CV

    Accurate Retinal Vessel Segmentation via Octave Convolution Neural Network

    Authors: Zhun Fan, Jiajie Mo, Benzhang Qiu, Wenji Li, Guijie Zhu, Chong Li, Jianye Hu, Yibiao Rong, Xinjian Chen

    Abstract: Retinal vessel segmentation is a crucial step in diagnosing and screening various diseases, including diabetes, ophthalmologic diseases, and cardiovascular diseases. In this paper, we propose an effective and efficient method for vessel segmentation in color fundus images using encoder-decoder based octave convolution networks. Compared with other convolution networks utilizing standard convolutio… ▽ More

    Submitted 22 September, 2020; v1 submitted 28 June, 2019; originally announced June 2019.

  49. arXiv:1906.00891  [pdf, other

    cs.CV eess.SY

    Automated Steel Bar Counting and Center Localization with Convolutional Neural Networks

    Authors: Zhun Fan, Jiewei Lu, Benzhang Qiu, Tao Jiang, Kang An, Alex Noel Josephraj, Chuliang Wei

    Abstract: Automated steel bar counting and center localization plays an important role in the factory automation of steel bars. Traditional methods only focus on steel bar counting and their performances are often limited by complex industrial environments. Convolutional neural network (CNN), which has great capability to deal with complex tasks in challenging environments, is applied in this work. A framew… ▽ More

    Submitted 19 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Ready to submit IEEE Transactions on Industrial Informatics

  50. Guaranteed-cost consensus for multiagent networks with Lipschitz nonlinear dynamics and switching topologies

    Authors: Jianxiang Xi, Zhiliang Fan, Hao Liu, Tang Zheng

    Abstract: Guaranteed-cost consensus for high-order nonlinear multi-agent networks with switching topologies is investigated. By constructing a time-varying nonsingular matrix with a specific structure, the whole dynamics of multi-agent networks is decomposed into the consensus and disagreement parts with nonlinear terms, which is the key challenge to be dealt with. An explicit expression of the consensus dy… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

    Comments: 16 pages