Skip to main content

Showing 1–50 of 56 results for author: Zhu, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.07536  [pdf, other

    cs.RO eess.SY

    Multi-AUV Kinematic Task Assignment based on Self-organizing Map Neural Network and Dubins Path Generator

    Authors: Xin Li, Wenyang Gan, Pang Wen, Daqi Zhu

    Abstract: To deal with the task assignment problem of multi-AUV systems under kinematic constraints, which means steering capability constraints for underactuated AUVs or other vehicles likely, an improved task assignment algorithm is proposed combining the Dubins Path algorithm with improved SOM neural network algorithm. At first, the aimed tasks are assigned to the AUVs by improved SOM neural network meth… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2404.00327  [pdf, other

    eess.IV cs.CV cs.LG

    YNetr: Dual-Encoder architecture on Plain Scan Liver Tumors (PSLT)

    Authors: Wen Sheng, Zhong Zheng, Jiajun Liu, Han Lu, Hanyuan Zhang, Zhengyong Jiang, Zhihong Zhang, Dao** Zhu

    Abstract: Background: Liver tumors are abnormal growths in the liver that can be either benign or malignant, with liver cancer being a significant health concern worldwide. However, there is no dataset for plain scan segmentation of liver tumors, nor any related algorithms. To fill this gap, we propose Plain Scan Liver Tumors(PSLT) and YNetr. Methods: A collection of 40 liver tumor plain scan segmentation d… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 15 pages

  3. arXiv:2402.06841  [pdf

    eess.IV cs.CV

    Point cloud-based registration and image fusion between cardiac SPECT MPI and CTA

    Authors: Shaojie Tang, Penpen Miao, Xingyu Gao, Yu Zhong, Dantong Zhu, Haixing Wen, Zhihui Xu, Qiuyue Wei, Hong** Yao, Xin Huang, Rui Gao, Chen Zhao, Weihua Zhou

    Abstract: A method was proposed for the point cloud-based registration and image fusion between cardiac single photon emission computed tomography (SPECT) myocardial perfusion images (MPI) and cardiac computed tomography angiograms (CTA). Firstly, the left ventricle (LV) epicardial regions (LVERs) in SPECT and CTA images were segmented by using different U-Net neural networks trained to generate the point c… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  4. arXiv:2401.05521  [pdf, other

    cs.RO cs.AI eess.SY

    Current Effect-eliminated Optimal Target Assignment and Motion Planning for a Multi-UUV System

    Authors: Danjie Zhu, Simon X. Yang

    Abstract: The paper presents an innovative approach (CBNNTAP) that addresses the complexities and challenges introduced by ocean currents when optimizing target assignment and motion planning for a multi-unmanned underwater vehicle (UUV) system. The core of the proposed algorithm involves the integration of several key components. Firstly, it incorporates a bio-inspired neural network-based (BINN) approach… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: This paper was accepted by IEEE Transactions on Intelligent Transportation Systems

  5. Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters

    Authors: Dafeng Zhu, Bo Yang, Yu Wu, Haoran Deng, Zhaoyang Dong, Kai Ma, ** Guan

    Abstract: This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Smart Grid, 2023

  6. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  7. arXiv:2310.06162  [pdf

    eess.IV

    Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation

    Authors: Mohammad Peivandi, Jason Zhang, Michael Lu, Dongxiao Zhu, Zhifeng Kou

    Abstract: Brain tumor segmentation presents a formidable challenge in the field of Medical Image Segmentation. While deep-learning models have been useful, human expert segmentation remains the most accurate method. The recently released Segment Anything Model (SAM) has opened up the opportunity to apply foundation models to this difficult task. However, SAM was primarily trained on diverse natural images.… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  8. arXiv:2308.08449  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Improving CTC-AED model with integrated-CTC and auxiliary loss regularization

    Authors: Daobin Zhu, Xiangdong Su, Hongbin Zhang

    Abstract: Connectionist temporal classification (CTC) and attention-based encoder decoder (AED) joint training has been widely applied in automatic speech recognition (ASR). Unlike most hybrid models that separately calculate the CTC and AED losses, our proposed integrated-CTC utilizes the attention mechanism of AED to guide the output of CTC. In this paper, we employ two fusion methods, namely direct addit… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  9. arXiv:2308.01138  [pdf, other

    cs.LG cs.AI eess.SP

    Can We Transfer Noise Patterns? A Multi-environment Spectrum Analysis Model Using Generated Cases

    Authors: Haiwen Du, Zheng Ju, Yu An, Honghui Du, Dongjie Zhu, Zhaoshuo Tian, Aonghus Lawlor, Ruihai Dong

    Abstract: Spectrum analysis systems in online water quality testing are designed to detect types and concentrations of pollutants and enable regulatory agencies to respond promptly to pollution incidents. However, spectral data-based testing devices suffer from complex noise patterns when deployed in non-laboratory environments. To make the analysis model applicable to more environments, we propose a noise… ▽ More

    Submitted 14 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  10. arXiv:2307.07807  [pdf, other

    eess.IV cs.CV

    MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis

    Authors: Junyu Li, Han Huang, Dong Ni, Wufeng Xue, Dongmei Zhu, Jun Cheng

    Abstract: Early diagnosis of renal cancer can greatly improve the survival rate of patients. Contrast-enhanced ultrasound (CEUS) is a cost-effective and non-invasive imaging technique and has become more and more frequently used for renal tumor diagnosis. However, the classification of benign and malignant renal tumors can still be very challenging due to the highly heterogeneous appearance of cancer and im… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: MICCAI 2023

  11. arXiv:2307.02514  [pdf, other

    eess.AS cs.AI cs.SD

    Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

    Authors: Hongmin Cai, Xiaoke Huang, Zhengliang Liu, Wenxiong Liao, Haixing Dai, Zihao Wu, Dajiang Zhu, Hui Ren, Quanzheng Li, Tianming Liu, Xiang Li

    Abstract: Alzheimer's disease (AD) is a common form of dementia that severely impacts patient health. As AD impairs the patient's language understanding and expression ability, the speech of AD patients can serve as an indicator of this disease. This study investigates various methods for detecting AD using patients' speech and transcripts data from the DementiaBank Pitt database. The proposed approach invo… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  12. arXiv:2306.11730  [pdf, other

    eess.IV cs.CV cs.LG

    Segment Anything Model (SAM) for Radiation Oncology

    Authors: Lian Zhang, Zhengliang Liu, Lu Zhang, Zihao Wu, Xiaowei Yu, Jason Holmes, Hongying Feng, Haixing Dai, Xiang Li, Quanzheng Li, Dajiang Zhu, Tianming Liu, Wei Liu

    Abstract: In this study, we evaluate the performance of the Segment Anything Model (SAM) in clinical radiotherapy. Our results indicate that SAM's 'segment anything' mode can achieve clinically acceptable segmentation results in most organs-at-risk (OARs) with Dice scores higher than 0.7. SAM's 'box prompt' mode further improves the Dice scores by 0.1 to 0.5. Considering the size of the organ and the clarit… ▽ More

    Submitted 4 July, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  13. arXiv:2301.01827  [pdf, other

    cs.RO cs.AI eess.SY

    A GOA-Based Fault-Tolerant Trajectory Tracking Control for an Underwater Vehicle of Multi-Thruster System without Actuator Saturation

    Authors: Danjie Zhu, Lei Wang, Hua Zhang, Simon X. Yang

    Abstract: This paper proposes an intelligent fault-tolerant control (FTC) strategy to tackle the trajectory tracking problem of an underwater vehicle (UV) under thruster damage (power loss) cases and meanwhile resolve the actuator saturation brought by the vehicle's physical constraints. In the proposed control strategy, the trajectory tracking component is formed by a refined backstep** algorithm that co… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.01706

  14. arXiv:2212.02084  [pdf, other

    cs.SD eess.AS

    End-to-end Recording Device Identification Based on Deep Representation Learning

    Authors: Chunyan Zeng, Dongliang Zhu, Zhifeng Wang, Minghu Wu, Wei Xiong, Nan Zhao

    Abstract: Deep learning techniques have achieved specific results in recording device source identification. The recording device source features include spatial information and certain temporal information. However, most recording device source identification methods based on deep learning only use spatial representation learning from recording device source features, which cannot make full use of recordin… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: 20 pages, 5 figures, recording device identification

  15. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  16. arXiv:2211.05256  [pdf, other

    eess.IV cs.CV

    Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang , et al. (29 additional authors not shown)

    Abstract: Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this prob… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.08826, arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.03885

  17. arXiv:2210.08218  [pdf

    cs.IT eess.SP

    Massive MIMO Evolution Towards 3GPP Release 18

    Authors: Huang** **, Kunpeng Liu, Gilwon Lee, Emad J. Farag, Min Zhang, Dalin Zhu, Leiming Zhang, Eko Onggosanusi, Mansoor Shafi, Harsh Tataria

    Abstract: Since the introduction of fifth-generation new radio (5G-NR) in Third Generation Partnership Project (3GPP) Release 15, swift progress has been made to evolve 5G with 3GPP Release 18 emerging. A critical aspect is the design of massive multiple-input multiple-output (MIMO) technology. In this line, this paper makes several important contributions: We provide a comprehensive overview of the evoluti… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: 23 pages, 37 Figures, one fig in the annex

  18. arXiv:2210.03189  [pdf, other

    eess.IV cs.CV

    FocalUNETR: A Focal Transformer for Boundary-aware Segmentation of CT Images

    Authors: Chengyin Li, Yao Qiang, Rafi Ibn Sultan, Hassan Bagher-Ebadian, Prashant Khanduri, Indrin J. Chetty, Dongxiao Zhu

    Abstract: Computed Tomography (CT) based precise prostate segmentation for treatment planning is challenging due to (1) the unclear boundary of the prostate derived from CT's poor soft tissue contrast and (2) the limitation of convolutional neural network-based models in capturing long-range global context. Here we propose a novel focal transformer-based image segmentation architecture to effectively and ef… ▽ More

    Submitted 18 July, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 13 pages, 3 figures, 2 tables

  19. arXiv:2210.01706  [pdf, other

    cs.RO cs.AI eess.SY

    A Fuzzy Logic-based Cascade Control without Actuator Saturation for the Unmanned Underwater Vehicle Trajectory Tracking

    Authors: Danjie Zhu, Simon X. Yang, Mohammad Biglarbegian

    Abstract: An intelligent control strategy is proposed to eliminate the actuator saturation problem that exists in the trajectory tracking process of unmanned underwater vehicles (UUV). The control strategy consists of two parts: for the kinematic modeling part, a fuzzy logic-refined backstep** control is developed to achieve control velocities within acceptable ranges and errors of small fluctuations; on… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  20. arXiv:2209.13647  [pdf

    eess.SP cs.LG

    Deep learning based sferics recognition for AMT data processing in the dead band

    Authors: Enhua Jiang, Rujun Chen, Xinming Wu, Jianxin Liu, Debin Zhu, Weiqiang Liu

    Abstract: In the audio magnetotellurics (AMT) sounding data processing, the absence of sferic signals in some time ranges typically results in a lack of energy in the AMT dead band, which may cause unreliable resistivity estimate. We propose a deep convolutional neural network (CNN) to automatically recognize sferic signals from redundantly recorded data in a long time range and use them to compensate for t… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  21. arXiv:2209.04326  [pdf, other

    eess.IV cs.CV cs.LG

    Saliency Guided Adversarial Training for Learning Generalizable Features with Applications to Medical Imaging Classification System

    Authors: Xin Li, Yao Qiang, Chengyin Li, Sijia Liu, Dongxiao Zhu

    Abstract: This work tackles a central machine learning problem of performance degradation on out-of-distribution (OOD) test sets. The problem is particularly salient in medical imaging based diagnosis system that appears to be accurate but fails when tested in new hospitals/datasets. Recent studies indicate the system might learn shortcut and non-relevant features instead of generalizable features, so-calle… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 9 pages, 3 figures

    Journal ref: AdvML Frontiers workshop at 39th International Conference on Machine Learning (ICML), Baltimore, Maryland, USA, 2022

  22. Optimization of rule-based energy management strategies for hybrid vehicles using dynamic programming

    Authors: Di Zhu, Ewan Pritchard, Sumanth Reddy Dadam, Vivek Kumar, Yang Xu

    Abstract: Reducing energy consumption is a key focus for hybrid electric vehicle (HEV) development. The popular vehicle dynamic model used in many energy management optimization studies does not capture the vehicle dynamics that the in-vehicle measurement system does. However, feedback from the measurement system is what the vehicle controller actually uses to manage energy consumption. Therefore, the optim… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  23. Motion Planning and Tracking Control of Unmanned Underwater Vehicles: Technologies, Challenges and Prospects

    Authors: Danjie Zhu, Tao Yan, Simon X. Yang

    Abstract: The motion planning and tracking control techniques of unmanned underwater vehicles (UUV) are fundamentally significant for efficient and robust UUV navigation, which is crucial for underwater rescue, facility maintenance, marine resource exploration, aquatic recreation, etc. Studies on UUV motion planning and tracking control have been growing rapidly worldwide, which are usually sorted into the… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  24. arXiv:2206.12420  [pdf, other

    cs.LG eess.SP

    SCAI: A Spectral data Classification framework with Adaptive Inference for the IoT platform

    Authors: Yundong Sun, Dongjie Zhu, Haiwen Du, Yansong Wang, Zhaoshuo Tian

    Abstract: Currently, it is a hot research topic to realize accurate, efficient, and real-time identification of massive spectral data with the help of deep learning and IoT technology. Deep neural networks played a key role in spectral analysis. However, the inference of deeper models is performed in a static manner, and cannot be adjusted according to the device. Not all samples need to allocate all comput… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 14 pages,11 figures

  25. Bio-inspired Neural Network-based Optimal Path Planning for UUVs under the Effect of Ocean Currents

    Authors: Danjie Zhu, Simon X. Yang

    Abstract: To eliminate the effect of ocean currents when addressing the optimal path in the underwater environment, an intelligent algorithm designed for the unmanned underwater vehicle (UUV) is proposed in this paper. The algorithm consists of two parts: a neural network-based algorithm that deducts the shortest path and avoids all possible collisions; and an adjusting component that balances off the devia… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  26. Bio-inspired Intelligence with Applications to Robotics: A Survey

    Authors: Junfei Li, Zhe Xu, Danjie Zhu, Kevin Dong, Tao Yan, Zhu Zeng, Simon X. Yang

    Abstract: In the past decades, considerable attention has been paid to bio-inspired intelligence and its applications to robotics. This paper provides a comprehensive survey of bio-inspired intelligence, with a focus on neurodynamics approaches, to various robotic applications, particularly to path planning and control of autonomous robotic systems. Firstly, the bio-inspired shunting model and its variants… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  27. arXiv:2206.04264  [pdf, other

    eess.SY

    Formation Tracking for a Multi-Auv System Based on an Adaptive Sliding Mode Method in the Water Flow Environment

    Authors: Xin Li, Daqi Zhu, Bing Sun, Qi Chen, Wenyang Gan, Zhigang Li

    Abstract: In this paper, formation tracking for a multi-AUV system (MAS) using an improved adaptive sliding mode control method is studied in the Three Dimensional (3-D) underwater environment. Firstly, the kinematics model and the dynamic model of the AUVs are given as the Six Dimensions of Freedom (6-DOF) considered. Then, control law based on the mathematical model of the AUVs is proposed based on the im… ▽ More

    Submitted 17 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

  28. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, ** Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  29. arXiv:2205.10605  [pdf, other

    q-bio.NC cs.CV eess.IV

    Brain Cortical Functional Gradients Predict Cortical Folding Patterns via Attention Mesh Convolution

    Authors: Li Yang, Zhibin He, Changhe Li, Junwei Han, Dajiang Zhu, Tianming Liu, Tuo Zhang

    Abstract: Since gyri and sulci, two basic anatomical building blocks of cortical folding patterns, were suggested to bear different functional roles, a precise map** from brain function to gyro-sulcal patterns can provide profound insights into both biological and artificial neural networks. However, there lacks a generic theory and effective computational model so far, due to the highly nonlinear relatio… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  30. arXiv:2205.09576  [pdf, other

    cs.CV cs.AI cs.LG eess.IV q-bio.NC

    Discovering Dynamic Functional Brain Networks via Spatial and Channel-wise Attention

    Authors: Yiheng Liu, Enjie Ge, Mengshen He, Zhengliang Liu, Shijie Zhao, Xintao Hu, Dajiang Zhu, Tianming Liu, Bao Ge

    Abstract: Using deep learning models to recognize functional brain networks (FBNs) in functional magnetic resonance imaging (fMRI) has been attracting increasing interest recently. However, most existing work focuses on detecting static FBNs from entire fMRI signals, such as correlation-based functional connectivity. Sliding-window is a widely used strategy to capture the dynamics of FBNs, but it is still l… ▽ More

    Submitted 31 May, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 12 pages,6 figures, submitted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

    ACM Class: I.2.m

  31. arXiv:2204.04088  [pdf, other

    eess.SY

    Stochastic Gradient-based Fast Distributed Multi-Energy Management for an Industrial Park with Temporally-Coupled Constraints

    Authors: Dafeng Zhu, Bo Yang, Chengbin Ma, Zhaojian Wang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Contemporary industrial parks are challenged by the growing concerns about high cost and low efficiency of energy supply. Moreover, in the case of uncertain supply/demand, how to mobilize delay-tolerant elastic loads and compensate real-time inelastic loads to match multi-energy generation/storage and minimize energy cost is a key issue. Since energy management is hardly to be implemented offline… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by Applied Energy

  32. arXiv:2202.11784  [pdf, other

    cs.RO eess.SY

    Design and experimental investigation of a vibro-impact self-propelled capsule robot with orientation control

    Authors: Jiajia Zhang, Jiyuan Tian, Dibin Zhu, Yang Liu, Shyam Prasad

    Abstract: This paper presents a novel design and experimental investigation for a self-propelled capsule robot that can be used for painless colonoscopy during a retrograde progression from the patient's rectum. The steerable robot is driven forward and backward via its internal vibration and impact with orientation control by using an electromagnetic actuator. The actuator contains four sets of coils and a… ▽ More

    Submitted 1 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: ICRA 2022 Conference paper

  33. Energy Management Based on Multi-Agent Deep Reinforcement Learning for A Multi-Energy Industrial Park

    Authors: Dafeng Zhu, Bo Yang, Yuxiang Liu, Zhaojian Wang, Kai Ma, ** Guan

    Abstract: Owing to large industrial energy consumption, industrial production has brought a huge burden to the grid in terms of renewable energy access and power supply. Due to the coupling of multiple energy sources and the uncertainty of renewable energy and demand, centralized methods require large calculation and coordination overhead. Thus, this paper proposes a multi-energy management framework achiev… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted by Applied Energy

    Journal ref: Applied Energy 311 (2022) 118636

  34. Data Driven based Dynamic Correction Prediction Model for NOx Emission of Coal Fired Boiler

    Authors: Zhenhao Tang, Deyu Zhu, Yang Li

    Abstract: The real-time prediction of NOx emissions is of great significance for pollutant emission control and unit operation of coal-fired power plants. Aiming at dealing with the large time delay and strong nonlinear characteristics of the combustion process, a dynamic correction prediction model considering the time delay is proposed. First, the maximum information coefficient (MIC) is used to calculate… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: in Chinese language, Accepted by Proceedings of the CSEE

    Journal ref: Proceedings of the CSEE 42 (2022) 5182-5193

  35. arXiv:2110.14209  [pdf, ps, other

    eess.SY

    Fast Distributed Stochastic Scheduling for A Multi-Energy Industrial Park

    Authors: Dafeng Zhu, Bo Yang, Zhaojian Wang, Chengbin Ma, Kai Ma, Shanying Zhu

    Abstract: The multi-energy management framework of industrial parks advocates energy conversion and scheduling, which takes full advantage of the compensation and temporal availability of multiple energy. However, how to exploit elastic loads and compensate inelastic loads to match multiple generators and storage is still a key problem under the uncertainty of demand and supply. To solve the issue, the ener… ▽ More

    Submitted 24 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  36. arXiv:2109.06094  [pdf, other

    cs.CV cs.LG eess.IV

    Single-stream CNN with Learnable Architecture for Multi-source Remote Sensing Data

    Authors: Yi Yang, Daoye Zhu, Tengteng Qu, Qiangyu Wang, Fuhu Ren, Chengqi Cheng

    Abstract: In this paper, we propose an efficient and generalizable framework based on deep convolutional neural network (CNN) for multi-source remote sensing data joint classification. While recent methods are mostly based on multi-stream architectures, we use group convolution to construct equivalent network architectures efficiently within a single-stream network. We further adopt and improve dynamic grou… ▽ More

    Submitted 6 February, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  37. arXiv:2103.09030  [pdf, other

    cs.CV eess.IV

    A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition

    Authors: Jianbang Liu, Yuqi Fang, Delong Zhu, Nachuan Ma, ** Pan, Max Q. -H. Meng

    Abstract: Human activities are hugely restricted by COVID-19, recently. Robots that can conduct inter-floor navigation attract much public attention, since they can substitute human workers to conduct the service work. However, current robots either depend on human assistance or elevator retrofitting, and fully autonomous inter-floor navigation is still not available. As the very first step of inter-floor n… ▽ More

    Submitted 22 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  38. arXiv:2103.02155  [pdf

    cs.CV cs.LG eess.IV

    Sensing population distribution from satellite imagery via deep learning: model selection, neighboring effect, and systematic biases

    Authors: Xiao Huang, Di Zhu, Fan Zhang, Tao Liu, Xiao Li, Lei Zou

    Abstract: The rapid development of remote sensing techniques provides rich, large-coverage, and high-temporal information of the ground, which can be coupled with the emerging deep learning approaches that enable latent features and hidden geographical patterns to be extracted. This study marks the first attempt to cross-compare performances of popular state-of-the-art deep learning models in estimating pop… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 15 pages. 10 figures, 3 tables

  39. arXiv:2102.03215  [pdf, other

    eess.SY cs.CR

    Security Assessment and Impact Analysis of Cyberattacks in Integrated T&D Power Systems

    Authors: Ioannis Zografopoulos, Charalambos Konstantinou, Nektarios Georgios Tsoutsos, Dan Zhu, Robert Broadwater

    Abstract: In this paper, we examine the impact of cyberattacks in an integrated transmission and distribution (T&D) power grid model with distributed energy resource (DER) integration. We adopt the OCTAVE Allegro methodology to identify critical system assets, enumerate potential threats, analyze, and prioritize risks for threat scenarios. Based on the analysis, attack strategies and exploitation scenarios… ▽ More

    Submitted 11 April, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

  40. arXiv:2101.10492  [pdf, other

    cs.CV eess.IV

    Fast Non-line-of-sight Imaging with Two-step Deep Remap**

    Authors: Dayu Zhu, Wenshan Cai

    Abstract: Conventional imaging only records photons directly sent from the object to the detector, while non-line-of-sight (NLOS) imaging takes the indirect light into account. Most NLOS solutions employ a transient scanning process, followed by a physical based algorithm to reconstruct the NLOS scenes. However, the transient detection requires sophisticated apparatus, with long scanning time and low robust… ▽ More

    Submitted 25 March, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

  41. arXiv:2101.00150  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Grid Back-Projection Networks

    Authors: Pablo Navarrete Michelini, Wenbin Chen, Hanwen Liu, Dan Zhu, Xingqun Jiang

    Abstract: Multi-Grid Back-Projection (MGBP) is a fully-convolutional network architecture that can learn to restore images and videos with upscaling artifacts. Using the same strategy of multi-grid partial differential equation (PDE) solvers this multiscale architecture scales computational complexity efficiently with increasing output resolutions. The basic processing block is inspired in the iterative bac… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

    Comments: Accepted for publication in IEEE Journal of Selected Topics in Signal Processing (J-STSP). arXiv admin note: text overlap with arXiv:1809.10711

  42. Robust Retinal Vessel Segmentation from a Data Augmentation Perspective

    Authors: Xu Sun, Huihui Fang, Yehui Yang, Dongwei Zhu, Lei Wang, Junwei Liu, Yanwu Xu

    Abstract: Retinal vessel segmentation is a fundamental step in screening, diagnosis, and treatment of various cardiovascular and ophthalmic diseases. Robustness is one of the most critical requirements for practical utilization, since the test images may be captured using different fundus cameras, or be affected by various pathological changes. We investigate this problem from a data augmentation perspectiv… ▽ More

    Submitted 28 September, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

  43. arXiv:2006.13555  [pdf, other

    cs.LG cs.CV eess.IV

    Defending against adversarial attacks on medical imaging AI system, classification or detection?

    Authors: Xin Li, Deng Pan, Dongxiao Zhu

    Abstract: Medical imaging AI systems such as disease classification and segmentation are increasingly inspired and transformed from computer vision based AI systems. Although an array of adversarial training and/or loss function based defense techniques have been developed and proved to be effective in computer vision, defending against adversarial attacks on medical images remains largely an uncharted terr… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  44. arXiv:2006.05669  [pdf, other

    eess.SY cs.LG

    Interpretable Multimodal Learning for Intelligent Regulation in Online Payment Systems

    Authors: Shuoyao Wang, Diwei Zhu

    Abstract: With the explosive growth of transaction activities in online payment systems, effective and realtime regulation becomes a critical problem for payment service providers. Thanks to the rapid development of artificial intelligence (AI), AI-enable regulation emerges as a promising solution. One main challenge of the AI-enabled regulation is how to utilize multimedia information, i.e., multimodal sig… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: Accepted by IJCAI 2020. SOLE copyright holder is IJCAI (international Joint Conferences on Artificial Intelligence)

  45. Energy Trading in Microgrids for Synergies among Electricity, Hydrogen and Heat Networks

    Authors: Dafeng Zhu, Bo Yang, Qi Liu, Kai Ma, Shanying Zhu, ** Guan

    Abstract: The emerging paradigm of interconnected microgrids advocates energy trading or sharing among multiple microgrids. It helps make full use of the temporal availability of energy and diversity in operational costs when meeting various energy loads. However, energy trading might not completely absorb excess renewable energy. A multi-energy management framework including fuel cell vehicles, energy stor… ▽ More

    Submitted 11 June, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

  46. arXiv:2004.03042  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-MobileXpert: On-Device COVID-19 Patient Triage and Follow-up using Chest X-rays

    Authors: Xin Li, Chengyin Li, Dongxiao Zhu

    Abstract: During the COVID-19 pandemic, there has been an emerging need for rapid, dedicated, and point-of-care COVID-19 patient disposition techniques to optimize resource utilization and clinical workflow. In view of this need, we present COVID-MobileXpert: a lightweight deep neural network (DNN) based mobile app that can use chest X-ray (CXR) for COVID-19 case screening and radiological trajectory predic… ▽ More

    Submitted 7 September, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: COVID-19, SARS-CoV-2, On-device Machine Learning, Chest X-Ray (CXR)

  47. arXiv:1912.11774  [pdf, other

    cs.RO cs.CV eess.IV

    Autonomous Removal of Perspective Distortion for Robotic Elevator Button Recognition

    Authors: Delong Zhu, Jianbang Liu, Nachuan Ma, Zhe Min, Max Q. -H. Meng

    Abstract: Elevator button recognition is considered an indispensable function for enabling the autonomous elevator operation of mobile robots. However, due to unfavorable image conditions and various image distortions, the recognition accuracy remains to be improved. In this paper, we present a novel algorithm that can autonomously correct perspective distortions of elevator panel images. The algorithm firs… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

  48. arXiv:1912.05971  [pdf, other

    eess.IV cs.CV cs.HC

    Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos

    Authors: Yucheng Zhu, Xiongkuo Min, DanDan Zhu, Ke Gu, Jiantao Zhou, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang

    Abstract: Augmented reality (AR) overlays digital content onto the reality. In AR system, correct and precise estimations of user's visual fixations and head movements can enhance the quality of experience by allocating more computation resources on the areas of interest. However, there is inadequate research about understanding the visual exploration of users when using an AR system or modeling AR visual a… ▽ More

    Submitted 20 July, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  49. arXiv:1909.12983  [pdf, other

    eess.IV cs.CV cs.LG

    MGBPv2: Scaling Up Multi-Grid Back-Projection Networks

    Authors: Pablo Navarrete Michelini, Wenbin Chen, Hanwen Liu, Dan Zhu

    Abstract: Here, we describe our solution for the AIM-2019 Extreme Super-Resolution Challenge, where we won the 1st place in terms of perceptual quality (MOS) similar to the ground truth and achieved the 5th place in terms of high-fidelity (PSNR). To tackle this challenge, we introduce the second generation of MultiGrid BackProjection networks (MGBPv2) whose major modifications make the system scalable and m… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

    Comments: In ICCV 2019 Workshops. Winner of Perceptual track in AIM Extreme Super-Resolution Challenge 2019. Code available at https://github.com/pnavarre/mgbpv2

  50. arXiv:1906.03691  [pdf, other

    eess.IV cs.LG stat.ML

    Interpreting Age Effects of Human Fetal Brain from Spontaneous fMRI using Deep 3D Convolutional Neural Networks

    Authors: Xiangrui Li, Jasmine Hect, Moriah Thomason, Dongxiao Zhu

    Abstract: Understanding human fetal neurodevelopment is of great clinical importance as abnormal development is linked to adverse neuropsychiatric outcomes after birth. Recent advances in functional Magnetic Resonance Imaging (fMRI) have provided new insight into development of the human brain before birth, but these studies have predominately focused on brain functional connectivity (i.e. Fisher z-score),… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 9 pages