Skip to main content

Showing 1–48 of 48 results for author: Pan, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.17004  [pdf, other

    cs.CV eess.IV

    Efficient Visual Fault Detection for Freight Train via Neural Architecture Search with Data Volume Robustness

    Authors: Yang Zhang, Mingying Li, Huilin Pan, Moyun Liu, Yang Zhou

    Abstract: Deep learning-based fault detection methods have achieved significant success. In visual fault detection of freight trains, there exists a large characteristic difference between inter-class components (scale variance) but intra-class on the contrary, which entails scale-awareness for detectors. Moreover, the design of task-specific networks heavily relies on human expertise. As a consequence, neu… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures

  2. arXiv:2405.13901  [pdf, other

    cs.CV cs.LG eess.SP

    DCT-Based Decorrelated Attention for Vision Transformers

    Authors: Hongyi Pan, Emadeldeen Hamdan, Xin Zhu, Koushik Biswas, Ahmet Enis Cetin, Ulas Bagci

    Abstract: Central to the Transformer architectures' effectiveness is the self-attention mechanism, a function that maps queries, keys, and values into a high-dimensional vector space. However, training the attention weights of queries, keys, and values is non-trivial from a state of random initialization. In this paper, we propose two methods. (i) We first address the initialization problem of Vision Transf… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.12367  [pdf, other

    eess.IV cs.CV

    Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning

    Authors: Zheyuan Zhang, Elif Keles, Gorkem Durak, Yavuz Taktak, Onkar Susladkar, Vandan Gorade, Debesh Jha, Asli C. Ormeci, Alpay Medetalibeyoglu, Lanhong Yao, Bin Wang, Ilkin Sevgi Isler, Linkai Peng, Hongyi Pan, Camila Lopes Vendrami, Amir Bourhani, Yury Velichko, Boqing Gong, Concetto Spampinato, Ayis Pyrros, Pallavi Tiwari, Derk C. F. Klatte, Megan Engels, Sanne Hoogenboom, Candice W. Bolan , et al. (13 additional authors not shown)

    Abstract: Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: under review version

  4. arXiv:2405.06166  [pdf, other

    eess.IV cs.CV

    MDNet: Multi-Decoder Network for Abdominal CT Organs Segmentation

    Authors: Debesh Jha, Nikhil Kumar Tomar, Koushik Biswas, Gorkem Durak, Matthew Antalek, Zheyuan Zhang, Bin Wang, Md Mostafijur Rahman, Hongyi Pan, Alpay Medetalibeyoglu, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci

    Abstract: Accurate segmentation of organs from abdominal CT scans is essential for clinical applications such as diagnosis, treatment planning, and patient monitoring. To handle challenges of heterogeneity in organ shapes, sizes, and complex anatomical relationships, we propose a \textbf{\textit{\ac{MDNet}}}, an encoder-decoder network that uses the pre-trained \textit{MiT-B2} as the encoder and multiple di… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  5. arXiv:2405.01503  [pdf, other

    eess.IV cs.CV

    PAM-UNet: Shifting Attention on Region of Interest in Medical Images

    Authors: Abhijit Das, Debesh Jha, Vandan Gorade, Koushik Biswas, Hongyi Pan, Zheyuan Zhang, Daniela P. Ladner, Yury Velichko, Amir Borhani, Ulas Bagci

    Abstract: Computer-aided segmentation methods can assist medical personnel in improving diagnostic outcomes. While recent advancements like UNet and its variants have shown promise, they face a critical challenge: balancing accuracy with computational efficiency. Shallow encoder architectures in UNets often struggle to capture crucial spatial features, leading in inaccurate and sparse segmentation. To addre… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at 2024 IEEE EMBC

  6. arXiv:2403.15828  [pdf, other

    eess.SY

    TJCCT: A Two-timescale Approach for UAV-assisted Mobile Edge Computing

    Authors: Zemin Sun, Geng Sun, Qingqing Wu, Long He, Shuang Liang, Hongyang Pan, Dusit Niyato, Chau Yuen, Victor C. M. Leung

    Abstract: Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) is emerging as a promising paradigm to provide aerial-terrestrial computing services in close proximity to mobile devices (MDs). However, meeting the demands of computation-intensive and delay-sensitive tasks for MDs poses several challenges, including the demand-supply contradiction between MDs and MEC servers, the demand-supply h… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  7. arXiv:2403.12985  [pdf, other

    cs.IT eess.SP

    Multi-objective Optimization for Data Collection in UAV-assisted Agricultural IoT

    Authors: Lingling Liu, Aimin Wang, Geng Sun, Jiahui Li, Hongyang Pan, Tony Q. S. Quek

    Abstract: The ground fixed base stations (BSs) are often deployed inflexibly, and have high overheads, as well as are susceptible to the damage from natural disasters, making it impractical for them to continuously collect data from sensor devices. To improve the network coverage and performance of wireless communication, unmanned aerial vehicles (UAVs) have been introduced in diverse wireless networks, the… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 13 pages, 7 figures, 4 tables

  8. arXiv:2403.06532  [pdf, other

    eess.IV cs.CV q-bio.NC

    Reconstructing Visual Stimulus Images from EEG Signals Based on Deep Visual Representation Model

    Authors: Hongguang Pan, Zhuoyi Li, Yunpeng Fu, Xuebin Qin, Jianchen Hu

    Abstract: Reconstructing visual stimulus images is a significant task in neural decoding, and up to now, most studies consider the functional magnetic resonance imaging (fMRI) as the signal source. However, the fMRI-based image reconstruction methods are difficult to widely applied because of the complexity and high cost of the acquisition equipments. Considering the advantages of low cost and easy portabil… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  9. arXiv:2403.05024  [pdf, other

    eess.IV cs.CV cs.LG

    A Probabilistic Hadamard U-Net for MRI Bias Field Correction

    Authors: Xin Zhu, Hongyi Pan, Yury Velichko, Adam B. Murphy, Ashley Ross, Baris Turkbey, Ahmet Enis Cetin, Ulas Bagci

    Abstract: Magnetic field inhomogeneity correction remains a challenging task in MRI analysis. Most established techniques are designed for brain MRI by supposing that image intensities in the identical tissue follow a uniform distribution. Such an assumption cannot be easily applied to other organs, especially those that are small in size and heterogeneous in texture (large variations in intensity), such as… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  10. arXiv:2312.05832  [pdf, other

    cs.CV eess.IV

    Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

    Authors: Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

    Abstract: Despite the successful application of convolutional neural networks (CNNs) in object detection tasks, their efficiency in detecting faults from freight train images remains inadequate for implementation in real-world engineering scenarios. Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting i… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 10 pages, 6 figures

  11. arXiv:2311.16116  [pdf, ps, other

    cs.NI eess.SY

    Resource Scheduling for UAVs-aided D2D Networks: A Multi-objective Optimization Approach

    Authors: Hongyang Pan, Yanheng Liu, Geng Sun, Pengfei Wang, Chau Yuen

    Abstract: Unmanned aerial vehicles (UAVs)-aided device-todevice (D2D) networks have attracted great interests with the development of 5G/6G communications, while there are several challenges about resource scheduling in UAVs-aided D2D networks. In this work, we formulate a UAVs-aided D2D network resource scheduling optimization problem (NetResSOP) to comprehensively consider the number of deployed UAVs, UAV… ▽ More

    Submitted 30 September, 2023; originally announced November 2023.

  12. arXiv:2310.02862  [pdf, other

    cs.LG cs.AI eess.SP

    A novel asymmetrical autoencoder with a sparsifying discrete cosine Stockwell transform layer for gearbox sensor data compression

    Authors: Xin Zhu, Daoguang Yang, Hongyi Pan, Hamid Reza Karimi, Didem Ozevin, Ahmet Enis Cetin

    Abstract: The lack of an efficient compression model remains a challenge for the wireless transmission of gearbox data in non-contact gear fault diagnosis problems. In this paper, we present a signal-adaptive asymmetrical autoencoder with a transform domain layer to compress sensor signals. First, a new discrete cosine Stockwell transform (DCST) layer is introduced to replace linear layers in a multi-layer… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  13. arXiv:2310.00396  [pdf, other

    eess.SY

    Joint Scheduling and Trajectory Optimization of Charging UAV in Wireless Rechargeable Sensor Networks

    Authors: Yanheng Liu, Hongyang Pan, Geng Sun, Aimin Wang, Jiahui Li, Shuang Liang

    Abstract: Wireless rechargeable sensor networks with a charging unmanned aerial vehicle (CUAV) have the broad application prospects in the power supply of the rechargeable sensor nodes (SNs). However, how to schedule a CUAV and design the trajectory to improve the charging efficiency of the entire system is still a vital problem. In this paper, we formulate a joint-CUAV scheduling and trajectory optimizatio… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  14. arXiv:2310.00384  [pdf, ps, other

    eess.SY

    Joint Power and 3D Trajectory Optimization for UAV-enabled Wireless Powered Communication Networks with Obstacles

    Authors: Hongyang Pan, Yanheng Liu, Geng Sun, Junsong Fan, Shuang Liang, Chau Yuen

    Abstract: Unmanned aerial vehicle (UAV)-enabled wireless powered communication networks (WPCNs) are promising technologies in 5G/6G wireless communications, while there are several challenges about UAV power allocation and scheduling to enhance the energy utilization efficiency, considering the existence of obstacles. In this work, we consider a UAV-enabled WPCN scenario that a UAV needs to cover the ground… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  15. arXiv:2309.12201  [pdf, other

    eess.SP cs.AI cs.LG

    Electroencephalogram Sensor Data Compression Using An Asymmetrical Sparse Autoencoder With A Discrete Cosine Transform Layer

    Authors: Xin Zhu, Hongyi Pan, Shuaiang Rong, Ahmet Enis Cetin

    Abstract: Electroencephalogram (EEG) data compression is necessary for wireless recording applications to reduce the amount of data that needs to be transmitted. In this paper, an asymmetrical sparse autoencoder with a discrete cosine transform (DCT) layer is proposed to compress EEG signals. The encoder module of the autoencoder has a combination of a fully connected linear layer and the DCT layer to reduc… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  16. arXiv:2309.09866  [pdf, other

    eess.IV cs.LG

    Domain Generalization with Fourier Transform and Soft Thresholding

    Authors: Hongyi Pan, Bin Wang, Zheyuan Zhang, Xin Zhu, Debesh Jha, Ahmet Enis Cetin, Concetto Spampinato, Ulas Bagci

    Abstract: Domain generalization aims to train models on multiple source domains so that they can generalize well to unseen target domains. Among many domain generalization methods, Fourier-transform-based domain generalization methods have gained popularity primarily because they exploit the power of Fourier transformation to capture essential patterns and regularities in the data, making the model more rob… ▽ More

    Submitted 12 December, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: This paper was accepted to ICASSP 2024

  17. arXiv:2309.08782  [pdf, other

    eess.SP

    Stein Variational Gradient Descent-based Detection For Random Access With Preambles In MTC

    Authors: Xin Zhu, Hongyi Pan, Salih Atici, Ahmet Enis Cetin

    Abstract: Traditional preamble detection algorithms have low accuracy in the grant-based random access scheme in massive machine-type communication (mMTC). We present a novel preamble detection algorithm based on Stein variational gradient descent (SVGD) at the second step of the random access procedure. It efficiently leverages deterministic updates of particles for continuous inference. To further enhance… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  18. arXiv:2307.02779  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Large Language Models Empowered Autonomous Edge AI for Connected Intelligence

    Authors: Yifei Shen, Jiawei Shao, Xinjie Zhang, Zehong Lin, Hao Pan, Dongsheng Li, Jun Zhang, Khaled B. Letaief

    Abstract: The evolution of wireless networks gravitates towards connected intelligence, a concept that envisions seamless interconnectivity among humans, objects, and intelligence in a hyper-connected cyber-physical world. Edge artificial intelligence (Edge AI) is a promising solution to achieve connected intelligence by delivering high-quality, low-latency, and privacy-preserving AI services at the network… ▽ More

    Submitted 25 December, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: IEEE Communication Magazine

  19. arXiv:2307.00701  [pdf, other

    cs.CV eess.IV

    Efficient Visual Fault Detection for Freight Train Braking System via Heterogeneous Self Distillation in the Wild

    Authors: Yang Zhang, Huilin Pan, Yang Zhou, Mingying Li, Guodong Sun

    Abstract: Efficient visual fault detection of freight trains is a critical part of ensuring the safe operation of railways under the restricted hardware environment. Although deep learning-based approaches have excelled in object detection, the efficiency of freight train fault detection is still insufficient to apply in real-world engineering. This paper proposes a heterogeneous self-distillation framework… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: 12 pages, 9 figures

  20. arXiv:2306.09650  [pdf, other

    cs.IT eess.SP

    Reconfigurable Intelligent Surface Assisted Semantic Communication Systems

    Authors: Jiajia Shi, Tse-Tin Chan, Haoyuan Pan, Tat-Ming Lok

    Abstract: Semantic communication, which focuses on conveying the meaning of information rather than exact bit reconstruction, has gained considerable attention in recent years. Meanwhile, reconfigurable intelligent surface (RIS) is a promising technology that can achieve high spectral and energy efficiency by dynamically reflecting incident signals through programmable passive components. In this paper, we… ▽ More

    Submitted 29 June, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  21. arXiv:2305.17510  [pdf, other

    cs.CV eess.SP

    A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer

    Authors: Hongyi Pan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

    Abstract: In this paper, we propose a novel Hadamard Transform (HT)-based neural network layer for hybrid quantum-classical computing. It implements the regular convolutional layers in the Hadamard transform domain. The idea is based on the HT convolution theorem which states that the dyadic convolution between two vectors is equivalent to the element-wise multiplication of their HT representation. Computin… ▽ More

    Submitted 22 February, 2024; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: To be presented at International Conference on Machine Learning (ICML), 2023

  22. arXiv:2305.11651  [pdf, other

    cs.IT cs.MA cs.PF eess.SY

    Channel Cycle Time: A New Measure of Short-term Fairness

    Authors: Pengfei Shen, Yulin Shao, Haoyuan Pan, Lu Lu, Yonina C. Eldar

    Abstract: This paper puts forth a new metric, dubbed channel cycle time (CCT), to measure the short-term fairness of communication networks. CCT characterizes the average duration between two consecutive successful transmissions of a user, during which all other users successfully accessed the channel at least once. In contrast to existing short-term fairness measures, CCT provides more comprehensive insigh… ▽ More

    Submitted 14 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  23. arXiv:2304.09373  [pdf, other

    eess.IV cs.CV

    Multi-scale Adaptive Fusion Network for Hyperspectral Image Denoising

    Authors: Haodong Pan, Feng Gao, Junyu Dong, Qian Du

    Abstract: Removing the noise and improving the visual quality of hyperspectral images (HSIs) is challenging in academia and industry. Great efforts have been made to leverage local, global or spectral context information for HSI denoising. However, existing methods still have limitations in feature interaction exploitation among multiple scales and rich spectral structure preservation. In view of this, we p… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: IEEE JSTASRS 2023, code at: https://github.com/summitgao/MAFNet

  24. arXiv:2304.05119  [pdf, other

    cs.IT eess.SP

    Device Activity Detection in mMTC with Low-Resolution ADC: A New Protocol

    Authors: Zhaorui Wang, Ya-Feng Liu, Ziyue Wang, Liang Liu, Haoyuan Pan, Shuguang Cui

    Abstract: This paper investigates the effect of low-resolution analog-to-digital converters (ADCs) on device activity detection in massive machine-type communications (mMTC). The low-resolution ADCs induce two challenges on the device activity detection compared with the traditional setup with the assumption of infinite ADC resolution. First, the codebook design for signal quantization by the low-resolution… ▽ More

    Submitted 13 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Submitted to IEEE for possible publication

  25. arXiv:2303.06797  [pdf, other

    cs.CV eess.IV eess.SP

    Multichannel Orthogonal Transform-Based Perceptron Layers for Efficient ResNets

    Authors: Hongyi Pan, Emadeldeen Hamdan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

    Abstract: In this paper, we propose a set of transform-based neural network layers as an alternative to the $3\times3$ Conv2D layers in Convolutional Neural Networks (CNNs). The proposed layers can be implemented based on orthogonal transforms such as the Discrete Cosine Transform (DCT), Hadamard transform (HT), and biorthogonal Block Wavelet Transform (BWT). Furthermore, by taking advantage of the convolut… ▽ More

    Submitted 22 April, 2024; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: This work is accepted to IEEE Transactions on Neural Networks and Learning Systems. The initial title is "Orthogonal Transform Domain Approaches for the Convolutional Layer". We changed it to "Multichannel Orthogonal Transform-Based Perceptron Layers for Efficient ResNets" based on reviewer's comment. arXiv admin note: text overlap with arXiv:2211.08577

  26. arXiv:2212.09921  [pdf, other

    cs.LG eess.SP

    Input Normalized Stochastic Gradient Descent Training of Deep Neural Networks

    Authors: Salih Atici, Hongyi Pan, Ahmet Enis Cetin

    Abstract: In this paper, we propose a novel optimization algorithm for training machine learning models called Input Normalized Stochastic Gradient Descent (INSGD), inspired by the Normalized Least Mean Squares (NLMS) algorithm used in adaptive filtering. When training complex models on large datasets, the choice of optimizer parameters, particularly the learning rate, is crucial to avoid divergence. Our al… ▽ More

    Submitted 26 June, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  27. arXiv:2212.00595  [pdf, other

    cs.CV eess.IV

    Ghost-free High Dynamic Range Imaging via Hybrid CNN-Transformer and Structure Tensor

    Authors: Yu Yuan, Jiaqi Wu, Zhongliang **g, Henry Leung, Han Pan

    Abstract: Eliminating ghosting artifacts due to moving objects is a challenging problem in high dynamic range (HDR) imaging. In this letter, we present a hybrid model consisting of a convolutional encoder and a Transformer decoder to generate ghost-free HDR images. In the encoder, a context aggregation network and non-local attention block are adopted to optimize multi-scale features and capture both global… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  28. arXiv:2211.14522  [pdf, other

    cs.CV eess.IV

    Visual Fault Detection of Multi-scale Key Components in Freight Trains

    Authors: Yang Zhang, Yang Zhou, Huilin Pan, Bo Wu, Guodong Sun

    Abstract: Fault detection for key components in the braking system of freight trains is critical for ensuring railway transportation safety. Despite the frequently employed methods based on deep learning, these fault detectors are highly reliant on hardware resources and are complex to implement. In addition, no train fault detectors consider the drop in accuracy induced by scale variation of fault parts. T… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: 9 pages, 4 figures

  29. arXiv:2211.09206  [pdf, other

    cs.CV eess.IV

    Learning to Kindle the Starlight

    Authors: Yu Yuan, Jiaqi Wu, Lindong Wang, Zhongliang **g, Henry Leung, Shuyuan Zhu, Han Pan

    Abstract: Capturing highly appreciated star field images is extremely challenging due to light pollution, the requirements of specialized hardware, and the high level of photographic skills needed. Deep learning-based techniques have achieved remarkable results in low-light image enhancement (LLIE) but have not been widely applied to star field image enhancement due to the lack of training data. To address… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  30. arXiv:2211.08577  [pdf, other

    cs.CV eess.IV

    DCT Perceptron Layer: A Transform Domain Approach for Convolution Layer

    Authors: Hongyi Pan, Xin Zhu, Salih Atici, Ahmet Enis Cetin

    Abstract: In this paper, we propose a novel Discrete Cosine Transform (DCT)-based neural network layer which we call DCT-perceptron to replace the $3\times3$ Conv2D layers in the Residual neural Network (ResNet). Convolutional filtering operations are performed in the DCT domain using element-wise multiplications by taking advantage of the Fourier and DCT Convolution theorems. A trainable soft-thresholding… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  31. arXiv:2211.08505  [pdf, other

    eess.IV

    Classification of the Cervical Vertebrae Maturation (CVM) stages Using the Tripod Network

    Authors: Salih Atici, Hongyi Pan, Mohammed H. Elnagar, Veerasathpurush Allareddy, Omar Suhaym, Rashid Ansari, Ahmet Enis Cetin

    Abstract: We present a novel deep learning method for fully automated detection and classification of the Cervical Vertebrae Maturation (CVM) stages. The deep convolutional neural network consists of three parallel networks (TriPodNet) independently trained with different initialization parameters. They also have a built-in set of novel directional filters that highlight the Cervical Verte edges in X-ray im… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  32. arXiv:2211.08491  [pdf, other

    eess.SP

    Real-time Wireless ECG-derived Respiration Rate Estimation Using an Autoencoder with a DCT Layer

    Authors: Hongyi Pan, Xin Zhu, Zhilu Ye, Pai-Yen Chen, Ahmet Enis Cetin

    Abstract: In this paper, we present a wireless ECG-derived Respiration Rate (RR) estimation using an autoencoder with a DCT Layer. The wireless wearable system records the ECG data of the subject and the respiration rate is determined from the variations in the baseline level of the ECG data. A straightforward Fourier analysis of the ECG data obtained using the wireless wearable system may lead to incorrect… ▽ More

    Submitted 16 February, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: This paper was accepted to ICASSP 2023

  33. A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains

    Authors: Guodong Sun, Yang Zhou, Huilin Pan, Bo Wu, Ye Hu, Yang Zhang

    Abstract: Real-time vision-based system of fault detection (RVBS-FD) for freight trains is an essential part of ensuring railway transportation safety. Most existing vision-based methods still have high computational costs based on convolutional neural networks. The computational cost is mainly reflected in the backbone, neck, and post-processing, i.e., non-maximum suppression (NMS). In this paper, we propo… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 11 pages, 5 figures, accepted by IEEE Transactions on Instrumentation and Measurement

  34. arXiv:2203.16954  [pdf, other

    cs.CL cs.SD eess.AS

    An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice Transformer

    Authors: Wenlin Dai, Changhe Song, Xiang Li, Zhiyong Wu, Huashan Pan, Xiulin Li, Helen Meng

    Abstract: Text normalization, defined as a procedure transforming non standard words to spoken-form words, is crucial to the intelligibility of synthesized speech in text-to-speech system. Rule-based methods without considering context can not eliminate ambiguation, whereas sequence-to-sequence neural network based methods suffer from the unexpected and uninterpretable errors problem. Recently proposed hybr… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted by ICASSP 2022

  35. arXiv:2203.02100  [pdf, other

    eess.IV cs.CV cs.LG

    Learning Incrementally to Segment Multiple Organs in a CT Image

    Authors: Pengbo Liu, Xia Wang, Mengsi Fan, Hongli Pan, Minmin Yin, Xiaohong Zhu, Dandan Du, Xiaoying Zhao, Li Xiao, Lian Ding, Xingwang Wu, S. Kevin Zhou

    Abstract: There exists a large number of datasets for organ segmentation, which are partially annotated and sequentially constructed. A typical dataset is constructed at a certain time by curating medical images and annotating the organs of interest. In other words, new datasets with annotations of new organ categories are built over time. To unleash the potential behind these partially labeled, sequentiall… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2103.04526

  36. arXiv:2201.02711  [pdf, other

    cs.LG cs.CV eess.IV

    Block Walsh-Hadamard Transform Based Binary Layers in Deep Neural Networks

    Authors: Hongyi Pan, Diaa Badawi, Ahmet Enis Cetin

    Abstract: Convolution has been the core operation of modern deep neural networks. It is well-known that convolutions can be implemented in the Fourier Transform domain. In this paper, we propose to use binary block Walsh-Hadamard transform (WHT) instead of the Fourier transform. We use WHT-based binary layers to replace some of the regular convolution layers in deep neural networks. We utilize both one-dime… ▽ More

    Submitted 27 January, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: This paper has been accepted by ACM Transactions on Embedded Computing Systems

  37. arXiv:2201.02709  [pdf, other

    eess.SP

    Detecting Anomaly in Chemical Sensors via L1-Kernels based Principal Component Analysis

    Authors: Hongyi Pan, Diaa Badawi, Ishaan Bassi, Sule Ozev, Ahmet Enis Cetin

    Abstract: We propose a kernel-PCA based method to detect anomaly in chemical sensors. We use temporal signals produced by chemical sensors to form vectors to perform the Principal Component Analysis (PCA). We estimate the kernel-covariance matrix of the sensor data and compute the eigenvector corresponding to the largest eigenvalue of the covariance matrix. The anomaly can be detected by comparing the diffe… ▽ More

    Submitted 28 September, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: This paper has been accepted to IEEE Sensors Letters

  38. arXiv:2111.02977  [pdf, other

    eess.SY

    Safe, efficient and socially-compatible decision of automated vehicles: a case study of unsignalized intersection driving

    Authors: Daofei Li, Ao Liu, Hao Pan, Wentao Chen

    Abstract: Safe and smooth interacting with other vehicles is one of the ultimate goals of driving automation. However, recent reports of demonstrative deployments of automated vehicles (AVs) indicate that AVs are still difficult to meet the expectation of other interacting drivers, which leads to several AV accidents involving human-driven vehicles (HVs). This is most likely due to the lack of understanding… ▽ More

    Submitted 10 May, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 23 pages,15 figures

  39. arXiv:2110.12065  [pdf, other

    eess.SP cs.LG

    Multiplication-Avoiding Variant of Power Iteration with Applications

    Authors: Hongyi Pan, Diaa Badawi, Runxuan Miao, Erdem Koyuncu, Ahmet Enis Cetin

    Abstract: Power iteration is a fundamental algorithm in data analysis. It extracts the eigenvector corresponding to the largest eigenvalue of a given matrix. Applications include ranking algorithms, recommendation systems, principal component analysis (PCA), among many others. In this paper, we introduce multiplication-avoiding power iteration (MAPI), which replaces the standard $\ell_2$-inner products that… ▽ More

    Submitted 31 January, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: This is the technique report for the paper "MULTIPLICATION-AVOIDING VARIANT OF POWER ITERATION WITH APPLICATIONS", which has been accepted by ICASSP 2022

  40. arXiv:2109.02920  [pdf, other

    eess.IV cs.CV

    FDA: Feature Decomposition and Aggregation for Robust Airway Segmentation

    Authors: Minghui Zhang, Xin Yu, Hanxiao Zhang, Hao Zheng, Weihao Yu, Hong Pan, Xiangran Cai, Yun Gu

    Abstract: 3D Convolutional Neural Networks (CNNs) have been widely adopted for airway segmentation. The performance of 3D CNNs is greatly influenced by the dataset while the public airway datasets are mainly clean CT scans with coarse annotation, thus difficult to be generalized to noisy CT scans (e.g. COVID-19 CT scans). In this work, we proposed a new dual-stream network to address the variability between… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: Accepted at MICCAI2021-DART

  41. arXiv:2105.11634  [pdf, other

    cs.LG eess.IV

    Robust Principal Component Analysis Using a Novel Kernel Related with the L1-Norm

    Authors: Hongyi Pan, Diaa Badawi, Erdem Koyuncu, A. Enis Cetin

    Abstract: We consider a family of vector dot products that can be implemented using sign changes and addition operations only. The dot products are energy-efficient as they avoid the multiplication operation entirely. Moreover, the dot products induce the $\ell_1$-norm, thus providing robustness to impulsive noise. First, we analytically prove that the dot products yield symmetric, positive semi-definite ge… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: 6 pages, 3 tables and one figure

  42. arXiv:2104.07085  [pdf, other

    cs.CV eess.IV

    Fast Walsh-Hadamard Transform and Smooth-Thresholding Based Binary Layers in Deep Neural Networks

    Authors: Hongyi Pan, Diaa Dabawi, Ahmet Enis Cetin

    Abstract: In this paper, we propose a novel layer based on fast Walsh-Hadamard transform (WHT) and smooth-thresholding to replace $1\times 1$ convolution layers in deep neural networks. In the WHT domain, we denoise the transform domain coefficients using the new smooth-thresholding non-linearity, a smoothed version of the well-known soft-thresholding operator. We also introduce a family of multiplication-f… ▽ More

    Submitted 29 October, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: The paper (v1) has been accepted to CVPR 2021 BiVision Workshop. We notice the final Conv2D is also a 1x1 convolution layer so we update the result with changing the layer in v2. In v3, we update citation 37 because its authorship changes. In v4, we propose the improved version of smooth thresholding called "weighted smooth thresholding"

  43. arXiv:2010.00893  [pdf, other

    eess.IV cs.CV cs.LG

    Weight Encode Reconstruction Network for Computed Tomography in a Semi-Case-Wise and Learning-Based Way

    Authors: Hujie Pan, Xuesong Li, Min Xu

    Abstract: Classic algebraic reconstruction technology (ART) for computed tomography requires pre-determined weights of the voxels for projecting pixel values. However, such weight cannot be accurately obtained due to the limitation of the physical understanding and computation resources. In this study, we propose a semi-case-wise learning-based method named Weight Encode Reconstruction Network (WERNet) to t… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  44. arXiv:2009.13015  [pdf

    eess.IV cs.CV

    Cloud Removal for Remote Sensing Imagery via Spatial Attention Generative Adversarial Network

    Authors: Heng Pan

    Abstract: Optical remote sensing imagery has been widely used in many fields due to its high resolution and stable geometric properties. However, remote sensing imagery is inevitably affected by climate, especially clouds. Removing the cloud in the high-resolution remote sensing satellite image is an indispensable pre-processing step before analyzing it. For the sake of large-scale training data, neural net… ▽ More

    Submitted 14 November, 2020; v1 submitted 27 September, 2020; originally announced September 2020.

  45. Is Multichannel Access Useful in Timely Information Update?

    Authors: Jiaxin Liang, Haoyuan Pan, Soung Chang Liew

    Abstract: This paper investigates information freshness of multichannel access in information update systems. Age of information (AoI) is a fundamentally important metric to characterize information freshness, defined as the time elapsed since the generation of the last successfully received update. When multiple devices share the same wireless channel to send updates to a common receiver, an interesting qu… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: 13 pages, 6 figures, submitted to Wireless Communication Letter

  46. arXiv:2007.01001  [pdf, other

    eess.IV cs.CV cs.NE

    PGD-UNet: A Position-Guided Deformable Network for Simultaneous Segmentation of Organs and Tumors

    Authors: Ziqiang Li, Hong Pan, Ya** Zhu, A. K. Qin

    Abstract: Precise segmentation of organs and tumors plays a crucial role in clinical applications. It is a challenging task due to the irregular shapes and various sizes of organs and tumors as well as the significant class imbalance between the anatomy of interest (AOI) and the background region. In addition, in most situation tumors and normal organs often overlap in medical images, but current approaches… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: Accepted by the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

  47. arXiv:1911.02241  [pdf, other

    cs.IT eess.SP

    Information Update: TDMA or FDMA?

    Authors: Haoyuan Pan, Soung Chang Liew

    Abstract: This paper studies information freshness in information update systems operated with TDMA and FDMA. Information freshness is characterized by a recently introduced metric, age of information (AoI), defined as the time elapsed since the generation of the last successfully received update. In an update system with multiple users sharing the same wireless channel to send updates to a common receiver,… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

  48. HeartBEAT: Heart Beat Estimation through Adaptive Tracking

    Authors: Huijie Pan, Dogancan Temel, Ghassan AlRegib

    Abstract: In this paper, we propose an algorithm denoted as HeartBEAT that tracks heart rate from wrist-type photoplethysmography (PPG) signals and simultaneously recorded three-axis acceleration data. HeartBEAT contains three major parts: spectrum estimation of PPG signals and acceleration data, elimination of motion artifacts in PPG signals using recursive least Square (RLS) adaptive filters, and auxiliar… ▽ More

    Submitted 13 November, 2018; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: 4 pages, 5 figures, 2 tables

    Journal ref: H. Pan, D. Temel and G. AlRegib, "HeartBEAT: Heart beat estimation through adaptive tracking," 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), Las Vegas, NV, 2016, pp. 587-590