Skip to main content

Showing 1–23 of 23 results for author: Shu, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.14803  [pdf, other

    eess.SY

    Transmission Benefits and Cost Allocation under Ambiguity

    Authors: Han Shu, Jacob Mays

    Abstract: Disputes over cost allocation can present a significant barrier to investment in shared infrastructure. While it may be desirable to allocate cost in a way that corresponds to expected benefits, investments in long-lived projects are made under conditions of substantial uncertainty. In the context of electricity transmission, uncertainty combined with the inherent complexity of power systems analy… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 32 pages, 7 figures, 7 tables

  2. arXiv:2402.03492  [pdf, other

    eess.IV cs.CV

    Beyond Strong labels: Weakly-supervised Learning Based on Gaussian Pseudo Labels for The Segmentation of Ellipse-like Vascular Structures in Non-contrast CTs

    Authors: Qixiang Ma, Antoine Łucas, Huazhong Shu, Adrien Kaladji, Pascal Haigron

    Abstract: Deep-learning-based automated segmentation of vascular structures in preoperative CT scans contributes to computer-assisted diagnosis and intervention procedure in vascular diseases. While CT angiography (CTA) is the common standard, non-contrast CT imaging is significant as a contrast-risk-free alternative, avoiding complications associated with contrast agents. However, the challenges of labor-i… ▽ More

    Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2309.00885  [pdf, other

    eess.IV cs.CV cs.LG

    A Generic Fundus Image Enhancement Network Boosted by Frequency Self-supervised Representation Learning

    Authors: Heng Li, Haofeng Liu, Huazhu Fu, Yanwu Xu, Hui Shu, Ke Niu, Yan Hu, Jiang Liu

    Abstract: Fundus photography is prone to suffer from image quality degradation that impacts clinical examination performed by ophthalmologists or intelligent systems. Though enhancement algorithms have been developed to promote fundus observation on degraded images, high data demands and limited applicability hinder their clinical deployment. To circumvent this bottleneck, a generic fundus image enhancement… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: Accepted by Medical Image Analysis in Auguest, 2023

    Journal ref: Medical Image Analysis, 2023, 90:102945

  4. Dual-Scale Single Image Dehazing Via Neural Augmentation

    Authors: Zhengguo Li, Chaobing Zheng, Haiyan Shu, Shiqian Wu

    Abstract: Model-based single image dehazing algorithms restore haze-free images with sharp edges and rich details for real-world hazy images at the expense of low PSNR and SSIM values for synthetic hazy images. Data-driven ones restore haze-free images with high PSNR and SSIM values for synthetic hazy images but with low contrast, and even some remaining haze for real world hazy images. In this paper, a nov… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Single image dehazing, dual-scale, neural augmentation, haze line averaging, generative adversarial network. arXiv admin note: substantial text overlap with arXiv:2111.10943

  5. arXiv:2206.04684  [pdf, other

    eess.IV cs.CV

    Structure-consistent Restoration Network for Cataract Fundus Image Enhancement

    Authors: Heng Li, Haofeng Liu, Huazhu Fu, Hai Shu, Yitian Zhao, Xiaoling Luo, Yan Hu, Jiang Liu

    Abstract: Fundus photography is a routine examination in clinics to diagnose and monitor ocular diseases. However, for cataract patients, the fundus image always suffers quality degradation caused by the clouding lens. The degradation prevents reliable diagnosis by ophthalmologists or computer-aided systems. To improve the certainty in clinical diagnosis, restoration algorithms have been proposed to enhance… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  6. arXiv:2205.14833  [pdf, other

    cs.LG cs.DC eess.SY

    Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning

    Authors: Chengfei Lv, Chaoyue Niu, Renjie Gu, Xiaotang Jiang, Zhaode Wang, Bin Liu, Ziqi Wu, Qiulin Yao, Congyu Huang, Panos Huang, Tao Huang, Hui Shu, **de Song, Bin Zou, Peng Lan, Guohuan Xu, Fei Wu, Shaojie Tang, Fan Wu, Guihai Chen

    Abstract: To break the bottlenecks of mainstream cloud-based machine learning (ML) paradigm, we adopt device-cloud collaborative ML and build the first end-to-end and general-purpose system, called Walle, as the foundation. Walle consists of a deployment platform, distributing ML tasks to billion-scale devices in time; a data pipeline, efficiently preparing task input; and a compute container, providing a c… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: Accepted by OSDI 2022

  7. arXiv:2205.04846  [pdf, other

    eess.IV cs.CV

    MNet: Rethinking 2D/3D Networks for Anisotropic Medical Image Segmentation

    Authors: Zhangfu Dong, Yuting He, Xiaoming Qi, Yang Chen, Huazhong Shu, Jean-Louis Coatrieux, Guanyu Yang, Shuo Li

    Abstract: The nature of thick-slice scanning causes severe inter-slice discontinuities of 3D medical images, and the vanilla 2D/3D convolutional neural networks (CNNs) fail to represent sparse inter-slice information and dense intra-slice information in a balanced way, leading to severe underfitting to inter-slice features (for vanilla 2D CNNs) and overfitting to noise from long-range slices (for vanilla 3D… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCAI 2022

  8. arXiv:2111.00242  [pdf

    eess.AS cs.SD

    Self-Supervised Speech Denoising Using Only Noisy Audio Signals

    Authors: Jiasong Wu, Qingchun Li, Guanyu Yang, Lei Li, Lotfi Senhadji, Huazhong Shu

    Abstract: In traditional speech denoising tasks, clean audio signals are often used as the training target, but absolutely clean signals are collected from expensive recording equipment or in studios with the strict environments. To overcome this drawback, we propose an end-to-end self-supervised speech denoising training scheme using only noisy audio signals, named Only-Noisy Training (ONT), without extra… ▽ More

    Submitted 19 January, 2023; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: 11 pages, 4 figures, 6 tables

  9. arXiv:2109.12271  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    BiTr-Unet: a CNN-Transformer Combined Network for MRI Brain Tumor Segmentation

    Authors: Qiran Jia, Hai Shu

    Abstract: Convolutional neural networks (CNNs) have achieved remarkable success in automatically segmenting organs or lesions on 3D medical images. Recently, vision transformer networks have exhibited exceptional performance in 2D image classification tasks. Compared with CNNs, transformer networks have an appealing advantage of extracting long-range features due to their self-attention algorithm. Therefore… ▽ More

    Submitted 30 December, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

    Comments: Accepted by MICCAI BrainLes 2021

    Journal ref: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries.(BrainLes 2021). LNCS 12963, pp. 3-14, 2022

  10. arXiv:2106.04130  [pdf, other

    eess.IV cs.CV

    EnMcGAN: Adversarial Ensemble Learning for 3D Complete Renal Structures Segmentation

    Authors: Yuting He, Rongjun Ge, Xiaoming Qi, Guanyu Yang, Yang Chen, Youyong Kong, Huazhong Shu, Jean-Louis Coatrieux, Shuo Li

    Abstract: 3D complete renal structures(CRS) segmentation targets on segmenting the kidneys, tumors, renal arteries and veins in one inference. Once successful, it will provide preoperative plans and intraoperative guidance for laparoscopic partial nephrectomy(LPN), playing a key role in the renal cancer treatment. However, no success has been reported in 3D CRS segmentation due to the complex shapes of rena… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Journal ref: Information Processing in Medical Imaging (IPMI) 2021

  11. A Two-Stage Cascade Model with Variational Autoencoders and Attention Gates for MRI Brain Tumor Segmentation

    Authors: Chenggang Lyu, Hai Shu

    Abstract: Automatic MRI brain tumor segmentation is of vital importance for the disease diagnosis, monitoring, and treatment planning. In this paper, we propose a two-stage encoder-decoder based model for brain tumor subregional segmentation. Variational autoencoder regularization is utilized in both stages to prevent the overfitting issue. The second-stage network adopts attention gates and is trained addi… ▽ More

    Submitted 28 November, 2020; v1 submitted 4 November, 2020; originally announced November 2020.

    Journal ref: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (BrainLes 2020)

  12. arXiv:2010.14841  [pdf, other

    cs.SD cs.CL eess.AS

    INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices

    Authors: Yiwu Yao, Yuchao Li, Chengyu Wang, Tianhang Yu, Houjiang Chen, Xiaotang Jiang, Jun Yang, Jun Huang, Wei Lin, Hui Shu, Chengfei Lv

    Abstract: The intensive computation of Automatic Speech Recognition (ASR) models obstructs them from being deployed on mobile devices. In this paper, we present a novel quantized Winograd optimization pipeline, which combines the quantization and fast convolution to achieve efficient inference acceleration on mobile devices for ASR models. To avoid the information loss due to the combination of quantization… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  13. arXiv:2007.14177  [pdf

    cs.CV eess.IV

    Generative networks as inverse problems with fractional wavelet scattering networks

    Authors: Jiasong Wu, **g Zhang, Fuzhi Wu, Youyong Kong, Guanyu Yang, Lotfi Senhadji, Huazhong Shu

    Abstract: Deep learning is a hot research topic in the field of machine learning methods and applications. Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) provide impressive image generations from Gaussian white noise, but both of them are difficult to train since they need to train the generator (or encoder) and the discriminator (or decoder) simultaneously, which is easy to cau… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 27 pages, 13 figures, 6 tables

  14. arXiv:2007.10629  [pdf

    eess.AS cs.CV cs.SD

    CSLNSpeech: solving extended speech separation problem with the help of Chinese sign language

    Authors: Jiasong Wu, Xuan Li, Taotao Li, Fanman Meng, Youyong Kong, Guanyu Yang, Lotfi Senhadji, Huazhong Shu

    Abstract: Previous audio-visual speech separation methods use the synchronization of the speaker's facial movement and speech in the video to supervise the speech separation in a self-supervised way. In this paper, we propose a model to solve the speech separation problem assisted by both face and sign language, which we call the extended speech separation problem. We design a general deep learning network… ▽ More

    Submitted 2 November, 2023; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: 13 pages, 6 figures, 5 tables

  15. arXiv:2003.09279  [pdf, other

    eess.SY math.OC

    Control Reconfiguration of Dynamical Systems for Improved Performance via Reverse- and Forward-engineering

    Authors: Han Shu, Xuan Zhang, Na Li, Antonis Papachristodoulou

    Abstract: This paper presents a control reconfiguration approach to improve the performance of two classes of dynamical systems. Motivated by recent research on re-engineering cyber-physical systems, we propose a three-step control retrofit procedure. First, we reverse-engineer a dynamical system to dig out an optimization problem it actually solves. Second, we forward-engineer the system by applying a corr… ▽ More

    Submitted 5 January, 2021; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: 20 pages, 3 figures

  16. arXiv:2003.03519  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Distilling portable Generative Adversarial Networks for Image Translation

    Authors: Hanting Chen, Yunhe Wang, Han Shu, Changyuan Wen, Chun**g Xu, Boxin Shi, Chao Xu, Chang Xu

    Abstract: Despite Generative Adversarial Networks (GANs) have been widely used in various image-to-image translation tasks, they can be hardly applied on mobile devices due to their heavy computation and storage cost. Traditional network compression methods focus on visually recognition tasks, but never deal with generation tasks. Inspired by knowledge distillation, a student generator of fewer parameters i… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    Journal ref: AAAI 2020

  17. arXiv:2002.11581  [pdf, other

    eess.IV cs.CV

    Automatically Searching for U-Net Image Translator Architecture

    Authors: Han Shu, Yunhe Wang

    Abstract: Image translators have been successfully applied to many important low level image processing tasks. However, classical network architecture of image translator like U-Net, is borrowed from other vision tasks like biomedical image segmentation. This straightforward adaptation may not be optimal and could cause redundancy in the network structure. In this paper, we propose an automatic architecture… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  18. arXiv:1911.10145  [pdf

    physics.med-ph eess.IV

    Machine-learning-based Classification of Lower-grade gliomas and High-grade gliomas using Radiomic Features in Multi-parametric MRI

    Authors: Ge Cui, Jiwoong Jeong, Bob Press, Yang Lei, Hui-Kuo Shu, Tian Liu, Walter Curran, Hui Mao, Xiaofeng Yang

    Abstract: Objectives: Glioblastomas are the most aggressive brain and central nervous system (CNS) tumors with poor prognosis in adults. The purpose of this study is to develop a machine-learning based classification method using radio-mic features of multi-parametric MRI to classify high-grade gliomas (HGG) and low-grade gliomas (LGG). Methods: Multi-parametric MRI of 80 patients, 40 HGG and 40 LGG, with g… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 14 pages, 5 figures

  19. arXiv:1911.09264  [pdf

    physics.med-ph eess.IV

    Air, bone and soft-tissue Segmentation on 3D brain MRI Using Semantic Classification Random Forest with Auto-Context Model

    Authors: Xue Dong, Yang Lei, Sibo Tian, Yingzi Liu, Tonghe Wang, Tian Liu, Walter J. Curran, Hui Mao, Hui-Kuo Shu, Xiaofeng Yang

    Abstract: As bone and air produce weak signals with conventional MR sequences, segmentation of these tissues particularly difficult in MRI. We propose to integrate patch-based anatomical signatures and an auto-context model into a machine learning framework to iteratively segment MRI into air, bone and soft tissue. The proposed semantic classification random forest (SCRF) method consists of a training stage… ▽ More

    Submitted 22 November, 2019; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: 18 pages, 8 figures

  20. arXiv:1907.11837  [pdf, other

    cs.CV cs.LG eess.IV

    Attribute Aware Pooling for Pedestrian Attribute Recognition

    Authors: Kai Han, Yunhe Wang, Han Shu, Chuanjian Liu, Chun**g Xu, Chang Xu

    Abstract: This paper expands the strength of deep convolutional neural networks (CNNs) to the pedestrian attribute recognition problem by devising a novel attribute aware pooling algorithm. Existing vanilla CNNs cannot be straightforwardly applied to handle multi-attribute data because of the larger label space as well as the attribute entanglement and correlations. We tackle these challenges that hampers t… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: Accepted by IJCAI 2019

  21. arXiv:1907.10804  [pdf, other

    cs.CV cs.LG eess.IV

    Co-Evolutionary Compression for Unpaired Image Translation

    Authors: Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chun**g Xu, Qi Tian, Chang Xu

    Abstract: Generative adversarial networks (GANs) have been successfully used for considerable computer vision tasks, especially the image-to-image translation. However, generators in these networks are of complicated architectures with large number of parameters and huge computational complexities. Existing methods are mainly designed for compressing and speeding-up deep neural networks in the classificatio… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: Accepted by ICCV 2019

  22. arXiv:1509.07951  [pdf

    eess.SY

    Error Gradient-based Variable-Lp Norm Constraint LMS Algorithm for Sparse System Identification

    Authors: Yong Feng, Fei Chen, Rui Zeng, Jiasong Wu, Huazhong Shu

    Abstract: Sparse adaptive filtering has gained much attention due to its wide applicability in the field of signal processing. Among the main algorithm families, sparse norm constraint adaptive filters develop rapidly in recent years. However, when applied for system identification, most priori work in sparse norm constraint adaptive filtering suffers from the difficulty of adaptability to the sparsity of t… ▽ More

    Submitted 26 September, 2015; originally announced September 2015.

    Comments: Submitted to 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 5 pages, 2 tables, 2 figures, 15 equations, 15 references

  23. arXiv:1503.01185  [pdf

    eess.SY

    Gradient Compared Lp-LMS Algorithms for Sparse System Identification

    Authors: Yong Feng, Jiasong Wu, Rui Zeng, Limin Luo, Huazhong Shu

    Abstract: In this paper, we propose two novel p-norm penalty least mean square (Lp-LMS) algorithms as supplements of the conventional Lp-LMS algorithm established for sparse adaptive filtering recently. A gradient comparator is employed to selectively apply the zero attractor of p-norm constraint for only those taps that have the same polarity as that of the gradient of the squared instantaneous error, whic… ▽ More

    Submitted 10 March, 2015; v1 submitted 3 March, 2015; originally announced March 2015.

    Comments: Submitted to 27th Chinese Control and Decision Conference (CCDC 2015), 5 pages, 4 tables, 5 figures, 7 equations, 11 references