Skip to main content

Showing 1–50 of 52 results for author: Peng, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.07255  [pdf, other

    cs.CV eess.IV

    Towards Realistic Data Generation for Real-World Super-Resolution

    Authors: Long Peng, Wenbo Li, Ren**g Pei, **g**g Ren, Xueyang Fu, Yang Wang, Yang Cao, Zheng-Jun Zha

    Abstract: Existing image super-resolution (SR) techniques often fail to generalize effectively in complex real-world settings due to the significant divergence between training data and practical scenarios. To address this challenge, previous efforts have either manually simulated intricate physical-based degradations or utilized learning-based techniques, yet these approaches remain inadequate for producin… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.12367  [pdf, other

    eess.IV cs.CV

    Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning

    Authors: Zheyuan Zhang, Elif Keles, Gorkem Durak, Yavuz Taktak, Onkar Susladkar, Vandan Gorade, Debesh Jha, Asli C. Ormeci, Alpay Medetalibeyoglu, Lanhong Yao, Bin Wang, Ilkin Sevgi Isler, Linkai Peng, Hongyi Pan, Camila Lopes Vendrami, Amir Bourhani, Yury Velichko, Boqing Gong, Concetto Spampinato, Ayis Pyrros, Pallavi Tiwari, Derk C. F. Klatte, Megan Engels, Sanne Hoogenboom, Candice W. Bolan , et al. (13 additional authors not shown)

    Abstract: Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: under review version

  3. arXiv:2405.11856  [pdf, other

    cs.RO eess.SY

    Modeling and simulation of a mechanism for suppressing the flip** problem of a jum** robot

    Authors: Qi Li, Liang Peng, Zhiyuan Wu, Pengda Ye, Weitao Zhang, Yi Xu, Qing Shi

    Abstract: In order to solve the problem of stable jum** of micro robot, we design a special mechanism: elastic passive joint (EPJ). EPJ can assist in achieving smooth jum** through the opening-closing process when the robot jumps. First, we introduce the composition and operation principle of EPJ, and perform a dynamic modeling of the robot's jum** process. Then, in order to verify the effectiveness o… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  4. arXiv:2405.07023  [pdf, other

    eess.IV cs.CV

    Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution

    Authors: Long Peng, Yang Cao, Ren**g Pei, Wenbo Li, Jiaming Guo, Xueyang Fu, Yang Wang, Zheng-Jun Zha

    Abstract: Real-SR endeavors to produce high-resolution images with rich details while mitigating the impact of multiple degradation factors. Although existing methods have achieved impressive achievements in detail recovery, they still fall short when addressing regions with complex gradient arrangements due to the intensity-based linear weighting feature extraction manner. Moreover, the stochastic artifact… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  5. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, **shan Pan, Jiangxin Dong, **hui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi **, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  6. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  7. arXiv:2402.11419  [pdf, other

    eess.SP

    A Self-Healing Magnetic-Array-Type Current Sensor with Data-Driven Identification of Abnormal Magnetic Measurement Units

    Authors: Xiaohu Liu, Wei Zhao, Kang Ma, Jian Liu, Lisha Peng, Songling Huang, Shisong Li

    Abstract: Magnetic-array-type current sensors have garnered increasing popularity owing to their notable advantages, including broadband functionality, a large dynamic range, cost-effectiveness, and compact dimensions. However, the susceptibility of the measurement error of one or more magnetic measurement units (MMUs) within the current sensor to drift significantly from the nominal value due to environmen… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  8. arXiv:2309.04780  [pdf, other

    cs.CV eess.IV

    Latent Degradation Representation Constraint for Single Image Deraining

    Authors: Yuhong He, Long Peng, Lu Wang, Jun Cheng

    Abstract: Since rain streaks show a variety of shapes and directions, learning the degradation representation is extremely challenging for single image deraining. Existing methods are mainly targeted at designing complicated modules to implicitly learn latent degradation representation from coupled rainy images. This way, it is hard to decouple the content-independent degradation representation due to the l… ▽ More

    Submitted 18 January, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: This paper is accepted to ICASSP 2024

  9. arXiv:2308.14536  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Spoken Language Intelligence of Large Language Models for Language Learning

    Authors: Linkai Peng, Baorian Nuchged, Yingming Gao

    Abstract: People have long hoped for a conversational system that can assist in real-life situations, and recent progress on large language models (LLMs) is bringing this idea closer to reality. While LLMs are often impressive in performance, their efficacy in real-world scenarios that demand expert knowledge remains unclear. LLMs are believed to hold the most potential and value in education, especially in… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 28 pages, 7 figures, Preprint

  10. arXiv:2307.12266  [pdf, other

    cs.CL eess.SP

    Transformer-based Joint Source Channel Coding for Textual Semantic Communication

    Authors: Shicong Liu, Zhen Gao, Gaojie Chen, Yu Su, Lu Peng

    Abstract: The Space-Air-Ground-Sea integrated network calls for more robust and secure transmission techniques against jamming. In this paper, we propose a textual semantic transmission framework for robust transmission, which utilizes the advanced natural language processing techniques to model and encode sentences. Specifically, the textual sentences are firstly split into tokens using wordpiece algorithm… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures. Accepted by IEEE/CIC ICCC 2023

  11. arXiv:2306.06865  [pdf, other

    cs.LG cs.AI eess.SP

    Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula

    Authors: Li-Chin Chen, Yi-Heng Lin, Li-Ning Peng, Feng-Ming Wang, Yu-Hsin Chen, Po-Hsun Huang, Shang-Feng Yang, Yu Tsao

    Abstract: Clinical guidelines underscore the importance of regularly monitoring and surveilling arteriovenous fistula (AVF) access in hemodialysis patients to promptly detect any dysfunction. Although phono-angiography/sound analysis overcomes the limitations of standardized AVF stenosis diagnosis tool, prior studies have depended on conventional feature extraction methods, restricting their applicability i… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  12. Synthetic Datasets for Autonomous Driving: A Survey

    Authors: Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and chan… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures

    Journal ref: in IEEE Transactions on Intelligent Vehicles, vol. 9, no. 1, pp. 1847-1864, Jan. 2024

  13. arXiv:2303.06811  [pdf, other

    eess.AS

    The NPU-Elevoc Personalized Speech Enhancement System for ICASSP2023 DNS Challenge

    Authors: Xiaopeng Yan, Yindi Yang, Zhihao Guo, Liangliang Peng, Lei Xie

    Abstract: This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge at ICASSP 2023. Based on the superior two-stage model TEA-PSE 2.0, our system particularly explores better strategy for speaker embedding fusion, optimizes the model training pipeline, and leverages adversarial training and multi-scale loss. According to the results, our… ▽ More

    Submitted 15 March, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

  14. arXiv:2212.13654  [pdf

    physics.optics cs.CV eess.IV

    Large-scale single-photon imaging

    Authors: Liheng Bian, Haoze Song, Lintao Peng, Xuyang Chang, Xi Yang, Roarke Horstmeyer, Lin Ye, Tong Qin, Dezhi Zheng, Jun Zhang

    Abstract: Benefiting from its single-photon sensitivity, single-photon avalanche diode (SPAD) array has been widely applied in various fields such as fluorescence lifetime imaging and quantum computing. However, large-scale high-fidelity single-photon imaging remains a big challenge, due to the complex hardware manufacture craft and heavy noise disturbance of SPAD arrays. In this work, we introduce deep lea… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  15. arXiv:2212.05566  [pdf, other

    cs.CV eess.IV

    YoloCurvSeg: You Only Label One Noisy Skeleton for Vessel-style Curvilinear Structure Segmentation

    Authors: Li Lin, Linkai Peng, Huaqing He, Pu** Cheng, Jiewei Wu, Kenneth K. Y. Wong, Xiaoying Tang

    Abstract: Weakly-supervised learning (WSL) has been proposed to alleviate the conflict between data annotation cost and model performance through employing sparsely-grained (i.e., point-, box-, scribble-wise) supervision and has shown promising performance, particularly in the image segmentation field. However, it is still a very challenging task due to the limited supervision, especially when only a small… ▽ More

    Submitted 18 August, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 20 pages, 15 figures, MEDIA accepted

  16. arXiv:2208.11184  [pdf, other

    eess.IV cs.CV

    AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

    Authors: Ren Yang, Radu Timofte, Xin Li, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei Li, **gzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota , et al. (28 additional authors not shown)

    Abstract: This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 3… ▽ More

    Submitted 25 August, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Camera-ready version

  17. arXiv:2207.08998  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    Discovering novel systemic biomarkers in photos of the external eye

    Authors: Boris Babenko, Ilana Traynis, Christina Chen, Preeti Singh, Akib Uddin, Jorge Cuadros, Lauren P. Daskivich, April Y. Maa, Ramasamy Kim, Eugene Yu-Chuan Kang, Yossi Matias, Greg S. Corrado, Lily Peng, Dale R. Webster, Christopher Semturs, Jonathan Krause, Avinash V. Varadarajan, Naama Hammel, Yun Liu

    Abstract: External eye photos were recently shown to reveal signs of diabetic retinal disease and elevated HbA1c. In this paper, we evaluate if external eye photos contain information about additional systemic medical conditions. We developed a deep learning system (DLS) that takes external eye photos as input and predicts multiple systemic parameters, such as those related to the liver (albumin, AST); kidn… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  18. arXiv:2206.07289  [pdf, other

    cs.SD cs.AI eess.AS

    Text-Aware End-to-end Mispronunciation Detection and Diagnosis

    Authors: Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, **song Zhang

    Abstract: Mispronunciation detection and diagnosis (MDD) technology is a key component of computer-assisted pronunciation training system (CAPT). In the field of assessing the pronunciation quality of constrained speech, the given transcriptions can play the role of a teacher. Conventional methods have fully utilized the prior texts for the model construction or improving the system performance, e.g. forced… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Rejected by Interspeech2022

  19. arXiv:2206.04948  [pdf, other

    eess.SY

    A Holistic Robust Motion Controller Framework for Autonomous Platooning

    Authors: Hong Wang, Li-Ming Peng, Zi-Chun Wei, Kai Yang, Xian-Xu Bai, Luo Jiang, Ehsan Hashemi

    Abstract: Safety is the foremost concern for autonomous platooning. The vehicle-to-vehicle (V2V) communication delay and the sudden appearance of obstacles will trigger the safety of the intended functionality (SOTIF) issues for autonomous platooning. This research proposes a holistic robust motion controller framework (MCF) for an intelligent and connected vehicle platoon system. The MCF utilizes a hierarc… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 13 pages, 20 figures

  20. Towards a median signal detector through the total Bregman divergence and its robustness analysis

    Authors: Yusuke Ono, Linyu Peng

    Abstract: A novel family of geometric signal detectors are proposed through medians of the total Bregman divergence (TBD), which are shown advantageous over the conventional methods and their mean counterparts. By interpreting the observation data as Hermitian positive-definite matrices, their mean or median play an essential role in signal detection. As is difficult to be solved analytically, we propose nu… ▽ More

    Submitted 14 July, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 15 pages, 3 figures

    Journal ref: Signal Processing 201, 108728, 2022

  21. arXiv:2204.11278  [pdf, ps, other

    eess.SP cs.IT stat.ML

    Unsupervised Learning Discriminative MIG Detectors in Nonhomogeneous Clutter

    Authors: Xiaoqiang Hua, Yusuke Ono, Linyu Peng, Yuting Xu

    Abstract: Principal component analysis (PCA) is a commonly used pattern analysis method that maps high-dimensional data into a lower-dimensional space maximizing the data variance, that results in the promotion of separability of data. Inspired by the principle of PCA, a novel type of learning discriminative matrix information geometry (MIG) detectors in the unsupervised scenario are developed, and applied… ▽ More

    Submitted 8 May, 2022; v1 submitted 24 April, 2022; originally announced April 2022.

    Comments: 14 pages, 6 figures

    Journal ref: IEEE Transactions on Communications 70, 4107-4120, 2022

  22. arXiv:2203.10139  [pdf

    cs.LG cs.AI cs.CV eess.IV

    AI system for fetal ultrasound in low-resource settings

    Authors: Ryan G. Gomes, Bellington Vwalika, Chace Lee, Angelica Willis, Marcin Sieniek, Joan T. Price, Christina Chen, Margaret P. Kasaro, James A. Taylor, Elizabeth M. Stringer, Scott Mayer McKinney, Ntazana Sindano, George E. Dahl, William Goodnight III, Justin Gilmer, Benjamin H. Chi, Charles Lau, Terry Spitz, T Saensuksopa, Kris Liu, Jonny Wong, Rory Pilgrim, Akib Uddin, Greg Corrado, Lily Peng , et al. (4 additional authors not shown)

    Abstract: Despite considerable progress in maternal healthcare, maternal and perinatal deaths remain high in low-to-middle income countries. Fetal ultrasound is an important component of antenatal care, but shortage of adequately trained healthcare workers has limited its adoption. We developed and validated an artificial intelligence (AI) system that uses novice-acquired "blind sweep" ultrasound videos to… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  23. GATE: Graph CCA for Temporal SElf-supervised Learning for Label-efficient fMRI Analysis

    Authors: Liang Peng, Nan Wang, Jie Xu, Xiaofeng Zhu, Xiaoxiao Li

    Abstract: In this work, we focus on the challenging task, neuro-disease classification, using functional magnetic resonance imaging (fMRI). In population graph-based disease analysis, graph convolutional neural networks (GCNs) have achieved remarkable success. However, these achievements are inseparable from abundant labeled data and sensitive to spurious signals. To improve fMRI representation learning and… ▽ More

    Submitted 27 August, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Journal ref: IEEE Transactions on Medical Imaging 2022

  24. arXiv:2203.04586  [pdf, other

    eess.IV cs.CV

    Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion

    Authors: Ziqi Huang, Li Lin, Pu** Cheng, Linkai Peng, Xiaoying Tang

    Abstract: Multi-modal magnetic resonance (MR) imaging provides great potential for diagnosing and analyzing brain gliomas. In clinical scenarios, common MR sequences such as T1, T2 and FLAIR can be obtained simultaneously in a single scanning process. However, acquiring contrast enhanced modalities such as T1ce requires additional time, cost, and injection of contrast agent. As such, it is clinically meanin… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 6 pages, 5 figures, submitted to ICPR 2022

  25. arXiv:2203.03631  [pdf, other

    eess.IV cs.CV

    Student Becomes Decathlon Master in Retinal Vessel Segmentation via Dual-teacher Multi-target Domain Adaptation

    Authors: Linkai Peng, Li Lin, Pu** Cheng, Huaqing He, Xiaoying Tang

    Abstract: Unsupervised domain adaptation has been proposed recently to tackle the so-called domain shift between training data and test data with different distributions. However, most of them only focus on single-target domain adaptation and cannot be applied to the scenario with multiple target domains. In this paper, we propose RVms, a novel unsupervised multi-target domain adaptation approach to segment… ▽ More

    Submitted 11 October, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: To be published in MICCAI-MLMI 2022

  26. arXiv:2201.04812  [pdf, other

    eess.IV cs.CV

    Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning

    Authors: Linkai Peng, Li Lin, Pu** Cheng, Ziqi Huang, Xiaoying Tang

    Abstract: Various deep learning models have been developed to segment anatomical structures from medical images, but they typically have poor performance when tested on another target domain with different data distribution. Recently, unsupervised domain adaptation methods have been proposed to alleviate this so-called domain shift issue, but most of them are designed for scenarios with relatively small dom… ▽ More

    Submitted 20 January, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: To be published in ISBI 2022

  27. U-shape Transformer for Underwater Image Enhancement

    Authors: Lintao Peng, Chunli Zhu, Liheng Bian

    Abstract: The light absorption and scattering of underwater impurities lead to poor underwater imaging quality. The existing data-driven based underwater image enhancement (UIE) techniques suffer from the lack of a large-scale dataset containing various underwater scenes and high-fidelity reference images. Besides, the inconsistent attenuation in different color channels and space areas is not fully conside… ▽ More

    Submitted 12 June, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: under review

  28. arXiv:2111.01544  [pdf

    eess.IV cs.CV physics.med-ph

    Comprehensive and Clinically Accurate Head and Neck Organs at Risk Delineation via Stratified Deep Learning: A Large-scale Multi-Institutional Study

    Authors: Dazhou Guo, Jia Ge, Xianghua Ye, Senxiang Yan, Yi Xin, Yuchen Song, Bing-shen Huang, Tsung-Min Hung, Zhuotun Zhu, Ling Peng, Yan** Ren, Rui Liu, Gong Zhang, Mengyuan Mao, Xiaohua Chen, Zhongjie Lu, Wenxiang Li, Yuzhen Chen, Lingyun Huang, **g Xiao, Adam P. Harrison, Le Lu, Chien-Yu Lin, Dakai **, Tsung-Ying Ho

    Abstract: Accurate organ at risk (OAR) segmentation is critical to reduce the radiotherapy post-treatment complications. Consensus guidelines recommend a set of more than 40 OARs in the head and neck (H&N) region, however, due to the predictable prohibitive labor-cost of this task, most institutions choose a substantially simplified protocol by delineating a smaller subset of OARs and neglecting the dose di… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  29. arXiv:2110.12271  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Self-Validation: Early Stop** for Single-Instance Deep Generative Priors

    Authors: Taihui Li, Zhong Zhuang, Hengyue Liang, Le Peng, Hengkang Wang, Ju Sun

    Abstract: Recent works have shown the surprising effectiveness of deep generative models in solving numerous image reconstruction (IR) tasks, even without training data. We call these models, such as deep image prior and deep decoder, collectively as single-instance deep generative priors (SIDGPs). The successes, however, often hinge on appropriate early stop** (ES), which by far has largely been handled… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: To appear in British Machine Vision Conference (BMVC) 2021

  30. arXiv:2109.09271  [pdf, ps, other

    eess.IV cs.CV

    DeepStationing: Thoracic Lymph Node Station Parsing in CT Scans using Anatomical Context Encoding and Key Organ Auto-Search

    Authors: Dazhou Guo, Xianghua Ye, Jia Ge, Xing Di, Le Lu, Lingyun Huang, Guotong Xie, **g Xiao, Zhongjie Liu, Ling Peng, Senxiang Yan, Dakai **

    Abstract: Lymph node station (LNS) delineation from computed tomography (CT) scans is an indispensable step in radiation oncology workflow. High inter-user variabilities across oncologists and prohibitive laboring costs motivated the automated approach. Previous works exploit anatomical priors to infer LNS based on predefined ad-hoc margins. However, without voxel-level supervision, the performance is sever… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

  31. SALIENCE: An Unsupervised User Adaptation Model for Multiple Wearable Sensors Based Human Activity Recognition

    Authors: Ling Chen, Yi Zhang, Shenghuan Miao, Sirou Zhu, Rong Hu, Liangying Peng, Mingqi Lv

    Abstract: Unsupervised user adaptation aligns the feature distributions of the data from training users and the new user, so a well-trained wearable human activity recognition (WHAR) model can be well adapted to the new user. With the development of wearable sensors, multiple wearable sensors based WHAR is gaining more and more attention. In order to address the challenge that the transferabilities of diffe… ▽ More

    Submitted 27 April, 2022; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE Transactions on Mobile Computing

  32. arXiv:2108.00911  [pdf, ps, other

    eess.IV cs.CV

    Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting

    Authors: Yue Zhang, Chengtao Peng, Liying Peng, Huimin Huang, Ruofeng Tong, Lanfen Lin, **gsong Li, Yen-Wei Chen, Qingqing Chen, Hongjie Hu, Zhiyi Peng

    Abstract: Multi-phase computed tomography (CT) images provide crucial complementary information for accurate liver tumor segmentation (LiTS). State-of-the-art multi-phase LiTS methods usually fused cross-phase features through phase-weighted summation or channel-attention based concatenation. However, these methods ignored the spatial (pixel-wise) relationships between different phases, hence leading to ins… ▽ More

    Submitted 5 August, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: To appear in MICCAI 2021

  33. arXiv:2106.09891  [pdf, ps, other

    eess.SP cs.IT

    ICINet: ICI-Aware Neural Network Based Channel Estimation for Rapidly Time-Varying OFDM Systems

    Authors: Yi Sun, Hong Shen, Zhenguo Du, Lan Peng, Chunming Zhao

    Abstract: A novel intercarrier interference (ICI)-aware orthogonal frequency division multiplexing (OFDM) channel estimation network ICINet is presented for rapidly time-varying channels. ICINet consists of two components: a preprocessing deep neural subnetwork (PreDNN) and a cascaded residual learning-based neural subnetwork (CasResNet). By fully taking into account the impact of ICI, the proposed PreDNN f… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  34. arXiv:2106.05152  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Transfer Learning for Medical Image Classification

    Authors: Le Peng, Hengyue Liang, Gaoxiang Luo, Taihui Li, Ju Sun

    Abstract: Transfer learning (TL) from pretrained deep models is a standard practice in modern medical image classification (MIC). However, what levels of features to be reused are problem-dependent, and uniformly finetuning all layers of pretrained models may be suboptimal. This insight has partly motivated the recent differential TL strategies, such as TransFusion (TF) and layer-wise finetuning (LWFT), whi… ▽ More

    Submitted 26 May, 2024; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted by BMVC2023 (oral)

  35. arXiv:2106.05082  [pdf, other

    cs.CV eess.IV

    Agile wide-field imaging with selective high resolution

    Authors: Lintao Peng, Liheng Bian, Tiexin Liu, Jun Zhang

    Abstract: Wide-field and high-resolution (HR) imaging is essential for various applications such as aviation reconnaissance, topographic map** and safety monitoring. The existing techniques require a large-scale detector array to capture HR images of the whole field, resulting in high complexity and heavy cost. In this work, we report an agile wide-field imaging framework with selective high resolution th… ▽ More

    Submitted 11 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 12pages,6figures

  36. arXiv:2106.02118  [pdf

    eess.IV cs.CV cs.LG

    A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals

    Authors: Ju Sun, Le Peng, Taihui Li, Dyah Adila, Zach Zaiman, Genevieve B. Melton, Nicholas Ingraham, Eric Murray, Daniel Boley, Sean Switzer, John L. Burns, Kun Huang, Tadashi Allen, Scott D. Steenburg, Judy Wawira Gichoya, Erich Kummerfeld, Christopher Tignanelli

    Abstract: Importance: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and int… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Check out the medRxiv version at https://doi.org/10.1101/2021.06.04.21258316 for updates

  37. MIG Median Detectors with Manifold Filter

    Authors: Xiaoqiang Hua, Linyu Peng

    Abstract: In this paper, we propose a class of median-based matrix information geometry (MIG) detectors with a manifold filter and apply them to signal detection in nonhomogeneous environments. As customary, the sample data is assumed to be modeled as Hermitian positive-definite (HPD) matrices, and the geometric median of a set of HPD matrices is interpreted as an estimate of the clutter covariance matrix (… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: 22 pages, 12 figures

    Journal ref: Signal Processing 188, 108176, 2021

  38. arXiv:2105.07540  [pdf

    eess.IV cs.AI cs.CV

    Deep learning for detecting pulmonary tuberculosis via chest radiography: an international study across 10 countries

    Authors: Sahar Kazemzadeh, ** Yu, Shahar Jamshy, Rory Pilgrim, Zaid Nabulsi, Christina Chen, Neeral Beladia, Charles Lau, Scott Mayer McKinney, Thad Hughes, Atilla Kiraly, Sreenivasa Raju Kalidindi, Monde Muyoyeta, Jameson Malemela, Ting Shih, Greg S. Corrado, Lily Peng, Katherine Chou, Po-Hsuan Cameron Chen, Yun Liu, Krish Eswaran, Daniel Tse, Shravya Shetty, Shruthi Prabhakara

    Abstract: Tuberculosis (TB) is a top-10 cause of death worldwide. Though the WHO recommends chest radiographs (CXRs) for TB screening, the limited availability of CXR interpretation is a barrier. We trained a deep learning system (DLS) to detect active pulmonary TB using CXRs from 9 countries across Africa, Asia, and Europe, and utilized large-scale CXR pretraining, attention pooling, and noisy student semi… ▽ More

    Submitted 29 October, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

  39. arXiv:2105.06270  [pdf, other

    cs.LG cs.RO eess.SP

    Group Feature Learning and Domain Adversarial Neural Network for aMCI Diagnosis System Based on EEG

    Authors: Chen-Chen Fan, Haiqun Xie, Liang Peng, Hongjun Yang, Zhen-Liang Ni, Guan'an Wang, Yan-Jie Zhou, Sheng Chen, Zhijie Fang, Shuyun Huang, Zeng-Guang Hou

    Abstract: Medical diagnostic robot systems have been paid more and more attention due to its objectivity and accuracy. The diagnosis of mild cognitive impairment (MCI) is considered an effective means to prevent Alzheimer's disease (AD). Doctors diagnose MCI based on various clinical examinations, which are expensive and the diagnosis results rely on the knowledge of doctors. Therefore, it is necessary to d… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: This paper has been accepted by 2021 International Conference on Robotics and Automation (ICRA 2021)

  40. arXiv:2101.01668  [pdf, other

    eess.SP cs.LG

    Radio Frequency Fingerprint Identification for LoRa Using Spectrogram and CNN

    Authors: Guanxiong Shen, Junqing Zhang, Alan Marshall, Linning Peng, Xianbin Wang

    Abstract: Radio frequency fingerprint identification (RFFI) is an emerging device authentication technique that relies on intrinsic hardware characteristics of wireless devices. We designed an RFFI scheme for Long Range (LoRa) systems based on spectrogram and convolutional neural network (CNN). Specifically, we used spectrogram to represent the fine-grained time-frequency characteristics of LoRa signals. In… ▽ More

    Submitted 30 December, 2020; originally announced January 2021.

    Comments: Accepted for publication in IEEE INFOCOM 2021

  41. Target Detection within Nonhomogeneous Clutter via Total Bregman Divergence-Based Matrix Information Geometry Detectors

    Authors: Xiaoqiang Hua, Yusuke Ono, Linyu Peng, Yongqiang Cheng, Hongqiang Wang

    Abstract: Information divergences are commonly used to measure the dissimilarity of two elements on a statistical manifold. Differentiable manifolds endowed with different divergences may possess different geometric properties, which can result in totally different performances in many practical applications. In this paper, we propose a total Bregman divergence-based matrix information geometry (TBD-MIG) de… ▽ More

    Submitted 7 August, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: 15 pages, 8 figures

    Journal ref: IEEE Transactions on Signal Processing, 69, 4326-4340, 2021

  42. arXiv:2011.11732  [pdf

    eess.IV cs.CV cs.LG

    Detecting hidden signs of diabetes in external eye photographs

    Authors: Boris Babenko, Akinori Mitani, Ilana Traynis, Naho Kitade, Preeti Singh, April Maa, Jorge Cuadros, Greg S. Corrado, Lily Peng, Dale R. Webster, Avinash Varadarajan, Naama Hammel, Yun Liu

    Abstract: Diabetes-related retinal conditions can be detected by examining the posterior of the eye. By contrast, examining the anterior of the eye can reveal conditions affecting the front of the eye, such as changes to the eyelids, cornea, or crystalline lens. In this work, we studied whether external photographs of the front of the eye can reveal insights into both diabetic retinal diseases and blood glu… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Journal ref: Nature Biomedical Engineering 2022

  43. Interpretable Survival Prediction for Colorectal Cancer using Deep Learning

    Authors: Ellery Wulczyn, David F. Steiner, Melissa Moran, Markus Plass, Robert Reihs, Fraser Tan, Isabelle Flament-Auvigne, Trissia Brown, Peter Regitnig, Po-Hsuan Cameron Chen, Narayan Hegde, Apaar Sadhwani, Robert MacDonald, Benny Ayalew, Greg S. Corrado, Lily H. Peng, Daniel Tse, Heimo Müller, Zhaoyang Xu, Yun Liu, Martin C. Stumpe, Kurt Zatloukal, Craig H. Mermel

    Abstract: Deriving interpretable prognostic features from deep-learning-based prognostic histopathology models remains a challenge. In this study, we developed a deep learning system (DLS) for predicting disease specific survival for stage II and III colorectal cancer using 3,652 cases (27,300 slides). When evaluated on two validation datasets containing 1,239 cases (9,340 slides) and 738 cases (7,140 slide… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Journal ref: Nature Partner Journal Digital Medicine (2021)

  44. arXiv:2010.11375  [pdf

    eess.IV cs.CV cs.LG

    Deep Learning for Distinguishing Normal versus Abnormal Chest Radiographs and Generalization to Unseen Diseases

    Authors: Zaid Nabulsi, Andrew Sellergren, Shahar Jamshy, Charles Lau, Edward Santos, Atilla P. Kiraly, Wenxing Ye, Jie Yang, Rory Pilgrim, Sahar Kazemzadeh, ** Yu, Sreenivasa Raju Kalidindi, Mozziyar Etemadi, Florencia Garcia-Vicente, David Melnick, Greg S. Corrado, Lily Peng, Krish Eswaran, Daniel Tse, Neeral Beladia, Yun Liu, Po-Hsuan Cameron Chen, Shravya Shetty

    Abstract: Chest radiography (CXR) is the most widely-used thoracic clinical imaging modality and is crucial for guiding the management of cardiothoracic conditions. The detection of specific CXR findings has been the main focus of several artificial intelligence (AI) systems. However, the wide range of possible CXR abnormalities makes it impractical to build specific systems to detect every possible conditi… ▽ More

    Submitted 29 October, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Journal ref: Nature Scientific Reports (2021)

  45. Predicting Risk of Develo** Diabetic Retinopathy using Deep Learning

    Authors: Ashish Bora, Siva Balasubramanian, Boris Babenko, Sunny Virmani, Subhashini Venugopalan, Akinori Mitani, Guilherme de Oliveira Marinho, Jorge Cuadros, Paisan Ruamviboonsuk, Greg S Corrado, Lily Peng, Dale R Webster, Avinash V Varadarajan, Naama Hammel, Yun Liu, Pinal Bavishi

    Abstract: Diabetic retinopathy (DR) screening is instrumental in preventing blindness, but faces a scaling challenge as the number of diabetic patients rises. Risk stratification for the development of DR may help optimize screening intervals to reduce costs while improving vision-related outcomes. We created and validated two versions of a deep learning system (DLS) to predict the development of mild-or-wo… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Journal ref: The Lancet Digital Health (2021)

  46. Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks

    Authors: Haoqiang Guo, Lu Peng, Jian Zhang, Fang Qi, Lide Duan

    Abstract: Recent studies identify that Deep learning Neural Networks (DNNs) are vulnerable to subtle perturbations, which are not perceptible to human visual system but can fool the DNN models and lead to wrong outputs. A class of adversarial attack network algorithms has been proposed to generate robust physical perturbations under different circumstances. These algorithms are the first efforts to move for… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: IGSC'2019 (https://shirazi21.wixsite.com/igsc2019archive) Best paper award

    MSC Class: 68-06 ACM Class: C.3

    Journal ref: 2019 Tenth International Green and Sustainable Computing Conference (IGSC)

  47. arXiv:2007.05500  [pdf, other

    cs.CV cs.LG eess.IV

    Scientific Discovery by Generating Counterfactuals using Image Translation

    Authors: Arunachalam Narayanaswamy, Subhashini Venugopalan, Dale R. Webster, Lily Peng, Greg Corrado, Paisan Ruamviboonsuk, Pinal Bavishi, Rory Sayres, Abigail Huang, Siva Balasubramanian, Michael Brenner, Philip Nelson, Avinash V. Varadarajan

    Abstract: Model explanation techniques play a critical role in understanding the source of a model's performance and making its decisions transparent. Here we investigate if explanation techniques can also be used as a mechanism for scientific discovery. We make three contributions: first, we propose a framework to convert predictions from explanation techniques to a mechanism of discovery. Second, we show… ▽ More

    Submitted 19 July, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted at MICCAI 2020. This version combines camera-ready and supplement

    Journal ref: MICCAI 2020

  48. arXiv:2006.11493  [pdf, other

    eess.SY

    Real-time LCC-HVDC Maximum Emergency Power Capacity Estimation Based on Local PMU Measurements

    Authors: Long Peng, Junbo Zhao, Yong Tang, Lamine Mili, Zhuoyuan Gu, Zongsheng Zheng

    Abstract: The adjustable capacity of a line-commutated-converter High Voltage Direct Current (LCC-HVDC) connected to a power system, called the LCC-HVDC maximum emergency power capability or HVDC-MC for short, plays an important role in determining the response of that system to a large disturbance. However, it is a challenging task to obtain an accurate HVDC-MC due to system model uncertainties as well as… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

    Comments: 11 pages, 17 figures

  49. arXiv:2006.11423  [pdf, other

    eess.SY

    An Adaptive MMC Synchronous Stability Control Method Based on Local PMU measurements

    Authors: Long Peng, Yong Tang, Lamine Mili, Yingbiao Li, Bing Zhao, Yijun Xu, Fan Cheng

    Abstract: Reducing the current is a common method to ensure the synchronous stability of a modular multilevel converter (MMC) when there is a short-circuit fault at its AC side. However, the uncertainty of the fault location of the AC system leads to a significant difference in the maximum allowable stable operating current during the fault. This paper proposes an adaptive MMC fault-current control method u… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 8 pages, 15 figures

  50. arXiv:2004.13761  [pdf

    eess.SP

    A Method for Vehicle Collision Risk Assessment through Inferring Driver's Braking Actions in Near-Crash Situations

    Authors: Liqun Peng, Miguel Angel Sotelo, Yi He, Yunfei Ai, Zhixiong Li

    Abstract: Driving information and data under potential vehicle crashes create opportunities for extensive real-world observations of driver behaviors and relevant factors that significantly influence the driving safety in emergency scenarios. Furthermore, the availability of such data also enhances the collision avoidance systems (CASs) by evaluating driver's actions in near-crash scenarios and providing ti… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 14 pages