Skip to main content

Showing 1–50 of 94 results for author: Mao, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  2. arXiv:2405.12569  [pdf, other

    eess.SP

    TypeII-CsiNet: CSI Feedback with TypeII Codebook

    Authors: Yiliang Sang, Ke Ma, Yang Ming, ** Lian, Zhaocheng Wang

    Abstract: The latest TypeII codebook selects partial strongest angular-delay ports for the feedback of downlink channel state information (CSI), whereas its performance is limited due to the deficiency of utilizing the correlations among the port coefficients. To tackle this issue, we propose a tailored autoencoder named TypeII-CsiNet to effectively integrate the TypeII codebook with deep learning, wherein… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2404.01672  [pdf, other

    cs.IT eess.SP

    The Meta Distribution of the SIR in Joint Communication and Sensing Networks

    Authors: Kun Ma, Chenyuan Feng, Giovanni Geraci, Howard H. Yang

    Abstract: In this paper, we introduce a novel mathematical framework for assessing the performance of joint communication and sensing (JCAS) in wireless networks, employing stochastic geometry as an analytical tool. We focus on deriving the meta distribution of the signal-to-interference ratio (SIR) for JCAS networks. This approach enables a fine-grained quantification of individual user or radar performanc… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  4. arXiv:2404.00252  [pdf, other

    eess.IV cs.CV

    Learned Scanpaths Aid Blind Panoramic Video Quality Assessment

    Authors: Kanglong Fan, Wen Wen, Mu Li, Yifan Peng, Kede Ma

    Abstract: Panoramic videos have the advantage of providing an immersive and interactive viewing experience. Nevertheless, their spherical nature gives rise to various and uncertain user viewing behaviors, which poses significant challenges for panoramic video quality assessment (PVQA). In this work, we propose an end-to-end optimized, blind PVQA method with explicit modeling of user viewing patterns through… ▽ More

    Submitted 15 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  5. arXiv:2403.12369  [pdf, other

    eess.SP

    Block-Dominant Compressed Sensing for Near-Field Communications: Fundamentals, Solutions and Future Directions

    Authors: Liyang Lu, Ke Ma, Zhaocheng Wang

    Abstract: Near-field (NF) communications draw much attention in the context of extremely large-scale antenna arrays (ELAA). Owing to a large number of antennas and high carrier frequency, the NF coverage distance is quite substantial, where the electromagnetic radiation propagates by spherical waves, in contrast to the conventional planar waves of the far-field. Motivated by these facts, the block-dominant… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE for possible publication

  6. arXiv:2402.19276  [pdf, other

    eess.IV cs.CV

    Modular Blind Video Quality Assessment

    Authors: Wen Wen, Mu Li, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang, Kede Ma

    Abstract: Blind video quality assessment (BVQA) plays a pivotal role in evaluating and improving the viewing experience of end-users across a wide range of video-based platforms and services. Contemporary deep learning-based models primarily analyze video content in its aggressively subsampled format, while being blind to the impact of the actual spatial resolution and frame rate on video quality. In this p… ▽ More

    Submitted 31 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR 2024; Camera-ready version

  7. arXiv:2402.11419  [pdf, other

    eess.SP

    A Self-Healing Magnetic-Array-Type Current Sensor with Data-Driven Identification of Abnormal Magnetic Measurement Units

    Authors: Xiaohu Liu, Wei Zhao, Kang Ma, Jian Liu, Lisha Peng, Songling Huang, Shisong Li

    Abstract: Magnetic-array-type current sensors have garnered increasing popularity owing to their notable advantages, including broadband functionality, a large dynamic range, cost-effectiveness, and compact dimensions. However, the susceptibility of the measurement error of one or more magnetic measurement units (MMUs) within the current sensor to drift significantly from the nominal value due to environmen… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  8. arXiv:2402.11250  [pdf, other

    eess.IV cs.CV cs.MM

    Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression

    Authors: Dingquan Li, Kede Ma, **g Wang, Ge Li

    Abstract: The Geometry-based Point Cloud Compression (G-PCC) has been developed by the Moving Picture Experts Group to compress point clouds. In its lossy mode, the reconstructed point cloud by G-PCC often suffers from noticeable distortions due to the naïve geometry quantization (i.e., grid downsampling). This paper proposes a hierarchical prior-based super resolution method for point cloud geometry compre… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  9. arXiv:2402.05817  [pdf

    eess.IV cs.CV cs.LG

    Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging

    Authors: Pouria Yazdian Anari, Fiona Obiezu, Nathan Lay, Fatemeh Dehghani Firouzabadi, Aditi Chaurasia, Mahshid Golagha, Shiva Singh, Fatemeh Homayounieh, Aryan Zahergivar, Stephanie Harmon, Evrim Turkbey, Rabindra Gautam, Kevin Ma, Maria Merino, Elizabeth C. Jones, Mark W. Ball, W. Marston Linehan, Baris Turkbey, Ashkan A. Malayeri

    Abstract: Introduction This study explores the use of the latest You Only Look Once (YOLO V7) object detection method to enhance kidney detection in medical imaging by training and testing a modified YOLO V7 on medical image formats. Methods Study includes 878 patients with various subtypes of renal cell carcinoma (RCC) and 206 patients with normal kidneys. A total of 5657 MRI scans for 1084 patients were r… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  10. Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters

    Authors: Dafeng Zhu, Bo Yang, Yu Wu, Haoran Deng, Zhaoyang Dong, Kai Ma, ** Guan

    Abstract: This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Smart Grid, 2023

  11. arXiv:2312.01679  [pdf, other

    eess.IV cs.CV cs.LG

    Adversarial Medical Image with Hierarchical Feature Hiding

    Authors: Qingsong Yao, Zecheng He, Yuexiang Li, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

    Abstract: Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon an… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Our code is available at \url{https://github.com/qsyao/Hierarchical_Feature_Constraint}. arXiv admin note: text overlap with arXiv:2012.09501

  12. arXiv:2310.12877  [pdf, other

    eess.IV cs.CV

    Perceptual Assessment and Optimization of HDR Image Rendering

    Authors: Peibei Cao, Rafal K. Mantiuk, Kede Ma

    Abstract: High dynamic range (HDR) rendering has the ability to faithfully reproduce the wide luminance ranges in natural scenes, but how to accurately assess the rendering quality is relatively underexplored. Existing quality models are mostly designed for low dynamic range (LDR) images, and do not align well with human perception of HDR image quality. To fill this gap, we propose a family of HDR quality m… ▽ More

    Submitted 16 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  13. arXiv:2310.05962  [pdf, other

    cs.IT cs.LG eess.SP

    Improving the Performance of R17 Type-II Codebook with Deep Learning

    Authors: Ke Ma, Yiliang Sang, Yang Ming, ** Lian, Chang Tian, Zhaocheng Wang

    Abstract: The Type-II codebook in Release 17 (R17) exploits the angular-delay-domain partial reciprocity between uplink and downlink channels to select part of angular-delay-domain ports for measuring and feeding back the downlink channel state information (CSI), where the performance of existing deep learning enhanced CSI feedback methods is limited due to the deficiency of sparse structures. To address th… ▽ More

    Submitted 13 September, 2023; originally announced October 2023.

    Comments: Accepted by IEEE GLOBECOM 2023, conference version of Arxiv:2305.08081

  14. arXiv:2309.12461  [pdf, other

    eess.SY cs.NI

    Knowledge Base Aware Semantic Communication in Vehicular Networks

    Authors: Le Xia, Yao Sun, Dusit Niyato, Kairong Ma, Jiawen Kang, Muhammad Ali Imran

    Abstract: Semantic communication (SemCom) has recently been considered a promising solution for the inevitable crisis of scarce communication resources. This trend stimulates us to explore the potential of applying SemCom to vehicular networks, which normally consume a tremendous amount of resources to achieve stringent requirements on high reliability and low latency. Unfortunately, the unique background k… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted for publication by 2023 IEEE International Conference on Communications (ICC 2023). Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: substantial text overlap with arXiv:2302.11993

  15. Artificial-Intelligence-Based Triple Phase Shift Modulation for Dual Active Bridge Converter with Minimized Current Stress

    Authors: Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

    Abstract: The dual active bridge (DAB) converter has been popular in many applications for its outstanding power density and bidirectional power transfer capacity. Up to now, triple phase shift (TPS) can be considered as one of the most advanced modulation techniques for DAB converter. It can widen zero voltage switching range and improve power efficiency significantly. Currently, current stress of the DAB… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 12 pages, 29 figures

  16. Artificial-Intelligence-Based Hybrid Extended Phase Shift Modulation for the Dual Active Bridge Converter with Full ZVS Range and Optimal Efficiency

    Authors: Xinze Li, Xin Zhang, Fanfan Lin, Changjiang Sun, Kezhi Mao

    Abstract: Dual active bridge (DAB) converter is the key enabler in many popular applications such as wireless charging, electric vehicle and renewable energy. ZVS range and efficiency are two significant performance indicators for DAB converter. To obtain the desired ZVS and efficiency performance, modulation should be carefully designed. Hybrid modulation considers several single modulation strategies to a… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 13 pages, 32 figures

  17. arXiv:2307.13981  [pdf, other

    cs.CV cs.MM eess.IV

    Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models

    Authors: Wei Sun, Wen Wen, Xiongkuo Min, Long Lan, Guangtao Zhai, Kede Ma

    Abstract: Blind video quality assessment (BVQA) plays an indispensable role in monitoring and improving the end-users' viewing experience in various real-world video-enabled media applications. As an experimental field, the improvements of BVQA models have been measured primarily on a few human-rated VQA datasets. Thus, it is crucial to gain a better understanding of existing VQA datasets in order to proper… ▽ More

    Submitted 3 April, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

  18. arXiv:2307.09570  [pdf, other

    eess.IV cs.CV

    SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology

    Authors: **gwei Zhang, Ke Ma, Saarthak Kapse, Joel Saltz, Maria Vakalopoulou, Prateek Prasanna, Dimitris Samaras

    Abstract: Semantic segmentations of pathological entities have crucial clinical value in computational pathology workflows. Foundation models, such as the Segment Anything Model (SAM), have been recently proposed for universal use in segmentation tasks. SAM shows remarkable promise in instance segmentation on natural images. However, the applicability of SAM to computational pathology tasks is limited due t… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Submitted to MedAGI 2023

  19. arXiv:2305.10353  [pdf, other

    eess.SP cs.LG cs.NI

    An Ensemble Learning Approach for Exercise Detection in Type 1 Diabetes Patients

    Authors: Ke Ma, Hongkai Chen, Shan Lin

    Abstract: Type 1 diabetes is a serious disease in which individuals are unable to regulate their blood glucose levels, leading to various medical complications. Artificial pancreas (AP) systems have been developed as a solution for type 1 diabetic patients to mimic the behavior of the pancreas and regulate blood glucose levels. However, current AP systems lack detection capabilities for exercise-induced glu… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 10 pages, 7 figures, 2 tables

    MSC Class: 68T07 (Primary) 34A05 (Secondary) ACM Class: J.3

  20. arXiv:2305.00837  [pdf, other

    eess.IV cs.CV cs.LG

    LCAUnet: A skin lesion segmentation network with enhanced edge and body fusion

    Authors: Qisen Ma, Keming Mao, Gao Wang, Lisheng Xu, Yuhai Zhao

    Abstract: Accurate segmentation of skin lesions in dermatoscopic images is crucial for the early diagnosis of skin cancer and improving the survival rate of patients. However, it is still a challenging task due to the irregularity of lesion areas, the fuzziness of boundaries, and other complex interference factors. In this paper, a novel LCAUnet is proposed to improve the ability of complementary representa… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 14 pages, 10 figures

  21. arXiv:2303.15043  [pdf, other

    cs.CV eess.IV

    Joint Video Multi-Frame Interpolation and Deblurring under Unknown Exposure Time

    Authors: Wei Shang, Dongwei Ren, Yi Yang, Hongzhi Zhang, Kede Ma, Wangmeng Zuo

    Abstract: Natural videos captured by consumer cameras often suffer from low framerate and motion blur due to the combination of dynamic scene complexity, lens and sensor imperfection, and less than ideal exposure setting. As a result, computational methods that jointly perform video frame interpolation and deblurring begin to emerge with the unrealistic assumption that the exposure time is known and fixed.… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023, available at https://github.com/shangwei5/VIDUE

    ACM Class: I.4.3

  22. arXiv:2303.14964  [pdf, other

    cs.CV eess.IV

    Learning a Deep Color Difference Metric for Photographic Images

    Authors: Haoyu Chen, Zhihua Wang, Yang Yang, Qilin Sun, Kede Ma

    Abstract: Most well-established and widely used color difference (CD) metrics are handcrafted and subject-calibrated against uniformly colored patches, which do not generalize well to photographic images characterized by natural scene complexities. Constructing CD formulae for photographic images is still an active research topic in imaging/illumination, vision science, and color science communities. In thi… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  23. arXiv:2303.09400  [pdf, other

    eess.SP

    Enhancing Vital Sign Estimation Performance of FMCW MIMO Radar by Prior Human Shape Recognition

    Authors: Hadi Alidoustaghdam, Min Chen, Ben Willetts, Kai Mao, André Kokkeler, Yang Miao

    Abstract: Radio technology enabled contact-free human posture and vital sign estimation is promising for health monitoring. Radio systems at millimeter-wave (mmWave) frequencies advantageously bring large bandwidth, multi-antenna array and beam steering capability. \textit{However}, the human point cloud obtained by mmWave radar and utilized for posture estimation is likely to be sparse and incomplete. Addi… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted for presentation at the IEEE ICC 2023 conference

  24. arXiv:2212.13059  [pdf

    eess.IV cs.CV

    OMSN and FAROS: OCTA Microstructure Segmentation Network and Fully Annotated Retinal OCTA Segmentation Dataset

    Authors: Peng Xiao, Xiaodong Hu, Ke Ma, Gengyuan Wang, Ziqing Feng, Yuancong Huang, ** Yuan

    Abstract: The lack of efficient segmentation methods and fully-labeled datasets limits the comprehensive assessment of optical coherence tomography angiography (OCTA) microstructures like retinal vessel network (RVN) and foveal avascular zone (FAZ), which are of great value in ophthalmic and systematic diseases evaluation. Here, we introduce an innovative OCTA microstructure segmentation network (OMSN) by c… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: 10 pages, 6 figures, submitted to IEEE Transactions on Medical Imaging (TMI)

  25. arXiv:2212.02764  [pdf, other

    eess.IV cs.CV cs.LG

    A Trustworthy Framework for Medical Image Analysis with Deep Learning

    Authors: Kai Ma, Siyuan He, Pengcheng Xi, Ashkan Ebadi, Stéphane Tremblay, Alexander Wong

    Abstract: Computer vision and machine learning are playing an increasingly important role in computer-assisted diagnosis; however, the application of deep learning to medical imaging has challenges in data availability and data imbalance, and it is especially important that models for medical imaging are built to be trustworthy. Therefore, we propose TRUDLMIA, a trustworthy deep learning framework for medic… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  26. arXiv:2210.03904  [pdf, other

    cs.CV eess.IV

    LW-ISP: A Lightweight Model with ISP and Deep Learning

    Authors: Hongyang Chen, Kaisheng Ma

    Abstract: The deep learning (DL)-based methods of low-level tasks have many advantages over the traditional camera in terms of hardware prospects, error accumulation and imaging effects. Recently, the application of deep learning to replace the image signal processing (ISP) pipeline has appeared one after another; however, there is still a long way to go towards real landing. In this paper, we show the poss… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 16 PAGES, ACCEPTED AS A CONFERENCE PAPER AT: BMVC 2022

  27. arXiv:2210.02245  [pdf, other

    eess.SP eess.IV

    Channel Modeling for UAV-to-Ground Communications with Posture Variation and Fuselage Scattering Effect

    Authors: Boyu Hua, Haoran Ni, Qiuming Zhu, Cheng-Xiang Wang, Tongtong Zhou, Kai Mao, Junwei Bao, Xiaofei Zhang

    Abstract: Unmanned aerial vehicle (UAV)-to-ground (U2G) channel models play a pivotal role for reliable communications between UAV and ground terminal. This paper proposes a three-dimensional (3D) non-stationary hybrid model including both large-scale and small-scale fading for U2G multiple-input-multiple-output (MIMO) channels. Distinctive channel characteristics under U2G scenarios, i.e., 3D trajectory an… ▽ More

    Submitted 13 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  28. arXiv:2210.00933  [pdf, other

    cs.CV eess.IV

    Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

    Authors: Weixia Zhang, Dingquan Li, Xiongkuo Min, Guangtao Zhai, Guodong Guo, Xiaokang Yang, Kede Ma

    Abstract: No-reference image quality assessment (NR-IQA) aims to quantify how humans perceive visual distortions of digital images without access to their undistorted references. NR-IQA models are extensively studied in computational vision, and are widely used for performance evaluation and perceptual optimization of man-made vision systems. Here we make one of the first attempts to examine the perceptual… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  29. arXiv:2209.08800  [pdf, ps, other

    eess.SP

    A Realistic 3D Non-Stationary Channel Model for UAV-to-Vehicle Communications Incorporating Fuselage Posture

    Authors: Boyu Hua, Tongtong Zhou, Qiuming Zhu, Kai Mao, Junwei Bao, Weizhi Zhong, Naeem Ahmed

    Abstract: Considering the unmanned aerial vehicle (UAV) three-dimensional (3D) posture, a novel 3D non-stationary geometry-based stochastic model (GBSM) is proposed for multiple-input multiple-output (MIMO) UAV-to-vehicle (U2V) channels. It consists of a line-of-sight (LoS) and non-line-of-sight (NLoS) components. The factor of fuselage posture is considered by introducing a time-variant 3D posture matrix.… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 12 pages, 8 figures, CNCOM

  30. arXiv:2207.09312  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for COVID-19 Screening With Chest Radiography

    Authors: Kai Ma, Pengcheng Xi, Karim Habashy, Ashkan Ebadi, Stéphane Tremblay, Alexander Wong

    Abstract: Building AI models with trustworthiness is important especially in regulated areas such as healthcare. In tackling COVID-19, previous work uses convolutional neural networks as the backbone architecture, which has shown to be prone to over-caution and overconfidence in making decisions, rendering them less trustworthy -- a crucial flaw in the context of medical imaging. In this study, we propose a… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted to 39th International Conference on Machine Learning, Workshop on Healthcare AI and COVID-19

  31. arXiv:2206.09146  [pdf, other

    eess.IV cs.AI cs.CV

    A Perceptually Optimized and Self-Calibrated Tone Map** Operator

    Authors: Peibei Cao, Chenyang Le, Yuming Fang, Kede Ma

    Abstract: With the increasing popularity and accessibility of high dynamic range (HDR) photography, tone map** operators (TMOs) for dynamic range compression are practically demanding. In this paper, we develop a two-stage neural network-based TMO that is self-calibrated and perceptually optimized. In Stage one, motivated by the physiology of the early stages of the human visual system, we first decompose… ▽ More

    Submitted 25 August, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 15 pages,17 figures

  32. arXiv:2206.08751  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment of Virtual Reality Videos in the Wild

    Authors: Wen Wen, Mu Li, Yiru Yao, Xiangjie Sui, Yabin Zhang, Long Lan, Yuming Fang, Kede Ma

    Abstract: Investigating how people perceive virtual reality (VR) videos in the wild (i.e., those captured by everyday users) is a crucial and challenging task in VR-related applications due to complex authentic distortions localized in space and time. Existing panoramic video databases only consider synthetic distortions, assume fixed viewing conditions, and are limited in size. To overcome these shortcomin… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  33. arXiv:2205.13489  [pdf, other

    cs.CV cs.GR eess.IV

    Measuring Perceptual Color Differences of Smartphone Photographs

    Authors: Zhihua Wang, Keshuo Xu, Yang Yang, Jianlei Dong, Shuhang Gu, Lihao Xu, Yuming Fang, Kede Ma

    Abstract: Measuring perceptual color differences (CDs) is of great importance in modern smartphone photography. Despite the long history, most CD measures have been constrained by psychophysical data of homogeneous color patches or a limited number of simplistic natural photographic images. It is thus questionable whether existing CD measures generalize in the age of smartphone photography characterized by… ▽ More

    Submitted 31 March, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 10 figures, 8 tables, 14 pages

  34. arXiv:2204.10090  [pdf, other

    eess.IV cs.CV

    Learn from Unpaired Data for Image Restoration: A Variational Bayes Approach

    Authors: Dihan Zheng, Xiaowen Zhang, Kaisheng Ma, Chenglong Bao

    Abstract: Collecting paired training data is difficult in practice, but the unpaired samples broadly exist. Current approaches aim at generating synthesized training data from unpaired samples by exploring the relationship between the corrupted and clean data. This work proposes LUD-VAE, a deep generative method to learn the joint probability density function from data sampled from marginal distributions. O… ▽ More

    Submitted 11 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  35. arXiv:2204.04088  [pdf, other

    eess.SY

    Stochastic Gradient-based Fast Distributed Multi-Energy Management for an Industrial Park with Temporally-Coupled Constraints

    Authors: Dafeng Zhu, Bo Yang, Chengbin Ma, Zhaojian Wang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Contemporary industrial parks are challenged by the growing concerns about high cost and low efficiency of energy supply. Moreover, in the case of uncertain supply/demand, how to mobilize delay-tolerant elastic loads and compensate real-time inelastic loads to match multi-energy generation/storage and minimize energy cost is a key issue. Since energy management is hardly to be implemented offline… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by Applied Energy

  36. arXiv:2203.07659  [pdf

    eess.IV cs.CV

    Breast Cancer Molecular Subtypes Prediction on Pathological Images with Discriminative Patch Selecting and Multi-Instance Learning

    Authors: Hong Liu, Wen-Dong Xu, Zi-Hao Shang, Xiang-Dong Wang, Hai-Yan Zhou, Ke-Wen Ma, Huan Zhou, Jia-Lin Qi, Jia-Rui Jiang, Li-Lan Tan, Hui-Min Zeng, Hui-Juan Cai, Kuan-Song Wang, Yue-Liang Qian

    Abstract: Molecular subtypes of breast cancer are important references to personalized clinical treatment. For cost and labor savings, only one of the patient's paraffin blocks is usually selected for subsequent immunohistochemistry (IHC) to obtain molecular subtypes. Inevitable sampling error is risky due to tumor heterogeneity and could result in a delay in treatment. Molecular subtype prediction from con… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  37. Conquering Data Variations in Resolution: A Slice-Aware Multi-Branch Decoder Network

    Authors: Shuxin Wang, Shilei Cao, Zhizhong Chai, Dong Wei, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Fully convolutional neural networks have made promising progress in joint liver and liver tumor segmentation. Instead of following the debates over 2D versus 3D networks (for example, pursuing the balance between large-scale 2D pretraining and 3D context), in this paper, we novelly identify the wide variation in the ratio between intra- and inter-slice resolutions as a crucial obstacle to the perf… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Published by IEEE TMI

  38. Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retina OCT Images

    Authors: Hong Liu, Dong Wei, Donghuan Lu, Yuexiang Li, Kai Ma, Liansheng Wang, Yefeng Zheng

    Abstract: Automated surface segmentation of retinal layer is important and challenging in analyzing optical coherence tomography (OCT). Recently, many deep learning based methods have been developed for this task and yield remarkable performance. However, due to large spatial gap and potential mismatch between the B-scans of OCT data, all of them are based on 2D segmentation of individual B-scans, which may… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: Presented at MICCAI 2021

  39. arXiv:2203.00270  [pdf, other

    cs.GT eess.SP

    Bidirectional Pricing and Demand Response for Nanogrids with HVAC Systems

    Authors: Jiaxin Cao, Bo Yang, Shanying Zhu, Kai Ma, ** Guan

    Abstract: Owing to the fluctuant renewable generation and power demand, the energy surplus or deficit in each nanogrid is embodied differently across time. To stimulate local renewable energy consumption and minimize the long-term energy cost, some issues still remain to be explored: when and how the energy demand and bidirectional trading prices are scheduled considering personal comfort preferences and en… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  40. Visual attention analysis of pathologists examining whole slide images of Prostate cancer

    Authors: Souradeep Chakraborty, Ke Ma, Rajarsi Gupta, Beatrice Knudsen, Gregory J. Zelinsky, Joel H. Saltz, Dimitris Samaras

    Abstract: We study the attention of pathologists as they examine whole-slide images (WSIs) of prostate cancer tissue using a digital microscope. To the best of our knowledge, our study is the first to report in detail how pathologists navigate WSIs of prostate cancer as they accumulate information for their diagnoses. We collected slide navigation data (i.e., viewport location, magnification level, and time… ▽ More

    Submitted 2 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: ISBI 2022 (Oral presentation)

  41. arXiv:2202.08195  [pdf, other

    eess.IV cs.CV q-bio.QM

    Nuclei Segmentation with Point Annotations from Pathology Images via Self-Supervised Learning and Co-Training

    Authors: Yi Lin, Zhiyong Qu, Hao Chen, Zhongke Gao, Yuexiang Li, Lili Xia, Kai Ma, Yefeng Zheng, Kwang-Ting Cheng

    Abstract: Nuclei segmentation is a crucial task for whole slide image analysis in digital pathology. Generally, the segmentation performance of fully-supervised learning heavily depends on the amount and quality of the annotated data. However, it is time-consuming and expensive for professional pathologists to provide accurate pixel-level ground truth, while it is much easier to get coarse labels such as po… ▽ More

    Submitted 17 August, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by MedIA

  42. Energy Management Based on Multi-Agent Deep Reinforcement Learning for A Multi-Energy Industrial Park

    Authors: Dafeng Zhu, Bo Yang, Yuxiang Liu, Zhaojian Wang, Kai Ma, ** Guan

    Abstract: Owing to large industrial energy consumption, industrial production has brought a huge burden to the grid in terms of renewable energy access and power supply. Due to the coupling of multiple energy sources and the uncertainty of renewable energy and demand, centralized methods require large calculation and coordination overhead. Thus, this paper proposes a multi-energy management framework achiev… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted by Applied Energy

    Journal ref: Applied Energy 311 (2022) 118636

  43. arXiv:2201.08388  [pdf, other

    eess.IV cs.CV

    Steerable Pyramid Transform Enables Robust Left Ventricle Quantification

    Authors: Xiangyang Zhu, Kede Ma, Wufeng Xue

    Abstract: Predicting cardiac indices has long been a focal point in the medical imaging community. While various deep learning models have demonstrated success in quantifying cardiac indices, they remain susceptible to mild input perturbations, e.g., spatial transformations, image distortions, and adversarial attacks. This vulnerability undermines confidence in using learning-based automated systems for dia… ▽ More

    Submitted 2 July, 2024; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Code is available at https://github.com/yangyangyang127/RobustLV

  44. arXiv:2112.13227  [pdf, other

    eess.IV cs.CV

    Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression

    Authors: Mu Li, Kede Ma, **xing Li, David Zhang

    Abstract: Although equirectangular projection (ERP) is a convenient form to store omnidirectional images (also known as 360-degree images), it is neither equal-area nor conformal, thus not friendly to subsequent visual communication. In the context of image compression, ERP will over-sample and deform things and stuff near the poles, making it difficult for perceptually optimal bit allocation. In convention… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

  45. arXiv:2110.14209  [pdf, ps, other

    eess.SY

    Fast Distributed Stochastic Scheduling for A Multi-Energy Industrial Park

    Authors: Dafeng Zhu, Bo Yang, Zhaojian Wang, Chengbin Ma, Kai Ma, Shanying Zhu

    Abstract: The multi-energy management framework of industrial parks advocates energy conversion and scheduling, which takes full advantage of the compensation and temporal availability of multiple energy. However, how to exploit elastic loads and compensate inelastic loads to match multiple generators and storage is still a key problem under the uncertainty of demand and supply. To solve the issue, the ener… ▽ More

    Submitted 24 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  46. arXiv:2110.09699  [pdf, ps, other

    cs.CV eess.IV

    Image Quality Assessment in the Modern Age

    Authors: Kede Ma, Yuming Fang

    Abstract: This tutorial provides the audience with the basic theories, methodologies, and current progresses of image quality assessment (IQA). From an actionable perspective, we will first revisit several subjective quality assessment methodologies, with emphasis on how to properly select visual stimuli. We will then present in detail the design principles of objective quality assessment models, supplement… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: ACM Multimedia 2021 Tutorial

  47. Locally Adaptive Structure and Texture Similarity for Image Quality Assessment

    Authors: Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma

    Abstract: The latest advances in full-reference image quality assessment (IQA) involve unifying structure and texture similarity based on deep representations. The resulting Deep Image Structure and Texture Similarity (DISTS) metric, however, makes rather global quality measurements, ignoring the fact that natural photographic images are locally structured and textured across space and scale. In this paper,… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 29th ACM International Conference on Multimedia, 2021

  48. arXiv:2109.13930  [pdf, other

    eess.IV cs.CV

    All-Around Real Label Supervision: Cyclic Prototype Consistency Learning for Semi-supervised Medical Image Segmentation

    Authors: Zhe Xu, Yixin Wang, Donghuan Lu, Lequan Yu, Jiangpeng Yan, Jie Luo, Kai Ma, Yefeng Zheng, Raymond Kai-yu Tong

    Abstract: Semi-supervised learning has substantially advanced medical image segmentation since it alleviates the heavy burden of acquiring the costly expert-examined annotations. Especially, the consistency-based approaches have attracted more attention for their superior performance, wherein the real labels are only utilized to supervise their paired images via supervised loss while the unlabeled images ar… ▽ More

    Submitted 15 March, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: 11 pages

  49. arXiv:2109.11715  [pdf, other

    eess.IV cs.CV

    Training Automatic View Planner for Cardiac MR Imaging via Self-Supervision by Spatial Relationship between Views

    Authors: Dong Wei, Kai Ma, Yefeng Zheng

    Abstract: View planning for the acquisition of cardiac magnetic resonance imaging (CMR) requires acquaintance with the cardiac anatomy and remains a challenging task in clinical practice. Existing approaches to its automation relied either on an additional volumetric image not typically acquired in clinic routine, or on laborious manual annotations of cardiac structural landmarks. This work presents a clini… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Accepted by MICCAI 2021

  50. arXiv:2109.05298  [pdf, other

    eess.IV

    InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

    Authors: Hong Wang, Yuexiang Li, Haimiao Zhang, Jiawei Chen, Kai Ma, Deyu Meng, Yefeng Zheng

    Abstract: For the task of metal artifact reduction (MAR), although deep learning (DL)-based methods have achieved promising performances, most of them suffer from two problems: 1) the CT imaging geometry constraint is not fully embedded into the network during training, leaving room for further performance improvement; 2) the model interpretability is lack of sufficient consideration. Against these issues,… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Journal ref: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2021