Skip to main content

Showing 1–31 of 31 results for author: Sheng, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12357  [pdf

    eess.IV cs.CV

    Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI

    Authors: Di Xu, Xin Miao, Hengjie Liu, Jessica E. Scholey, Wensha Yang, Mary Feng, Michael Ohliger, Hui Lin, Yi Lao, Yang Yang, Ke Sheng

    Abstract: Purpose: 4D MRI with high spatiotemporal resolution is desired for image-guided liver radiotherapy. Acquiring densely sampling k-space data is time-consuming. Accelerated acquisition with sparse samples is desirable but often causes degraded image quality or long reconstruction time. We propose the Reconstruct Paired Conditional Generative Adversarial Network (Re-Con-GAN) to shorten the 4D MRI rec… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2405.02008  [pdf, other

    cs.CV

    DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

    Authors: Pei** Jia, Tuopu Wen, Ziang Luo, Mengmeng Yang, Kun Jiang, Zhiquan Lei, Xuewei Tang, Ziyuan Liu, Le Cui, Kehua Sheng, Bo Zhang, Diange Yang

    Abstract: Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV) perception. However, existing models still encounter challenges in producing realistic and consistent semantic map layouts. One prominent issue is the limited… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2401.06893  [pdf, other

    eess.IV cs.CV

    Local Gamma Augmentation for Ischemic Stroke Lesion Segmentation on MRI

    Authors: Jon Middleton, Marko Bauer, Kaining Sheng, Jacob Johansen, Mathias Perslev, Silvia Ingala, Mads Nielsen, Akshay Pai

    Abstract: The identification and localisation of pathological tissues in medical images continues to command much attention among deep learning practitioners. When trained on abundant datasets, deep neural networks can match or exceed human performance. However, the scarcity of annotated data complicates the training of these models. Data augmentation techniques can compensate for a lack of training samples… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Camera-ready version for Northern Lights Deep Learning Conference 2024, 7 pages, 2 figures

  4. arXiv:2401.02192  [pdf

    eess.IV cs.CV cs.LG

    Nodule detection and generation on chest X-rays: NODE21 Challenge

    Authors: Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy

    Abstract: Pulmonary nodules may be an early manifestation of lung cancer, the leading cause of cancer-related deaths among both men and women. Numerous studies have established that deep learning methods can yield high-performance levels in the detection of lung nodules in chest X-rays. However, the lack of gold-standard public datasets slows down the progression of the research and prevents benchmarking of… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures

  5. arXiv:2312.07221  [pdf, other

    cs.CV cs.AI

    Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

    Authors: Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu

    Abstract: Traditional 3D segmentation methods can only recognize a fixed range of classes that appear in the training set, which limits their application in real-world scenarios due to the lack of generalization ability. Large-scale visual-language pre-trained models, such as CLIP, have shown their generalization ability in the zero-shot 2D vision tasks, but are still unable to be applied to 3D semantic seg… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  6. arXiv:2311.02880  [pdf, other

    cs.LG

    MultiSPANS: A Multi-range Spatial-Temporal Transformer Network for Traffic Forecast via Structural Entropy Optimization

    Authors: Dongcheng Zou, Senzhang Wang, Xuefeng Li, Hao Peng, Yuandong Wang, Chunyang Liu, Kehua Sheng, Bo Zhang

    Abstract: Traffic forecasting is a complex multivariate time-series regression task of paramount importance for traffic management and planning. However, existing approaches often struggle to model complex multi-range dependencies using local spatiotemporal features and road network hierarchical knowledge. To address this, we propose MultiSPANS. First, considering that an individual recording point cannot r… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures, conference. The work has been accepted by WSDM2024

  7. arXiv:2309.10230  [pdf, other

    cs.CV

    Learning Point-wise Abstaining Penalty for Point Cloud Anomaly Detection

    Authors: Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao

    Abstract: LiDAR-based semantic scene understanding is an important module in the modern autonomous driving perception stack. However, identifying Out-Of-Distribution (OOD) points in a LiDAR point cloud is challenging as point clouds lack semantically rich features when compared with RGB images. We revisit this problem from the perspective of selective classification, which introduces a selective function in… ▽ More

    Submitted 19 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: codes is available at https://github.com/Daniellli/PAD.git

  8. arXiv:2309.10227  [pdf

    eess.IV cs.CV

    Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer

    Authors: Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng

    Abstract: Dynamic magnetic resonance imaging (DMRI) is an effective imaging tool for diagnosis tasks that require motion tracking of a certain anatomy. To speed up DMRI acquisition, k-space measurements are commonly undersampled along spatial or spatial-temporal domains. The difficulty of recovering useful information increases with increasing undersampling ratios. Compress sensing was invented for this pur… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 Workshop

  9. arXiv:2304.08491  [pdf, other

    cs.CV

    Delving into Shape-aware Zero-shot Semantic Segmentation

    Authors: Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

    Abstract: Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy. However, translating this success to semantic segmentation is not trivial, because this dense prediction task requires not only accurate semantic understanding but also fine shape delineation an… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023, code: https://github.com/Liuxinyv/SAZS

  10. arXiv:2302.09696  [pdf

    eess.IV cs.CV

    An Efficient and Robust Method for Chest X-Ray Rib Suppression that Improves Pulmonary Abnormality Diagnosis

    Authors: Di Xu, Qifan Xu, Kevin Nhieu, Dan Ruan, Ke Sheng

    Abstract: Suppression of thoracic bone shadows on chest X-rays (CXRs) has been indicated to improve the diagnosis of pulmonary disease. Previous approaches can be categorized as unsupervised physical and supervised deep learning models. Nevertheless, with physical models able to preserve morphological details but at the cost of extremely long processing time, existing DL methods face challenges of gathering… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  11. Generative Domain Adaptation for Face Anti-Spoofing

    Authors: Qianyu Zhou, Ke-Yue Zhang, Tai** Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma

    Abstract: Face anti-spoofing (FAS) approaches based on unsupervised domain adaption (UDA) have drawn growing attention due to promising performances for target scenarios. Most existing UDA FAS methods typically fit the trained models to the target domain via aligning the distribution of semantic high-level features. However, insufficient supervision of unlabeled target domains and neglect of low-level featu… ▽ More

    Submitted 11 September, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to European Conference on Computer Vision (ECCV), 2022

  12. arXiv:2206.11134  [pdf, other

    cs.CV

    Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization

    Authors: Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke Li

    Abstract: Open-vocabulary object detection (OVD) aims to scale up vocabulary size to detect objects of novel categories beyond the training vocabulary. Recent work resorts to the rich knowledge in pre-trained vision-language models. However, existing methods are ineffective in proposal-level vision-language alignment. Meanwhile, the models usually suffer from confidence bias toward base categories and perfo… ▽ More

    Submitted 24 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  13. arXiv:2206.06829  [pdf, other

    cs.CV

    Efficient Decoder-free Object Detection with Transformers

    Authors: Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

    Abstract: Vision transformers (ViTs) are changing the landscape of object detection approaches. A natural usage of ViTs in detection is to replace the CNN-based backbone with a transformer-based backbone, which is straightforward and effective, with the price of bringing considerable computation burden for inference. More subtle usage is the DETR family, which eliminates the need for many hand-designed comp… ▽ More

    Submitted 16 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Update metadata, 10 pages

  14. arXiv:2203.12217  [pdf, other

    cs.CV

    Training-free Transformer Architecture Search

    Authors: Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji

    Abstract: Recently, Vision Transformer (ViT) has achieved remarkable success in several computer vision tasks. The progresses are highly relevant to the architecture design, then it is worthwhile to propose Transformer Architecture Search (TAS) to search for better ViTs automatically. However, current TAS methods are time-consuming and existing zero-cost proxies in CNN do not generalize well to the ViT sear… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  15. arXiv:2203.10812  [pdf, other

    cs.CV

    ARM: Any-Time Super-Resolution Method

    Authors: Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji

    Abstract: This paper proposes an Any-time super-Resolution Method (ARM) to tackle the over-parameterized single image super-resolution (SISR) models. Our ARM is motivated by three observations: (1) The performance of different image patches varies with SISR networks of different sizes. (2) There is a tradeoff between computation overhead and performance of the reconstructed image. (3) Given an input image,… ▽ More

    Submitted 18 July, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022

  16. arXiv:2202.07173  [pdf, other

    eess.IV cs.CV

    To what extent can Plug-and-Play methods outperform neural networks alone in low-dose CT reconstruction

    Authors: Qifan Xu, Qihui Lyu, Dan Ruan, Ke Sheng

    Abstract: The Plug-and-Play (PnP) framework was recently introduced for low-dose CT reconstruction to leverage the interpretability and the flexibility of model-based methods to incorporate various plugins, such as trained deep learning (DL) neural networks. However, the benefits of PnP vs. state-of-the-art DL methods have not been clearly demonstrated. In this work, we proposed an improved PnP framework to… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to IEEE ISBI 2022

  17. arXiv:2202.02457  [pdf, other

    cs.NI eess.SP

    Path Planning for the Dynamic UAV-Aided Wireless Systems using Monte Carlo Tree Search

    Authors: Yuwen Qian, Kexin Sheng, Chuan Ma, Jun Li, Ming Ding, Mahbub Hassan

    Abstract: For UAV-aided wireless systems, online path planning attracts much attention recently. To better adapt to the real-time dynamic environment, we, for the first time, propose a Monte Carlo Tree Search (MCTS)-based path planning scheme. In details, we consider a single UAV acts as a mobile server to provide computation tasks offloading services for a set of mobile users on the ground, where the movem… ▽ More

    Submitted 13 January, 2022; originally announced February 2022.

  18. arXiv:2112.10474  [pdf, other

    cs.CV

    Reciprocal Normalization for Domain Adaptation

    Authors: Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Tai** Yao, Weiming Dong, Dengwen Zhou, Xing Sun

    Abstract: Batch normalization (BN) is widely used in modern deep neural networks, which has been shown to represent the domain-related knowledge, and thus is ineffective for cross-domain tasks like unsupervised domain adaptation (UDA). Existing BN variant methods aggregate source and target domain knowledge in the same channel in normalization module. However, the misalignment between the features of corres… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: The best feature normalization module for domain adaptation

  19. arXiv:2108.01390  [pdf, other

    cs.CV

    Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

    Authors: Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun

    Abstract: Vision transformers (ViTs) have recently received explosive popularity, but the huge computational cost is still a severe issue. Since the computation complexity of ViT is quadratic with respect to the input sequence length, a mainstream paradigm for computation reduction is to reduce the number of tokens. Existing designs include structured spatial compression that uses a progressive shrinking py… ▽ More

    Submitted 6 December, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: We propose a novel and effective design for dynamic vision transformer to achieve better computational efficiency. The code is available at https://github.com/YifanXu74/Evo-ViT

  20. arXiv:2106.16128  [pdf, other

    cs.CV

    Dual Reweighting Domain Generalization for Face Presentation Attack Detection

    Authors: Shubao Liu, Ke-Yue Zhang, Tai** Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Yuan Xie, Lizhuang Ma

    Abstract: Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios. Previous methods treat each sample from multiple domains indiscriminately during the training process, and endeavor to extract a common feature space to improve the generalization. However, due to complex and biased data distribution, directly treating them e… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: accepted on IJCAI 2021

  21. arXiv:2105.02453  [pdf, other

    cs.CV

    Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing

    Authors: Zhihong Chen, Tai** Yao, Kekai Sheng, Shouhong Ding, Ying Tai, Jilin Li, Feiyue Huang, Xinyu **

    Abstract: Face anti-spoofing approach based on domain generalization(DG) has drawn growing attention due to its robustness forunseen scenarios. Existing DG methods assume that the do-main label is known.However, in real-world applications, thecollected dataset always contains mixture domains, where thedomain label is unknown. In this case, most of existing meth-ods may not work. Further, even if we can obta… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in AAAI2021

  22. arXiv:2104.10376  [pdf, other

    cs.CV cs.LG

    Towards Corruption-Agnostic Robust Domain Adaptation

    Authors: Yifan Xu, Kekai Sheng, Weiming Dong, Baoyuan Wu, Changsheng Xu, Bao-Gang Hu

    Abstract: Big progress has been achieved in domain adaptation in decades. Existing works are always based on an ideal assumption that testing target domain are i.i.d. with training target domains. However, due to unpredictable corruptions (e.g., noise and blur) in real data like web images, domain adaptation methods are increasingly required to be corruption robust on target domains. In this paper, we inves… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: The first literature to investigate the topic of corruption-agnostic robust domain adaptation, a new practical and challenging domain adaptation setting

  23. arXiv:2103.13561  [pdf, other

    cs.CV

    On Evolving Attention Towards Domain Adaptation

    Authors: Kekai Sheng, Ke Li, Xiawu Zheng, Jian Liang, Weiming Dong, Feiyue Huang, Rongrong Ji, Xing Sun

    Abstract: Towards better unsupervised domain adaptation (UDA). Recently, researchers propose various domain-conditioned attention modules and make promising progresses. However, considering that the configuration of attention, i.e., the type and the position of attention module, affects the performance significantly, it is more generalized to optimize the attention configuration automatically to be speciali… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Among the first to study arbitrary domain adaptation from the perspective of network architecture design

  24. arXiv:2012.02621  [pdf, other

    cs.CV

    Effective Label Propagation for Discriminative Semi-Supervised Domain Adaptation

    Authors: Zhiyong Huang, Kekai Sheng, Weiming Dong, Xing Mei, Chongyang Ma, Feiyue Huang, Dengwen Zhou, Changsheng Xu

    Abstract: Semi-supervised domain adaptation (SSDA) methods have demonstrated great potential in large-scale image classification tasks when massive labeled data are available in the source domain but very few labeled samples are provided in the target domain. Existing solutions usually focus on feature alignment between the two domains while paying little attention to the discrimination capability of learne… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  25. arXiv:2011.05323  [pdf, other

    cs.RO

    Robotic Exploration of Unknown 2D Environment Using a Frontier-based Automatic-Differentiable Information Gain Measure

    Authors: Di Deng, Runlin Duan, Jiahong Liu, Kuangjie Sheng, Kenji Shimada

    Abstract: At the heart of path-planning methods for autonomous robotic exploration is a heuristic which encourages exploring unknown regions of the environment. Such heuristics are typically computed using frontier-based or information-theoretic methods. Frontier-based methods define the information gain of an exploration path as the number of boundary cells, or frontiers, which are visible from the path. H… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  26. arXiv:2010.08209  [pdf, other

    cs.CV

    Human Perception-based Evaluation Criterion for Ultra-high Resolution Cell Membrane Segmentation

    Authors: Ruohua Shi, Wenyao Wang, Zhixuan Li, Liuyuan He, Kaiwen Sheng, Lei Ma, Kai Du, Tingting Jiang, Tiejun Huang

    Abstract: Computer vision technology is widely used in biological and medical data analysis and understanding. However, there are still two major bottlenecks in the field of cell membrane segmentation, which seriously hinder further research: lack of sufficient high-quality data and lack of suitable evaluation criteria. In order to solve these two problems, this paper first proposes an Ultra-high Resolution… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: submitted to ICLR 2021

  27. arXiv:2005.10052  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Lung Segmentation from Chest X-rays using Variational Data Imputation

    Authors: Raghavendra Selvan, Erik B. Dam, Nicki S. Detlefsen, Sofus Rischel, Kaining Sheng, Mads Nielsen, Akshay Pai

    Abstract: Pulmonary opacification is the inflammation in the lungs caused by many respiratory ailments, including the novel corona virus disease 2019 (COVID-19). Chest X-rays (CXRs) with such opacifications render regions of lungs imperceptible, making it difficult to perform automated image analysis on them. In this work, we focus on segmenting lungs from such abnormal CXRs as part of a pipeline aimed at a… ▽ More

    Submitted 7 July, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to be presented at the first Workshop on the Art of Learning with Missing Values (Artemiss) hosted by the 37th International Conference on Machine Learning (ICML). Source code, training data and the trained models are available here: https://github.com/raghavian/lungVAE/

  28. arXiv:2005.09973  [pdf, other

    cs.CV

    Dynamic Refinement Network for Oriented and Densely Packed Object Detection

    Authors: Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

    Abstract: Object detection has achieved remarkable progress in the past decade. However, the detection of oriented and densely packed objects remains challenging because of following inherent reasons: (1) receptive fields of neurons are all axis-aligned and of the same shape, whereas objects are usually of diverse shapes and align along various directions; (2) detection models are typically trained with gen… ▽ More

    Submitted 10 June, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted by CVPR 2020 as Oral

  29. arXiv:1911.11419  [pdf, other

    cs.CV

    Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning

    Authors: Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma

    Abstract: Visual aesthetic assessment has been an active research field for decades. Although latest methods have achieved promising performance on benchmark datasets, they typically rely on a large number of manual annotations including both aesthetic labels and related image attributes. In this paper, we revisit the problem of image aesthetic assessment from the self-supervised feature learning perspectiv… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: AAAI Conference on Artificial Intelligence, 2020, accepted

    Journal ref: Proceedings of AAAI Conference on Articial Intelligence 2020

  30. arXiv:1906.01795  [pdf, other

    cs.CV

    Fully Automated Pancreas Segmentation with Two-stage 3D Convolutional Neural Networks

    Authors: Ningning Zhao, Nuo Tong, Dan Ruan, Ke Sheng

    Abstract: Due to the fact that pancreas is an abdominal organ with very large variations in shape and size, automatic and accurate pancreas segmentation can be challenging for medical image analysis. In this work, we proposed a fully automated two stage framework for pancreas segmentation based on convolutional neural networks (CNN). In the first stage, a U-Net is trained for the down-sampled 3D volume segm… ▽ More

    Submitted 25 July, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: This paper has been accepted by MICCAI 2019

  31. arXiv:1707.07089  [pdf, other

    cs.CV

    Motion Compensated Dynamic MRI Reconstruction with Local Affine Optical Flow Estimation

    Authors: Ningning Zhao, Daniel O'Connor, Adrian Basarab, Dan Ruan, Peng Hu, Ke Sheng

    Abstract: This paper proposes a novel framework to reconstruct the dynamic magnetic resonance images (DMRI) with motion compensation (MC). Due to the inherent motion effects during DMRI acquisition, reconstruction of DMRI using motion estimation/compensation (ME/MC) has been studied under a compressed sensing (CS) scheme. In this paper, by embedding the intensity-based optical flow (OF) constraint into the… ▽ More

    Submitted 13 February, 2019; v1 submitted 21 July, 2017; originally announced July 2017.