Skip to main content

Showing 1–50 of 52 results for author: Shao, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2312.06454  [pdf, other

    eess.IV cs.CV cs.LG

    Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images

    Authors: Bao Li, Zhenyu Liu, Lizhi Shao, Bensheng Qiu, Hong Bu, Jie Tian

    Abstract: Directly predicting human epidermal growth factor receptor 2 (HER2) status from widely available hematoxylin and eosin (HE)-stained whole slide images (WSIs) can reduce technical costs and expedite treatment selection. Accurately predicting HER2 requires large collections of multi-site WSIs. Federated learning enables collaborative training of these WSIs without gigabyte-size WSIs transportation a… ▽ More

    Submitted 27 February, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  2. Synergistic Perception and Control Simplex for Verifiable Safe Vertical Landing

    Authors: Ayoosh Bansal, Yang Zhao, James Zhu, Sheng Cheng, Yuliang Gu, Hyung-** Yoon, Hunmin Kim, Naira Hovakimyan, Lui Sha

    Abstract: Perception, Planning, and Control form the essential components of autonomy in advanced air mobility. This work advances the holistic integration of these components to enhance the performance and robustness of the complete cyber-physical system. We adapt Perception Simplex, a system for verifiable collision avoidance amidst obstacle detection faults, to the vertical landing maneuver for autonomou… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: To appear in AIAA SciTech 2024

    ACM Class: C.3; C.4; J.7

    Journal ref: AIAA SCITECH 2024 Forum, p. 1167

  3. arXiv:2309.04710  [pdf, other

    cs.RO cs.AI cs.CV cs.GR eess.SY

    Jade: A Differentiable Physics Engine for Articulated Rigid Bodies with Intersection-Free Frictional Contact

    Authors: Gang Yang, Siyuan Luo, Lin Shao

    Abstract: We present Jade, a differentiable physics engine for articulated rigid bodies. Jade models contacts as the Linear Complementarity Problem (LCP). Compared to existing differentiable simulations, Jade offers features including intersection-free collision simulation and stable LCP solutions for multiple frictional contacts. We use continuous collision detection to detect the time of impact and adopt… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  4. arXiv:2306.06102  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control with Guaranteed Stability

    Authors: Ran Tao, Hunmin Kim, Hyung-** Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes and evaluates a new safety concept called backup plan safety for path planning of autonomous vehicles under mission uncertainty using model predictive control (MPC). Backup plan safety is defined as the ability to complete an alternative mission when the primary mission is aborted. To include this new safety concept in control problems, we formulate a feasibility maximization… ▽ More

    Submitted 6 October, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  5. arXiv:2303.16860  [pdf, other

    cs.LG eess.SY

    Physical Deep Reinforcement Learning Towards Safety Guarantee

    Authors: Hongpeng Cao, Yanbing Mao, Lui Sha, Marco Caccamo

    Abstract: Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces. However, the safety and stability still remain major concerns that hinder the applications of DRL to safety-critical autonomous systems. To address the concerns, we proposed the Phy-DRL: a physical deep reinforcement learnin… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Working Paper

  6. arXiv:2212.14735  [pdf

    eess.SP

    Compressed domain vibration detection and classification for distributed acoustic sensing

    Authors: Xingliang Shen, Huan Wu, Kun Zhu, Yujia Li, Hua Zheng, Jialong Li, Liyang Shao, Perry ** Shum, Chao Lu

    Abstract: Distributed acoustic sensing (DAS) is a novel enabling technology that can turn existing fibre optic networks to distributed acoustic sensors. However, it faces the challenges of transmitting, storing, and processing massive streams of data which are orders of magnitude larger than that collected from point sensors. The gap between intensive data generated by DAS and modern computing system with l… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  7. arXiv:2209.01710  [pdf, other

    cs.RO cs.LG eess.SY

    Perception Simplex: Verifiable Collision Avoidance in Autonomous Vehicles Amidst Obstacle Detection Faults

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Advances in deep learning have revolutionized cyber-physical applications, including the development of Autonomous Vehicles. However, real-world collisions involving autonomous control of vehicles have raised significant safety concerns regarding the use of Deep Neural Networks (DNN) in safety-critical tasks, particularly Perception. The inherent unverifiability of DNNs poses a key challenge in en… ▽ More

    Submitted 28 November, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14403

    ACM Class: D.2.11; I.2.9; C.4; J.7

    Journal ref: Software Testing, Verification and Reliability. 2024. e1879

  8. Verifiable Obstacle Detection

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Perception of obstacles remains a critical safety concern for autonomous vehicles. Real-world collisions have shown that the autonomy faults leading to fatal collisions originate from obstacle existence detection. Open source autonomous driving implementations show a perception pipeline with complex interdependent Deep Neural Networks. These networks are not fully verifiable, making them unsuitabl… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at ISSRE 2022

    ACM Class: D.2.4; I.2.9; I.4.8

    Journal ref: 33rd International Symposium on Software Reliability Engineering (ISSRE), pp. 61-72. IEEE, 2022

  9. arXiv:2205.01649  [pdf, other

    eess.IV cs.CV

    Learning Enriched Features for Fast Image Restoration and Enhancement

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: Given a degraded input image, image restoration aims to recover the missing high-quality image content. Numerous applications demand effective image restoration, e.g., computational photography, surveillance, autonomous vehicles, and remote sensing. Significant advances in image restoration have been made in recent years, dominated by convolutional neural networks (CNNs). The widely-used CNN-based… ▽ More

    Submitted 19 April, 2022; originally announced May 2022.

    Comments: This article supersedes arXiv:2003.06792. Accepted for publication in TPAMI

  10. arXiv:2112.05752  [pdf, other

    eess.IV cs.CV

    Specificity-Preserving Federated Learning for MR Image Reconstruction

    Authors: Chun-Mei Feng, Yunlu Yan, Shanshan Wang, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Federated learning (FL) can be used to improve data privacy and efficiency in magnetic resonance (MR) image reconstruction by enabling multiple institutions to collaborate without needing to aggregate local data. However, the domain shift caused by different MR imaging protocols can substantially degrade the performance of FL models. Recent FL techniques tend to solve this by enhancing the general… ▽ More

    Submitted 22 August, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 12 pages, 8 figures Code: https://github.com/chunmeifeng/FedMRI

    Journal ref: IEEE Transactions on Medical Imaging, 2022

  11. arXiv:2110.08080  [pdf, other

    eess.IV cs.CV

    Deep multi-modal aggregation network for MR image reconstruction with auxiliary modality

    Authors: Chun-Mei Feng, Huazhu Fu, Tianfei Zhou, Yong Xu, Ling Shao, David Zhang

    Abstract: Magnetic resonance (MR) imaging produces detailed images of organs and tissues with better contrast, but it suffers from a long acquisition time, which makes the image quality vulnerable to say motion artifacts. Recently, many approaches have been developed to reconstruct full-sampled images from partially observed measurements to accelerate MR imaging. However, most approaches focused on reconstr… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  12. arXiv:2109.01664  [pdf, other

    eess.IV cs.CV

    Exploring Separable Attention for Multi-Contrast MR Image Super-Resolution

    Authors: Chun-Mei Feng, Yunlu Yan, Kai Yu, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Super-resolving the Magnetic Resonance (MR) image of a target contrast under the guidance of the corresponding auxiliary contrast, which provides additional anatomical information, is a new and effective solution for fast MR imaging. However, current multi-contrast super-resolution (SR) methods tend to concatenate different contrasts directly, ignoring their relationships in different clues, e.g.,… ▽ More

    Submitted 21 August, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:2105.08949 https://github.com/chunmeifeng/SANet

  13. Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

    Authors: Bo Dong, Wenhai Wang, Deng-** Fan, **peng Li, Huazhu Fu, Ling Shao

    Abstract: Most polyp segmentation methods use CNNs as their backbone, leading to two key issues when exchanging information between the encoder and decoder: 1) taking into account the differences in contribution between different-level features and 2) designing an effective mechanism for fusing these features. Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and… ▽ More

    Submitted 19 February, 2024; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted to CAAI AIR 2023

    Journal ref: CAAI Artificial Intelligence Research, 2023, 2: 9150015

  14. arXiv:2107.07314  [pdf, other

    cs.CV cs.LG eess.IV

    Variational Topic Inference for Chest X-Ray Report Generation

    Authors: Ivona Najdenkoska, Xiantong Zhen, Marcel Worring, Ling Shao

    Abstract: Automating report generation for medical imaging promises to reduce workload and assist diagnosis in clinical practice. Recent work has shown that deep learning models can successfully caption natural images. However, learning from medical data is challenging due to the diversity and uncertainty inherent in the reports written by different radiologists with discrepant expertise and experience. To… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: To be published in the International Conference on Medical Image Computing and Computer Assisted Intervention 2021

  15. arXiv:2106.14248  [pdf, other

    eess.IV cs.CV

    Multi-Modal Transformer for Accelerated MR Imaging

    Authors: Chun-Mei Feng, Yunlu Yan, Geng Chen, Yong Xu, Ling Shao, Huazhu Fu

    Abstract: Accelerated multi-modal magnetic resonance (MR) imaging is a new and effective solution for fast MR imaging, providing superior performance in restoring the target modality from its undersampled counterpart with guidance from an auxiliary modality. However, existing works simply combine the auxiliary modality as prior information, lacking in-depth investigations on the potential mechanisms for fus… ▽ More

    Submitted 11 May, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: https://github.com/chunmeifeng/MTrans

  16. arXiv:2105.05980  [pdf, other

    eess.IV cs.CV

    DONet: Dual-Octave Network for Fast MR Image Reconstruction

    Authors: Chun-Mei Feng, Zhanyuan Yang, Huazhu Fu, Yong Xu, Jian Yang, Ling Shao

    Abstract: Magnetic resonance (MR) image acquisition is an inherently prolonged process, whose acceleration has long been the subject of research. This is commonly achieved by obtaining multiple undersampled images, simultaneously, through parallel imaging. In this paper, we propose the Dual-Octave Network (DONet), which is capable of learning multi-scale spatial-frequency features from both the real and ima… ▽ More

    Submitted 12 June, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2104.05345

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2021

  17. arXiv:2104.05345  [pdf, other

    eess.IV cs.CV

    Dual-Octave Convolution for Accelerated Parallel MR Image Reconstruction

    Authors: Chun-Mei Feng, Zhanyuan Yang, Geng Chen, Yong Xu, Ling Shao

    Abstract: Magnetic resonance (MR) image acquisition is an inherently prolonged process, whose acceleration by obtaining multiple undersampled images simultaneously through parallel imaging has always been the subject of research. In this paper, we propose the Dual-Octave Convolution (Dual-OctConv), which is capable of learning multi-scale spatial-frequency features from both real and imaginary components, f… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI) 2021

    Journal ref: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI) 2021

  18. arXiv:2103.14819  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control

    Authors: Hunmin Kim, Hyung** Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes a new safety concept: backup plan safety. The backup plan safety is defined as the ability to complete one of the alternative missions in the case of primary mission abortion. To incorporate this new safety concept in control problems, we formulate a feasibility maximization problem that adopts additional (virtual) input horizons toward the alternative missions on top of the… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

  19. arXiv:2103.11587  [pdf, other

    cs.CV eess.IV

    Brain Image Synthesis with Unsupervised Multivariate Canonical CSC$\ell_4$Net

    Authors: Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao

    Abstract: Recent advances in neuroscience have highlighted the effectiveness of multi-modal medical data for investigating certain pathologies and understanding human cognition. However, obtaining full sets of different modalities is limited by various factors, such as long acquisition times, high examination costs and artifact suppression. In addition, the complexity, high dimensionality and heterogeneity… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures CVPR2021 oral

  20. arXiv:2103.10825  [pdf, other

    eess.IV cs.CV

    Variational Knowledge Distillation for Disease Classification in Chest X-Rays

    Authors: Tom van Sonsbeek, Xiantong Zhen, Marcel Worring, Ling Shao

    Abstract: Disease classification relying solely on imaging data attracts great interest in medical image analysis. Current models could be further improved, however, by also employing Electronic Health Records (EHRs), which contain rich information on patients and findings from clinicians. It is challenging to incorporate this information into disease classification due to the high reliance on clinician inp… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  21. arXiv:2012.02776  [pdf, other

    cs.CV cs.LG eess.IV

    Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

    Authors: Wencheng Han, ** Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

    Abstract: Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, ha… ▽ More

    Submitted 30 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: Accepted by CVPR2021

  22. arXiv:2010.06616  [pdf, ps, other

    eess.SY

    Finite-Time Model Inference From A Single Noisy Trajectory

    Authors: Yanbing Mao, Naira Hovakimyan, Petros Voulgaris, Lui Sha

    Abstract: This paper proposes a novel model inference procedure to identify system matrix from a single noisy trajectory over a finite-time interval. The proposed inference procedure comprises an observation data processor, a redundant data processor and an ordinary least-square estimator, wherein the data processors mitigate the influence of observation noise on inference error. We first systematically inv… ▽ More

    Submitted 1 January, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Submitted

  23. arXiv:2009.12349  [pdf, other

    eess.SY

    Robust Vehicle Lane Kee** Control with Networked Proactive Adaptation

    Authors: Hunmin Kim, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: Road condition is an important environmental factor for autonomous vehicle control. A dramatic change in the road condition from the nominal status is a source of uncertainty that can lead to a system failure. Once the vehicle encounters an uncertain environment, such as hitting an ice patch, it is too late to reduce the speed, and the vehicle can lose control. To cope with future uncertainties in… ▽ More

    Submitted 28 September, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

  24. arXiv:2009.08973  [pdf, other

    cs.LG cs.AI cs.RO eess.SY stat.ML

    GRAC: Self-Guided and Self-Regularized Actor-Critic

    Authors: Lin Shao, Yifan You, Mengyuan Yan, Qingyun Sun, Jeannette Bohg

    Abstract: Deep reinforcement learning (DRL) algorithms have successfully been demonstrated on a range of challenging decision making and control tasks. One dominant component of recent deep reinforcement learning algorithms is the target network which mitigates the divergence when learning the Q function. However, target networks can slow down the learning process due to delayed function updates. Our main c… ▽ More

    Submitted 10 November, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  25. arXiv:2008.02101  [pdf, other

    eess.IV cs.CV

    Structure Preserving Stain Normalization of Histopathology Images Using Self-Supervised Semantic Guidance

    Authors: Dwarikanath Mahapatra, Behzad Bozorgtabar, Jean-Philippe Thiran, Ling Shao

    Abstract: Although generative adversarial network (GAN) based style transfer is state of the art in histopathology color-stain normalization, they do not explicitly integrate structural information of tissues. We propose a self-supervised approach to incorporate semantic guidance into a GAN based stain normalization framework and preserve detailed structural information. Our method does not require manual s… ▽ More

    Submitted 3 June, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

  26. arXiv:2008.01627  [pdf, ps, other

    eess.SY

    SL1-Simplex: Safe Velocity Regulation of Self-Driving Vehicles in Dynamic and Unforeseen Environments

    Authors: Yanbing Mao, Yuliang Gu, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This paper proposes a novel extension of the Simplex architecture with model switching and model learning to achieve safe velocity regulation of self-driving vehicles in dynamic and unforeseen environments. To guarantee the reliability of autonomous vehicles, an $\mathcal{L}_{1}$ adaptive controller that compensates for uncertainties and disturbances is employed by the Simplex architecture as a ve… ▽ More

    Submitted 1 February, 2022; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Submitted to ACM Transactions on Cyber-Physical Systems

  27. arXiv:2006.11538  [pdf, other

    cs.CV cs.LG eess.IV

    Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition

    Authors: Ionut Cosmin Duta, Li Liu, Fan Zhu, Ling Shao

    Abstract: This work introduces pyramidal convolution (PyConv), which is capable of processing the input at multiple filter scales. PyConv contains a pyramid of kernels, where each level involves different types of filters with varying size and depth, which are able to capture different levels of details in the scene. On top of these improved recognition capabilities, PyConv is also efficient and, with our f… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  28. arXiv:2006.11392  [pdf, other

    eess.IV cs.CV

    PraNet: Parallel Reverse Attention Network for Polyp Segmentation

    Authors: Deng-** Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Colonoscopy is an effective technique for detecting colorectal polyps, which are highly related to colorectal cancer. In clinical practice, segmenting polyps from colonoscopy images is of great importance since it provides valuable information for diagnosis and surgery. However, accurate polyp segmentation is a challenging task, for two major reasons: (i) the same type of polyps has a diversity of… ▽ More

    Submitted 3 July, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted to MICCAI 2020

  29. arXiv:2006.10135  [pdf, other

    eess.IV cs.CV cs.LG

    M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients

    Authors: Tao Zhou, Huazhu Fu, Yu Zhang, Changqing Zhang, Xiankai Lu, Jianbing Shen, Ling Shao

    Abstract: Early and accurate prediction of overall survival (OS) time can help to obtain better treatment planning for brain tumor patients. Although many OS time prediction methods have been developed and obtain promising results, there are still several issues. First, conventional prediction methods rely on radiomic features at the local lesion area of a magnetic resonance (MR) volume, which may not repre… ▽ More

    Submitted 14 July, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted by MICCAI'20

  30. arXiv:2005.07697  [pdf, other

    eess.SY cs.MA

    Safety Constrained Multi-UAV Time Coordination: A Bi-level Control Framework in GPS Denied Environment

    Authors: Wenbin Wan, Hunmin Kim, Yikun Cheng, Naira Hovakimyan, Petros G. Voulgaris, Lui Sha

    Abstract: Unmanned aerial vehicles (UAVs) suffer from sensor drifts in GPS denied environments, which can cause safety issues. To avoid intolerable sensor drifts while completing the time-critical coordination task for multi-UAV systems, we propose a safety constrained bi-level control framework. The first level is the time-critical coordination level that achieves a consensus of coordination states and pro… ▽ More

    Submitted 19 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.10826

  31. arXiv:2005.05594  [pdf, other

    eess.IV cs.CV

    Modeling and Enhancing Low-quality Retinal Fundus Images

    Authors: Ziyi Shen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Retinal fundus images are widely used for the clinical screening and diagnosis of eye diseases. However, fundus images captured by operators with various levels of experience have a large variation in quality. Low-quality fundus images increase uncertainty in clinical observation and lead to the risk of misdiagnosis. However, due to the special optical beam of fundus imaging and structure of the r… ▽ More

    Submitted 9 December, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

  32. arXiv:2004.14133  [pdf, other

    eess.IV cs.CV cs.LG

    Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images

    Authors: Deng-** Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

    Abstract: Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causing the world to face an existential health crisis. Automated detection of lung infections from computed tomography (CT) images offers a great potential to augment the traditional healthcare strategy for tackling COVID-19. However, segmenting infected regions from CT slices faces several challenges, including high variation in… ▽ More

    Submitted 21 May, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: To appear in IEEE TMI. The code is released in: https://github.com/Deng**Fan/Inf-Net

  33. arXiv:2004.08499  [pdf, other

    cs.RO cs.LG eess.SY

    Design and Control of Roller Grasper V2 for In-Hand Manipulation

    Authors: Shenli Yuan, Lin Shao, Connor L. Yako, Alex Gruebele, J. Kenneth Salisbury

    Abstract: The ability to perform in-hand manipulation still remains an unsolved problem; having this capability would allow robots to perform sophisticated tasks requiring repositioning and reorienting of grasped objects. In this work, we present a novel non-anthropomorphic robot grasper with the ability to manipulate objects by means of active surfaces at the fingertips. Active surfaces are achieved by sph… ▽ More

    Submitted 17 November, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) October 25-29, 2020, Las Vegas, NV, USA (Virtual)

  34. arXiv:2004.04491  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Granularity Canonical Appearance Pooling for Remote Sensing Scene Classification

    Authors: S. Wang, Y. Guan, L. Shao

    Abstract: Recognising remote sensing scene images remains challenging due to large visual-semantic discrepancies. These mainly arise due to the lack of detailed annotations that can be employed to align pixel-level representations with high-level semantic labels. As the tagging process is labour-intensive and subjective, we hereby propose a novel Multi-Granularity Canonical Appearance Pooling (MG-CAP) to au… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: This paper is going to be published by IEEE Transactions on Image Processing

    Journal ref: IEEE Transactions on Image Processing 29, 5396--5407 (2020)

  35. arXiv:2003.14119  [pdf, other

    eess.IV cs.CV

    Pathological Retinal Region Segmentation From OCT Images Using Geometric Relation Based Augmentation

    Authors: Dwarikanath Mahapatra, Behzad Bozorgtabar, Jean-Philippe Thiran, Ling Shao

    Abstract: Medical image segmentation is an important task for computer aided diagnosis. Pixelwise manual annotations of large datasets require high expertise and is time consuming. Conventional data augmentations have limited benefit by not fully representing the underlying distribution of the training set, thus affecting model robustness when tested on images captured from different sources. Prior work lev… ▽ More

    Submitted 25 April, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

  36. arXiv:2003.07761  [pdf, other

    eess.IV cs.CV

    CycleISP: Real Image Restoration via Improved Data Synthesis

    Authors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

    Abstract: The availability of large-scale datasets has helped unleash the true potential of deep convolutional neural networks (CNNs). However, for the single-image denoising problem, capturing a real dataset is an unacceptably expensive and cumbersome procedure. Consequently, image denoising algorithms are mostly developed and evaluated on synthetic data that is usually generated with a widespread assumpti… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: CVPR 2020 (Oral)

  37. arXiv:2003.04253  [pdf, other

    cs.CV cs.LG eess.IV

    Motion-Attentive Transition for Zero-Shot Video Object Segmentation

    Authors: Tianfei Zhou, Shunzhou Wang, Yi Zhou, Yazhou Yao, Jianwu Li, Ling Shao

    Abstract: In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation. An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder, which transforms appearance features into motion-attenti… ▽ More

    Submitted 9 July, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: AAAI 2020. Code: https://github.com/tfzhou/MATNet

  38. arXiv:2002.05000  [pdf, other

    cs.CV eess.IV

    Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis

    Authors: Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao

    Abstract: Magnetic resonance imaging (MRI) is a widely used neuroimaging technique that can provide images of different contrasts (i.e., modalities). Fusing this multi-modal data has proven particularly effective for boosting model performance in many tasks. However, due to poor data quality and frequent patient dropout, collecting all modalities for every patient remains a challenge. Medical image synthesi… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: has been accepted by IEEE TMI

  39. DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images

    Authors: Yi Zhou, Boyang Wang, Xiaodong He, Shanshan Cui, Ling Shao

    Abstract: Diabetic retinopathy (DR) is a complication of diabetes that severely affects eyes. It can be graded into five levels of severity according to international protocol. However, optimizing a grading model to have strong generalizability requires a large amount of balanced training data, which is difficult to collect particularly for the high severity levels. Typical data augmentation methods, includ… ▽ More

    Submitted 11 November, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Extension work of our MICCAI paper

    Journal ref: IEEE Journal of Biomedical and Health Informatics 2020

  40. arXiv:1911.04470  [pdf, other

    cs.CV cs.LG eess.IV

    Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval

    Authors: Jianjun Lei, Yuxin Song, Bo Peng, Zhanyu Ma, Ling Shao, Yi-Zhe Song

    Abstract: Sketch-based image retrieval (SBIR) is a challenging task due to the large cross-domain gap between sketches and natural images. How to align abstract sketches and natural images into a common high-level semantic space remains a key problem in SBIR. In this paper, we propose a novel semi-heterogeneous three-way joint embedding network (Semi3-Net), which integrates three branches (a sketch branch,… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  41. arXiv:1911.00969  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Learning to Scaffold the Development of Robotic Manipulation Skills

    Authors: Lin Shao, Toki Migimatsu, Jeannette Bohg

    Abstract: Learning contact-rich, robotic manipulation skills is a challenging problem due to the high-dimensionality of the state and action space as well as uncertainty from noisy sensors and inaccurate motor control. To combat these factors and achieve more robust manipulation, humans actively exploit contact constraints in the environment. By adopting a similar strategy, robots can also achieve more robu… ▽ More

    Submitted 5 October, 2020; v1 submitted 3 November, 2019; originally announced November 2019.

    Comments: Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2020

  42. arXiv:1910.10826  [pdf, other

    eess.SY

    A Safety Constrained Control Framework for UAVs in GPS Denied Environment

    Authors: Wenbin Wan, Hunmin Kim, Naira Hovakimyan, Lui Sha, Petros G. Voulgaris

    Abstract: Unmanned aerial vehicles (UAVs) suffer from sensor drifts in GPS denied environments, which can lead to potentially dangerous situations. To avoid intolerable sensor drifts in the presence of GPS spoofing attacks, we propose a safety constrained control framework that adapts the UAV at a path re-planning level to support resilient state estimation against GPS spoofing attacks. The attack detector… ▽ More

    Submitted 12 April, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

  43. arXiv:1909.03749  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Learning Visual Dynamics Models of Rigid Objects using Relational Inductive Biases

    Authors: Fabio Ferreira, Lin Shao, Tamim Asfour, Jeannette Bohg

    Abstract: Endowing robots with human-like physical reasoning abilities remains challenging. We argue that existing methods often disregard spatio-temporal relations and by using Graph Neural Networks (GNNs) that incorporate a relational inductive bias, we can shift the learning process towards exploiting relations. In this work, we learn action-conditional forward dynamics models of a simulated manipulation… ▽ More

    Submitted 23 October, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: short paper (4 pages, two figures), accepted to NeurIPS 2019 Graph Representation Learning workshop

  44. arXiv:1907.05598  [pdf, other

    eess.IV cs.CV

    Coupled-Projection Residual Network for MRI Super-Resolution

    Authors: Chun-Mei Feng, Kai Wang, Shijian Lu, Yong Xu, Heng Kong, Ling Shao

    Abstract: Magnetic Resonance Imaging(MRI) has been widely used in clinical application and pathology research by hel** doctors make more accurate diagnoses. On the other hand, accurate diagnosis by MRI remains a great challenge as images obtained via present MRI techniques usually have low resolutions. Improving MRI image quality and resolution thus becomes a critically important task. This paper presents… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: Our source code will be publicly available at http://www.yongxu.org/lunwen.html

  45. Noisy-As-Clean: Learning Self-supervised Denoising from the Corrupted Image

    Authors: Jun Xu, Yuan Huang, Ming-Ming Cheng, Li Liu, Fan Zhu, Zhou Xu, Ling Shao

    Abstract: Supervised deep networks have achieved promisingperformance on image denoising, by learning image priors andnoise statistics on plenty pairs of noisy and clean images. Unsupervised denoising networks are trained with only noisy images. However, for an unseen corrupted image, both supervised andunsupervised networks ignore either its particular image prior, the noise statistics, or both. That is, t… ▽ More

    Submitted 9 May, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: 12 pages, 9 figures, 6 tables, the first two authors contribute equally

  46. arXiv:1906.05348  [pdf, other

    eess.SY cs.RO

    Towards Resilient UAV: Escape Time in GPS Denied Environment with Sensor Drift

    Authors: Hyung-** Yoon, Wenbin Wan, Hunmin Kim, Naira Hovakimyan, Lui Sha, Petros G. Voulgaris

    Abstract: This paper considers a resilient state estimation framework for unmanned aerial vehicles (UAVs) that integrates a Kalman filter-like state estimator and an attack detector. When an attack is detected, the state estimator uses only IMU signals as the GPS signals do not contain legitimate information. This limited sensor availability induces a sensor drift problem questioning the reliability of the… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  47. arXiv:1811.08064  [pdf, other

    cs.SE cs.FL cs.LO eess.SY

    Model and Integrate Medical Resource Availability into Verifiably Correct Executable Medical Guidelines - Technical Report

    Authors: Chunhui Guo, Zhicheng Fu, Zhenyu Zhang, Shang** Ren, Lui Sha

    Abstract: Improving effectiveness and safety of patient care is an ultimate objective for medical cyber-physical systems. A recent study shows that the patients' death rate can be reduced by computerizing medical guidelines. Most existing medical guideline models are validated and/or verified based on the assumption that all necessary medical resources needed for a patient care are always available. However… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: full version, 8 pages. arXiv admin note: substantial text overlap with arXiv:1811.08061

    Journal ref: IEEE/ACM 36th International Conference on Computer-Aided Design (ICCAD), 2017

  48. arXiv:1811.08061  [pdf, other

    cs.SE cs.FL cs.LO eess.SY

    Model and Integrate Medical Resource Available Times and Relationships in Verifiably Correct Executable Medical Best Practice Guideline Models (Extended Version)

    Authors: Chunhui Guo, Zhicheng Fu, Zhenyu Zhang, Shang** Ren, Lui Sha

    Abstract: Improving patient care safety is an ultimate objective for medical cyber-physical systems. A recent study shows that the patients' death rate is significantly reduced by computerizing medical best practice guidelines. Recent data also show that some morbidity and mortality in emergency care are directly caused by delayed or interrupted treatment due to lack of medical resources. However, medical g… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: full version, 12 pages

    Journal ref: ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS), 2018

  49. arXiv:1811.00694  [pdf, other

    cs.SE cs.FL eess.SY

    Design Verifiably Correct Model Patterns to Facilitate Modeling Medical Best Practice Guidelines with Statecharts (Technical Report)

    Authors: Chunhui Guo, Zhicheng Fu, Zhenyu Zhang, Shang** Ren, Lui Sha

    Abstract: Improving patient care safety is an ultimate objective for medical cyber-physical systems. A recent study shows that the patients' death rate can be significantly reduced by computerizing medical best practice guidelines. To facilitate the development of computerized medical best practice guidelines, statecharts are often used as a modeling tool because of their high resemblances to disease and tr… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: full version, 14 pages

    Journal ref: IEEE Internet of Things Journal, 2018

  50. arXiv:1810.12126  [pdf, other

    eess.IV cs.CV

    ActionXPose: A Novel 2D Multi-view Pose-based Algorithm for Real-time Human Action Recognition

    Authors: Federico Angelini, Zeyu Fu, Yang Long, Ling Shao, Syed Mohsen Naqvi

    Abstract: We present ActionXPose, a novel 2D pose-based algorithm for posture-level Human Action Recognition (HAR). The proposed approach exploits 2D human poses provided by OpenPose detector from RGB videos. ActionXPose aims to process poses data to be provided to a Long Short-Term Memory Neural Network and to a 1D Convolutional Neural Network, which solve the classification problem. ActionXPose is one of… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.