Skip to main content

Showing 1–50 of 67 results for author: Yuille, A L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.16139  [pdf, other

    cs.CV cs.DB cs.LG

    MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision

    Authors: Jianning Li, Zongwei Zhou, Jiancheng Yang, Antonio Pepe, Christina Gsaxner, Gijs Luijten, Chongyu Qu, Tiezheng Zhang, Xiaoxi Chen, Wenxuan Li, Marek Wodzinski, Paul Friedrich, Kangxian Xie, Yuan **, Narmada Ambigapathy, Enrico Nasca, Naida Solak, Gian Marco Melito, Viet Duc Vu, Afaque R. Memon, Christopher Schlachta, Sandrine De Ribaupierre, Rajnikant Patel, Roy Eagleson, Xiaojun Chen , et al. (132 additional authors not shown)

    Abstract: Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 16 pages

    MSC Class: 68T01

  2. arXiv:2210.02442  [pdf, other

    cs.CV cs.AI cs.LG

    Making Your First Choice: To Address Cold Start Problem in Vision Active Learning

    Authors: Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L. Yuille, Zongwei Zhou

    Abstract: Active learning promises to improve annotation efficiency by iteratively selecting the most important data to be annotated first. However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices. We identify this as the cold start problem in vision active learning, caused by a biased and outlier initial q… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  3. External Attention Assisted Multi-Phase Splenic Vascular Injury Segmentation with Limited Data

    Authors: Yuyin Zhou, David Dreizin, Yan Wang, Fengze Liu, Wei Shen, Alan L. Yuille

    Abstract: The spleen is one of the most commonly injured solid organs in blunt abdominal trauma. The development of automatic segmentation systems from multi-phase CT for splenic vascular injury can augment severity grading for improving clinical decision support and outcome prediction. However, accurate segmentation of splenic vascular injury is challenging for the following reasons: 1) Splenic vascular in… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: IEEE TMI

  4. arXiv:2111.13495  [pdf, other

    cs.CV

    SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection

    Authors: Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan L. Yuille, Chaoyi Zhang, Weidong Cai, Zongwei Zhou

    Abstract: Radiography imaging protocols focus on particular body regions, therefore producing images of great similarity and yielding recurrent anatomical structures across patients. To exploit this structured information, we propose the use of Space-aware Memory Queues for In-painting and Detecting anomalies from radiography images (abbreviated as SQUID). We show that SQUID can taxonomize the ingrained ana… ▽ More

    Submitted 24 March, 2023; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: CVPR 2023

  5. arXiv:2109.12265  [pdf, other

    cs.CV cs.AI

    Label-Assemble: Leveraging Multiple Datasets with Partial Labels

    Authors: Mintong Kang, Bowen Li, Zengle Zhu, Yongyi Lu, Elliot K. Fishman, Alan L. Yuille, Zongwei Zhou

    Abstract: The success of deep learning relies heavily on large labeled datasets, but we often only have access to several small datasets associated with partial labels. To address this problem, we propose a new initiative, "Label-Assemble", that aims to unleash the full potential of partial labels from an assembly of public datasets. We discovered that learning from negative examples facilitates both comput… ▽ More

    Submitted 14 May, 2023; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: ISBI 2023

  6. arXiv:2106.09748  [pdf, other

    cs.CV

    DeepLab2: A TensorFlow Library for Deep Labeling

    Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

    Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel prediction problems in computer vision. DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code, allowing the community to reproduce and further improve upon the sta… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 4-page technical report. The first three authors contributed equally to this work

  7. Learning Inductive Attention Guidance for Partially Supervised Pancreatic Ductal Adenocarcinoma Prediction

    Authors: Yan Wang, Peng Tang, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: Pancreatic ductal adenocarcinoma (PDAC) is the third most common cause of cancer death in the United States. Predicting tumors like PDACs (including both classification and segmentation) from medical images by deep learning is becoming a growing trend, but usually a large number of annotated data are required for training, which is very labor-intensive and time-consuming. In this paper, we conside… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

  8. arXiv:2103.05525  [pdf, other

    eess.IV cs.CV

    Multi-phase Deformable Registration for Time-dependent Abdominal Organ Variations

    Authors: Seyoun Park, Elliot K. Fishman, Alan L. Yuille

    Abstract: Human body is a complex dynamic system composed of various sub-dynamic parts. Especially, thoracic and abdominal organs have complex internal shape variations with different frequencies by various reasons such as respiration with fast motion and peristalsis with slower motion. CT protocols for abdominal lesions are multi-phase scans for various tumor detection to use different vascular contrast, h… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  9. arXiv:2103.05170  [pdf, other

    cs.CV

    Sequential Learning on Liver Tumor Boundary Semantics and Prognostic Biomarker Mining

    Authors: Jieneng Chen, Ke Yan, Yu-Dong Zhang, Youbao Tang, Xun Xu, Shuwen Sun, Qiu** Liu, Lingyun Huang, **g Xiao, Alan L. Yuille, Ya Zhang, Le Lu

    Abstract: The boundary of tumors (hepatocellular carcinoma, or HCC) contains rich semantics: capsular invasion, visibility, smoothness, folding and protuberance, etc. Capsular invasion on tumor boundary has proven to be clinically correlated with the prognostic indicator, microvascular invasion (MVI). Investigating tumor boundary semantics has tremendous clinical values. In this paper, we propose the first… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  10. arXiv:2102.04306  [pdf, other

    cs.CV

    TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

    Authors: Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L. Yuille, Yuyin Zhou

    Abstract: Medical image segmentation is an essential prerequisite for develo** healthcare systems, especially for disease diagnosis and treatment planning. On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard and achieved tremendous success. However, due to the intrinsic locality of convolution operations, U-Net generally demonstrate… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 13 pages, 3 figures

  11. arXiv:2012.00088  [pdf, other

    cs.CV cs.RO

    Nothing But Geometric Constraints: A Model-Free Method for Articulated Object Pose Estimation

    Authors: Qihao Liu, Weichao Qiu, Weiyao Wang, Gregory D. Hager, Alan L. Yuille

    Abstract: We propose an unsupervised vision-based system to estimate the joint configurations of the robot arm from a sequence of RGB or RGB-D images without knowing the model a priori, and then adapt it to the task of category-independent articulated object pose estimation. We combine a classical geometric formulation with deep learning and extend the use of epipolar constraint to multi-rigid-body systems… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: 10 pages, 3 figures

  12. Volumetric Medical Image Segmentation: A 3D Deep Coarse-to-fine Framework and Its Adversarial Examples

    Authors: Yingwei Li, Zhuotun Zhu, Yuyin Zhou, Yingda Xia, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: Although deep neural networks have been a dominant method for many 2D vision tasks, it is still challenging to apply them to 3D tasks, such as medical image segmentation, due to the limited amount of annotated 3D data and limited computational resources. In this chapter, by rethinking the strategy to apply 3D Convolutional Neural Networks to segment medical images, we propose a novel 3D-based coar… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1712.00201

  13. arXiv:2004.02021  [pdf, other

    eess.IV cs.CV

    Segmentation for Classification of Screening Pancreatic Neuroendocrine Tumors

    Authors: Zhuotun Zhu, Yongyi Lu, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: This work presents comprehensive results to detect in the early stage the pancreatic neuroendocrine tumors (PNETs), a group of endocrine tumors arising in the pancreas, which are the second common type of pancreatic cancer, by checking the abdominal CT scans. To the best of our knowledge, this task has not been studied before as a computational task. To provide radiologists with tumor locations, w… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

  14. arXiv:2003.12798  [pdf, other

    cs.CV

    CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks

    Authors: Qihang Yu, Yingwei Li, Jieru Mei, Yuyin Zhou, Alan L. Yuille

    Abstract: 3D Convolution Neural Networks (CNNs) have been widely applied to 3D scene understanding, such as video analysis and volumetric image recognition. However, 3D networks can easily lead to over-parameterization which incurs expensive computation cost. In this paper, we propose Channel-wise Automatic KErnel Shrinking (CAKES), to enable efficient 3D learning by shrinking standard 3D convolutions into… ▽ More

    Submitted 16 December, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

    Comments: AAAI 2021

  15. arXiv:2003.08441  [pdf, other

    eess.IV cs.CV

    Detecting Pancreatic Ductal Adenocarcinoma in Multi-phase CT Scans via Alignment Ensemble

    Authors: Yingda Xia, Qihang Yu, Wei Shen, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille

    Abstract: Pancreatic ductal adenocarcinoma (PDAC) is one of the most lethal cancers among the population. Screening for PDACs in dynamic contrast-enhanced CT is beneficial for early diagnosis. In this paper, we investigate the problem of automated detecting PDACs in multi-phase (arterial and venous) CT scans. Multiple phases provide more information than single phase, but they are unaligned and inhomogeneou… ▽ More

    Submitted 1 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: The first two authors contributed equally to this work. Accepted to MICCAI 2020

  16. arXiv:1912.09628  [pdf, other

    cs.CV

    C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation

    Authors: Qihang Yu, Dong Yang, Holger Roth, Yutong Bai, Yixiao Zhang, Alan L. Yuille, Daguang Xu

    Abstract: 3D convolution neural networks (CNN) have been proved very successful in parsing organs or tumours in 3D medical images, but it remains sophisticated and time-consuming to choose or design proper 3D networks given different task contexts. Recently, Neural Architecture Search (NAS) is proposed to solve this problem by searching for the best network architecture automatically. However, the inconsist… ▽ More

    Submitted 19 April, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: CVPR 2020

  17. arXiv:1912.04363  [pdf, other

    cs.CV

    Car Pose in Context: Accurate Pose Estimation with Ground Plane Constraints

    Authors: Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille

    Abstract: Scene context is a powerful constraint on the geometry of objects within the scene in cases, such as surveillance, where the camera geometry is unknown and image quality may be poor. In this paper, we describe a method for estimating the pose of cars in a scene jointly with the ground plane that supports them. We formulate this as a joint optimization that accounts for varying car shape using a st… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  18. arXiv:1912.03383  [pdf, other

    cs.CV

    Deep Distance Transform for Tubular Structure Segmentation in CT Scans

    Authors: Yan Wang, Xu Wei, Fengze Liu, Jieneng Chen, Yuyin Zhou, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: Tubular structure segmentation in medical images, e.g., segmenting vessels in CT scans, serves as a vital step in the use of computers to aid in screening early stages of related diseases. But automatic tubular structure segmentation in CT scans is a challenging problem, due to issues such as poor contrast, noise and complicated background. A tubular structure usually has a cylinder-like shape whi… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  19. arXiv:1906.00335  [pdf, other

    cs.CV

    Adversarial Examples for Edge Detection: They Exist, and They Transfer

    Authors: Christian Cosgrove, Alan L. Yuille

    Abstract: Convolutional neural networks have recently advanced the state of the art in many tasks including edge and object boundary detection. However, in this paper, we demonstrate that these edge detectors inherit a troubling property of neural networks: they can be fooled by adversarial examples. We show that adding small perturbations to an image causes HED, a CNN-based edge detection model, to fail to… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

  20. arXiv:1905.08231  [pdf, other

    cs.CV

    Patch-based 3D Human Pose Refinement

    Authors: Qingfu Wan, Weichao Qiu, Alan L. Yuille

    Abstract: State-of-the-art 3D human pose estimation approaches typically estimate pose from the entire RGB image in a single forward run. In this paper, we develop a post-processing step to refine 3D human pose estimation from body part patches. Using local patches as input has two advantages. First, the fine details around body parts are zoomed in to high resolution for preciser 3D pose prediction. Second,… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: Accepted by CVPR 2019 Augmented Human: Human-centric Understanding and 2D/3D Synthesis, and the third Look Into Person (LIP) Challenge Workshop

  21. arXiv:1904.01150  [pdf, other

    cs.CV

    Thickened 2D Networks for Efficient 3D Medical Image Segmentation

    Authors: Qihang Yu, Yingda Xia, Lingxi Xie, Elliot K. Fishman, Alan L. Yuille

    Abstract: There has been a debate in 3D medical image segmentation on whether to use 2D or 3D networks, where both pipelines have advantages and disadvantages. 2D methods enjoy a low inference time and greater transfer-ability while 3D methods are superior in performance for hard targets requiring contextual information. This paper investigates efficient 3D segmentation from another perspective, which uses… ▽ More

    Submitted 22 November, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

  22. arXiv:1904.00979  [pdf, other

    cs.CV

    Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses

    Authors: Yingwei Li, Song Bai, Cihang Xie, Zhenyu Liao, Xiaohui Shen, Alan L. Yuille

    Abstract: This paper focuses on learning transferable adversarial examples specifically against defense models (models to defense adversarial attacks). In particular, we show that a simple universal perturbation can fool a series of state-of-the-art defenses. Adversarial examples generated by existing attacks are generally hard to transfer to defense models. We observe the property of regional homogeneity… ▽ More

    Submitted 30 July, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: ECCV 2020. Project page: https://github.com/LiYingwei/Regional-Homogeneity

  23. arXiv:1812.00725  [pdf, other

    cs.CV

    CRAVES: Controlling Robotic Arm with a Vision-based Economic System

    Authors: Yiming Zuo, Weichao Qiu, Lingxi Xie, Fangwei Zhong, Yizhou Wang, Alan L. Yuille

    Abstract: Training a robotic arm to accomplish real-world tasks has been attracting increasing attention in both academia and industry. This work discusses the role of computer vision algorithms in this field. We focus on low-cost arms on which no sensors are equipped and thus all decisions are made upon visual recognition, e.g., real-time 3D pose estimation. This requires annotating a lot of training data,… ▽ More

    Submitted 2 July, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: 10 pages, 6 figures

    Journal ref: In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019) 4214-4223

  24. arXiv:1812.00518  [pdf, other

    cs.CV

    Elastic Boundary Projection for 3D Medical Image Segmentation

    Authors: Tianwei Ni, Lingxi Xie, Huangjie Zheng, Elliot K. Fishman, Alan L. Yuille

    Abstract: We focus on an important yet challenging problem: using a 2D deep network to deal with 3D segmentation for medical image analysis. Existing approaches either applied multi-view planar (2D) networks or directly used volumetric (3D) networks for this purpose, but both of them are not ideal: 2D networks cannot capture 3D contexts effectively, and 3D networks are both memory-consuming and less stable… ▽ More

    Submitted 6 June, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: Accepted to CVPR 2019

  25. arXiv:1812.00329  [pdf, other

    cs.CV

    Iterative Reorganization with Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning

    Authors: Chen Wei, Lingxi Xie, Xutong Ren, Yingda Xia, Chi Su, Jiaying Liu, Qi Tian, Alan L. Yuille

    Abstract: Learning visual features from unlabeled image data is an important yet challenging task, which is often achieved by training a model on some annotation-free information. We consider spatial contexts, for which we solve so-called jigsaw puzzles, i.e., each image is cut into grids and then disordered, and the goal is to recover the correct configuration. Existing approaches formulated it as a classi… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

  26. arXiv:1812.00123  [pdf, other

    cs.CV

    Snapshot Distillation: Teacher-Student Optimization in One Generation

    Authors: Chenglin Yang, Lingxi Xie, Chi Su, Alan L. Yuille

    Abstract: Optimizing a deep neural network is a fundamental task in computer vision, yet direct training methods often suffer from over-fitting. Teacher-student optimization aims at providing complementary cues from a model trained previously, but these approaches are often considerably slow due to the pipeline of training a few generations in sequence, i.e., time complexity is increased by several times.… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.

  27. arXiv:1811.12047  [pdf, other

    cs.CV

    Generalized Coarse-to-Fine Visual Recognition with Progressive Training

    Authors: Xutong Ren, Lingxi Xie, Chen Wei, Siyuan Qiao, Chi Su, Jiaying Liu, Qi Tian, Elliot K. Fishman, Alan L. Yuille

    Abstract: Computer vision is difficult, partly because the desired mathematical function connecting input and output data is often complex, fuzzy and thus hard to learn. Coarse-to-fine (C2F) learning is a promising direction, but it remains unclear how it is applied to a wide range of vision problems. This paper presents a generalized C2F framework by making two technical contributions. First, we provide… ▽ More

    Submitted 15 April, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

  28. arXiv:1811.11814  [pdf, other

    cs.CV

    Phase Collaborative Network for Two-Phase Medical Image Segmentation

    Authors: Huangjie Zheng, Lingxi Xie, Tianwei Ni, Ya Zhang, Yan-Feng Wang, Qi Tian, Elliot K. Fishman, Alan L. Yuille

    Abstract: In real-world practice, medical images acquired in different phases possess complementary information, {\em e.g.}, radiologists often refer to both arterial and venous scans in order to make the diagnosis. However, in medical image analysis, fusing prediction from two phases is often difficult, because (i) there is a domain gap between two phases, and (ii) the semantic labels are not pixel-wise co… ▽ More

    Submitted 12 September, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

  29. arXiv:1810.09807  [pdf, other

    cs.CL cs.AI cs.LG

    PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution

    Authors: Hong Chen, Zhenhua Fan, Hao Lu, Alan L. Yuille, Shu Rong

    Abstract: We introduce PreCo, a large-scale English dataset for coreference resolution. The dataset is designed to embody the core challenges in coreference, such as entity representation, by alleviating the challenge of low overlap between training and test sets and enabling separated analysis of mention detection and mention clustering. To strengthen the training-test overlap, we collect a large corpus of… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: EMNLP 2018

  30. arXiv:1807.02941  [pdf, other

    cs.CV

    Multi-Scale Coarse-to-Fine Segmentation for Screening Pancreatic Ductal Adenocarcinoma

    Authors: Zhuotun Zhu, Yingda Xia, Lingxi Xie, Elliot K. Fishman, Alan L. Yuille

    Abstract: We propose an intuitive approach of detecting pancreatic ductal adenocarcinoma (PDAC), the most common type of pancreatic cancer, by checking abdominal CT scans. Our idea is named multi-scale segmentation-for-classification, which classifies volumes by checking if at least a sufficient number of voxels is segmented as tumors, by which we can provide radiologists with tumor locations. In order to d… ▽ More

    Submitted 8 August, 2019; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Accepted by MICCAI 2019, 4 figures, 2 tables, 9 pages

  31. arXiv:1805.04025  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Nets: What have they ever done for Vision?

    Authors: Alan L. Yuille, Chenxi Liu

    Abstract: This is an opinion paper about the strengths and weaknesses of Deep Nets for vision. They are at the heart of the enormous recent progress in artificial intelligence and are of growing importance in cognitive science and neuroscience. They have had many successes but also have several limitations and there is limited understanding of their inner workings. At present Deep Nets perform very well on… ▽ More

    Submitted 25 November, 2020; v1 submitted 10 May, 2018; originally announced May 2018.

    Comments: To appear in IJCV

  32. arXiv:1804.10684  [pdf, other

    cs.CV

    Joint Shape Representation and Classification for Detecting PDAC

    Authors: Fengze Liu, Lingxi Xie, Yingda Xia, Elliot K. Fishman, Alan L. Yuille

    Abstract: We aim to detect pancreatic ductal adenocarcinoma (PDAC) in abdominal CT scans, which sheds light on early diagnosis of pancreatic cancer. This is a 3D volume classification task with little training data. We propose a two-stage framework, which first segments the pancreas into a binary mask, then compresses the mask into a shape vector and performs abnormality classification. Shape representation… ▽ More

    Submitted 20 August, 2019; v1 submitted 27 April, 2018; originally announced April 2018.

    Comments: Accepted to MICCAI 2019 Workshop(MLMI)(8 pages, 3 figures)

  33. Abdominal multi-organ segmentation with organ-attention networks and statistical fusion

    Authors: Yan Wang, Yuyin Zhou, Wei Shen, Seyoun Park, Elliot K. Fishman, Alan L. Yuille

    Abstract: Accurate and robust segmentation of abdominal organs on CT is essential for many clinical applications such as computer-aided diagnosis and computer-aided surgery. But this task is challenging due to the weak boundaries of organs, the complexity of the background, and the variable sizes of different organs. To address these challenges, we introduce a novel framework for multi-organ segmentation by… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: 21 pages, 11 figures

    Journal ref: Medical Image Analysis, 2019

  34. arXiv:1804.02595  [pdf, other

    cs.CV

    Training Multi-organ Segmentation Networks with Sample Selection by Relaxed Upper Confident Bound

    Authors: Yan Wang, Yuyin Zhou, Peng Tang, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: Deep convolutional neural networks (CNNs), especially fully convolutional networks, have been widely applied to automatic medical image segmentation problems, e.g., multi-organ segmentation. Existing CNN-based segmentation methods mainly focus on looking for increasingly powerful network architectures, but pay less attention to data sampling strategies for training networks more effectively. In th… ▽ More

    Submitted 7 April, 2018; originally announced April 2018.

    Comments: Submitted to MICCAI 2018

  35. arXiv:1804.02586  [pdf, other

    cs.CV

    Semi-Supervised Multi-Organ Segmentation via Deep Multi-Planar Co-Training

    Authors: Yuyin Zhou, Yan Wang, Peng Tang, Song Bai, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: In multi-organ segmentation of abdominal CT scans, most existing fully supervised deep learning algorithms require lots of voxel-wise annotations, which are usually difficult, expensive, and slow to obtain. In comparison, massive unlabeled 3D CT volumes are usually easily accessible. Current mainstream works to address the semi-supervised biomedical image segmentation problem are mostly graph-base… ▽ More

    Submitted 19 November, 2018; v1 submitted 7 April, 2018; originally announced April 2018.

    Comments: accepted by WACV2019

  36. arXiv:1804.00787  [pdf, other

    cs.CV

    Multi-Scale Spatially-Asymmetric Recalibration for Image Classification

    Authors: Yan Wang, Lingxi Xie, Siyuan Qiao, Ya Zhang, Wenjun Zhang, Alan L. Yuille

    Abstract: Convolution is spatially-symmetric, i.e., the visual features are independent of its position in the image, which limits its ability to utilize contextual cues for visual recognition. This paper addresses this issue by introducing a recalibration process, which refers to the surrounding region of each neuron, computes an importance value and multiplies it to the original neural response. Our appro… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: 17 pages, 5 figures, submitted to ECCV 2018

  37. arXiv:1804.00392  [pdf, other

    cs.CV

    Bridging the Gap Between 2D and 3D Organ Segmentation with Volumetric Fusion Net

    Authors: Yingda Xia, Lingxi Xie, Fengze Liu, Zhuotun Zhu, Elliot K. Fishman, Alan L. Yuille

    Abstract: There has been a debate on whether to use 2D or 3D deep neural networks for volumetric organ segmentation. Both 2D and 3D models have their advantages and disadvantages. In this paper, we present an alternative framework, which trains 2D networks on different viewpoints for segmentation, and builds a 3D Volumetric Fusion Net (VFN) to fuse the 2D segmentation results. VFN is relatively shallow and… ▽ More

    Submitted 9 June, 2018; v1 submitted 1 April, 2018; originally announced April 2018.

    Comments: 8 pages, 2 figures, accepted to MICCAI 2018

  38. arXiv:1801.08297  [pdf, other

    cs.CV cs.LG

    NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction

    Authors: Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille

    Abstract: In this paper, we propose a novel Convolutional Neural Network (CNN) structure for general-purpose multi-task learning (MTL), which enables automatic feature fusing at every layer from different tasks. This is in contrast with the most widely used MTL CNN structures which empirically or heuristically share features on some specific layers (e.g., share all the features except the last convolutional… ▽ More

    Submitted 4 April, 2019; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: 11 pages, 3 figures, 9 tables

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition, 2019

  39. arXiv:1712.00433  [pdf, other

    cs.CV

    Single-Shot Object Detection with Enriched Semantics

    Authors: Zhishuai Zhang, Siyuan Qiao, Cihang Xie, Wei Shen, Bo Wang, Alan L. Yuille

    Abstract: We propose a novel single shot object detection network named Detection with Enriched Semantics (DES). Our motivation is to enrich the semantics of object detection features within a typical deep detector, by a semantic segmentation branch and a global activation module. The segmentation branch is supervised by weak segmentation ground-truth, i.e., no extra annotation is required. In conjunction w… ▽ More

    Submitted 7 April, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

  40. arXiv:1712.00201  [pdf, other

    cs.CV

    A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

    Authors: Zhuotun Zhu, Yingda Xia, Wei Shen, Elliot K. Fishman, Alan L. Yuille

    Abstract: In this paper, we adopt 3D Convolutional Neural Networks to segment volumetric medical images. Although deep neural networks have been proven to be very effective on many 2D vision tasks, it is still challenging to apply them to 3D tasks due to the limited amount of annotated 3D data and limited computational resources. We propose a novel 3D-based coarse-to-fine framework to effectively and effici… ▽ More

    Submitted 1 August, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: 9 pages, 4 figures, Accepted to 3DV

  41. arXiv:1711.07183  [pdf, other

    cs.CV

    Adversarial Attacks Beyond the Image Space

    Authors: Xiaohui Zeng, Chenxi Liu, Yu-Siang Wang, Weichao Qiu, Lingxi Xie, Yu-Wing Tai, Chi Keung Tang, Alan L. Yuille

    Abstract: Generating adversarial examples is an intriguing problem and an important way of understanding the working mechanism of deep neural networks. Most existing approaches generated perturbations in the image space, i.e., each pixel can be modified independently. However, in this paper we pay special attention to the subset of adversarial examples that correspond to meaningful changes in 3D physical pr… ▽ More

    Submitted 6 April, 2019; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: To appear in CVPR 2019 as oral

  42. arXiv:1709.04577  [pdf, other

    cs.CV

    DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection under Partial Occlusion

    Authors: Zhishuai Zhang, Cihang Xie, Jianyu Wang, Lingxi Xie, Alan L. Yuille

    Abstract: In this paper, we study the task of detecting semantic parts of an object, e.g., a wheel of a car, under partial occlusion. We propose that all models should be trained without seeing occlusions while being able to transfer the learned knowledge to deal with occlusions. This setting alleviates the difficulty in collecting an exponentially large dataset to cover occlusion patterns and is more essen… ▽ More

    Submitted 29 March, 2018; v1 submitted 13 September, 2017; originally announced September 2017.

  43. arXiv:1709.04518  [pdf, other

    cs.CV

    Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation

    Authors: Qihang Yu, Lingxi Xie, Yan Wang, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille

    Abstract: We aim at segmenting small organs (e.g., the pancreas) from abdominal CT scans. As the target often occupies a relatively small region in the input image, deep neural networks can be easily confused by the complex and variable background. To alleviate this, researchers proposed a coarse-to-fine approach, which used prediction from the first (coarse) stage to indicate a smaller input region for the… ▽ More

    Submitted 7 April, 2018; v1 submitted 13 September, 2017; originally announced September 2017.

    Comments: Accepted to CVPR 2018 (10 pages, 6 figures)

  44. arXiv:1706.07346  [pdf, other

    cs.CV

    Deep Supervision for Pancreatic Cyst Segmentation in Abdominal CT Scans

    Authors: Yuyin Zhou, Lingxi Xie, Elliot K. Fishman, Alan L. Yuille

    Abstract: Automatic segmentation of an organ and its cystic region is a prerequisite of computer-aided diagnosis. In this paper, we focus on pancreatic cyst segmentation in abdominal CT scan. This task is important and very useful in clinical practice yet challenging due to the low contrast in boundary, the variability in location, shape and the different stages of the pancreatic cancer. Inspired by the hig… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

    Comments: Accepted to MICCAI 2017 (8 pages, 3 figures)

  45. NormFace: L2 Hypersphere Embedding for Face Verification

    Authors: Feng Wang, Xiang Xiang, Jian Cheng, Alan L. Yuille

    Abstract: Thanks to the recent developments of Convolutional Neural Networks, the performance of face verification methods has increased rapidly. In a typical face verification method, feature normalization is a critical step for boosting performance. This motivates us to introduce and study the effect of normalization during training. But we find this is non-trivial, despite normalization being differentia… ▽ More

    Submitted 26 July, 2017; v1 submitted 20 April, 2017; originally announced April 2017.

    Comments: camera-ready version

  46. arXiv:1702.07432  [pdf, other

    cs.CV

    Multi-Context Attention for Human Pose Estimation

    Authors: Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, Alan L. Yuille, Xiaogang Wang

    Abstract: In this paper, we propose to incorporate convolutional neural networks with a multi-context attention mechanism into an end-to-end framework for human pose estimation. We adopt stacked hourglass networks to generate attention maps from features at multiple resolutions with various semantics. The Conditional Random Field (CRF) is utilized to model the correlations among neighboring regions in the a… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

    Comments: The first two authors contribute equally to this work

  47. arXiv:1702.06925  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Regularizing Face Verification Nets For Pain Intensity Regression

    Authors: Feng Wang, Xiang Xiang, Chang Liu, Trac D. Tran, Austin Reiter, Gregory D. Hager, Harry Quon, Jian Cheng, Alan L. Yuille

    Abstract: Limited labeled data are available for the research of estimating facial expression intensities. For instance, the ability to train deep networks for automated pain assessment is limited by small datasets with labels of patient-reported pain intensities. Fortunately, fine-tuning from a data-extensive pre-trained domain, such as face verification, can alleviate this problem. In this paper, we propo… ▽ More

    Submitted 1 June, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: 5 pages, 3 figure; Camera-ready version to appear at IEEE ICIP 2017

  48. arXiv:1702.02258  [pdf, other

    cs.CV cs.AI cs.MM stat.ML

    Generating Multiple Diverse Hypotheses for Human 3D Pose Consistent with 2D Joint Detections

    Authors: Ehsan Jahangiri, Alan L. Yuille

    Abstract: We propose a method to generate multiple diverse and valid human pose hypotheses in 3D all consistent with the 2D detection of joints in a monocular RGB image. We use a novel generative model uniform (unbiased) in the space of anatomically plausible 3D poses. Our model is compositional (produces a pose by combining parts) and since it is restricted only by anatomical constraints it can generalize… ▽ More

    Submitted 20 August, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

    Comments: accepted to ICCV 2017 (PeopleCap)

  49. arXiv:1612.08230  [pdf, other

    cs.CV

    A Fixed-Point Model for Pancreas Segmentation in Abdominal CT Scans

    Authors: Yuyin Zhou, Lingxi Xie, Wei Shen, Yan Wang, Elliot K. Fishman, Alan L. Yuille

    Abstract: Deep neural networks have been widely adopted for automatic organ segmentation from abdominal CT scans. However, the segmentation accuracy of some small organs (e.g., the pancreas) is sometimes below satisfaction, arguably because deep networks are easily disrupted by the complex and variable background regions which occupies a large fraction of the input volume. In this paper, we formulate this p… ▽ More

    Submitted 21 June, 2017; v1 submitted 24 December, 2016; originally announced December 2016.

    Comments: Accepted to MICCAI 2017 (8 pages, 3 figures)

  50. arXiv:1611.06596  [pdf, other

    cs.CV

    Object Recognition with and without Objects

    Authors: Zhuotun Zhu, Lingxi Xie, Alan L. Yuille

    Abstract: While recent deep neural networks have achieved a promising performance on object recognition, they rely implicitly on the visual contents of the whole image. In this paper, we train deep neural net- works on the foreground (object) and background (context) regions of images respectively. Consider- ing human recognition in the same situations, net- works trained on the pure background without ob-… ▽ More

    Submitted 25 May, 2017; v1 submitted 20 November, 2016; originally announced November 2016.

    Comments: To Appear in IJCAI 2017