Skip to main content

Showing 1–39 of 39 results for author: Terzopoulos, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01146  [pdf, other

    eess.IV cs.CV

    Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Kaifeng Pang, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: Current deep learning-based models typically analyze medical images in either 2D or 3D albeit disregarding volumetric information or suffering sub-optimal performance due to the anisotropic resolution of MR data. Furthermore, providing an accurate uncertainty estimation is beneficial to clinicians, as it indicates how confident a model is about its prediction. We propose a novel 2.5D cross-slice a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2403.00833  [pdf, other

    cs.AI

    Position Paper: Agent AI Towards a Holistic Intelligence

    Authors: Qiuyuan Huang, Naoki Wake, Bidipta Sarkar, Zane Durante, Ran Gong, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Noboru Kuno, Ade Famoti, Ashley Llorens, John Langford, Hoi Vo, Li Fei-Fei, Katsu Ikeuchi, Jianfeng Gao

    Abstract: Recent advancements in large foundation models have remarkably enhanced our understanding of sensory information in open-world environments. In leveraging the power of foundation models, it is crucial for AI research to pivot away from excessive reductionism and toward an emphasis on systems that function as cohesive wholes. Specifically, we emphasize develo** Agent AI -- an embodied system that… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

    Comments: 22 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2401.03568

  3. arXiv:2402.05929  [pdf, other

    cs.AI cs.LG cs.RO

    An Interactive Agent Foundation Model

    Authors: Zane Durante, Bidipta Sarkar, Ran Gong, Rohan Taori, Yusuke Noda, Paul Tang, Ehsan Adeli, Shrinidhi Kowshika Lakshmikanth, Kevin Schulman, Arnold Milstein, Demetri Terzopoulos, Ade Famoti, Noboru Kuno, Ashley Llorens, Hoi Vo, Katsu Ikeuchi, Li Fei-Fei, Jianfeng Gao, Naoki Wake, Qiuyuan Huang

    Abstract: The development of artificial intelligence systems is transitioning from creating static, task-specific models to dynamic, agent-based systems capable of performing well in a wide range of applications. We propose an Interactive Agent Foundation Model that uses a novel multi-task agent training paradigm for training AI agents across a wide range of domains, datasets, and tasks. Our training paradi… ▽ More

    Submitted 17 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2401.03568  [pdf, other

    cs.AI cs.HC cs.LG

    Agent AI: Surveying the Horizons of Multimodal Interaction

    Authors: Zane Durante, Qiuyuan Huang, Naoki Wake, Ran Gong, Jae Sung Park, Bidipta Sarkar, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Ye** Choi, Katsushi Ikeuchi, Hoi Vo, Li Fei-Fei, Jianfeng Gao

    Abstract: Multi-modal AI systems will likely become a ubiquitous presence in our everyday lives. A promising approach to making these systems more interactive is to embody them as agents within physical and virtual environments. At present, systems leverage existing foundation models as the basic building blocks for the creation of embodied agents. Embedding agents within such environments facilitates the a… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  5. arXiv:2312.10338  [pdf, other

    cs.CE

    Material Point Methods on Unstructured Tessellations: A Stable Kernel Approach With Continuous Gradient Reconstruction

    Authors: Yadi Cao, Yidong Zhao, Minchen Li, Yin Yang, **hyun Choo, Demetri Terzopoulos, Chenfanfu Jiang

    Abstract: The Material Point Method (MPM) is a hybrid Eulerian-Lagrangian simulation technique for solid mechanics with significant deformation. Structured background grids are commonly employed in the standard MPM, but they may give rise to several accuracy problems in handling complex geometries. When using (2D) unstructured triangular or (3D) tetrahedral background elements, however, significant challeng… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  6. arXiv:2312.05503  [pdf, other

    cs.CL cs.AI cs.LG

    Aligner: One Global Token is Worth Millions of Parameters When Aligning Large Language Models

    Authors: Zhou Ziheng, Yingnian Wu, Song-Chun Zhu, Demetri Terzopoulos

    Abstract: We introduce Aligner, a novel Parameter-Efficient Fine-Tuning (PEFT) method for aligning multi-billion-parameter-sized Large Language Models (LLMs). Aligner employs a unique design that constructs a globally shared set of tunable tokens that modify the attention of every layer. Remarkably with this method, even when using one token accounting for a mere 5,000 parameters, Aligner can still perform… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 81 pages, 77 figures

    ACM Class: I.2; I.2.6; I.2.7

  7. arXiv:2311.04942  [pdf, other

    eess.IV cs.CV

    CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: A large portion of volumetric medical data, especially magnetic resonance imaging (MRI) data, is anisotropic, as the through-plane resolution is typically much lower than the in-plane resolution. Both 3D and purely 2D deep learning-based segmentation methods are deficient in dealing with such volumetric data since the performance of 3D methods suffers when confronting anisotropic data, and 2D meth… ▽ More

    Submitted 26 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  8. arXiv:2309.09971  [pdf, other

    cs.AI cs.HC cs.MA

    MindAgent: Emergent Gaming Interaction

    Authors: Ran Gong, Qiuyuan Huang, Xiaojian Ma, Hoi Vo, Zane Durante, Yusuke Noda, Zilong Zheng, Song-Chun Zhu, Demetri Terzopoulos, Li Fei-Fei, Jianfeng Gao

    Abstract: Large Language Models (LLMs) have the capacity of performing complex scheduling in a multi-agent system and can coordinate these agents into completing sophisticated tasks that require extensive collaboration. However, despite the introduction of numerous gaming frameworks, the community has insufficient benchmarks towards building general multi-agents collaboration infrastructure that encompass b… ▽ More

    Submitted 19 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: The first three authors contributed equally. 28 pages

  9. arXiv:2304.05047  [pdf, other

    cs.CV

    Semi-Supervised Relational Contrastive Learning

    Authors: Attiano Purpura-Pontoniere, Demetri Terzopoulos, Adam Wang, Abdullah-Al-Zubaer Imran

    Abstract: Disease diagnosis from medical images via supervised learning is usually dependent on tedious, error-prone, and costly image labeling by medical experts. Alternatively, semi-supervised learning and self-supervised learning offer effectiveness through the acquisition of valuable insights from readily available unlabeled images. We present Semi-Supervised Relational Contrastive Learning (SRCL), a no… ▽ More

    Submitted 13 June, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures, 2 tables

  10. arXiv:2304.04321  [pdf, other

    cs.AI cs.CL cs.CV cs.RO

    ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes

    Authors: Ran Gong, Jiangyong Huang, Yizhou Zhao, Haoran Geng, Xiaofeng Gao, Qingyang Wu, Wensi Ai, Ziheng Zhou, Demetri Terzopoulos, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

    Abstract: Understanding the continuous states of objects is essential for task learning and planning in the real world. However, most existing task learning benchmarks assume discrete (e.g., binary) object goal states, which poses challenges for the learning of complex tasks and transferring learned policy from simulated environments to the real world. Furthermore, state discretization limits a robot's abil… ▽ More

    Submitted 11 September, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: The first two authors contributed equally; 20 pages; 17 figures; project availalbe: https://arnold-benchmark.github.io/ ICCV 2023

  11. mBEST: Realtime Deformable Linear Object Detection Through Minimal Bending Energy Skeleton Pixel Traversals

    Authors: Andrew Choi, Dezhong Tong, Brian Park, Demetri Terzopoulos, Jungseock Joo, Mohammad Khalid Jawed

    Abstract: Robotic manipulation of deformable materials is a challenging task that often requires realtime visual feedback. This is especially true for deformable linear objects (DLOs) or "rods", whose slender and flexible structures make proper tracking and detection nontrivial. To address this challenge, we present mBEST, a robust algorithm for the realtime detection of DLOs that is capable of producing an… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: IEEE Robotics and Automation Letters (RA-L 2023). YouTube video: https://youtu.be/q84I9i0DOK4

  12. Learning Neural Force Manifolds for Sim2Real Robotic Symmetrical Paper Folding

    Authors: Andrew Choi, Dezhong Tong, Demetri Terzopoulos, Jungseock Joo, M. Khalid Jawed

    Abstract: Robotic manipulation of slender objects is challenging, especially when the induced deformations are large and nonlinear. Traditionally, learning-based control approaches, such as imitation learning, have been used to address deformable material manipulation. These approaches lack generality and often suffer critical failure from a simple switch of material, geometric, and/or environmental (e.g.,… ▽ More

    Submitted 19 February, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: IEEE Transactions on Automation Science and Engineering (T-ASE 2024). First two authors have equal contribution. Supplementary video is available on YouTube: https://youtu.be/k0nexYGy-P4

  13. arXiv:2212.02575  [pdf, other

    cs.LG cs.AI cs.SI

    A Mobility-Aware Deep Learning Model for Long-Term COVID-19 Pandemic Prediction and Policy Impact Analysis

    Authors: Danfeng Guo, Zijie Huang, Junheng Hao, Yizhou Sun, Wei Wang, Demetri Terzopoulos

    Abstract: Pandemic(epidemic) modeling, aiming at disease spreading analysis, has always been a popular research topic especially following the outbreak of COVID-19 in 2019. Some representative models including SIR-based deep learning prediction models have shown satisfactory performance. However, one major drawback for them is that they fall short in their long-term predictive ability. Although graph convol… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  14. arXiv:2203.15163  [pdf, other

    eess.IV cs.CV

    CAT-Net: A Cross-Slice Attention Transformer Model for Prostate Zonal Segmentation in MRI

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: Prostate cancer is the second leading cause of cancer death among men in the United States. The diagnosis of prostate MRI often relies on the accurate prostate zonal segmentation. However, state-of-the-art automatic segmentation methods often fail to produce well-contained volumetric segmentation of the prostate zones since certain slices of prostate MRI, such as base and apex slices, are harder t… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

  15. arXiv:2203.14928  [pdf, other

    eess.IV cs.CV cs.LG

    RAVIR: A Dataset and Methodology for the Semantic Segmentation and Quantitative Analysis of Retinal Arteries and Veins in Infrared Reflectance Imaging

    Authors: Ali Hatamizadeh, Hamid Hosseini, Niraj Patel, **seo Choi, Cameron C. Pole, Cory M. Hoeferlin, Steven D. Schwartz, Demetri Terzopoulos

    Abstract: The retinal vasculature provides important clues in the diagnosis and monitoring of systemic diseases including hypertension and diabetes. The microvascular system is of primary involvement in such conditions, and the retina is the only anatomical site where the microvasculature can be directly observed. The objective assessment of retinal vessels has long been considered a surrogate biomarker for… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Paper accepted to IEEE Journal of Biomedical Health Informatics (JBHI)

  16. arXiv:2111.06517  [pdf, other

    cs.GR cs.CV

    Neuromuscular Control of the Face-Head-Neck Biomechanical Complex With Learning-Based Expression Transfer From Images and Videos

    Authors: Xiao S. Zeng, Surya Dwarakanath, Wuyue Lu, Masaki Nakada, Demetri Terzopoulos

    Abstract: The transfer of facial expressions from people to 3D face models is a classic computer graphics problem. In this paper, we present a novel, learning-based approach to transferring facial expressions and head movements from images and videos to a biomechanical model of the face-head-neck complex. Leveraging the Facial Action Coding System (FACS) as an intermediate representation of the expression s… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 12 pages, 7 figures, 2 tables

  17. Generalized Multi-Task Learning from Substantially Unlabeled Multi-Source Medical Image Data

    Authors: Ayaan Haque, Abdullah-Al-Zubaer Imran, Adam Wang, Demetri Terzopoulos

    Abstract: Deep learning-based models, when trained in a fully-supervised manner, can be effective in performing complex image analysis tasks, although contingent upon the availability of large labeled datasets. Especially in the medical imaging domain, however, expert image annotation is expensive, time-consuming, and prone to variability. Semi-supervised learning from limited quantities of labeled data has… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/

  18. arXiv:2103.10178  [pdf, other

    cs.CV

    A Location-Sensitive Local Prototype Network for Few-Shot Medical Image Segmentation

    Authors: Qinji Yu, Kang Dang, Nima Tajbakhsh, Demetri Terzopoulos, Xiaowei Ding

    Abstract: Despite the tremendous success of deep neural networks in medical image segmentation, they typically require a large amount of costly, expert-level annotated data. Few-shot segmentation approaches address this issue by learning to transfer knowledge from limited quantities of labeled examples. Incorporating appropriate prior knowledge is critical in designing high-performance few-shot segmentation… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: ISBI2021 accepted

  19. arXiv:2010.14731  [pdf, other

    cs.CV cs.LG

    MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images

    Authors: Ayaan Haque, Abdullah-Al-Zubaer Imran, Adam Wang, Demetri Terzopoulos

    Abstract: Semi-supervised learning via learning from limited quantities of labeled data has been investigated as an alternative to supervised counterparts. Maximizing knowledge gains from copious unlabeled data benefit semi-supervised learning settings. Moreover, learning multiple tasks within the same model further improves model generalizability. We propose a novel multitask learning model, namely MultiMi… ▽ More

    Submitted 1 April, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Accepted to IEEE International Symposium on Biomedical Imaging (ISBI) 2021

  20. arXiv:2007.11691  [pdf, other

    cs.CV

    End-to-End Trainable Deep Active Contour Models for Automated Image Segmentation: Delineating Buildings in Aerial Imagery

    Authors: Ali Hatamizadeh, Debleena Sengupta, Demetri Terzopoulos

    Abstract: The automated segmentation of buildings in remote sensing imagery is a challenging task that requires the accurate delineation of multiple building instances over typically large image areas. Manual methods are often laborious and current deep-learning-based approaches fail to delineate all building instances and do so with adequate accuracy. As a solution, we present Trainable Deep Active Contour… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: Accepted to European Conference on Computer Vision (ECCV) 2020

  21. arXiv:2005.14330  [pdf, other

    eess.IV cs.CV cs.LG

    Bipartite Distance for Shape-Aware Landmark Detection in Spinal X-Ray Images

    Authors: Abdullah-Al-Zubaer Imran, Chao Huang, Hui Tang, Wei Fan, Kenneth M. C. Cheung, Michael To, Zhen Qian, Demetri Terzopoulos

    Abstract: Scoliosis is a congenital disease that causes lateral curvature in the spine. Its assessment relies on the identification and localization of vertebrae in spinal X-ray images, conventionally via tedious and time-consuming manual radiographic procedures that are prone to subjectivity and observational variability. Reliability can be improved through the automatic detection and localization of spina… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: Presented at Med-NeurIPS 2019

  22. arXiv:2005.04311  [pdf, other

    eess.IV cs.CV

    Progressive Adversarial Semantic Segmentation

    Authors: Abdullah-Al-Zubaer Imran, Demetri Terzopoulos

    Abstract: Medical image computing has advanced rapidly with the advent of deep learning techniques such as convolutional neural networks. Deep convolutional neural networks can perform exceedingly well given full supervision. However, the success of such fully-supervised models for various image analysis tasks (e.g., anatomy or lesion segmentation from medical images) is limited to the availability of massi… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 9 pages, 5 figures, 12 tables

  23. arXiv:2005.02523  [pdf, other

    cs.CV

    Partly Supervised Multitask Learning

    Authors: Abdullah-Al-Zubaer Imran, Chao Huang, Hui Tang, Wei Fan, Yuan Xiao, Dingjun Hao, Zhen Qian, Demetri Terzopoulos

    Abstract: Semi-supervised learning has recently been attracting attention as an alternative to fully supervised models that require large pools of labeled data. Moreover, optimizing a model for multiple tasks can provide better generalizability than single-task learning. Leveraging self-supervision and adversarial training, we propose a novel general purpose semi-supervised, multiple-task model---namely, se… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 10 pages, 8 figures, 3 tables

  24. arXiv:2004.06887  [pdf, other

    eess.IV cs.CV

    Analysis of Scoliosis From Spinal X-Ray Images

    Authors: Abdullah-Al-Zubaer Imran, Chao Huang, Hui Tang, Wei Fan, Kenneth M. C. Cheung, Michael To, Zhen Qian, Demetri Terzopoulos

    Abstract: Scoliosis is a congenital disease in which the spine is deformed from its normal shape. Measurement of scoliosis requires labeling and identification of vertebrae in the spine. Spine radiographs are the most cost-effective and accessible modality for imaging the spine. Reliable and accurate vertebrae segmentation in spine radiographs is crucial in image-guided spinal assessment, disease diagnosis,… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: 6 pages, 6 figures, 3 tables

  25. arXiv:2002.04207  [pdf, other

    eess.IV cs.CV

    Edge-Gated CNNs for Volumetric Semantic Segmentation of Medical Images

    Authors: Ali Hatamizadeh, Demetri Terzopoulos, Andriy Myronenko

    Abstract: Textures and edges contribute different information to image recognition. Edges and boundaries encode shape information, while textures manifest the appearance of regions. Despite the success of Convolutional Neural Networks (CNNs) in computer vision and medical image analysis applications, predominantly only texture abstractions are learned, which often leads to imprecise boundary delineations. I… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  26. arXiv:2001.05566  [pdf, other

    cs.CV cs.LG

    Image Segmentation Using Deep Learning: A Survey

    Authors: Shervin Minaee, Yuri Boykov, Fatih Porikli, Antonio Plaza, Nasser Kehtarnavaz, Demetri Terzopoulos

    Abstract: Image segmentation is a key topic in image processing and computer vision with applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among many others. Various algorithms for image segmentation have been developed in the literature. Recently, due to the success of deep learning models in a wide range of v… ▽ More

    Submitted 14 November, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  27. arXiv:1909.13359  [pdf, other

    cs.CV

    End-to-End Deep Convolutional Active Contours for Image Segmentation

    Authors: Ali Hatamizadeh, Debleena Sengupta, Demetri Terzopoulos

    Abstract: The Active Contour Model (ACM) is a standard image analysis technique whose numerous variants have attracted an enormous amount of research attention across multiple fields. Incorrectly, however, the ACM's differential-equation-based formulation and prototypical dependence on user initialization have been regarded as being largely incompatible with the recently popular deep learning approaches to… ▽ More

    Submitted 4 October, 2019; v1 submitted 29 September, 2019; originally announced September 2019.

  28. arXiv:1908.08071  [pdf, other

    cs.CV cs.LG eess.IV

    End-to-End Boundary Aware Networks for Medical Image Segmentation

    Authors: Ali Hatamizadeh, Demetri Terzopoulos, Andriy Myronenko

    Abstract: Fully convolutional neural networks (CNNs) have proven to be effective at representing and classifying textural information, thus transforming image intensity into output class masks that achieve semantic image segmentation. In medical image analysis, however, expert manual segmentation often relies on the boundaries of anatomical structures of interest. We propose boundary aware CNNs for medical… ▽ More

    Submitted 10 September, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Accepted to MICCAI Machine Learning in Medical Imaging (MLMI 2019)

    Journal ref: MLMI 2019

  29. arXiv:1908.06933  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Active Lesion Segmentation

    Authors: Ali Hatamizadeh, Assaf Hoogi, Debleena Sengupta, Wuyue Lu, Brian Wilcox, Daniel Rubin, Demetri Terzopoulos

    Abstract: Lesion segmentation is an important problem in computer-assisted diagnosis that remains challenging due to the prevalence of low contrast, irregular boundaries that are unamenable to shape priors. We introduce Deep Active Lesion Segmentation (DALS), a fully automated segmentation framework for that leverages the powerful nonlinear feature extraction abilities of fully Convolutional Neural Networks… ▽ More

    Submitted 30 August, 2020; v1 submitted 19 August, 2019; originally announced August 2019.

    Comments: Accepted to Machine Learning in Medical Imaging (MLMI 2019). Link to source code added

    Journal ref: MLMI 2019

  30. arXiv:1908.03693  [pdf, other

    eess.IV cs.CV

    Semi-Supervised Multi-Task Learning With Chest X-Ray Images

    Authors: Abdullah-Al-Zubaer Imran, Demetri Terzopoulos

    Abstract: Discriminative models that require full supervision are inefficacious in the medical imaging domain when large labeled datasets are unavailable. By contrast, generative modeling---i.e., learning data generation and classification---facilitates semi-supervised training with limited labeled data. Moreover, generative modeling can be advantageous in accomplishing multiple objectives for better genera… ▽ More

    Submitted 26 August, 2019; v1 submitted 10 August, 2019; originally announced August 2019.

    Comments: Accepted to Machine Learning in Medical Imaging (MLMI 2019)

  31. arXiv:1906.06430  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-Adversarial Variational Autoencoder Networks

    Authors: Abdullah-Al-Zubaer Imran, Demetri Terzopoulos

    Abstract: The unsupervised training of GANs and VAEs has enabled them to generate realistic images mimicking real-world distributions and perform image-based unsupervised clustering or semi-supervised classification. Combining the power of these two generative models, we introduce Multi-Adversarial Variational autoEncoder Networks (MAVENs), a novel network architecture that incorporates an ensemble of discr… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  32. arXiv:1905.12120  [pdf, other

    eess.IV cs.CV

    Deep Dilated Convolutional Nets for the Automatic Segmentation of Retinal Vessels

    Authors: Ali Hatamizadeh, Hamid Hosseini, Zhengyuan Liu, Steven D. Schwartz, Demetri Terzopoulos

    Abstract: The reliable segmentation of retinal vasculature can provide the means to diagnose and monitor the progression of a variety of diseases affecting the blood vessel network, including diabetes and hypertension. We leverage the power of convolutional neural networks to devise a reliable and fully automated method that can accurately detect, segment, and analyze retinal vessels. In particular, we prop… ▽ More

    Submitted 20 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  33. Automatic Segmentation of Pulmonary Lobes Using a Progressive Dense V-Network

    Authors: Abdullah-Al-Zubaer Imran, Ali Hatamizadeh, Shilpa P. Ananth, Xiaowei Ding, Demetri Terzopoulos, Nima Tajbakhsh

    Abstract: Reliable and automatic segmentation of lung lobes is important for diagnosis, assessment, and quantification of pulmonary diseases. The existing techniques are prohibitively slow, undesirably rely on prior (airway/vessel) segmentation, and/or require user interactions for optimal results. This work presents a reliable, fast, and fully automated lung lobe segmentation based on a progressive dense V… ▽ More

    Submitted 17 February, 2019; originally announced February 2019.

  34. arXiv:1901.08707  [pdf, other

    cs.CV

    Surrogate Supervision for Medical Image Analysis: Effective Deep Learning From Limited Quantities of Labeled Data

    Authors: Nima Tajbakhsh, Yufei Hu, Junli Cao, Xingjian Yan, Yi Xiao, Yong Lu, Jianming Liang, Demetri Terzopoulos, Xiaowei Ding

    Abstract: We investigate the effectiveness of a simple solution to the common problem of deep learning in medical image analysis with limited quantities of labeled training data. The underlying idea is to assign artificial labels to abundantly available unlabeled medical images and, through a process known as surrogate supervision, pre-train a deep neural network model for the target medical image analysis… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Accepted in IEEE International Symposium on Biomedical Imaging (ISBI 2019)

  35. arXiv:1810.05977  [pdf, other

    cs.CV

    Learning to Sketch with Deep Q Networks and Demonstrated Strokes

    Authors: Tao Zhou, Chen Fang, Zhaowen Wang, Jimei Yang, Byungmoon Kim, Zhili Chen, Jonathan Brandt, Demetri Terzopoulos

    Abstract: Doodling is a useful and common intelligent skill that people can learn and master. In this work, we propose a two-stage learning framework to teach a machine to doodle in a simulated painting environment via Stroke Demonstration and deep Q-learning (SDQ). The developed system, Doodle-SDQ, generates a sequence of pen actions to reproduce a reference drawing and mimics the behavior of human painter… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

  36. Fast and Scalable Position-Based Layout Synthesis

    Authors: Tomer Weiss, Alan Litteneker, Noah Duncan, Masaki Nakada, Chenfanfu Jiang, Lap-Fai Yu, Demetri Terzopoulos

    Abstract: The arrangement of objects into a layout can be challenging for non-experts, as is affirmed by the existence of interior design professionals. Recent research into the automation of this task has yielded methods that can synthesize layouts of objects respecting aesthetic and functional constraints that are non-linear and competing. These methods usually adopt a stochastic optimization scheme, whic… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: 13 pages

    Journal ref: Transactions on Visualization and Computer Graphics, 21 August 2018

  37. Position-Based Multi-Agent Dynamics for Real-Time Crowd Simulation (MiG paper)

    Authors: Tomer Weiss, Alan Litteneker, Chenfanfu Jiang, Demetri Terzopoulos

    Abstract: Exploiting the efficiency and stability of Position-Based Dynamics (PBD), we introduce a novel crowd simulation method that runs at interactive rates for hundreds of thousands of agents. Our method enables the detailed modeling of per-agent behavior in a Lagrangian formulation. We model short-range and long-range collision avoidance to simulate both sparse and dense crowds. On the particles repres… ▽ More

    Submitted 19 February, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: 9 pages

    Journal ref: MIG 2017 Proceedings of the Tenth International Conference on Motion in Games

  38. arXiv:1705.08923  [pdf, other

    cs.CV

    Attention-based Natural Language Person Retrieval

    Authors: Tao Zhou, Muhao Chen, Jie Yu, Demetri Terzopoulos

    Abstract: Following the recent progress in image classification and captioning using deep learning, we develop a novel natural language person retrieval system based on an attention mechanism. More specifically, given the description of a person, the goal is to localize the person in an image. To this end, we first construct a benchmark dataset for natural language person retrieval. To do so, we generate bo… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

    Comments: CVPR 2017 Workshop (vision meets cognition)

  39. Configurable 3D Scene Synthesis and 2D Image Rendering with Per-Pixel Ground Truth using Stochastic Grammars

    Authors: Chenfanfu Jiang, Siyuan Qi, Yixin Zhu, Siyuan Huang, Jenny Lin, Lap-Fai Yu, Demetri Terzopoulos, Song-Chun Zhu

    Abstract: We propose a systematic learning-based approach to the generation of massive quantities of synthetic 3D scenes and arbitrary numbers of photorealistic 2D images thereof, with associated ground truth information, for the purposes of training, benchmarking, and diagnosing learning-based computer vision and robotics algorithms. In particular, we devise a learning-based pipeline of algorithms capable… ▽ More

    Submitted 20 June, 2018; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: Accepted in IJCV 2018