Skip to main content

Showing 1–22 of 22 results for author: Gee, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18344  [pdf, other

    cs.CV

    AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

    Authors: Huzheng Yang, James Gee, Jianbo Shi

    Abstract: We study the intriguing connection between visual data, deep networks, and the brain. Our method creates a universal channel alignment by using brain voxel fMRI response prediction as the training objective. We discover that deep networks, trained with different objectives, share common feature channels across various models. These channels can be clustered into recurring sets, corresponding to di… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.07361  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Implicit Optimization for Robust and Flexible Image Registration

    Authors: Rohit Jena, Pratik Chaudhari, James C. Gee

    Abstract: Deep Learning in Image Registration (DLIR) methods have been tremendously successful in image registration due to their speed and ability to incorporate weak label supervision at training time. However, DLIR methods forego many of the benefits of classical optimization-based methods. The functional nature of deep networks do not guarantee that the predicted transformation is a local minima of the… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2405.14839  [pdf, other

    cs.CV cs.CL

    A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis

    Authors: Yue Yang, Mona Gandhi, Yufei Wang, Yifan Wu, Michael S. Yao, Chris Callison-Burch, James C. Gee, Mark Yatskar

    Abstract: While deep networks have achieved broad success in analyzing natural images, when applied to medical scans, they often fail in unexcepted situations. We investigate this challenge and focus on model sensitivity to domain shifts, such as data sampled from different hospitals or data confounded by demographic variables such as sex, race, etc, in the context of chest X-rays and skin lesion images. A… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 23 pages, 9 figures, 12 tables, project page: https://yueyang1996.github.io/knobo/

  4. arXiv:2404.02106  [pdf, other

    cs.CV cs.CE

    Neural Ordinary Differential Equation based Sequential Image Registration for Dynamic Characterization

    Authors: Yifan Wu, Meng** Dong, Rohit Jena, Chen Qin, James C. Gee

    Abstract: Deformable image registration (DIR) is crucial in medical image analysis, enabling the exploration of biological dynamics such as organ motions and longitudinal changes in imaging. Leveraging Neural Ordinary Differential Equations (ODE) for registration, this extension work discusses how this framework can aid in the characterization of sequential biological processes. Utilizing the Neural ODE's a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Journal extension of NODEO: A Neural Ordinary Differential Equation Based Optimization Framework for Deformable Image Registration, CVPR 2022

  5. arXiv:2404.01249  [pdf, other

    cs.CV

    FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Registration

    Authors: Rohit Jena, Pratik Chaudhari, James C. Gee

    Abstract: Diffeomorphic Image Registration is a critical part of the analysis in various imaging modalities and downstream tasks like image translation, segmentation, and atlas building. Registration algorithms based on optimization have stood the test of time in terms of accuracy, reliability, and robustness across a wide spectrum of modalities and acquisition settings. However, these algorithms converge s… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  6. arXiv:2403.05606  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data

    Authors: Yifan Wu, Yang Liu, Yue Yang, Michael S. Yao, Wenli Yang, Xuehui Shi, Lihong Yang, Dongjun Li, Yueming Liu, James C. Gee, Xuan Yang, Wenbin Wei, Shi Gu

    Abstract: Diagnosing rare diseases presents a common challenge in clinical practice, necessitating the expertise of specialists for accurate identification. The advent of machine learning offers a promising solution, while the development of such technologies is hindered by the scarcity of data on rare conditions and the demand for models that are both interpretable and trustworthy in a clinical context. In… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2402.06532  [pdf, other

    cs.LG cs.AI

    Generative Adversarial Bayesian Optimization for Surrogate Objectives

    Authors: Michael S. Yao, Yimeng Zeng, Hamsa Bastani, Jacob Gardner, James C. Gee, Osbert Bastani

    Abstract: Offline model-based policy optimization seeks to optimize a learned surrogate objective function without querying the true oracle objective during optimization. However, inaccurate surrogate model predictions are frequently encountered along the optimization trajectory. To address this limitation, we propose generative adversarial Bayesian optimization (GABO) using adaptive source critic regulariz… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 15 pages, 3 figures

  8. arXiv:2312.11593  [pdf, other

    cs.CV

    Towards Establishing Dense Correspondence on Multiview Coronary Angiography: From Point-to-Point to Curve-to-Curve Query Matching

    Authors: Yifan Wu, Rohit Jena, Mehmet Gulsun, Vivek Singh, Puneet Sharma, James C. Gee

    Abstract: Coronary angiography is the gold standard imaging technique for studying and diagnosing coronary artery disease. However, the resulting 2D X-ray projections lose 3D information and exhibit visual ambiguities. In this work, we aim to establish dense correspondence in multi-view angiography, serving as a fundamental basis for various clinical applications and downstream tasks. To overcome the challe… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2312.01280  [pdf, other

    cs.CV

    Brain Decodes Deep Nets

    Authors: Huzheng Yang, James Gee, Jianbo Shi

    Abstract: We developed a tool for visualizing and analyzing large pre-trained vision models by map** them onto the brain, thus exposing their hidden inside. Our innovation arises from a surprising usage of brain encoding: predicting brain fMRI measurements in response to images. We report two findings. First, explicit map** between the brain and deep-network features across dimensions of space, layers,… ▽ More

    Submitted 29 March, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Website: see https://huzeyann.github.io/brain-decodes-deep-nets . Code: see https://github.com/huzeyann/BrainDecodesDeepNets

  10. arXiv:2311.10812  [pdf, other

    cs.CV cs.GR cs.LG

    SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

    Authors: Rohit Jena, Ganesh Subramanian Iyer, Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, James Gee

    Abstract: We propose SplatArmor, a novel approach for recovering detailed and animatable human models by `armoring' a parameterized body model with 3D Gaussians. Our approach represents the human as a set of 3D Gaussians within a canonical space, whose articulation is defined by extending the skinning of the underlying SMPL geometry to arbitrary locations in the canonical space. To account for pose-dependen… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  11. arXiv:2311.09193  [pdf, other

    cs.CL cs.AI cs.CV

    The Role of Chain-of-Thought in Complex Vision-Language Reasoning Task

    Authors: Yifan Wu, Pengchuan Zhang, Wenhan Xiong, Barlas Oguz, James C. Gee, Yixin Nie

    Abstract: The study explores the effectiveness of the Chain-of-Thought approach, known for its proficiency in language tasks by breaking them down into sub-tasks and intermediate steps, in improving vision-language tasks that demand sophisticated perception and reasoning. We present the "Description then Decision" strategy, which is inspired by how humans process signals. This strategy significantly improve… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  12. arXiv:2308.01175  [pdf, other

    cs.CV

    Memory Encoding Model

    Authors: Huzheng Yang, James Gee, Jianbo Shi

    Abstract: We explore a new class of brain encoding model by adding memory-related information as input. Memory is an essential brain mechanism that works alongside visual stimuli. During a vision-memory cognitive task, we found the non-visual brain is largely predictable using previously seen images. Our Memory Encoding Model (Mem) won the Algonauts 2023 visual brain competition even without model ensemble… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  13. arXiv:2307.14021  [pdf, other

    cs.CV

    Retinotopy Inspired Brain Encoding Model and the All-for-One Training Recipe

    Authors: Huzheng Yang, Jianbo Shi, James Gee

    Abstract: Brain encoding models aim to predict brain voxel-wise responses to stimuli images, replicating brain signals captured by neuroimaging techniques. There is a large volume of publicly available data, but training a comprehensive brain encoding model is challenging. The main difficulties stem from a) diversity within individual brain, with functional heterogeneous brain regions; b) diversity of brain… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  14. arXiv:2303.08808  [pdf, other

    cs.CV

    Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos

    Authors: Rohit Jena, Pratik Chaudhari, James Gee, Ganesh Iyer, Siddharth Choudhary, Brandon M. Smith

    Abstract: Human reconstruction and synthesis from monocular RGB videos is a challenging problem due to clothing, occlusion, texture discontinuities and sharpness, and framespecific pose changes. Many methods employ deferred rendering, NeRFs and implicit methods to represent clothed humans, on the premise that mesh-based representations cannot capture complex clothing and textures from RGB, silhouettes, and… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  15. arXiv:2209.10043  [pdf, other

    cs.LG cs.AI eess.IV q-bio.QM

    SynthA1c: Towards Clinically Interpretable Patient Representations for Diabetes Risk Stratification

    Authors: Michael S. Yao, Allison Chae, Matthew T. MacLean, Anurag Verma, Jeffrey Duda, James Gee, Drew A. Torigian, Daniel Rader, Charles Kahn, Walter R. Witschey, Hersh Sagreiya

    Abstract: Early diagnosis of Type 2 Diabetes Mellitus (T2DM) is crucial to enable timely therapeutic interventions and lifestyle modifications. As the time available for clinical office visits shortens and medical imaging data become more widely available, patient image data could be used to opportunistically identify patients for additional T2DM diagnostic workup by physicians. We investigated whether imag… ▽ More

    Submitted 27 July, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: 12 pages. Accepted to PRIME MICCAI 2023

  16. arXiv:2207.01614  [pdf, other

    cs.CV cs.LG

    Beyond mAP: Towards better evaluation of instance segmentation

    Authors: Rohit Jena, Lukas Zhornyak, Nehal Doiphode, Pratik Chaudhari, Vivek Buch, James Gee, Jianbo Shi

    Abstract: Correctness of instance segmentation constitutes counting the number of objects, correctly localizing all predictions and classifying each localized prediction. Average Precision is the de-facto metric used to measure all these constituents of segmentation. However, this metric does not penalize duplicate predictions in the high-recall range, and cannot distinguish instances that are localized cor… ▽ More

    Submitted 20 March, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at CVPR 2023

  17. arXiv:2112.06979  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Sequence Registration (BraTS-Reg) Challenge: Establishing Correspondence Between Pre-Operative and Follow-up MRI Scans of Diffuse Glioma Patients

    Authors: Bhakti Baheti, Satrajit Chakrabarty, Hamed Akbari, Michel Bilello, Benedikt Wiestler, Julian Schwarting, Evan Calabrese, Jeffrey Rudie, Syed Abidi, Mina Mousa, Javier Villanueva-Meyer, Brandon K. K. Fields, Florian Kofler, Russell Takeshi Shinohara, Juan Eugenio Iglesias, Tony C. W. Mok, Albert C. S. Chung, Marek Wodzinski, Artur Jurgas, Niccolo Marini, Manfredo Atzori, Henning Muller, Christoph Grobroehmer, Hanna Siebert, Lasse Hansen , et al. (48 additional authors not shown)

    Abstract: Registration of longitudinal brain MRI scans containing pathologies is challenging due to dramatic changes in tissue appearance. Although there has been progress in develo** general-purpose medical image registration techniques, they have not yet attained the requisite precision and reliability for this task, highlighting its inherent complexity. Here we describe the Brain Tumor Sequence Registr… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 December, 2021; originally announced December 2021.

  18. arXiv:2108.03443  [pdf, other

    cs.CV

    NODEO: A Neural Ordinary Differential Equation Based Optimization Framework for Deformable Image Registration

    Authors: Yifan Wu, Tom Z. Jiahao, Jiancong Wang, Paul A. Yushkevich, M. Ani Hsieh, James C. Gee

    Abstract: Deformable image registration (DIR), aiming to find spatial correspondence between images, is one of the most critical problems in the domain of medical image analysis. In this paper, we present a novel, generic, and accurate diffeomorphic image registration framework that utilizes neural ordinary differential equations (NODEs). We model each voxel as a moving particle and consider the set of all… ▽ More

    Submitted 6 February, 2023; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted by the IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022

  19. arXiv:2010.05154  [pdf, other

    cs.LG cs.AI stat.ML

    Lambda Learner: Fast Incremental Learning on Data Streams

    Authors: Rohan Ramanath, Konstantin Salomatin, Jeffrey D. Gee, Kirill Talanine, Onkar Dalal, Gungor Polatkan, Sara Smoot, Deepak Kumar

    Abstract: One of the most well-established applications of machine learning is in deciding what content to show website visitors. When observation data comes from high-velocity, user-generated data streams, machine learning methods perform a balancing act between model complexity, training time, and computational costs. Furthermore, when model freshness is critical, the training of models becomes time-const… ▽ More

    Submitted 28 June, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

  20. arXiv:2006.02038  [pdf, other

    cs.CV

    Nested Scale Editing for Conditional Image Synthesis

    Authors: Lingzhi Zhang, Jiancong Wang, Yinshuang Xu, Jie Min, Tarmily Wen, James C. Gee, Jianbo Shi

    Abstract: We propose an image synthesis approach that provides stratified navigation in the latent code space. With a tiny amount of partial or very low-resolution image, our approach can consistently out-perform state-of-the-art counterparts in terms of generating the closest sampled image to the ground truth. We achieve this through scale-independent editing while expanding scale-specific diversity. Scale… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

  21. arXiv:1907.04835  [pdf, other

    eess.IV cs.CV

    Enhanced generative adversarial network for 3D brain MRI super-resolution

    Authors: Jiancong Wang, Yuhua Chen, Yifan Wu, Jianbo Shi, James Gee

    Abstract: Single image super-resolution (SISR) reconstruction for magnetic resonance imaging (MRI) has generated significant interest because of its potential to not only speed up imaging but to improve quantitative processing and analysis of available image data. Generative Adversarial Networks (GAN) have proven to perform well in recovering image texture detail, and many variants have therefore been propo… ▽ More

    Submitted 15 July, 2019; v1 submitted 10 July, 2019; originally announced July 2019.

  22. arXiv:1907.04834  [pdf, other

    cs.CV

    Barnes-Hut Approximation for Point SetGeodesic Shooting

    Authors: Jiancong Wang, Long Xie, Paul Yushkevich, James Gee

    Abstract: Geodesic shooting has been successfully applied to diffeo-morphic registration of point sets. Exact computation of the geodesicshooting between point sets, however, requiresO(N2) calculations each time step on the number of points in the point set. We proposean approximation approach based on the Barnes-Hut algorithm to speedup point set geodesic shooting. This approximation can reduce the al-gori… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.