Skip to main content

Showing 1–48 of 48 results for author: Guttag, J

.
  1. arXiv:2401.13650  [pdf, other

    eess.IV cs.CV

    Tyche: Stochastic In-Context Learning for Medical Image Segmentation

    Authors: Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalca

    Abstract: Existing learning-based solutions to medical image segmentation have two important shortcomings. First, for most new segmentation task, a new model has to be trained or fine-tuned. This requires extensive resources and machine learning expertise, and is therefore often infeasible for medical researchers and clinicians. Second, most existing segmentation methods produce a single deterministic segme… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  2. arXiv:2312.07381  [pdf, other

    cs.CV eess.IV

    ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image

    Authors: Hallee E. Wong, Marianne Rakic, John Guttag, Adrian V. Dalca

    Abstract: Biomedical image segmentation is a crucial part of both scientific research and clinical care. With enough labelled data, deep learning models can be trained to accurately automate specific biomedical image segmentation tasks. However, manually segmenting images to create training data is highly labor intensive and requires domain expertise. We present ScribblePrompt, a flexible neural network bas… ▽ More

    Submitted 12 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Project Website: https://scribbleprompt.csail.mit.edu Keywords: Interactive Segmentation, Medical Imaging, Segment Anything Model, SAM, Scribble Annotations, Prompt

  3. arXiv:2307.11315  [pdf, other

    cs.CV cs.CL

    GIST: Generating Image-Specific Text for Fine-grained Object Classification

    Authors: Kathleen M. Lewis, Emily Mu, Adrian V. Dalca, John Guttag

    Abstract: Recent vision-language models outperform vision-only models on many image classification tasks. However, because of the absence of paired text/image descriptions, it remains difficult to fine-tune these models for fine-grained image classification. In this work, we propose a method, GIST, for generating image-specific fine-grained text descriptions from image-only datasets, and show that these tex… ▽ More

    Submitted 4 August, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: The first two authors contributed equally to this work and are listed in alphabetical order

  4. arXiv:2307.10923  [pdf, other

    cs.LG

    Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series

    Authors: Aniruddh Raghu, Payal Chandak, Ridwan Alam, John Guttag, Collin M. Stultz

    Abstract: Self-supervised learning (SSL) for clinical time series data has received significant attention in recent literature, since these data are highly rich and provide important information about a patient's physiological state. However, most existing SSL methods for clinical time series are limited in that they are designed for unimodal time series, such as a sequence of structured features (e.g., lab… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  5. arXiv:2307.02712  [pdf, other

    cs.LG

    Multi-Similarity Contrastive Learning

    Authors: Emily Mu, John Guttag, Maggie Makar

    Abstract: Given a similarity metric, contrastive methods learn a representation in which examples that are similar are pushed together and examples that are dissimilar are pulled apart. Contrastive learning techniques have been utilized extensively to learn representations for tasks ranging from image classification to caption generation. However, existing contrastive learning approaches can fail to general… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  6. arXiv:2304.09270  [pdf, other

    cs.CY cs.LG stat.AP

    Coarse race data conceals disparities in clinical risk score performance

    Authors: Rajiv Movva, Divya Shanmugam, Kaihua Hou, Priya Pathak, John Guttag, Nikhil Garg, Emma Pierson

    Abstract: Healthcare data in the United States often records only a patient's coarse race group: for example, both Indian and Chinese patients are typically coded as "Asian." It is unknown, however, whether this coarse coding conceals meaningful disparities in the performance of clinical risk scores across granular race groups. Here we show that it does. Using data from 418K emergency department visits, we… ▽ More

    Submitted 24 August, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Published at MLHC 2023. v2 includes minor changes from the camera-ready, such as a link to code. Code is available at https://github.com/rmovva/granular-race-disparities_MLHC23

    ACM Class: J.3; K.4.2

  7. arXiv:2304.07645  [pdf, other

    cs.LG cs.AI

    Magnitude Invariant Parametrizations Improve Hypernetwork Learning

    Authors: Jose Javier Gonzalez Ortiz, John Guttag, Adrian Dalca

    Abstract: Hypernetworks, neural networks that predict the parameters of another neural network, are powerful models that have been successfully used in diverse applications from image generation to multi-task learning. Unfortunately, existing hypernetworks are often challenging to train. Training typically converges far more slowly than for non-hypernetwork models, and the rate of convergence can be very se… ▽ More

    Submitted 29 June, 2023; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: Source code at https://github.com/JJGO/hyperlight

  8. arXiv:2304.06131  [pdf, other

    cs.CV cs.LG

    UniverSeg: Universal Medical Image Segmentation

    Authors: Victor Ion Butoi, Jose Javier Gonzalez Ortiz, Tianyu Ma, Mert R. Sabuncu, John Guttag, Adrian V. Dalca

    Abstract: While deep learning models have become the predominant method for medical image segmentation, they are typically not capable of generalizing to unseen segmentation tasks involving new anatomies, image modalities, or labels. Given a new segmentation task, researchers generally have to train or fine-tune models, which is time-consuming and poses a substantial barrier for clinical researchers, who of… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Victor and Jose Javier contributed equally to this work. Project Website: https://universeg.csail.mit.edu

  9. arXiv:2304.05448  [pdf, other

    cs.CV cs.LG

    Scale-Space Hypernetworks for Efficient Biomedical Imaging

    Authors: Jose Javier Gonzalez Ortiz, John Guttag, Adrian Dalca

    Abstract: Convolutional Neural Networks (CNNs) are the predominant model used for a variety of medical image analysis tasks. At inference time, these models are computationally intensive, especially with volumetric data. In principle, it is possible to trade accuracy for computational efficiency by manipulating the rescaling factor in the downsample and upsample layers of CNN architectures. However, properl… ▽ More

    Submitted 29 June, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Code available at https://github.com/JJGO/scale-space-hypernetworks

  10. arXiv:2211.02892  [pdf, other

    cs.CV

    SizeGAN: Improving Size Representation in Clothing Catalogs

    Authors: Kathleen M. Lewis, John Guttag

    Abstract: Online clothing catalogs lack diversity in body shape and garment size. Brands commonly display their garments on models of one or two sizes, rarely including plus-size models. To our knowledge, our paper presents the first method for generating images of garments and models in a new target size to tackle the size under-representation problem. Our primary technical contribution is a conditional ge… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

  11. arXiv:2207.04312  [pdf, other

    cs.CY

    At the Intersection of Deep Learning and Conceptual Art: The End of Signature

    Authors: Divya Shanmugam, Katie Lewis, Jose Javier Gonzalez-Ortiz, Agnieszka Kurant, John Guttag

    Abstract: MIT wanted to commission a large scale artwork that would serve to 'illuminate a new campus gateway, inaugurate a space of exchange between MIT and Cambridge, and inspire our students, faculty, visitors, and the surrounding community to engage with art in new ways and to have art be part of their daily lives.' Among other things, the art was to reflect the fact that scientific discovery is often t… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  12. arXiv:2206.13607  [pdf, other

    cs.LG cs.CL

    Improved Text Classification via Test-Time Augmentation

    Authors: Helen Lu, Divya Shanmugam, Harini Suresh, John Guttag

    Abstract: Test-time augmentation -- the aggregation of predictions across transformed examples of test inputs -- is an established technique to improve the performance of image classification models. Importantly, TTA can be used to improve model performance post-hoc, without additional training. Although test-time augmentation (TTA) can be applied to any data modality, it has seen limited adoption in NLP du… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  13. Saliency Cards: A Framework to Characterize and Compare Saliency Methods

    Authors: Angie Boggust, Harini Suresh, Hendrik Strobelt, John V. Guttag, Arvind Satyanarayan

    Abstract: Saliency methods are a common class of machine learning interpretability techniques that calculate how important each input feature is to a model's output. We find that, with the rapid pace of development, users struggle to stay informed of the strengths and limitations of new methods and, thus, choose methods for unprincipled reasons (e.g., popularity). Moreover, despite a corresponding rise in e… ▽ More

    Submitted 30 May, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Published at FAccT 2023, 19 pages, 8 figures, 2 tables

  14. arXiv:2204.04360  [pdf, other

    cs.LG

    Data Augmentation for Electrocardiograms

    Authors: Aniruddh Raghu, Divya Shanmugam, Eugene Pomerantsev, John Guttag, Collin M. Stultz

    Abstract: Neural network models have demonstrated impressive performance in predicting pathologies and outcomes from the 12-lead electrocardiogram (ECG). However, these models often need to be trained with large, labelled datasets, which are not available for many predictive tasks of interest. In this work, we perform an empirical study examining whether training time data augmentation methods can be used t… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Conference on Health, Inference, and Learning (CHIL) 2022

  15. arXiv:2203.16680  [pdf, other

    cs.CV cs.LG eess.IV

    Learning the Effect of Registration Hyperparameters with HyperMorph

    Authors: Andrew Hoopes, Malte Hoffmann, Douglas N. Greve, Bruce Fischl, John Guttag, Adrian V. Dalca

    Abstract: We introduce HyperMorph, a framework that facilitates efficient hyperparameter tuning in learning-based deformable image registration. Classical registration algorithms perform an iterative pair-wise optimization to compute a deformation field that aligns two images. Recent learning-based approaches leverage large image datasets to learn a function that rapidly estimates a deformation for a given… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) at https://www.melba-journal.org

  16. arXiv:2106.10860  [pdf, other

    cs.LG cs.AR cs.PF stat.ML

    Multiplying Matrices Without Multiplying

    Authors: Davis Blalock, John Guttag

    Abstract: Multiplying matrices is among the most fundamental and compute-intensive operations in machine learning. Consequently, there has been significant work on efficiently approximating matrix multiplies. We introduce a learning-based algorithm for this task that greatly outperforms existing methods. Experiments using hundreds of matrices from diverse domains show that it often runs $100\times$ faster t… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: To appear at ICML 2021

    Journal ref: PMLR 139:992-1004, 2021

  17. arXiv:2103.02768  [pdf, other

    cs.LG

    Learning to Predict with Supporting Evidence: Applications to Clinical Risk Prediction

    Authors: Aniruddh Raghu, John Guttag, Katherine Young, Eugene Pomerantsev, Adrian V. Dalca, Collin M. Stultz

    Abstract: The impact of machine learning models on healthcare will depend on the degree of trust that healthcare professionals place in the predictions made by these models. In this paper, we present a method to provide people with clinical expertise with domain-relevant evidence about why a prediction should be trusted. We first design a probabilistic model that relates meaningful latent concepts to predic… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: ACM Conference on Health, Learning, and Inference 2021

  18. arXiv:2102.08540  [pdf, other

    cs.HC cs.AI cs.LG

    Intuitively Assessing ML Model Reliability through Example-Based Explanations and Editing Model Inputs

    Authors: Harini Suresh, Kathleen M. Lewis, John V. Guttag, Arvind Satyanarayan

    Abstract: Interpretability methods aim to help users build trust in and understand the capabilities of machine learning models. However, existing approaches often rely on abstract, complex visualizations that poorly map to the task at hand or require non-trivial ML expertise to interpret. Here, we present two visual analytics modules that facilitate an intuitive assessment of model reliability. To help user… ▽ More

    Submitted 9 July, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  19. arXiv:2101.01035  [pdf, other

    cs.CV eess.IV

    HyperMorph: Amortized Hyperparameter Learning for Image Registration

    Authors: Andrew Hoopes, Malte Hoffmann, Bruce Fischl, John Guttag, Adrian V. Dalca

    Abstract: We present HyperMorph, a learning-based strategy for deformable image registration that removes the need to tune important registration hyperparameters during training. Classical registration methods solve an optimization problem to find a set of spatial correspondences between two images, while learning-based methods leverage a training dataset to learn a function that generates these corresponde… ▽ More

    Submitted 4 May, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: IPMI 2021: Information Processing in Medical Imaging. Keywords: Deformable Image Registration, Hyperparameter Search, Deep Learning, Hypernetworks, and Amortized Learning

  20. arXiv:2011.11156  [pdf, other

    cs.CV

    Better Aggregation in Test-Time Augmentation

    Authors: Divya Shanmugam, Davis Blalock, Guha Balakrishnan, John Guttag

    Abstract: Test-time augmentation -- the aggregation of predictions across transformed versions of a test input -- is a common practice in image classification. Traditionally, predictions are combined using a simple average. In this paper, we present 1) experimental analyses that shed light on cases in which the simple average is suboptimal and 2) a method to address these shortcomings. A key finding is that… ▽ More

    Submitted 11 October, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Journal ref: ICCV 2021

  21. arXiv:2007.10233  [pdf, other

    cs.CV cs.LG

    Unsupervised Domain Adaptation in the Absence of Source Data

    Authors: Roshni Sahoo, Divya Shanmugam, John Guttag

    Abstract: Current unsupervised domain adaptation methods can address many types of distribution shift, but they assume data from the source domain is freely available. As the use of pre-trained models becomes more prevalent, it is reasonable to assume that source data is unavailable. We propose an unsupervised method for adapting a source classifier to a target domain that varies from the source domain alon… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  22. arXiv:2006.00090  [pdf, other

    cs.CV eess.IV

    Anatomical Predictions using Subject-Specific Medical Data

    Authors: Marianne Rakic, John Guttag, Adrian V. Dalca

    Abstract: Changes over time in brain anatomy can provide important insight for treatment design or scientific analyses. We present a method that predicts how a brain MRI for an individual will change over time. We model changes using a diffeomorphic deformation field that we predict using function using convolutional neural networks. Given a predicted deformation field, a baseline scan can be warped to give… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: Accepted as a short paper to MIDL2020. Keywords: Medical Imaging, Multi-Modal, Prediction

    Report number: MIDL/2020/ExtendedAbstract/apwZYLKTCo

  23. arXiv:2003.03033  [pdf, other

    cs.LG stat.ML

    What is the State of Neural Network Pruning?

    Authors: Davis Blalock, Jose Javier Gonzalez Ortiz, Jonathan Frankle, John Guttag

    Abstract: Neural network pruning---the task of reducing the size of a network by removing parameters---has been the subject of a great deal of work in recent years. We provide a meta-analysis of the literature, including an overview of approaches to pruning and consistent findings in the literature. After aggregating results across 81 papers and pruning hundreds of models in controlled conditions, our clear… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Published in Proceedings of Machine Learning and Systems 2020 (MLSys 2020)

  24. arXiv:2001.01026  [pdf, other

    cs.GR cs.CV

    Painting Many Pasts: Synthesizing Time Lapse Videos of Paintings

    Authors: Amy Zhao, Guha Balakrishnan, Kathleen M. Lewis, Frédo Durand, John V. Guttag, Adrian V. Dalca

    Abstract: We introduce a new video synthesis task: synthesizing time lapse videos depicting how a given painting might have been created. Artists paint using unique combinations of brushes, strokes, and colors. There are often many possible ways to create a given painting. Our goal is to learn to capture this rich range of possibilities. Creating distributions of long-term videos is a challenge for learni… ▽ More

    Submitted 25 April, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: 10 pages, CVPR 2020

  25. arXiv:1912.00262  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV q-bio.TO

    Image segmentation of liver stage malaria infection with spatial uncertainty sampling

    Authors: Ava P. Soleimany, Harini Suresh, Jose Javier Gonzalez Ortiz, Divya Shanmugam, Nil Gural, John Guttag, Sangeeta N. Bhatia

    Abstract: Global eradication of malaria depends on the development of drugs effective against the silent, yet obligate liver stage of the disease. The gold standard in drug development remains microscopic imaging of liver stage parasites in in vitro cell culture models. Image analysis presents a major bottleneck in this pipeline since the parasite has significant variability in size, shape, and density in t… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

  26. arXiv:1910.04817  [pdf, other

    cs.LG stat.ML

    Estimation of Bounds on Potential Outcomes For Decision Making

    Authors: Maggie Makar, Fredrik D. Johansson, John Guttag, David Sontag

    Abstract: Estimation of individual treatment effects is commonly used as the basis for contextual decision making in fields such as healthcare, education, and economics. However, it is often sufficient for the decision maker to have estimates of upper and lower bounds on the potential outcomes of decision alternatives to assess risks and benefits. We show that, in such cases, we can improve sample efficienc… ▽ More

    Submitted 12 August, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Journal ref: ICML 2020

  27. arXiv:1909.00475  [pdf, other

    cs.CV

    Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions

    Authors: Guha Balakrishnan, Adrian V. Dalca, Amy Zhao, John V. Guttag, Fredo Durand, William T. Freeman

    Abstract: We introduce visual deprojection: the task of recovering an image or video that has been collapsed along a dimension. Projections arise in various contexts, such as long-exposure photography, where a dynamic scene is collapsed in time to produce a motion-blurred image, and corner cameras, where reflected light from a scene is collapsed along a spatial dimension because of an edge occluder to yield… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: ICCV 2019

  28. arXiv:1908.02738  [pdf, other

    cs.CV cs.LG eess.IV

    Learning Conditional Deformable Templates with Convolutional Networks

    Authors: Adrian V. Dalca, Marianne Rakic, John Guttag, Mert R. Sabuncu

    Abstract: We develop a learning framework for building deformable templates, which play a fundamental role in many image analysis and computational anatomy tasks. Conventional methods for template creation and image alignment to the template have undergone decades of rich technical development. In these frameworks, templates are constructed using an iterative process of template estimation and alignment, wh… ▽ More

    Submitted 11 October, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: NeurIPS 2019: Neural Information Processing Systems. Keywords: deformable templates, conditional atlases, diffeomorphic image registration, probabilistic models, neuroimaging

    Journal ref: NeurIPS: Thirty-third Conference on Neural Information Processing Systems, 2019

  29. Unsupervised Learning of Probabilistic Diffeomorphic Registration for Images and Surfaces

    Authors: Adrian V. Dalca, Guha Balakrishnan, John Guttag, Mert R. Sabuncu

    Abstract: Classical deformable registration techniques achieve impressive results and offer a rigorous theoretical treatment, but are computationally intensive since they solve an optimization problem for each image pair. Recently, learning-based methods have facilitated fast registration by learning spatial deformation functions. However, these approaches use restricted deformation models, require supervis… ▽ More

    Submitted 23 July, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: MedIA: Medical Image Analysis (MICCAI2018 Special Issue). Expands on MICCAI 2018 paper (arXiv:1805.04605) by introducing an extension to anatomical surface registration, new experiments, and analysis of diffeomorphic implementations. Keywords: medical image registration; diffeomorphic; invertible; probabilistic modeling; variational inference. Code available at http://voxelmorph.csail.mit.edu. arXiv admin note: text overlap with arXiv:1805.04605

  30. arXiv:1903.03503  [pdf, other

    cs.CV cs.LG

    Unsupervised Data Imputation via Variational Inference of Deep Subspaces

    Authors: Adrian V. Dalca, John Guttag, Mert R. Sabuncu

    Abstract: A wide range of systems exhibit high dimensional incomplete data. Accurate estimation of the missing data is often desired, and is crucial for many downstream analyses. Many state-of-the-art recovery methods involve supervised learning using datasets containing full observations. In contrast, we focus on unsupervised estimation of missing image data, where no full observations are available - a co… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

  31. Anatomical Priors in Convolutional Networks for Unsupervised Biomedical Segmentation

    Authors: Adrian V. Dalca, John Guttag, Mert R. Sabuncu

    Abstract: We consider the problem of segmenting a biomedical image into anatomical regions of interest. We specifically address the frequent scenario where we have no paired training data that contains images and their manual segmentations. Instead, we employ unpaired segmentation images to build an anatomical prior. Critically these segmentations can be derived from imaging data from a different dataset an… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

    Comments: Presented at CVPR 2018. IEEE CVPR proceedings pp. 9290-9299

  32. arXiv:1902.09383  [pdf, other

    cs.CV

    Data augmentation using learned transformations for one-shot medical image segmentation

    Authors: Amy Zhao, Guha Balakrishnan, Frédo Durand, John V. Guttag, Adrian V. Dalca

    Abstract: Image segmentation is an important task in many medical applications. Methods based on convolutional neural networks attain state-of-the-art accuracy; however, they typically rely on supervised training with large labeled datasets. Labeling medical images requires significant expertise and time, and typical hand-tuned approaches for data augmentation fail to capture the complex variations in such… ▽ More

    Submitted 6 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 9 pages, CVPR 2019

  33. A Framework for Understanding Sources of Harm throughout the Machine Learning Life Cycle

    Authors: Harini Suresh, John V. Guttag

    Abstract: As machine learning (ML) increasingly affects people and society, awareness of its potential unwanted consequences has also grown. To anticipate, prevent, and mitigate undesirable downstream consequences, it is critical that we understand when and how harm might be introduced throughout the ML life cycle. In this paper, we provide a framework that identifies seven distinct potential sources of dow… ▽ More

    Submitted 1 December, 2021; v1 submitted 28 January, 2019; originally announced January 2019.

    Journal ref: EAAMO 2021: Equity and Access in Algorithms, Mechanisms, and Optimization

  34. arXiv:1812.06932  [pdf, other

    cs.CV cs.LG q-bio.QM stat.ML

    Fast Learning-based Registration of Sparse 3D Clinical Images

    Authors: Kathleen M. Lewis, Natalia S. Rost, John Guttag, Adrian V. Dalca

    Abstract: We introduce SparseVM, a method that registers clinical-quality 3D MR scans both faster and more accurately than previously possible. Deformable alignment, or registration, of clinical scans is a fundamental task for many clinical neuroscience studies. However, most registration algorithms are designed for high-resolution research-quality scans. In contrast to research-quality scans, clinical scan… ▽ More

    Submitted 6 April, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: This version was accepted to CHIL. It builds on the previous version of the paper and includes more experimental results

  35. arXiv:1812.00475  [pdf, other

    cs.LG stat.ML

    Multiple Instance Learning for ECG Risk Stratification

    Authors: Divya Shanmugam, Davis Blalock, John Guttag

    Abstract: Patients who suffer an acute coronary syndrome are at elevated risk for adverse cardiovascular events such as myocardial infarction and cardiovascular death. Accurate assessment of this risk is crucial to their course of care. We focus on estimating a patient's risk of cardiovascular death after an acute coronary syndrome based on a patient's raw electrocardiogram (ECG) signal. Learning from this… ▽ More

    Submitted 25 March, 2020; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Healthcare Conference (MLHC 2019)

  36. VoxelMorph: A Learning Framework for Deformable Medical Image Registration

    Authors: Guha Balakrishnan, Amy Zhao, Mert R. Sabuncu, John Guttag, Adrian V. Dalca

    Abstract: We present VoxelMorph, a fast learning-based framework for deformable, pairwise medical image registration. Traditional registration methods optimize an objective function for each pair of images, which can be time-consuming for large datasets or rich deformation models. In contrast to this approach, and building on recent learning-based methods, we formulate registration as a function that maps a… ▽ More

    Submitted 1 September, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: Accepted to IEEE TMI ( (c) IEEE). This manuscript expands the CVPR 2018 paper (arXiv:1802.02604) by introducing an auxiliary model that uses segmentation maps during training, an amortized optimization analysis, and extensive model analysis. Code available at http://voxelmorph.csail.mit.edu

  37. Sprintz: Time Series Compression for the Internet of Things

    Authors: Davis Blalock, Samuel Madden, John Guttag

    Abstract: Thanks to the rapid proliferation of connected devices, sensor-generated time series constitute a large and growing portion of the world's data. Often, this data is collected from distributed, resource-constrained devices and centralized at one or more servers. A key challenge in this setup is reducing the size of the transmitted data without sacrificing its quality. Lower quality reduces the data… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

  38. Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU

    Authors: Harini Suresh, Jen J. Gong, John Guttag

    Abstract: Machine learning approaches have been effective in predicting adverse outcomes in different clinical settings. These models are often developed and evaluated on datasets with heterogeneous patient populations. However, good predictive performance on the aggregate population does not imply good performance for specific groups. In this work, we present a two-step framework to 1) learn relevant pat… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: KDD 2018

  39. arXiv:1806.00397  [pdf, other

    cs.CY

    Visualizing Patient Timelines in the Intensive Care Unit

    Authors: Dina Levy-Lambert, Jen J. Gong, Tristan Naumann, Tom J. Pollard, John V. Guttag

    Abstract: Electronic Health Records (EHRs) contain a large volume of heterogeneous patient data, which are useful at the point of care and for retrospective research. These data are typically stored in relational databases. Gaining an integrated view of these data for a single patient typically requires complex SQL queries joining multiple tables. In this work, we present a visualization tool that integrate… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  40. Unsupervised Learning for Fast Probabilistic Diffeomorphic Registration

    Authors: Adrian V. Dalca, Guha Balakrishnan, John Guttag, Mert R. Sabuncu

    Abstract: Traditional deformable registration techniques achieve impressive results and offer a rigorous theoretical treatment, but are computationally intensive since they solve an optimization problem for each image pair. Recently, learning-based methods have facilitated fast registration by learning spatial deformation functions. However, these approaches use restricted deformation models, require superv… ▽ More

    Submitted 14 September, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: MICCAI 2018 (Oral Presentation). Proceedings: LNCS 11070, pp 729-738

    Journal ref: LNCS 11070, pp 729-738, Springer. 2018

  41. arXiv:1804.07739  [pdf, other

    cs.CV

    Synthesizing Images of Humans in Unseen Poses

    Authors: Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Fredo Durand, John Guttag

    Abstract: We address the computational problem of novel human pose synthesis. Given an image of a person and a desired pose, we produce a depiction of that person in that pose, retaining the appearance of both the person and background. We present a modular generative neural network that synthesizes unseen poses using training pairs of images and poses taken from human action videos. Our network separates a… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: CVPR 2018

  42. An Unsupervised Learning Model for Deformable Medical Image Registration

    Authors: Guha Balakrishnan, Amy Zhao, Mert R. Sabuncu, John Guttag, Adrian V. Dalca

    Abstract: We present a fast learning-based algorithm for deformable, pairwise 3D medical image registration. Current registration methods optimize an objective function independently for each pair of images, which can be time-consuming for large data. We define registration as a parametric function, and optimize its parameters given a set of images from a collection of interest. Given a new pair of scans, w… ▽ More

    Submitted 20 April, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: 9 pages, in CVPR 2018

  43. arXiv:1712.00643  [pdf, other

    cs.SI physics.soc-ph

    Learning the Probability of Activation in the Presence of Latent Spreaders

    Authors: Maggie Makar, John Guttag, Jenna Wiens

    Abstract: When an infection spreads in a community, an individual's probability of becoming infected depends on both her susceptibility and exposure to the contagion through contact with others. While one often has knowledge regarding an individual's susceptibility, in many cases, whether or not an individual's contacts are contagious is unknown. We study the problem of predicting if an individual will adop… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: To appear in AAA1-18

  44. Bolt: Accelerated Data Mining with Fast Vector Compression

    Authors: Davis W Blalock, John V Guttag

    Abstract: Vectors of data are at the heart of machine learning and data mining. Recently, vector quantization methods have shown great promise in reducing both the time and space costs of operating on vectors. We introduce a vector quantization algorithm that can compress vectors over 12x faster than existing techniques while also accelerating approximate vector operations such as distance and dot product c… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: Research track paper at KDD 2017

  45. arXiv:1612.04007  [pdf, other

    cs.CV

    A Video-Based Method for Objectively Rating Ataxia

    Authors: Ronnachai Jaroensri, Amy Zhao, Guha Balakrishnan, Derek Lo, Jeremy Schmahmann, John Guttag, Fredo Durand

    Abstract: For many movement disorders, such as Parkinson's disease and ataxia, disease progression is visually assessed by a clinician using a numerical disease rating scale. These tests are subjective, time-consuming, and must be administered by a professional. This can be problematic where specialists are not available, or when a patient is not consistently evaluated by the same clinician. We present an a… ▽ More

    Submitted 7 September, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: MLHC 2017

  46. arXiv:1609.09196  [pdf, other

    stat.ML cs.DB cs.LG

    EXTRACT: Strong Examples from Weakly-Labeled Sensor Data

    Authors: Davis W. Blalock, John V. Guttag

    Abstract: Thanks to the rise of wearable and connected devices, sensor-generated time series comprise a large and growing fraction of the world's data. Unfortunately, extracting value from this data can be challenging, since sensors report low-level signals (e.g., acceleration), not the high-level events that are typically of interest (e.g., gestures). We introduce a technique to bridge this gap by automati… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

    Comments: To appear in IEEE International Conference on Data Mining 2016

  47. arXiv:1608.02301  [pdf, other

    cs.LG

    Uncovering Voice Misuse Using Symbolic Mismatch

    Authors: Marzyeh Ghassemi, Zeeshan Syed, Daryush D. Mehta, Jarrad H. Van Stan, Robert E. Hillman, John V. Guttag

    Abstract: Voice disorders affect an estimated 14 million working-aged Americans, and many more worldwide. We present the first large scale study of vocal misuse based on long-term ambulatory data collected by an accelerometer placed on the neck. We investigate an unsupervised data mining approach to uncovering latent information about voice misuse. We segment signals from over 253 days of data from 22 sub… ▽ More

    Submitted 7 August, 2016; originally announced August 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

  48. arXiv:1608.02071  [pdf, ps, other

    cs.LG cs.CL

    Transferring Knowledge from Text to Predict Disease Onset

    Authors: Yun Liu, Kun-Ta Chuang, Fu-Wen Liang, Huey-Jen Su, Collin M. Stultz, John V. Guttag

    Abstract: In many domains such as medicine, training data is in short supply. In such cases, external knowledge is often helpful in building predictive models. We propose a novel method to incorporate publicly available domain expertise to build accurate models. Specifically, we use word2vec models trained on a domain-specific corpus to estimate the relevance of each feature's text description to the predic… ▽ More

    Submitted 6 August, 2016; originally announced August 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

    Report number: Proceedings of Machine Learning Research Volume 56