Skip to main content

Showing 1–7 of 7 results for author: Lewis, K M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18064  [pdf, other

    cs.CV

    GELDA: A generative language annotation framework to reveal visual biases in datasets

    Authors: Krish Kabra, Kathleen M. Lewis, Guha Balakrishnan

    Abstract: Bias analysis is a crucial step in the process of creating fair datasets for training and evaluating computer vision models. The bottleneck in dataset analysis is annotation, which typically requires: (1) specifying a list of attributes relevant to the dataset domain, and (2) classifying each image-attribute pair. While the second step has made rapid progress in automation, the first has remained… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 21 pages, 15 figures, 9 tables

  2. arXiv:2307.11315  [pdf, other

    cs.CV cs.CL

    GIST: Generating Image-Specific Text for Fine-grained Object Classification

    Authors: Kathleen M. Lewis, Emily Mu, Adrian V. Dalca, John Guttag

    Abstract: Recent vision-language models outperform vision-only models on many image classification tasks. However, because of the absence of paired text/image descriptions, it remains difficult to fine-tune these models for fine-grained image classification. In this work, we propose a method, GIST, for generating image-specific fine-grained text descriptions from image-only datasets, and show that these tex… ▽ More

    Submitted 4 August, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: The first two authors contributed equally to this work and are listed in alphabetical order

  3. arXiv:2211.02892  [pdf, other

    cs.CV

    SizeGAN: Improving Size Representation in Clothing Catalogs

    Authors: Kathleen M. Lewis, John Guttag

    Abstract: Online clothing catalogs lack diversity in body shape and garment size. Brands commonly display their garments on models of one or two sizes, rarely including plus-size models. To our knowledge, our paper presents the first method for generating images of garments and models in a new target size to tackle the size under-representation problem. Our primary technical contribution is a conditional ge… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

  4. arXiv:2102.08540  [pdf, other

    cs.HC cs.AI cs.LG

    Intuitively Assessing ML Model Reliability through Example-Based Explanations and Editing Model Inputs

    Authors: Harini Suresh, Kathleen M. Lewis, John V. Guttag, Arvind Satyanarayan

    Abstract: Interpretability methods aim to help users build trust in and understand the capabilities of machine learning models. However, existing approaches often rely on abstract, complex visualizations that poorly map to the task at hand or require non-trivial ML expertise to interpret. Here, we present two visual analytics modules that facilitate an intuitive assessment of model reliability. To help user… ▽ More

    Submitted 9 July, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  5. arXiv:2101.02285  [pdf, other

    cs.CV cs.GR

    TryOnGAN: Body-Aware Try-On via Layered Interpolation

    Authors: Kathleen M Lewis, Srivatsan Varadharajan, Ira Kemelmacher-Shlizerman

    Abstract: Given a pair of images-target person and garment on another person-we automatically generate the target person in the given garment. Previous methods mostly focused on texture transfer via paired data training, while overlooking body shape deformations, skin color, and seamless blending of garment with the person. This work focuses on those three components, while also not requiring paired data tr… ▽ More

    Submitted 2 June, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

  6. arXiv:2001.01026  [pdf, other

    cs.GR cs.CV

    Painting Many Pasts: Synthesizing Time Lapse Videos of Paintings

    Authors: Amy Zhao, Guha Balakrishnan, Kathleen M. Lewis, Frédo Durand, John V. Guttag, Adrian V. Dalca

    Abstract: We introduce a new video synthesis task: synthesizing time lapse videos depicting how a given painting might have been created. Artists paint using unique combinations of brushes, strokes, and colors. There are often many possible ways to create a given painting. Our goal is to learn to capture this rich range of possibilities. Creating distributions of long-term videos is a challenge for learni… ▽ More

    Submitted 25 April, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: 10 pages, CVPR 2020

  7. arXiv:1812.06932  [pdf, other

    cs.CV cs.LG q-bio.QM stat.ML

    Fast Learning-based Registration of Sparse 3D Clinical Images

    Authors: Kathleen M. Lewis, Natalia S. Rost, John Guttag, Adrian V. Dalca

    Abstract: We introduce SparseVM, a method that registers clinical-quality 3D MR scans both faster and more accurately than previously possible. Deformable alignment, or registration, of clinical scans is a fundamental task for many clinical neuroscience studies. However, most registration algorithms are designed for high-resolution research-quality scans. In contrast to research-quality scans, clinical scan… ▽ More

    Submitted 6 April, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: This version was accepted to CHIL. It builds on the previous version of the paper and includes more experimental results