Skip to main content

Showing 1–13 of 13 results for author: Teterwak, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01449  [pdf, other

    cs.CV

    SLANT: Spurious Logo ANalysis Toolkit

    Authors: Maan Qraitem, Piotr Teterwak, Kate Saenko, Bryan A. Plummer

    Abstract: Online content is filled with logos, from ads and social media posts to website branding and product placements. Consequently, these logos are prevalent in the extensive web-scraped datasets used to pretrain Vision-Language Models, which are used for a wide array of tasks (content moderation, object classification). While these models have been shown to learn harmful correlations in various tasks,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2402.00626  [pdf, other

    cs.CV cs.CR cs.LG

    Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks

    Authors: Maan Qraitem, Nazia Tasnim, Piotr Teterwak, Kate Saenko, Bryan A. Plummer

    Abstract: Typographic Attacks, which involve pasting misleading text onto an image, were noted to harm the performance of Vision-Language Models like CLIP. However, the susceptibility of recent Large Vision-Language Models to these attacks remains understudied. Furthermore, prior work's Typographic attacks against CLIP randomly sample a misleading class from a predefined set of categories. However, this sim… ▽ More

    Submitted 16 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  3. arXiv:2312.01629  [pdf, other

    cs.CV

    CLAMP: Contrastive LAnguage Model Prompt-tuning

    Authors: Piotr Teterwak, Ximeng Sun, Bryan A. Plummer, Kate Saenko, Ser-Nam Lim

    Abstract: Large language models (LLMs) have emerged as powerful general-purpose interfaces for many machine learning problems. Recent work has adapted LLMs to generative visual tasks like image captioning, visual question answering, and visual chat, using a relatively small amount of instruction-tuning data. In this paper, we explore whether modern LLMs can also be adapted to classifying an image into a set… ▽ More

    Submitted 26 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  4. arXiv:2312.01274  [pdf, other

    cs.CV

    Learning to Compose SuperWeights for Neural Parameter Allocation Search

    Authors: Piotr Teterwak, Soren Nelson, Nikoli Dryden, Dina Bashkirova, Kate Saenko, Bryan A. Plummer

    Abstract: Neural parameter allocation search (NPAS) automates parameter sharing by obtaining weights for a network given an arbitrary, fixed parameter budget. Prior work has two major drawbacks we aim to address. First, there is a disconnect in the sharing pattern between the search and training steps, where weights are warped for layers of different sizes during the search to measure similarity, but not du… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at IEEE Winter Conference on Applications of Computer Vision (WACV) 2024

  5. arXiv:2311.04251  [pdf, other

    cs.LG cs.AI cs.CV

    MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters

    Authors: Chau Pham, Piotr Teterwak, Soren Nelson, Bryan A. Plummer

    Abstract: Most deep neural networks are trained under fixed network architectures and require retraining when the architecture changes. If expanding the network's size is needed, it is necessary to retrain from scratch, which is expensive. To avoid this, one can grow from a small network by adding random weights over time to gradually achieve the target network size. However, this naive approach falls short… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE Winter Conference on Applications of Computer Vision (WACV) 2024

  6. arXiv:2304.01973  [pdf, other

    cs.LG cs.CV

    ERM++: An Improved Baseline for Domain Generalization

    Authors: Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis, Kate Saenko, Bryan A. Plummer

    Abstract: Domain Generalization (DG) measures a classifier's ability to generalize to new distributions of data it was not trained on. Recent work has shown that a hyperparameter-tuned Empirical Risk Minimization (ERM) training procedure, that is simply minimizing the empirical risk on the source domains, can outperform most existing DG methods. ERM has achieved such strong results while only tuning hyper-p… ▽ More

    Submitted 26 March, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: An improved baseline for Domain Generalization

  7. arXiv:2303.14828  [pdf, other

    cs.CV

    VisDA 2022 Challenge: Domain Adaptation for Industrial Waste Sorting

    Authors: Dina Bashkirova, Samarth Mishra, Diala Lteif, Piotr Teterwak, Donghyun Kim, Fadi Alladkani, James Akl, Berk Calli, Sarah Adel Bargal, Kate Saenko, Daehan Kim, Minseok Seo, Young** Jeon, Dong-Geol Choi, Shahaf Ettedgui, Raja Giryes, Shady Abu-Hussein, Binhui Xie, Shuang Li

    Abstract: Label-efficient and reliable semantic segmentation is essential for many real-life applications, especially for industrial settings with high visual diversity, such as waste sorting. In industrial waste sorting, one of the biggest challenges is the extreme diversity of the input stream depending on factors like the location of the sorting facility, the equipment available in the facility, and the… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Proceedings of Machine Learning Research

  8. arXiv:2303.14744  [pdf, other

    cs.CV

    Mind the Backbone: Minimizing Backbone Distortion for Robust Object Detection

    Authors: Kuniaki Saito, Donghyun Kim, Piotr Teterwak, Rogerio Feris, Kate Saenko

    Abstract: Building object detectors that are robust to domain shifts is critical for real-world applications. Prior approaches fine-tune a pre-trained backbone and risk overfitting it to in-distribution (ID) data and distorting features useful for out-of-distribution (OOD) generalization. We propose to use Relative Gradient Norm (RGN) as a way to measure the vulnerability of a backbone to feature distortion… ▽ More

    Submitted 15 May, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Project page: http://ai.bu.edu/mind_back/

  9. arXiv:2108.10860  [pdf, other

    cs.CV

    Tune it the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density

    Authors: Kuniaki Saito, Donghyun Kim, Piotr Teterwak, Stan Sclaroff, Trevor Darrell, Kate Saenko

    Abstract: Unsupervised domain adaptation (UDA) methods can dramatically improve generalization on unlabeled target domains. However, optimal hyper-parameter selection is critical to achieving high accuracy and avoiding negative transfer. Supervised hyper-parameter validation is not possible without labeled target data, which raises the question: How can we validate unsupervised adaptation techniques in a re… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: ICCV2021

  10. arXiv:2107.11011  [pdf, other

    cs.LG

    VisDA-2021 Competition Universal Domain Adaptation to Improve Performance on Out-of-Distribution Data

    Authors: Dina Bashkirova, Dan Hendrycks, Donghyun Kim, Samarth Mishra, Kate Saenko, Kuniaki Saito, Piotr Teterwak, Ben Usman

    Abstract: Progress in machine learning is typically measured by training and testing a model on the same distribution of data, i.e., the same domain. This over-estimates future accuracy on out-of-distribution data. The Visual Domain Adaptation (VisDA) 2021 competition tests models' ability to adapt to novel test distributions and handle distributional shift. We set up unsupervised domain adaptation challeng… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Neurips 2021 Competition Track

  11. arXiv:2103.07470  [pdf, other

    cs.LG

    Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers

    Authors: Piotr Teterwak, Chiyuan Zhang, Dilip Krishnan, Michael C. Mozer

    Abstract: A discriminatively trained neural net classifier can fit the training data perfectly if all information about its input other than class membership has been discarded prior to the output layer. Surprisingly, past research has discovered that some extraneous visual detail remains in the logit vector. This finding is based on inversion techniques that map deep embeddings back to images. We explore t… ▽ More

    Submitted 21 July, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: Camera Ready ICML 2021

  12. arXiv:2004.11362  [pdf, other

    cs.LG cs.CV stat.ML

    Supervised Contrastive Learning

    Authors: Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, Dilip Krishnan

    Abstract: Contrastive learning applied to self-supervised representation learning has seen a resurgence in recent years, leading to state of the art performance in the unsupervised training of deep image models. Modern batch contrastive approaches subsume or significantly outperform traditional contrastive losses such as triplet, max-margin and the N-pairs loss. In this work, we extend the self-supervised b… ▽ More

    Submitted 10 March, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

  13. arXiv:1908.07007  [pdf, other

    cs.CV

    Boundless: Generative Adversarial Networks for Image Extension

    Authors: Piotr Teterwak, Aaron Sarna, Dilip Krishnan, Aaron Maschinot, David Belanger, Ce Liu, William T. Freeman

    Abstract: Image extension models have broad applications in image editing, computational photography and computer graphics. While image inpainting has been extensively studied in the literature, it is challenging to directly apply the state-of-the-art inpainting methods to image extension as they tend to generate blurry or repetitive pixels with inconsistent semantics. We introduce semantic conditioning to… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.