Skip to main content

Showing 1–17 of 17 results for author: Tzeng, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.12244  [pdf, other

    cs.CV cs.AI cs.LG

    Large-scale Reinforcement Learning for Diffusion Models

    Authors: Yinan Zhang, Eric Tzeng, Yilun Du, Dmitry Kislyuk

    Abstract: Text-to-image diffusion models are a class of deep generative models that have demonstrated an impressive capacity for high-quality image generation. However, these models are susceptible to implicit biases that arise from web-scale text-image training pairs and may inaccurately model aspects of images we care about. This can result in suboptimal samples, model bias, and images that do not align w… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  2. arXiv:2012.09958  [pdf, other

    cs.CV cs.AI cs.LG

    Toward Transformer-Based Object Detection

    Authors: Josh Beal, Eric Kim, Eric Tzeng, Dong Huk Park, Andrew Zhai, Dmitry Kislyuk

    Abstract: Transformers have become the dominant model in natural language processing, owing to their ability to pretrain on massive amounts of data, then transfer to smaller, more specific tasks via fine-tuning. The Vision Transformer was the first major attempt to apply a pure transformer model directly to images as input, demonstrating that as compared to convolutional networks, transformer-based architec… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  3. arXiv:2004.00137  [pdf, other

    cs.CV

    Revisiting Few-shot Activity Detection with Class Similarity Control

    Authors: Huijuan Xu, Ximeng Sun, Eric Tzeng, Abir Das, Kate Saenko, Trevor Darrell

    Abstract: Many interesting events in the real world are rare making preannotated machine learning ready videos a rarity in consequence. Thus, temporal activity detection models that are able to learn from a few examples are desirable. In this paper, we present a conceptually simple and general yet novel framework for few-shot temporal activity detection based on proposal regression which detects the start a… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

  4. arXiv:1911.11357  [pdf, other

    cs.LG cs.CV stat.ML

    Semantic Bottleneck Scene Generation

    Authors: Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

    Abstract: Coupling the high-fidelity generation capabilities of label-conditional image synthesis methods with the flexibility of unconditional generative models, we propose a semantic bottleneck GAN model for unconditional synthesis of complex scenes. We assume pixel-wise segmentation labels are available during training and use them to learn the scene structure. During inference, our model first synthesiz… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  5. arXiv:1909.11825  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Domain Adaptation through Self-Supervision

    Authors: Yu Sun, Eric Tzeng, Trevor Darrell, Alexei A. Efros

    Abstract: This paper addresses unsupervised domain adaptation, the setting where labeled training data is available on a source domain, but the goal is to have good performance on a target domain with only unlabeled data. Like much of previous work, we seek to align the learned representations of the source and target domains while preserving discriminability. The way we accomplish alignment is by learning… ▽ More

    Submitted 29 September, 2019; v1 submitted 25 September, 2019; originally announced September 2019.

  6. arXiv:1908.01707  [pdf, other

    cs.CV

    Learning a Unified Embedding for Visual Search at Pinterest

    Authors: Andrew Zhai, Hao-Yu Wu, Eric Tzeng, Dong Huk Park, Charles Rosenberg

    Abstract: At Pinterest, we utilize image embeddings throughout our search and recommendation systems to help our users navigate through visual content by powering experiences like browsing of related content and searching for exact products for shop**. In this work we describe a multi-task deep metric learning system to learn a single unified image embedding which can be used to power our multiple visual… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge and Discovery and Data Mining, 2019

  7. arXiv:1812.00929  [pdf, other

    cs.CV

    SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection

    Authors: Eric Tzeng, Kaylee Burns, Kate Saenko, Trevor Darrell

    Abstract: Domain adaptation of visual detectors is a critical challenge, yet existing methods have overlooked pixel appearance transformations, focusing instead on bootstrap** and/or domain confusion losses. We propose a Semantic Pixel-Level Adaptation Transform (SPLAT) approach to detector adaptation that efficiently generates cross-domain image pairs. Our model uses aligned-pair and/or pseudo-label loss… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  8. arXiv:1711.03213  [pdf, other

    cs.CV

    CyCADA: Cycle-Consistent Adversarial Domain Adaptation

    Authors: Judy Hoffman, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola, Kate Saenko, Alexei A. Efros, Trevor Darrell

    Abstract: Domain adaptation is critical for success in new, unseen environments. Adversarial adaptation models applied in feature spaces discover domain invariant representations, but are difficult to visualize and sometimes fail to capture pixel-level and low-level domain shifts. Recent work has shown that generative adversarial networks combined with cycle-consistency constraints are surprisingly effectiv… ▽ More

    Submitted 29 December, 2017; v1 submitted 8 November, 2017; originally announced November 2017.

  9. arXiv:1702.05464  [pdf, other

    cs.CV

    Adversarial Discriminative Domain Adaptation

    Authors: Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

    Abstract: Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They also can improve recognition despite the presence of domain shift or dataset bias: several adversarial approaches to unsupervised domain adaptation have recently been introduced, which reduce the difference between the training and test domain distri… ▽ More

    Submitted 17 February, 2017; originally announced February 2017.

  10. arXiv:1702.04680  [pdf, other

    cs.CV

    Visual Discovery at Pinterest

    Authors: Andrew Zhai, Dmitry Kislyuk, Yushi **g, Michael Feng, Eric Tzeng, Jeff Donahue, Yue Li Du, Trevor Darrell

    Abstract: Over the past three years Pinterest has experimented with several visual search and recommendation services, including Related Pins (2014), Similar Looks (2015), Flashlight (2016) and Lens (2017). This paper presents an overview of our visual discovery engine powering these services, and shares the rationales behind our technical and product decisions such as the use of object detection and intera… ▽ More

    Submitted 25 March, 2017; v1 submitted 15 February, 2017; originally announced February 2017.

  11. arXiv:1511.07111  [pdf, other

    cs.CV

    Adapting Deep Visuomotor Representations with Weak Pairwise Constraints

    Authors: Eric Tzeng, Coline Devin, Judy Hoffman, Chelsea Finn, Pieter Abbeel, Sergey Levine, Kate Saenko, Trevor Darrell

    Abstract: Real-world robotics problems often occur in domains that differ significantly from the robot's prior training environment. For many robotic control tasks, real world experience is expensive to obtain, but data is easy to collect in either an instrumented environment or in simulation. We propose a novel domain adaptation approach for robot perception that adapts visual representations learned on a… ▽ More

    Submitted 25 May, 2017; v1 submitted 23 November, 2015; originally announced November 2015.

  12. arXiv:1511.04003  [pdf, other

    cs.CV

    Human Curation and Convnets: Powering Item-to-Item Recommendations on Pinterest

    Authors: Dmitry Kislyuk, Yuchen Liu, David Liu, Eric Tzeng, Yushi **g

    Abstract: This paper presents Pinterest Related Pins, an item-to-item recommendation system that combines collaborative filtering with content-based ranking. We demonstrate that signals derived from user curation, the activity of users organizing content, are highly effective when used in conjunction with content-based ranking. This paper also demonstrates the effectiveness of visual features, such as image… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

  13. arXiv:1510.02192  [pdf, other

    cs.CV

    Simultaneous Deep Transfer Across Domains and Tasks

    Authors: Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko

    Abstract: Recent reports suggest that a generic supervised deep CNN model trained on a large-scale dataset reduces, but does not remove, dataset bias. Fine-tuning deep models in a new domain can require a significant amount of labeled data, which for many applications is simply not available. We propose a new CNN architecture to exploit unlabeled and sparsely labeled target domain data. Our approach simulta… ▽ More

    Submitted 7 October, 2015; originally announced October 2015.

  14. arXiv:1412.3474  [pdf, other

    cs.CV

    Deep Domain Confusion: Maximizing for Domain Invariance

    Authors: Eric Tzeng, Judy Hoffman, Ning Zhang, Kate Saenko, Trevor Darrell

    Abstract: Recent reports suggest that a generic supervised deep CNN model trained on a large-scale dataset reduces, but does not remove, dataset bias on a standard benchmark. Fine-tuning deep models in a new domain can require a significant amount of data, which for many applications is simply not available. We propose a new CNN architecture which introduces an adaptation layer and an additional domain conf… ▽ More

    Submitted 10 December, 2014; originally announced December 2014.

  15. arXiv:1407.5035  [pdf, other

    cs.CV

    LSDA: Large Scale Detection Through Adaptation

    Authors: Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff Donahue, Ross Girshick, Trevor Darrell, Kate Saenko

    Abstract: A major challenge in scaling object detection is the difficulty of obtaining labeled images for large numbers of categories. Recently, deep convolutional neural networks (CNNs) have emerged as clear winners on object classification benchmarks, in part due to training with 1.2M+ labeled classification images. Unfortunately, only a small fraction of those labels are available for the detection task.… ▽ More

    Submitted 31 October, 2014; v1 submitted 18 July, 2014; originally announced July 2014.

    Journal ref: Neural Information Processing Systems (NIPS) 2014

  16. arXiv:1312.6204  [pdf, other

    cs.CV cs.LG cs.NE

    One-Shot Adaptation of Supervised Deep Convolutional Models

    Authors: Judy Hoffman, Eric Tzeng, Jeff Donahue, Yangqing Jia, Kate Saenko, Trevor Darrell

    Abstract: Dataset bias remains a significant barrier towards solving real world computer vision tasks. Though deep convolutional networks have proven to be a competitive approach for image classification, a question remains: have these models have solved the dataset bias problem? In general, training or fine-tuning a state-of-the-art deep model on a new domain requires a significant amount of data, which fo… ▽ More

    Submitted 17 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Journal ref: ICLR Workshop 2014

  17. arXiv:1310.1531  [pdf, other

    cs.CV

    DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

    Authors: Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, Trevor Darrell

    Abstract: We evaluate whether features extracted from the activation of a deep convolutional network trained in a fully supervised fashion on a large, fixed set of object recognition tasks can be re-purposed to novel generic tasks. Our generic tasks may differ significantly from the originally trained tasks and there may be insufficient labeled or unlabeled data to conventionally train or adapt a deep archi… ▽ More

    Submitted 5 October, 2013; originally announced October 2013.