Skip to main content

Showing 1–19 of 19 results for author: Epstein, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16936  [pdf, other

    cs.CV cs.LG

    Disentangled 3D Scene Generation with Layout Learning

    Authors: Dave Epstein, Ben Poole, Ben Mildenhall, Alexei A. Efros, Aleksander Holynski

    Abstract: We introduce a method to generate 3D scenes that are disentangled into their component objects. This disentanglement is unsupervised, relying only on the knowledge of a large pretrained text-to-image model. Our key insight is that objects can be discovered by finding parts of a 3D scene that, when rearranged spatially, still produce valid configurations of the same scene. Concretely, our method jo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2402.11353  [pdf, other

    cs.HC cs.AI cs.CL

    Understanding the Impact of Long-Term Memory on Self-Disclosure with Large Language Model-Driven Chatbots for Public Health Intervention

    Authors: Eunkyung Jo, Yuin Jeong, SoHyun Park, Daniel A. Epstein, Young-Ho Kim

    Abstract: Recent large language models (LLMs) offer the potential to support public health monitoring by facilitating health disclosure through open-ended conversations but rarely preserve the knowledge gained about individuals across repeated interactions. Augmenting LLMs with long-term memory (LTM) presents an opportunity to improve engagement and self-disclosure, but we lack an understanding of how LTM i… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: Accepted to ACM CHI 2024 as a full paper

    ACM Class: H.5.2; I.2.7

    Journal ref: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA. ACM, New York, NY, USA

  3. arXiv:2310.15150  [pdf, other

    cs.CV cs.GR cs.LG

    Online Detection of AI-Generated Images

    Authors: David C. Epstein, Ishan Jain, Oliver Wang, Richard Zhang

    Abstract: With advancements in AI-generated images coming on a continuous basis, it is increasingly difficult to distinguish traditionally-sourced images (e.g., photos, artwork) from AI-generated ones. Previous detection methods study the generalization from a single generator to another in isolation. However, in reality, new generators are released on a streaming basis. We study generalization in this sett… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: ICCV DeepFake Analysis and Detection Workshop, 2023

  4. arXiv:2308.14411  [pdf

    cs.HC

    Community College Articulation Agreement Websites: Students' Suggestions for New Academic Advising Software Features

    Authors: David V. Nguyen, Shayan Doroudi, Daniel A. Epstein

    Abstract: Articulation agreements provide more transparency about how community college courses will transfer and fulfill university requirements. However, the literature displays conflicting results on whether articulation agreements improve transfer-related outcomes; perhaps one contributor to these conflicting research results is the subpar user experience of articulation agreement reports and the websit… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

  5. arXiv:2307.04500  [pdf

    cs.HC

    Optimal Academic Plan Derived from Articulation Agreements: A Preliminary Experiment on Human-Generated and (Hypothetical) Algorithm-Generated Academic Plans

    Authors: David V. Nguyen, Shayan Doroudi, Daniel A. Epstein

    Abstract: Objective: Community college students typically submit transfer applications to multiple universities. However, each university may have differing lower-division major requirements in order to transfer. Accordingly, our study examined one pain point users may have with ASSIST, which is California's official statewide database of articulation agreements. That pain point is cross-referencing multipl… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

  6. arXiv:2306.00986  [pdf, other

    cs.CV cs.LG stat.ML

    Diffusion Self-Guidance for Controllable Image Generation

    Authors: Dave Epstein, Allan Jabri, Ben Poole, Alexei A. Efros, Aleksander Holynski

    Abstract: Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that provides greater control over generated images by guiding the internal representations of diffusion models. We demonstrate that properties such as the shape, locati… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Project page at https://dave.ml/selfguidance/

  7. Revisiting Piggyback Prototy**: Examining Benefits and Tradeoffs in Extending Existing Social Computing Systems

    Authors: Daniel A. Epstein, Fannie Liu, Andrés Monroy-Hernández, Dennis Wang

    Abstract: The CSCW community has a history of designing, implementing, and evaluating novel social interactions in technology, but the process requires significant technical effort for uncertain value. We discuss the opportunities and applications of "piggyback prototy**", building and evaluating new ideas for social computing on top of existing ones, expanding on its potential to contribute design recomm… ▽ More

    Submitted 23 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: To appear at the 25th ACM Conference On Computer-Supported Cooperative Work And Social Computing (CSCW '22)

    ACM Class: H.5.3

  8. arXiv:2205.02837  [pdf, other

    cs.CV

    BlobGAN: Spatially Disentangled Scene Representations

    Authors: Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros

    Abstract: We propose an unsupervised, mid-level representation for a generative model of scenes. The representation is mid-level in that it is neither per-pixel nor per-image; rather, scenes are modeled as a collection of spatial, depth-ordered "blobs" of features. Blobs are differentiably placed onto a feature grid that is decoded into an image by a generative adversarial network. Due to the spatial unifor… ▽ More

    Submitted 29 July, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: ECCV 2022. Project webpage available at https://www.dave.ml/blobgan

  9. arXiv:2101.02337  [pdf, other

    cs.CV cs.LG

    Learning Temporal Dynamics from Cycles in Narrated Video

    Authors: Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun

    Abstract: Learning to model how the world changes as time elapses has proven a challenging problem for the computer vision community. We propose a self-supervised solution to this problem using temporal cycle consistency jointly in vision and language, training on narrated video. Our model learns modality-agnostic functions to predict forward and backward in time, which must undo each other when composed. T… ▽ More

    Submitted 12 September, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

    Comments: ICCV 2021

  10. arXiv:2012.04631  [pdf, other

    cs.CL cs.CV cs.LG

    Globetrotter: Connecting Languages by Connecting Images

    Authors: Dídac Surís, Dave Epstein, Carl Vondrick

    Abstract: Machine translation between many languages at once is highly challenging, since training with ground truth requires supervision between all language pairs, which is difficult to obtain. Our key insight is that, while languages may vary drastically, the underlying visual appearance of the world remains consistent. We introduce a method that uses visual observations to bridge the gap between languag… ▽ More

    Submitted 31 March, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: CVPR 2022 (Oral)

  11. arXiv:2006.15657  [pdf, other

    cs.CV

    Learning Goals from Failure

    Authors: Dave Epstein, Carl Vondrick

    Abstract: We introduce a framework that predicts the goals behind observable human action in video. Motivated by evidence in developmental psychology, we leverage video of unintentional action to learn video representations of goals without direct supervision. Our approach models videos as contextual trajectories that represent both low-level motion and high-level action features. Experiments and visualizat… ▽ More

    Submitted 12 December, 2020; v1 submitted 28 June, 2020; originally announced June 2020.

  12. arXiv:2004.03037  [pdf, other

    eess.IV cs.CV

    Dense Steerable Filter CNNs for Exploiting Rotational Symmetry in Histology Images

    Authors: Simon Graham, David Epstein, Nasir Rajpoot

    Abstract: Histology images are inherently symmetric under rotation, where each orientation is equally as likely to appear. However, this rotational symmetry is not widely utilised as prior knowledge in modern Convolutional Neural Networks (CNNs), resulting in data hungry models that learn independent features at each orientation. Allowing CNNs to be rotation-equivariant removes the necessity to learn this s… ▽ More

    Submitted 20 July, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  13. arXiv:1911.11237  [pdf, other

    cs.CL cs.CV cs.LG

    Learning to Learn Words from Visual Scenes

    Authors: Dídac Surís, Dave Epstein, Heng Ji, Shih-Fu Chang, Carl Vondrick

    Abstract: Language acquisition is the process of learning words from the surrounding scene. We introduce a meta-learning framework that learns how to learn word representations from unconstrained scenes. We leverage the natural compositional structure of language to create training episodes that cause a meta-learner to learn strong policies for language acquisition. Experiments on two datasets show that our… ▽ More

    Submitted 12 July, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 26 pages, 12 figures

    Journal ref: European Conference on Computer Vision (ECCV), 2020

  14. arXiv:1911.11206  [pdf, other

    cs.CV cs.LG eess.IV

    Oops! Predicting Unintentional Action in Video

    Authors: Dave Epstein, Boyuan Chen, Carl Vondrick

    Abstract: From just a short glance at a video, we can often tell whether a person's action is intentional or not. Can we train a model to recognize this? We introduce a dataset of in-the-wild videos of unintentional action, as well as a suite of tasks for recognizing, localizing, and anticipating its onset. We train a supervised neural network as a baseline and analyze its performance compared to human cons… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: 11 pages, 9 figures

  15. arXiv:1807.05620  [pdf, other

    cs.CR cs.LG

    NEUZZ: Efficient Fuzzing with Neural Program Smoothing

    Authors: Dongdong She, Kexin Pei, Dave Epstein, Junfeng Yang, Baishakhi Ray, Suman Jana

    Abstract: Fuzzing has become the de facto standard technique for finding software vulnerabilities. However, even state-of-the-art fuzzers are not very efficient at finding hard-to-trigger software bugs. Most popular fuzzers use evolutionary guidance to generate inputs that can trigger different bugs. Such evolutionary algorithms, while fast and simple to implement, often get stuck in fruitless sequences of… ▽ More

    Submitted 12 July, 2019; v1 submitted 15 July, 2018; originally announced July 2018.

    Comments: To appear in the 40th IEEE Symposium on Security and Privacy, May 20--22, 2019, San Francisco, CA, USA

  16. arXiv:1805.03699  [pdf, other

    cs.CV

    Fast and Accurate Tumor Segmentation of Histology Images using Persistent Homology and Deep Convolutional Features

    Authors: Talha Qaiser, Yee-Wah Tsang, Daiki Taniyama, Naoya Sakamoto, Kazuaki Nakane, David Epstein, Nasir Rajpoot

    Abstract: Tumor segmentation in whole-slide images of histology slides is an important step towards computer-assisted diagnosis. In this work, we propose a tumor segmentation framework based on the novel concept of persistent homology profiles (PHPs). For a given image patch, the homology profiles are derived by efficient computation of persistent homology, which is an algebraic tool from homology theory. W… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  17. Micro-Net: A unified model for segmentation of various objects in microscopy images

    Authors: Shan E Ahmed Raza, Linda Cheung, Muhammad Shaban, Simon Graham, David Epstein, Stella Pelengaris, Michael Khan, Nasir M. Rajpoot

    Abstract: Object segmentation and structure localization are important steps in automated image analysis pipelines for microscopy images. We present a convolution neural network (CNN) based deep learning architecture for segmentation of objects in microscopy images. The proposed network can be used to segment cells, nuclei and glands in fluorescence microscopy and histology images after slight tuning of inp… ▽ More

    Submitted 22 January, 2019; v1 submitted 22 April, 2018; originally announced April 2018.

    Journal ref: Medical Image Analysis. 52 (2019) 160-173

  18. arXiv:1801.07451  [pdf, other

    cs.CV eess.IV q-bio.TO

    Novel digital tissue phenotypic signatures of distant metastasis in colorectal cancer

    Authors: Korsuk Sirinukunwattana, David Snead, David Epstein, Zia Aftab, Imaad Mujeeb, Yee Wah Tsang, Ian Cree, Nasir Rajpoot

    Abstract: Distant metastasis is the major cause of death in colorectal cancer (CRC). Patients at high risk of develo** distant metastasis could benefit from appropriate adjuvant and follow-up treatments if stratified accurately at an early stage of the disease. Studies have increasingly recognized the role of diverse cellular components within the tumor microenvironment in the development and progression… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

  19. arXiv:1703.08658  [pdf, other

    cs.DS

    Maximizing the area of intersection of rectangles

    Authors: David B. A. Epstein, Mike Paterson

    Abstract: This paper attacks the following problem. We are given a large number $N$ of rectangles in the plane, each with horizontal and vertical sides, and also a number $r<N$. The given list of $N$ rectangles may contain duplicates. The problem is to find $r$ of these rectangles, such that, if they are discarded, then the intersection of the remaining $(N-r)$ rectangles has an intersection with as large a… ▽ More

    Submitted 25 March, 2017; originally announced March 2017.

    Comments: 16 pages, 1 figure

    ACM Class: F.2.2