Skip to main content

Showing 1–15 of 15 results for author: Seitz, S M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08210  [pdf, other

    cs.CV

    Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis

    Authors: Yifan Wang, Aleksander Holynski, Brian L. Curless, Steven M. Seitz

    Abstract: We present Infinite Texture, a method for generating arbitrarily large texture images from a text prompt. Our approach fine-tunes a diffusion model on a single texture, and learns to embed that statistical distribution in the output domain of the model. We seed this fine-tuning process with a sample texture patch, which can be optionally generated from a text-to-image model like DALL-E 2. At gener… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2404.17104  [pdf, other

    cs.HC cs.CV

    Don't Look at the Camera: Achieving Perceived Eye Contact

    Authors: Alice Gao, Samyukta Jayakumar, Marcello Maniglia, Brian Curless, Ira Kemelmacher-Shlizerman, Aaron R. Seitz, Steven M. Seitz

    Abstract: We consider the question of how to best achieve the perception of eye contact when a person is captured by camera and then rendered on a 2D display. For single subjects photographed by a camera, conventional wisdom tells us that looking directly into the camera achieves eye contact. Through empirical user studies, we show that it is instead preferable to {\em look just below the camera lens}. We q… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. arXiv:2308.14740  [pdf, other

    cs.CV cs.GR cs.LG

    Total Selfie: Generating Full-Body Selfies

    Authors: Bowei Chen, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz

    Abstract: We present a method to generate full-body selfies from photographs originally taken at arms length. Because self-captured photos are typically taken close up, they have limited field of view and exaggerated perspective that distorts facial shapes. We instead seek to generate the photo some one else would take of you from a few feet away. Our approach takes as input four selfies of your face and bo… ▽ More

    Submitted 3 April, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Project page: https://homes.cs.washington.edu/~boweiche/project_page/totalselfie/

  4. ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement

    Authors: Ishan Chatterjee, Maruchi Kim, Vivek Jayaram, Shyamnath Gollakota, Ira Kemelmacher-Shlizerman, Shwetak Patel, Steven M. Seitz

    Abstract: We present ClearBuds, the first hardware and software system that utilizes a neural network to enhance speech streamed from two wireless earbuds. Real-time speech enhancement for wireless earbuds requires high-quality sound separation and background cancellation, operating in real-time and on a mobile phone. Clear-Buds bridges state-of-the-art deep learning for blind audio source separation and in… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 12 pages, Published in Mobisys 2022

  5. arXiv:2111.07986  [pdf, other

    cs.RO cs.LG eess.SY

    Nonprehensile Riemannian Motion Predictive Control

    Authors: Hamid Izadinia, Byron Boots, Steven M. Seitz

    Abstract: Nonprehensile manipulation involves long horizon underactuated object interactions and physical contact with different objects that can inherently introduce a high degree of uncertainty. In this work, we introduce a novel Real-to-Sim reward analysis technique, called Riemannian Motion Predictive Control (RMPC), to reliably imagine and predict the outcome of taking possible actions for a real robot… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: To appear at International Symposium on Experimental Robotics (ISER)

  6. arXiv:2106.13228  [pdf, other

    cs.CV cs.GR

    HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields

    Authors: Keunhong Park, Utkarsh Sinha, Peter Hedman, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Ricardo Martin-Brualla, Steven M. Seitz

    Abstract: Neural Radiance Fields (NeRF) are able to reconstruct scenes with unprecedented fidelity, and various recent works have extended NeRF to handle dynamic scenes. A common approach to reconstruct such non-rigid scenes is through the use of a learned deformation field map** from coordinates in each input image into a canonical template coordinate space. However, these deformation-based approaches st… ▽ More

    Submitted 10 September, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: SIGGRAPH Asia 2021, Project page: https://hypernerf.github.io/

  7. arXiv:2103.16183  [pdf, other

    cs.CV

    Repopulating Street Scenes

    Authors: Yifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely

    Abstract: We present a framework for automatically reconfiguring images of street scenes by populating, depopulating, or repopulating them with objects such as pedestrians or vehicles. Applications of this method include anonymizing images to enhance privacy, generating data augmentations for perception tasks like autonomous driving, and composing scenes to achieve a certain ambiance, such as empty streets… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  8. Time-Travel Rephotography

    Authors: Xuan Luo, Xuaner Zhang, Paul Yoo, Ricardo Martin-Brualla, Jason Lawrence, Steven M. Seitz

    Abstract: Many historical people were only ever captured by old, faded, black and white photos, that are distorted due to the limitations of early cameras and the passage of time. This paper simulates traveling back in time with a modern camera to rephotograph famous subjects. Unlike conventional image restoration filters which apply independent operations like denoising, colorization, and superresolution,… ▽ More

    Submitted 13 December, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: SIGGRAPH Asia 2021. Project Page: https://time-travel-rephotography.github.io Video: https://youtu.be/ceIopN2UZ_s

    Journal ref: ACM Transactions on Graphics. 40 (2021) 1-12

  9. arXiv:2011.15128  [pdf, other

    cs.CV cs.GR

    Animating Pictures with Eulerian Motion Fields

    Authors: Aleksander Holynski, Brian Curless, Steven M. Seitz, Richard Szeliski

    Abstract: In this paper, we demonstrate a fully automatic method for converting a still image into a realistic animated loo** video. We target scenes with continuous fluid motion, such as flowing water and billowing smoke. Our method relies on the observation that this type of natural motion can be convincingly reproduced from a static Eulerian motion description, i.e. a single, temporally constant flow f… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  10. arXiv:2011.12948  [pdf, other

    cs.CV cs.GR

    Nerfies: Deformable Neural Radiance Fields

    Authors: Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Steven M. Seitz, Ricardo Martin-Brualla

    Abstract: We present the first method capable of photorealistically reconstructing deformable scenes using photos/videos captured casually from mobile phones. Our approach augments neural radiance fields (NeRF) by optimizing an additional continuous volumetric deformation field that warps each observed point into a canonical 5D NeRF. We observe that these NeRF-like deformation fields are prone to local mini… ▽ More

    Submitted 9 September, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: ICCV 2021, Project page with videos: https://nerfies.github.io/

  11. arXiv:1906.03539  [pdf, other

    cs.CV

    Structure from Motion for Panorama-Style Videos

    Authors: Chris Sweeney, Aleksander Holynski, Brian Curless, Steve M Seitz

    Abstract: We present a novel Structure from Motion pipeline that is capable of reconstructing accurate camera poses for panorama-style video capture without prior camera intrinsic calibration. While panorama-style capture is common and convenient, previous reconstruction methods fail to obtain accurate reconstructions due to the rotation-dominant motion and small baseline between views. Our method is built… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

  12. arXiv:1812.05583  [pdf, other

    cs.CV cs.LG

    Scene Recomposition by Learning-based ICP

    Authors: Hamid Izadinia, Steven M. Seitz

    Abstract: By moving a depth sensor around a room, we compute a 3D CAD model of the environment, capturing the room shape and contents such as chairs, desks, sofas, and tables. Rather than reconstructing geometry, we match, place, and align each object in the scene to thousands of CAD models of objects. In addition to the fully automatic system, the key technical contribution is a novel approach for aligning… ▽ More

    Submitted 7 April, 2020; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: To appear at CVPR 2020

  13. PhotoShape: Photorealistic Materials for Large-Scale Shape Collections

    Authors: Keunhong Park, Konstantinos Rematas, Ali Farhadi, Steven M. Seitz

    Abstract: Existing online 3D shape repositories contain thousands of 3D models but lack photorealistic appearance. We present an approach to automatically assign high-quality, realistic appearance models to large scale 3D shape collections. The key idea is to jointly leverage three types of online data -- shape collections, material collections, and photo collections, using the photos as reference to guide… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: To be presented at SIGGRAPH Asia 2018. Project page: https://keunhong.com/publications/photoshape/

  14. arXiv:1608.05137  [pdf, other

    cs.CV

    IM2CAD

    Authors: Hamid Izadinia, Qi Shan, Steven M. Seitz

    Abstract: Given a single photo of a room and a large database of furniture CAD models, our goal is to reconstruct a scene that is as similar as possible to the scene depicted in the photograph, and composed of objects drawn from the database. We present a completely automatic system to address this IM2CAD problem that produces high quality results on challenging imagery from interior home design and remodel… ▽ More

    Submitted 23 April, 2017; v1 submitted 17 August, 2016; originally announced August 2016.

    Comments: To appear at CVPR 2017

  15. arXiv:1511.03019  [pdf, other

    cs.CV

    3D Time-lapse Reconstruction from Internet Photos

    Authors: Ricardo Martin-Brualla, David Gallup, Steven M. Seitz

    Abstract: Given an Internet photo collection of a landmark, we compute a 3D time-lapse video sequence where a virtual camera moves continuously in time and space. While previous work assumed a static camera, the addition of camera motion during the time-lapse creates a very compelling impression of parallax. Achieving this goal, however, requires addressing multiple technical challenges, including solving f… ▽ More

    Submitted 21 February, 2020; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: To appear in ICCV'15. Supplementary video at: http://grail.cs.washington.edu/projects/timelapse3d/