Skip to main content

Showing 1–6 of 6 results for author: Van Hoorick, B

.
  1. arXiv:2405.14868  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

    Authors: Basile Van Hoorick, Rundi Wu, Ege Ozguroglu, Kyle Sargent, Ruoshi Liu, Pavel Tokmakov, Achal Dave, Changxi Zheng, Carl Vondrick

    Abstract: Accurate reconstruction of complex dynamic scenes from just a single viewpoint continues to be a challenging task in computer vision. Current dynamic novel view synthesis methods typically require videos from many different camera viewpoints, necessitating careful recording setups, and significantly restricting their utility in the wild as well as in terms of embodied AI applications. In this pape… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted to ECCV 2024. Project webpage is available at: https://gcd.cs.columbia.edu/

  2. arXiv:2305.03052  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Tracking through Containers and Occluders in the Wild

    Authors: Basile Van Hoorick, Pavel Tokmakov, Simon Stent, Jie Li, Carl Vondrick

    Abstract: Tracking objects with persistence in cluttered and dynamic environments remains a difficult challenge for computer vision systems. In this paper, we introduce $\textbf{TCOW}$, a new benchmark and model for visual tracking through heavy occlusion and containment. We set up a task where the goal is to, given a video sequence, segment both the projected extent of the target object, as well as the sur… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted at CVPR 2023. Project webpage is available at: https://tcow.cs.columbia.edu/

  3. arXiv:2303.11328  [pdf, other

    cs.CV cs.GR cs.RO

    Zero-1-to-3: Zero-shot One Image to 3D Object

    Authors: Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, Carl Vondrick

    Abstract: We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an object given just a single RGB image. To perform novel view synthesis in this under-constrained setting, we capitalize on the geometric priors that large-scale diffusion models learn about natural images. Our conditional diffusion model uses a synthetic dataset to learn controls of the relative camera viewpoint, which al… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Website: https://zero123.cs.columbia.edu/

  4. arXiv:2204.10916  [pdf, other

    cs.CV cs.LG

    Revealing Occlusions with 4D Neural Fields

    Authors: Basile Van Hoorick, Purva Tendulkar, Didac Suris, Dennis Park, Simon Stent, Carl Vondrick

    Abstract: For computer vision systems to operate in dynamic situations, they need to be able to represent and reason about object permanence. We introduce a framework for learning to estimate 4D visual representations from monocular RGB-D, which is able to persist objects, even once they become obstructed by occlusions. Unlike traditional video representations, we encode point clouds into a continuous repre… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: CVPR 2022 (Oral)

  5. arXiv:2011.11831  [pdf, other

    cs.CV

    Dissecting Image Crops

    Authors: Basile Van Hoorick, Carl Vondrick

    Abstract: The elementary operation of crop** underpins nearly every computer vision system, ranging from data augmentation and translation invariance to computational photography and representation learning. This paper investigates the subtle traces introduced by this operation. For example, despite refinements to camera optics, lenses will leave behind certain clues, notably chromatic aberration and vign… ▽ More

    Submitted 5 September, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Updated smartphone datasets & table; some rewording

  6. arXiv:1912.10960  [pdf, other

    cs.CV

    Image Outpainting and Harmonization using Generative Adversarial Networks

    Authors: Basile Van Hoorick

    Abstract: Although the inherently ambiguous task of predicting what resides beyond all four edges of an image has rarely been explored before, we demonstrate that GANs hold powerful potential in producing reasonable extrapolations. Two outpainting methods are proposed that aim to instigate this line of research: the first approach uses a context encoder inspired by common inpainting architectures and paradi… ▽ More

    Submitted 15 February, 2020; v1 submitted 23 December, 2019; originally announced December 2019.