Skip to main content

Showing 1–3 of 3 results for author: Gould, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.01518  [pdf, other

    cs.CV cs.LG eess.IV

    Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation

    Authors: Ming Xu, Stephen Gould

    Abstract: We propose a novel approach to the action segmentation task for long, untrimmed videos, based on solving an optimal transport problem. By encoding a temporal consistency prior into a Gromov-Wasserstein problem, we are able to decode a temporally consistent segmentation from a noisy affinity/matching cost matrix between video frames and action classes. Unlike previous approaches, our method does no… ▽ More

    Submitted 8 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024 (Oral)

  2. arXiv:2310.14468  [pdf, other

    cs.LG cs.RO eess.SY

    Revisiting Implicit Differentiation for Learning Problems in Optimal Control

    Authors: Ming Xu, Timothy Molloy, Stephen Gould

    Abstract: This paper proposes a new method for differentiating through optimal trajectories arising from non-convex, constrained discrete-time optimal control (COC) problems using the implicit function theorem (IFT). Previous works solve a differential Karush-Kuhn-Tucker (KKT) system for the trajectory derivative, and achieve this efficiently by solving an auxiliary Linear Quadratic Regulator (LQR) problem.… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023 (poster)

  3. arXiv:2212.03848  [pdf, other

    cs.CV cs.GR eess.IV

    NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing

    Authors: Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould

    Abstract: We present NeRFEditor, an efficient learning framework for 3D scene editing, which takes a video captured over 360° as input and outputs a high-quality, identity-preserving stylized 3D scene. Our method supports diverse types of editing such as guided by reference images, text prompts, and user interactions. We achieve this by encouraging a pre-trained StyleGAN model and a NeRF model to learn from… ▽ More

    Submitted 8 December, 2022; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: Project page: https://chuny1.github.io/NeRFEditor/nerfeditor.html