Skip to main content

Showing 1–7 of 7 results for author: Cashman, T

.
  1. arXiv:2204.02776  [pdf, other

    cs.CV

    3D face reconstruction with dense landmarks

    Authors: Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, **g**g Shen, Nikola Milosavljevic, Daniel Wilde, Stephan Garbin, Chirag Raman, Jamie Shotton, Toby Sharp, Ivan Stojiljkovic, Tom Cashman, Julien Valentin

    Abstract: Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering. Can we keep things simple by just using more landmarks? In answer, we present the first m… ▽ More

    Submitted 20 July, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: ECCV 2022

  2. arXiv:2203.05789  [pdf, other

    cs.CV cs.LG

    FLAG: Flow-based 3D Avatar Generation from Sparse Observations

    Authors: Sadegh Aliakbarian, Pashmina Cameron, Federica Bogo, Andrew Fitzgibbon, Thomas J. Cashman

    Abstract: To represent people in mixed reality applications for collaboration and communication, we need to generate realistic and faithful avatar poses. However, the signal streams that can be applied for this task from head-mounted devices (HMDs) are typically limited to head pose and hand pose estimates. While these signals are valuable, they are an incomplete representation of the human body, making it… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  3. arXiv:2109.15102  [pdf, other

    cs.CV

    Fake It Till You Make It: Face analysis in the wild using synthetic data alone

    Authors: Erroll Wood, Tadas Baltrušaitis, Charlie Hewitt, Sebastian Dziadzio, Matthew Johnson, Virginia Estellers, Thomas J. Cashman, Jamie Shotton

    Abstract: We demonstrate that it is possible to perform face-related computer vision in the wild using synthetic data alone. The community has long enjoyed the benefits of synthesizing training data with graphics, but the domain gap between real and synthetic data has remained a problem, especially for human faces. Researchers have tried to bridge this gap with data mixing, domain adaptation, and domain-adv… ▽ More

    Submitted 5 October, 2021; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: ICCV 2021. Amended acknowledgements

  4. arXiv:2008.11239  [pdf, other

    cs.CV

    HoloLens 2 Research Mode as a Tool for Computer Vision Research

    Authors: Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

    Abstract: Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research. In this technical report, we present HoloLens 2 Research Mode, an API and a set of tools enabling access to the raw sensor streams. We provide an overview of the API and explain how it can be used to build mixed… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  5. arXiv:2007.08364  [pdf, other

    cs.CV cs.LG

    A high fidelity synthetic face framework for computer vision

    Authors: Tadas Baltrusaitis, Erroll Wood, Virginia Estellers, Charlie Hewitt, Sebastian Dziadzio, Marek Kowalski, Matthew Johnson, Thomas J. Cashman, Jamie Shotton

    Abstract: Analysis of faces is one of the core applications of computer vision, with tasks ranging from landmark alignment, head pose estimation, expression recognition, and face recognition among others. However, building reliable methods requires time-consuming data collection and often even more time-consuming manual annotation, which can be unreliable. In our work we propose synthesizing such facial dat… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  6. arXiv:2007.04940  [pdf, other

    cs.CV

    The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization

    Authors: **g**g Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew William Fitzgibbon, Jamie Shotton

    Abstract: Realtime perceptual and interaction capabilities in mixed reality require a range of 3D tracking problems to be solved at low latency on resource-constrained hardware such as head-mounted devices. Indeed, for devices such as HoloLens 2 where the CPU and GPU are left available for applications, multiple tracking subsystems are required to run on a continuous, real-time basis while sharing a single… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Journal ref: ECCV2020

  7. arXiv:1802.03773  [pdf, other

    math.NA cs.MS

    QRkit: Sparse, Composable QR Decompositions for Efficient and Stable Solutions to Problems in Computer Vision

    Authors: Jan Svoboda, Thomas Cashman, Andrew Fitzgibbon

    Abstract: Embedded computer vision applications increasingly require the speed and power benefits of single-precision (32 bit) floating point. However, applications which make use of Levenberg-like optimization can lose significant accuracy when reducing to single precision, sometimes unrecoverably so. This accuracy can be regained using solvers based on QR rather than Cholesky decomposition, but the absenc… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.