Skip to main content

Showing 1–15 of 15 results for author: Cosker, D

.
  1. arXiv:2312.00870  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing

    Authors: Balamurugan Thambiraja, Sadegh Aliakbarian, Darren Cosker, Justus Thies

    Abstract: We present 3DiFACE, a novel method for personalized speech-driven 3D facial animation and editing. While existing methods deterministically predict facial animations from speech, they overlook the inherent one-to-many relationship between speech and facial expressions, i.e., there are multiple reasonable facial expression animations matching an audio input. It is especially important in content cr… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Project page: https://balamuruganthambiraja.github.io/3DiFACE/

  2. arXiv:2308.11261  [pdf, other

    cs.CV

    HMD-NeMo: Online 3D Avatar Motion Generation From Sparse Observations

    Authors: Sadegh Aliakbarian, Fatemeh Saleh, David Collier, Pashmina Cameron, Darren Cosker

    Abstract: Generating both plausible and accurate full body avatar motion is the key to the quality of immersive experiences in mixed reality scenarios. Head-Mounted Devices (HMDs) typically only provide a few input signals, such as head and hands 6-DoF. Recently, different approaches achieved impressive performance in generating full body motion given only head and hands signal. However, to the best of our… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023

  3. arXiv:2304.06024  [pdf, other

    cs.CV cs.AI

    Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views

    Authors: Siwei Zhang, Qianli Ma, Yan Zhang, Sadegh Aliakbarian, Darren Cosker, Siyu Tang

    Abstract: Automatic perception of human behaviors during social interactions is crucial for AR/VR applications, and an essential component is estimation of plausible 3D human pose and shape of our social partners from the egocentric view. One of the biggest challenges of this task is severe body truncation due to close social distances in egocentric scenarios, which brings large pose ambiguities for unseen… ▽ More

    Submitted 16 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Camera ready version for ICCV 2023, appendix included

  4. arXiv:2301.00023  [pdf, other

    cs.CV

    Imitator: Personalized Speech-driven 3D Facial Animation

    Authors: Balamurugan Thambiraja, Ikhsanul Habibie, Sadegh Aliakbarian, Darren Cosker, Christian Theobalt, Justus Thies

    Abstract: Speech-driven 3D facial animation has been widely explored, with applications in gaming, character animation, virtual reality, and telepresence systems. State-of-the-art methods deform the face topology of the target actor to sync the input audio without considering the identity-specific speaking style and facial idiosyncrasies of the target actor, thus, resulting in unrealistic and inaccurate lip… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: https://youtu.be/JhXTdjiUCUw

  5. arXiv:2107.07330  [pdf, other

    cs.CV

    DynaDog+T: A Parametric Animal Model for Synthetic Canine Image Generation

    Authors: Jake Deane, Sinead Kearney, Kwang In Kim, Darren Cosker

    Abstract: Synthetic data is becoming increasingly common for training computer vision models for a variety of tasks. Notably, such data has been applied in tasks related to humans such as 3D pose estimation where data is either difficult to create or obtain in realistic settings. Comparatively, there has been less work into synthetic animal data and it's uses for training models. Consequently, we introduce… ▽ More

    Submitted 20 July, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: CV4Animals Workshop in CVPR 2021. Update to correct minor spelling and grammer mistakes in supplementary material

  6. arXiv:2107.00480  [pdf, other

    cs.GR cs.HC

    EmoGen: Quantifiable Emotion Generation and Analysis for Experimental Psychology

    Authors: Nadejda Roubtsova, Martin Parsons, Nicola Binetti, Isabelle Mareschal, Essi Viding, Darren Cosker

    Abstract: 3D facial modelling and animation in computer vision and graphics traditionally require either digital artist's skill or complex pipelines with objective-function-based solvers to fit models to motion capture. This inaccessibility of quality modelling to a non-expert is an impediment to effective quantitative study of facial stimuli in experimental psychology. The EmoGen methodology we present in… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  7. arXiv:2004.07788  [pdf, other

    cs.CV

    RGBD-Dog: Predicting Canine Pose from RGBD Sensors

    Authors: Sinead Kearney, Wenbin Li, Martin Parsons, Kwang In Kim, Darren Cosker

    Abstract: The automatic extraction of animal \reb{3D} pose from images without markers is of interest in a range of scientific fields. Most work to date predicts animal pose from RGB images, based on 2D labelling of joint positions. However, due to the difficult nature of obtaining training data, no ground truth dataset of 3D animal motion is available to quantitatively evaluate these approaches. In additio… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: 18 pages, 16 figures, to be published in CVPR 2020

  8. arXiv:1806.02311  [pdf, other

    cs.CV cs.AI

    Unsupervised Attention-guided Image to Image Translation

    Authors: Youssef A. Mejjati, Christian Richardt, James Tompkin, Darren Cosker, Kwang In Kim

    Abstract: Current unsupervised image-to-image translation techniques struggle to focus their attention on individual objects without altering the background or the way multiple objects interact within a scene. Motivated by the important role of attention in human perception, we tackle this limitation by introducing unsupervised attention mechanisms that are jointly adversarialy trained with the generators a… ▽ More

    Submitted 8 November, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

    Journal ref: NIPS 2018

  9. Automatic Structural Scene Digitalization

    Authors: Rui Tang, Yuhan Wang, Darren Cosker, Wenbin Li

    Abstract: In this paper, we present an automatic system for the analysis and labeling of structural scenes, floor plan drawings in Computer-aided Design (CAD) format. The proposed system applies a fusion strategy to detect and recognize various components of CAD floor plans, such as walls, doors, windows and other ambiguous assets. Technically, a general rule-based filter parsing method is fist adopted to e… ▽ More

    Submitted 19 September, 2017; originally announced October 2017.

    Comments: paper submitted to PloS One

  10. arXiv:1704.05817  [pdf, other

    cs.CV

    Learn to Model Motion from Blurry Footages

    Authors: Wenbin Li, Da Chen, Zhihan Lv, Yan Yan, Darren Cosker

    Abstract: It is difficult to recover the motion field from a real-world footage given a mixture of camera shake and other photometric effects. In this paper we propose a hybrid framework by interleaving a Convolutional Neural Network (CNN) and a traditional optical flow energy. We first conduct a CNN architecture using a novel learnable directional filtering layer. Such layer encodes the angle and distance… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    Comments: Preprint of our paper accepted by Pattern Recognition

  11. Interactive Removal and Ground Truth for Difficult Shadow Scenes

    Authors: Han Gong, Darren P. Cosker

    Abstract: A user-centric method for fast, interactive, robust and high-quality shadow removal is presented. Our algorithm can perform detection and removal in a range of difficult cases: such as highly textured and colored shadows. To perform detection an on-the-fly learning approach is adopted guided by two rough user inputs for the pixels of the shadow and the lit area. After detection, shadow removal is… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

    Comments: Accepted by JOSA A

  12. arXiv:1603.08124  [pdf, other

    cs.CV

    Video Interpolation using Optical Flow and Laplacian Smoothness

    Authors: Wenbin Li, Darren Cosker

    Abstract: Non-rigid video interpolation is a common computer vision task. In this paper we present an optical flow approach which adopts a Laplacian Cotangent Mesh constraint to enhance the local smoothness. Similar to Li et al., our approach adopts a mesh to the image with a resolution up to one vertex per pixel and uses angle constraints to ensure sensible local deformations between image pairs. The Lapla… ▽ More

    Submitted 26 March, 2016; originally announced March 2016.

  13. Nonrigid Optical Flow Ground Truth for Real-World Scenes with Time-Varying Shading Effects

    Authors: Wenbin Li, Darren Cosker, Zhihan Lv, Matthew Brown

    Abstract: In this paper we present a dense ground truth dataset of nonrigidly deforming real-world scenes. Our dataset contains both long and short video sequences, and enables the quantitatively evaluation for RGB based tracking and registration methods. To construct ground truth for the RGB sequences, we simultaneously capture Near-Infrared (NIR) image sequences where dense markers - visible only in NIR -… ▽ More

    Submitted 15 July, 2016; v1 submitted 26 March, 2016; originally announced March 2016.

    Comments: preprint of our paper accepted by RA-L'16

  14. arXiv:1603.02253  [pdf, other

    cs.CV

    Blur Robust Optical Flow using Motion Channel

    Authors: Wenbin Li, Yang Chen, JeeHang Lee, Gang Ren, Darren Cosker

    Abstract: It is hard to estimate optical flow given a realworld video sequence with camera shake and other motion blur. In this paper, we first investigate the blur parameterization for video footage using near linear motion elements. we then combine a commercial 3D pose sensor with an RGB camera, in order to film video footage of interest together with the camera motion. We illustrates that this additional… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

    Comments: Preprint of our paper accepted by Neurocomputing

  15. arXiv:1603.02252  [pdf, other

    cs.CV

    Drift Robust Non-rigid Optical Flow Enhancement for Long Sequences

    Authors: Wenbin Li, Darren Cosker, Matthew Brown

    Abstract: It is hard to densely track a nonrigid object in long term, which is a fundamental research issue in the computer vision community. This task often relies on estimating pairwise correspondences between images over time where the error is accumulated and leads to a drift issue. In this paper, we introduce a novel optimization framework with an Anchor Patch constraint. It is supposed to significantl… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

    Comments: Preprint version of our paper accepted by Journal of Intelligent and Fuzzy Systems