Deep Active Shape Model for Face Alignment and Pose Estimation in Mobile Environment

Fard, Ali Pourramezan; Abdollahi, Hojjat; Mahoor, Mohammad

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.00119v2 (cs)

[Submitted on 27 Feb 2021 (v1), revised 11 Mar 2021 (this version, v2), latest version 7 May 2021 (v3)]

Title:Deep Active Shape Model for Face Alignment and Pose Estimation in Mobile Environment

Authors:Ali Pourramezan Fard, Hojjat Abdollahi, Mohammad Mahoor

View PDF

Abstract:Active Shape Model (ASM) is a statistical model of object shapes that represents a target structure. ASM can guide machine learning algorithms to fit a set of points representing an object (e.g., face) onto an image. This paper presents a lightweight Convolutional Neural Network (CNN) architecture with a loss function being assisted by ASM for face alignment and estimating head pose in the wild. We use ASM to first guide the network towards learning the smoother distribution of the facial landmark points. Then, during the training process, inspired by the transfer learning, we gradually harden the regression problem and lead the network towards learning the original landmark points distribution. We define multi-tasks in our loss function that are responsible for detecting facial landmark points, as well as estimating face pose. Learning multiple correlated tasks simultaneously builds synergy and improves the performance of individual tasks. We compare the performance of our proposed CNN, ASMNet with MobileNetV2 (which is about 2 times bigger ASMNet) in both face alignment and pose estimation tasks. Experimental results on challenging datasets show that by using the proposed ASM assisted loss function, ASMNet performance is comparable with MobileNetV2 in face alignment task. Besides, for face pose estimation, ASMNet performs much better than MobileNetV2. Moreover, overall ASMNet achieves an acceptable performance for facial landmark points detection and pose estimation while having a significantly smaller number of parameters and floating-point operations comparing to many CNN-based proposed models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2103.00119 [cs.CV]
	(or arXiv:2103.00119v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.00119

Submission history

From: Ali Pourramezan Fard [view email]
[v1] Sat, 27 Feb 2021 03:46:54 UTC (15,247 KB)
[v2] Thu, 11 Mar 2021 18:40:12 UTC (10,465 KB)
[v3] Fri, 7 May 2021 17:44:58 UTC (10,060 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Active Shape Model for Face Alignment and Pose Estimation in Mobile Environment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Active Shape Model for Face Alignment and Pose Estimation in Mobile Environment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators