Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation

Wang, Yunhan; Shi, Xiangwei; De Mello, Shalini; Chang, Hyung **; Zhang, Xucong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.09593 (cs)

[Submitted on 18 Aug 2023]

Title:Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation

Authors:Yunhan Wang, Xiangwei Shi, Shalini De Mello, Hyung ** Chang, Xucong Zhang

View PDF

Abstract:With the rapid development of deep learning technology in the past decade, appearance-based gaze estimation has attracted great attention from both computer vision and human-computer interaction research communities. Fascinating methods were proposed with variant mechanisms including soft attention, hard attention, two-eye asymmetry, feature disentanglement, rotation consistency, and contrastive learning. Most of these methods take the single-face or multi-region as input, yet the basic architecture of gaze estimation has not been fully explored. In this paper, we reveal the fact that tuning a few simple parameters of a ResNet architecture can outperform most of the existing state-of-the-art methods for the gaze estimation task on three popular datasets. With our extensive experiments, we conclude that the stride number, input image resolution, and multi-region architecture are critical for the gaze estimation performance while their effectiveness dependent on the quality of the input face image. We obtain the state-of-the-art performances on three datasets with 3.64 on ETH-XGaze, 4.50 on MPIIFaceGaze, and 9.13 on Gaze360 degrees gaze estimation error by taking ResNet-50 as the backbone.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.09593 [cs.CV]
	(or arXiv:2308.09593v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.09593

Submission history

From: Yunhan Wang [view email]
[v1] Fri, 18 Aug 2023 14:41:51 UTC (32 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Investigation of Architectures and Receptive Fields for Appearance-based Gaze Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators