Web-Scale Training for Face Identification

Taigman, Yaniv; Yang, Ming; Ranzato, Marc'Aurelio; Wolf, Lior

Abstract:Scaling machine learning methods to massive datasets has attracted considerable attention in recent years, thanks to easy access to ubiquitous sensing and data from the web. Face recognition is a task of great practical interest for which (i) very large labeled datasets exist, containing billions of images; (ii) the number of classes can reach tens of millions or more; and (iii) complex features are necessary in order to encode subtle differences between subjects, while maintaining invariance to factors such as pose, illumination, and aging. We present an elaborate pipeline that consists of a crucial network compression step followed by a new bootstrap** scheme for selecting a challenging subset of the dataset for efficient training of a higher capacity network. By using this approach, we are able to greatly improve face recognition accuracy on the widely used LFW benchmark. Moreover, as performance on supervised face verification (1:1) benchmarks saturates, we propose to shift the attention of the research community to the unsupervised Probe-Gallery (1:N) identification benchmarks. On this task, we bridge between the literature and the industry, for the first time, by directly comparing with the state of the art Commercially-Off-The-Shelf system and show a sizable leap in performance. Lastly, we demonstrate an intriguing trade-off between the number of training samples and the optimal size of the network.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1406.5266 [cs.CV]
	(or arXiv:1406.5266v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1406.5266

Computer Science > Computer Vision and Pattern Recognition

Title:Web-Scale Training for Face Identification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators