Computer Science > Computer Vision and Pattern Recognition
[Submitted on 20 Jun 2014 (this version), latest version 18 Apr 2015 (v2)]
Title:Web-Scale Training for Face Identification
View PDFAbstract:Scaling machine learning methods to massive datasets has attracted considerable attention in recent years, thanks to easy access to ubiquitous sensing and data from the web. Face recognition is a task of great practical interest for which (i) very large labeled datasets exist, containing billions of images; (ii) the number of classes can reach tens of millions or more; and (iii) complex features are necessary in order to encode subtle differences between subjects, while maintaining invariance to factors such as pose, illumination, and aging. We present an elaborate pipeline that consists of a crucial network compression step followed by a new bootstrap** scheme for selecting a challenging subset of the dataset for efficient training of a higher capacity network. By using this approach, we are able to greatly improve face recognition accuracy on the widely used LFW benchmark. Moreover, as performance on supervised face verification (1:1) benchmarks saturates, we propose to shift the attention of the research community to the unsupervised Probe-Gallery (1:N) identification benchmarks. On this task, we bridge between the literature and the industry, for the first time, by directly comparing with the state of the art Commercially-Off-The-Shelf system and show a sizable leap in performance. Lastly, we demonstrate an intriguing trade-off between the number of training samples and the optimal size of the network.
Submission history
From: Yaniv Taigman [view email][v1] Fri, 20 Jun 2014 02:51:31 UTC (515 KB)
[v2] Sat, 18 Apr 2015 09:18:19 UTC (1,512 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.