Hyperspherical Prototype Networks

Mettes, Pascal; van der Pol, Elise; Snoek, Cees G. M.

Computer Science > Machine Learning

arXiv:1901.10514 (cs)

[Submitted on 29 Jan 2019 (v1), last revised 25 Oct 2019 (this version, v3)]

Title:Hyperspherical Prototype Networks

Authors:Pascal Mettes, Elise van der Pol, Cees G. M. Snoek

View PDF

Abstract:This paper introduces hyperspherical prototype networks, which unify classification and regression with prototypes on hyperspherical output spaces. For classification, a common approach is to define prototypes as the mean output vector over training examples per class. Here, we propose to use hyperspheres as output spaces, with class prototypes defined a priori with large margin separation. We position prototypes through data-independent optimization, with an extension to incorporate priors from class semantics. By doing so, we do not require any prototype updating, we can handle any training size, and the output dimensionality is no longer constrained to the number of classes. Furthermore, we generalize to regression, by optimizing outputs as an interpolation between two prototypes on the hypersphere. Since both tasks are now defined by the same loss function, they can be jointly trained for multi-task problems. Experimentally, we show the benefit of hyperspherical prototype networks for classification, regression, and their combination over other prototype methods, softmax cross-entropy, and mean squared error approaches.

Comments:	NeurIPS 2019
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1901.10514 [cs.LG]
	(or arXiv:1901.10514v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.10514

Submission history

From: Pascal Mettes [view email]
[v1] Tue, 29 Jan 2019 20:05:23 UTC (1,514 KB)
[v2] Fri, 28 Jun 2019 10:42:30 UTC (1,180 KB)
[v3] Fri, 25 Oct 2019 09:17:20 UTC (1,172 KB)

Computer Science > Machine Learning

Title:Hyperspherical Prototype Networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hyperspherical Prototype Networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators