Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Kim, Jungtaek; Yoon, Jeongbeen; Cho, Minsu

Computer Science > Machine Learning

arXiv:2310.07174 (cs)

[Submitted on 11 Oct 2023 (v1), last revised 14 Mar 2024 (this version, v2)]

Title:Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Authors:Jungtaek Kim, Jeongbeen Yoon, Minsu Cho

View PDF HTML (experimental)

Abstract:Sorting is a fundamental operation of all computer systems, having been a long-standing significant research topic. Beyond the problem formulation of traditional sorting algorithms, we consider sorting problems for more abstract yet expressive inputs, e.g., multi-digit images and image fragments, through a neural sorting network. To learn a map** from a high-dimensional input to an ordinal variable, the differentiability of sorting networks needs to be guaranteed. In this paper we define a softening error by a differentiable swap function, and develop an error-free swap function that holds a non-decreasing condition and differentiability. Furthermore, a permutation-equivariant Transformer network with multi-head attention is adopted to capture dependency between given inputs and also leverage its model capacity with self-attention. Experiments on diverse sorting benchmarks show that our methods perform better than or comparable to baseline methods.

Comments:	Accepted at the 12th International Conference on Learning Representations (ICLR 2024)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2310.07174 [cs.LG]
	(or arXiv:2310.07174v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.07174

Submission history

From: Jungtaek Kim [view email]
[v1] Wed, 11 Oct 2023 03:47:34 UTC (1,079 KB)
[v2] Thu, 14 Mar 2024 00:39:43 UTC (1,147 KB)

Computer Science > Machine Learning

Title:Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators