Weight Sharing is Crucial to Succesful Optimization

Shalev-Shwartz, Shai; Shamir, Ohad; Shammah, Shaked

Computer Science > Machine Learning

arXiv:1706.00687 (cs)

[Submitted on 2 Jun 2017]

Title:Weight Sharing is Crucial to Succesful Optimization

Authors:Shai Shalev-Shwartz, Ohad Shamir, Shaked Shammah

View PDF

Abstract:Exploiting the great expressive power of Deep Neural Network architectures, relies on the ability to train them. While current theoretical work provides, mostly, results showing the hardness of this task, empirical evidence usually differs from this line, with success stories in abundance. A strong position among empirically successful architectures is captured by networks where extensive weight sharing is used, either by Convolutional or Recurrent layers. Additionally, characterizing specific aspects of different tasks, making them "harder" or "easier", is an interesting direction explored both theoretically and empirically. We consider a family of ConvNet architectures, and prove that weight sharing can be crucial, from an optimization point of view. We explore different notions of the frequency, of the target function, proving necessity of the target function having some low frequency components. This necessity is not sufficient - only with weight sharing can it be exploited, thus theoretically separating architectures using it, from others which do not. Our theoretical results are aligned with empirical experiments in an even more general setting, suggesting viability of examination of the role played by interleaving those aspects in broader families of tasks.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1706.00687 [cs.LG]
	(or arXiv:1706.00687v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1706.00687

Submission history

From: Shaked Shammah [view email]
[v1] Fri, 2 Jun 2017 13:56:59 UTC (39 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shai Shalev-Shwartz
Ohad Shamir
Shaked Shammah

export BibTeX citation

Computer Science > Machine Learning

Title:Weight Sharing is Crucial to Succesful Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Weight Sharing is Crucial to Succesful Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators