Instilling Inductive Biases with Subnetworks

Zhang, Enyan; Lepori, Michael A.; Pavlick, Ellie

Computer Science > Machine Learning

arXiv:2310.10899 (cs)

[Submitted on 17 Oct 2023 (v1), last revised 1 Feb 2024 (this version, v2)]

Title:Instilling Inductive Biases with Subnetworks

Authors:Enyan Zhang, Michael A. Lepori, Ellie Pavlick

View PDF HTML (experimental)

Abstract:Despite the recent success of artificial neural networks on a variety of tasks, we have little knowledge or control over the exact solutions these models implement. Instilling inductive biases -- preferences for some solutions over others -- into these models is one promising path toward understanding and controlling their behavior. Much work has been done to study the inherent inductive biases of models and instill different inductive biases through hand-designed architectures or carefully curated training regimens. In this work, we explore a more mechanistic approach: Subtask Induction. Our method discovers a functional subnetwork that implements a particular subtask within a trained model and uses it to instill inductive biases towards solutions utilizing that subtask. Subtask Induction is flexible and efficient, and we demonstrate its effectiveness with two experiments. First, we show that Subtask Induction significantly reduces the amount of training data required for a model to adopt a specific, generalizable solution to a modular arithmetic task. Second, we demonstrate that Subtask Induction successfully induces a human-like shape bias while increasing data efficiency for convolutional and transformer-based image classification models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.10899 [cs.LG]
	(or arXiv:2310.10899v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.10899

Submission history

From: Enyan Zhang [view email]
[v1] Tue, 17 Oct 2023 00:12:19 UTC (2,018 KB)
[v2] Thu, 1 Feb 2024 00:05:51 UTC (2,447 KB)

Computer Science > Machine Learning

Title:Instilling Inductive Biases with Subnetworks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Instilling Inductive Biases with Subnetworks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators