Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA

Vempala, Santosh S.; Xiao, Ying

Computer Science > Computational Complexity

arXiv:1108.3329 (cs)

[Submitted on 16 Aug 2011 (v1), last revised 14 Apr 2012 (this version, v3)]

Title:Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA

Authors:Santosh S. Vempala, Ying Xiao

View PDF

Abstract:We present a generalization of the well-known problem of learning k-juntas in R^n, and a novel tensor algorithm for unraveling the structure of high-dimensional distributions. Our algorithm can be viewed as a higher-order extension of Principal Component Analysis (PCA).
Our motivating problem is learning a labeling function in R^n, which is determined by an unknown k-dimensional subspace. This problem of learning a k-subspace junta is a common generalization of learning a k-junta (a function of k coordinates in R^n) and learning intersections of k halfspaces. In this context, we introduce an irrelevant noisy attributes model where the distribution over the "relevant" k-dimensional subspace is independent of the distribution over the (n-k)-dimensional "irrelevant" subspace orthogonal to it.
We give a spectral tensor algorithm which identifies the relevant subspace, and thereby learns k-subspace juntas under some additional assumptions. We do this by exploiting the structure of local optima of higher moment tensors over the unit sphere; PCA finds the global optima of the second moment tensor (covariance matrix). Our main result is that when the distribution in the irrelevant (n-k)-dimensional subspace is any Gaussian, the complexity of our algorithm is T(k,\epsilon) + \poly(n), where T is the complexity of learning the concept in k dimensions, and the polynomial is a function of the k-dimensional concept class being learned. This substantially generalizes existing results on learning low-dimensional concepts.

Subjects:	Computational Complexity (cs.CC); Optimization and Control (math.OC); Probability (math.PR)
MSC classes:	68Q32, 15A69, 90C26
ACM classes:	F.2; G.3
Cite as:	arXiv:1108.3329 [cs.CC]
	(or arXiv:1108.3329v3 [cs.CC] for this version)
	https://doi.org/10.48550/arXiv.1108.3329

Submission history

From: Santosh Vempala [view email]
[v1] Tue, 16 Aug 2011 19:50:06 UTC (48 KB)
[v2] Sat, 5 Nov 2011 13:22:39 UTC (42 KB)
[v3] Sat, 14 Apr 2012 02:33:56 UTC (44 KB)

Computer Science > Computational Complexity

Title:Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computational Complexity

Title:Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators