Reliable Clustering of Bernoulli Mixture Models

Najafi, Amir; Motahari, Abolfazl; Rabiee, Hamid R.

Computer Science > Machine Learning

arXiv:1710.02101v2 (cs)

[Submitted on 5 Oct 2017 (v1), revised 16 Dec 2018 (this version, v2), latest version 16 Jun 2019 (v3)]

Title:Reliable Clustering of Bernoulli Mixture Models

Authors:Amir Najafi, Abolfazl Motahari, Hamid R. Rabiee

View PDF

Abstract:A Bernoulli Mixture Model (BMM) is a finite mixture of random binary vectors with independent Bernoulli dimensions. The problem of clustering BMM data arises in a variety of real-world applications, ranging from population genetics to activity analysis in social networks. In this paper, we have analyzed the information-theoretic PAC-learnability of BMMs, when the number of clusters is unknown. In particular, we stipulate certain conditions on both sample complexity and the dimension of the model in order to guarantee the Probably Approximately Correct (PAC)-clusterability of a given dataset. To the best of our knowledge, these findings are the first non-asymptotic (PAC) bounds on the sample complexity of learning BMMs.

Comments:	24 pages
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:1710.02101 [cs.LG]
	(or arXiv:1710.02101v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1710.02101

Submission history

From: Amir Najafi [view email]
[v1] Thu, 5 Oct 2017 16:22:27 UTC (95 KB)
[v2] Sun, 16 Dec 2018 19:35:33 UTC (147 KB)
[v3] Sun, 16 Jun 2019 04:55:27 UTC (92 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-10

Change to browse by:

cs
cs.IT
math
math.IT
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Amir Najafi
Abolfazl S. Motahari
Hamid R. Rabiee

export BibTeX citation

Computer Science > Machine Learning

Title:Reliable Clustering of Bernoulli Mixture Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reliable Clustering of Bernoulli Mixture Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators