Skip to main content

Showing 1–1 of 1 results for author: Tsalamanis, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:1901.09108  [pdf, ps, other

    stat.ML cs.LG

    Subspace Clustering of Very Sparse High-Dimensional Data

    Authors: Hankui Peng, Nicos Pavlidis, Idris Eckley, Ioannis Tsalamanis

    Abstract: In this paper we consider the problem of clustering collections of very short texts using subspace clustering. This problem arises in many applications such as product categorisation, fraud detection, and sentiment analysis. The main challenge lies in the fact that the vectorial representation of short texts is both high-dimensional, due to the large number of unique terms in the corpus, and extre… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

    Comments: 2018 IEEE International Conference on Big Data