Skip to main content

Showing 1–5 of 5 results for author: Veroneze, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2003.04726  [pdf, other

    cs.LG stat.ML

    New advances in enumerative biclustering algorithms with online partitioning

    Authors: Rosana Veroneze, Fernando J. Von Zuben

    Abstract: This paper further extends RIn-Close_CVC, a biclustering algorithm capable of performing an efficient, complete, correct and non-redundant enumeration of maximal biclusters with constant values on columns in numerical datasets. By avoiding a priori partitioning and itemization of the dataset, RIn-Close_CVC implements an online partitioning, which is demonstrated here to guide to more informative b… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    Comments: This report unifies the proposals of two previous reports ('Efficient mining of maximal biclusters in mixed-attribute datasets' and 'RIn-Close_CVC2: an even more efficient enumerative algorithm for biclustering of numerical datasets') and brings some new novelties too. arXiv admin note: substantial text overlap with arXiv:1810.07725

  2. arXiv:1810.07725  [pdf, other

    cs.LG stat.ML

    RIn-Close_CVC2: an even more efficient enumerative algorithm for biclustering of numerical datasets

    Authors: Rosana Veroneze, Fernando J. Von Zuben

    Abstract: RIn-Close_CVC is an efficient (take polynomial time per bicluster), complete (find all maximal biclusters), correct (all biclusters attend the user-defined level of consistency) and non-redundant (all the obtained biclusters are maximal and the same bicluster is not enumerated more than once) enumerative algorithm for mining maximal biclusters with constant values on columns in numerical datasets.… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

  3. arXiv:1710.03289  [pdf, other

    cs.DB

    Efficient mining of maximal biclusters in mixed-attribute datasets

    Authors: Rosana Veroneze, Fernando J. Von Zuben

    Abstract: This paper presents a novel enumerative biclustering algorithm to directly mine all maximal biclusters in mixed-attribute datasets (containing both numerical and categorical attributes), with or without missing values. The proposal is an extension of RIn-Close_CVC, which was originally conceived to mine perfect or perturbed biclusters with constant values on columns solely from numerical datasets,… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

  4. arXiv:1506.01077  [pdf, other

    cs.LG

    On bicluster aggregation and its benefits for enumerative solutions

    Authors: Saullo Haniell Galvão de Oliveira, Rosana Veroneze, Fernando José Von Zuben

    Abstract: Biclustering involves the simultaneous clustering of objects and their attributes, thus defining local two-way clustering models. Recently, efficient algorithms were conceived to enumerate all biclusters in real-valued datasets. In this case, the solution composes a complete set of maximal and non-redundant biclusters. However, the ability to enumerate biclusters revealed a challenging scenario: i… ▽ More

    Submitted 2 June, 2015; originally announced June 2015.

    Comments: 15 pages, will be published by Springer Verlag in the LNAI Series in the book Advances in Data Mining

  5. arXiv:1403.3562  [pdf, other

    cs.DM

    Enumerating all maximal biclusters in numerical datasets

    Authors: Rosana Veroneze, Arindam Banerjee, Fernando J. Von Zuben

    Abstract: Biclustering has proved to be a powerful data analysis technique due to its wide success in various application domains. However, the existing literature presents efficient solutions only for enumerating maximal biclusters with constant values, or heuristic-based approaches which can not find all biclusters or even support the maximality of the obtained biclusters. Here, we present a general famil… ▽ More

    Submitted 23 July, 2015; v1 submitted 14 March, 2014; originally announced March 2014.