Skip to main content

Showing 1–8 of 8 results for author: Gomes, H M

Searching in archive cs. Search in all archives.
.
  1. Look At Me, No Replay! SurpriseNet: Anomaly Detection Inspired Class Incremental Learning

    Authors: Anton Lee, Yaqian Zhang, Heitor Murilo Gomes, Albert Bifet, Bernhard Pfahringer

    Abstract: Continual learning aims to create artificial neural networks capable of accumulating knowledge and skills through incremental training on a sequence of tasks. The main challenge of continual learning is catastrophic interference, wherein new knowledge overrides or interferes with past knowledge, leading to forgetting. An associated issue is the problem of learning "cross-task knowledge," where mod… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 32nd ACM international conference on information and knowledge management, CIKM 2023, birmingham, united kingdom, october 21-25, 2023

  2. Advances on Concept Drift Detection in Regression Tasks using Social Networks Theory

    Authors: Jean Paul Barddal, Heitor Murilo Gomes, Fabrício Enembreck

    Abstract: Mining data streams is one of the main studies in machine learning area due to its application in many knowledge areas. One of the major challenges on mining data streams is concept drift, which requires the learner to discard the current concept and adapt to a new one. Ensemble-based drift detection algorithms have been used successfully to the classification task but usually maintain a fixed siz… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  3. Fast & Furious: Modelling Malware Detection as Evolving Data Streams

    Authors: Fabrício Ceschin, Marcus Botacin, Heitor Murilo Gomes, Felipe Pinagé, Luiz S. Oliveira, André Grégio

    Abstract: Malware is a major threat to computer systems and imposes many challenges to cyber security. Targeted threats, such as ransomware, cause millions of dollars in losses every year. The constant increase of malware infections has been motivating popular antiviruses (AVs) to develop dedicated detection strategies, which include meticulously crafted machine learning (ML) pipelines. However, malware dev… ▽ More

    Submitted 15 August, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  4. arXiv:2106.09170  [pdf, other

    cs.LG

    A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams

    Authors: Heitor Murilo Gomes, Maciej Grzenda, Rodrigo Mello, Jesse Read, Minh Huong Le Nguyen, Albert Bifet

    Abstract: Unlabelled data appear in many domains and are particularly relevant to streaming applications, where even though data is abundant, labelled data is rare. To address the learning problems associated with such data, one can ignore the unlabelled data and focus only on the labelled data (supervised learning); use the labelled data and attempt to leverage the unlabelled data (semi-supervised learning… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  5. arXiv:2103.00903  [pdf, other

    cs.LG stat.ML

    STUDD: A Student-Teacher Method for Unsupervised Concept Drift Detection

    Authors: Vitor Cerqueira, Heitor Murilo Gomes, Albert Bifet, Luis Torgo

    Abstract: Concept drift detection is a crucial task in data stream evolving environments. Most of state of the art approaches designed to tackle this problem monitor the loss of predictive models. However, this approach falls short in many real-world scenarios, where the true labels are not readily available to compute the loss. In this context, there is increasing attention to approaches that perform conce… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: 23 pages, single column

  6. arXiv:2012.04740  [pdf, ps, other

    cs.LG cs.AI cs.MS

    River: machine learning for streaming data in Python

    Authors: Jacob Montiel, Max Halford, Saulo Martiello Mastelini, Geoffrey Bolmier, Raphael Sourty, Robin Vaysse, Adil Zouitine, Heitor Murilo Gomes, Jesse Read, Talel Abdessalem, Albert Bifet

    Abstract: River is a machine learning library for dynamic data streams and continual learning. It provides multiple state-of-the-art learning methods, data generators/transformers, performance metrics and evaluators for different stream learning problems. It is the result from the merger of the two most popular packages for stream learning in Python: Creme and scikit-multiflow. River introduces a revamped a… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: Submitted to JMLR MLOSS

    MSC Class: 68-04 ACM Class: I.2; I.2.5

  7. arXiv:2010.16045  [pdf, other

    cs.CR cs.LG

    Machine Learning (In) Security: A Stream of Problems

    Authors: Fabrício Ceschin, Marcus Botacin, Albert Bifet, Bernhard Pfahringer, Luiz S. Oliveira, Heitor Murilo Gomes, André Grégio

    Abstract: Machine Learning (ML) has been widely applied to cybersecurity and is considered state-of-the-art for solving many of the open issues in that field. However, it is very difficult to evaluate how good the produced solutions are, since the challenges faced in security may not appear in other areas. One of these challenges is the concept drift, which increases the existing arms race between attackers… ▽ More

    Submitted 4 September, 2023; v1 submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Digital Threats 2023

  8. arXiv:2010.10935  [pdf, other

    cs.LG stat.ML

    An Eager Splitting Strategy for Online Decision Trees

    Authors: Chaitanya Manapragada, Heitor M Gomes, Mahsa Salehi, Albert Bifet, Geoffrey I Webb

    Abstract: Decision tree ensembles are widely used in practice. In this work, we study in ensemble settings the effectiveness of replacing the split strategy for the state-of-the-art online tree learner, Hoeffding Tree, with a rigorous but more eager splitting strategy that we had previously published as Hoeffding AnyTime Tree. Hoeffding AnyTime Tree (HATT), uses the Hoeffding Test to determine whether the c… ▽ More

    Submitted 31 July, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: text overlap with arXiv:2010.08199