Skip to main content

Showing 1–27 of 27 results for author: Pfahringer, B

Searching in archive cs. Search in all archives.
.
  1. Look At Me, No Replay! SurpriseNet: Anomaly Detection Inspired Class Incremental Learning

    Authors: Anton Lee, Yaqian Zhang, Heitor Murilo Gomes, Albert Bifet, Bernhard Pfahringer

    Abstract: Continual learning aims to create artificial neural networks capable of accumulating knowledge and skills through incremental training on a sequence of tasks. The main challenge of continual learning is catastrophic interference, wherein new knowledge overrides or interferes with past knowledge, leading to forgetting. An associated issue is the problem of learning "cross-task knowledge," where mod… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 32nd ACM international conference on information and knowledge management, CIKM 2023, birmingham, united kingdom, october 21-25, 2023

  2. arXiv:2209.13917  [pdf, other

    cs.LG cs.AI

    A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal

    Authors: Yaqian Zhang, Bernhard Pfahringer, Eibe Frank, Albert Bifet, Nick ** Sean Lim, Yunzhe Jia

    Abstract: Online continual learning (OCL) aims to train neural networks incrementally from a non-stationary data stream with a single pass through data. Rehearsal-based methods attempt to approximate the observed input distributions over time with a small memory and revisit them later to avoid forgetting. Despite its strong empirical performance, rehearsal methods still suffer from a poor approximation of t… ▽ More

    Submitted 13 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  3. arXiv:2205.05831  [pdf, other

    cs.CV cs.LG

    Feature Extractor Stacking for Cross-domain Few-shot Learning

    Authors: Hongyu Wang, Eibe Frank, Bernhard Pfahringer, Michael Mayo, Geoffrey Holmes

    Abstract: Cross-domain few-shot learning (CDFSL) addresses learning problems where knowledge needs to be transferred from one or more source domains into an instance-scarce target domain with an explicitly different distribution. Recently published CDFSL methods generally construct a universal model that combines knowledge of multiple source domains into one feature extractor. This enables efficient inferen… ▽ More

    Submitted 24 October, 2023; v1 submitted 11 May, 2022; originally announced May 2022.

  4. arXiv:2201.06205  [pdf, other

    cs.LG

    Balancing Performance and Energy Consumption of Bagging Ensembles for the Classification of Data Streams in Edge Computing

    Authors: Guilherme Cassales, Heitor Gomes, Albert Bifet, Bernhard Pfahringer, Hermes Senger

    Abstract: In recent years, the Edge Computing (EC) paradigm has emerged as an enabling factor for develo** technologies like the Internet of Things (IoT) and 5G networks, bridging the gap between Cloud Computing services and end-users, supporting low latency, mobility, and location awareness to delay-sensitive applications. Most solutions in EC employ machine learning (ML) methods to perform data classifi… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    Comments: 18 pages. arXiv admin note: text overlap with arXiv:2112.09834

  5. Improving the performance of bagging ensembles for data streams through mini-batching

    Authors: Guilherme Cassales, Heitor Gomes, Albert Bifet, Bernhard Pfahringer, Hermes Senger

    Abstract: Often, machine learning applications have to cope with dynamic environments where data are collected in the form of continuous data streams with potentially infinite length and transient behavior. Compared to traditional (batch) data mining, stream processing algorithms have additional requirements regarding computational resources and adaptability to data evolution. They must process instances in… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Journal ref: Information Sciences, Volume 580, 2021, Pages 260-282

  6. arXiv:2112.01718  [pdf, other

    cs.LG cs.AI

    Improving Predictions of Tail-end Labels using Concatenated BioMed-Transformers for Long Medical Documents

    Authors: Vithya Yogarajan, Bernhard Pfahringer, Tony Smith, Jacob Montiel

    Abstract: Multi-label learning predicts a subset of labels from a given label set for an unseen instance while considering label correlations. A known challenge with multi-label classification is the long-tailed distribution of labels. Many studies focus on improving the overall predictions of the model and thus do not prioritise tail-end labels. Improving the tail-end label predictions in multi-label class… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

  7. Predicting COVID-19 Patient Shielding: A Comprehensive Study

    Authors: Vithya Yogarajan, Jacob Montiel, Tony Smith, Bernhard Pfahringer

    Abstract: There are many ways machine learning and big data analytics are used in the fight against the COVID-19 pandemic, including predictions, risk management, diagnostics, and prevention. This study focuses on predicting COVID-19 patient shielding -- identifying and protecting patients who are clinically extremely vulnerable from coronavirus. This study focuses on techniques used for the multi-label cla… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: Accepted in AJCAI 2021

    Journal ref: The 2021 Australasian Joint Conference on Artificial Intelligence (AJCAI 2021)

  8. Semi-Supervised Learning using Siamese Networks

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Neural networks have been successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples. These models, however, are more difficult to train successfully for semi-supervised problems where small amounts of labeled instances are available along with a large number of unlabeled instances. This work explores a new training method for sem… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link of GitHub repository

    Journal ref: In AI 2019: Advances in Artificial Intelligence. AI 2019 . Lecture Notes in Computer Science, vol 11919. Springer, Cham

  9. Transfer of Pretrained Model Weights Substantially Improves Semi-Supervised Image Classification

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Deep neural networks produce state-of-the-art results when trained on a large number of labeled examples but tend to overfit when small amounts of labeled examples are used for training. Creating a large number of labeled examples requires considerable resources, time, and effort. If labeling new data is not feasible, so-called semi-supervised learning can achieve better generalisation than purely… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link to code and data repo

    Journal ref: In: AI 2020: Advances in Artificial Intelligence. AI 2020 . Lecture Notes in Computer Science, vol 12576. Springer, Cham

  10. arXiv:2109.00778  [pdf, other

    cs.CV

    Better Self-training for Image Classification through Self-supervision

    Authors: Attaullah Sahito, Eibe Frank, Bernhard Pfahringer

    Abstract: Self-training is a simple semi-supervised learning approach: Unlabelled examples that attract high-confidence predictions are labelled with their predictions and added to the training set, with this process being repeated multiple times. Recently, self-supervision -- learning without manual supervision by solving an automatically-generated pretext task -- has gained prominence in deep learning. Th… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: added link to code repository

  11. arXiv:2101.10448  [pdf, other

    cs.CL

    PolyLM: Learning about Polysemy through Language Modeling

    Authors: Alan Ansell, Felipe Bravo-Marquez, Bernhard Pfahringer

    Abstract: To avoid the "meaning conflation deficiency" of word embeddings, a number of models have aimed to embed individual word senses. These methods at one time performed well on tasks such as word sense induction (WSI), but they have since been overtaken by task-specific techniques which exploit contextualized embeddings. However, sense embeddings and contextualization need not be mutually exclusive. We… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: EACL 2021

  12. arXiv:2010.16045  [pdf, other

    cs.CR cs.LG

    Machine Learning (In) Security: A Stream of Problems

    Authors: Fabrício Ceschin, Marcus Botacin, Albert Bifet, Bernhard Pfahringer, Luiz S. Oliveira, Heitor Murilo Gomes, André Grégio

    Abstract: Machine Learning (ML) has been widely applied to cybersecurity and is considered state-of-the-art for solving many of the open issues in that field. However, it is very difficult to evaluate how good the produced solutions are, since the challenges faced in security may not appear in other areas. One of these challenges is the concept drift, which increases the existing arms race between attackers… ▽ More

    Submitted 4 September, 2023; v1 submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Digital Threats 2023

  13. arXiv:2005.07353  [pdf, ps, other

    cs.LG stat.ML

    Adaptive XGBoost for Evolving Data Streams

    Authors: Jacob Montiel, Rory Mitchell, Eibe Frank, Bernhard Pfahringer, Talel Abdessalem, Albert Bifet

    Abstract: Boosting is an ensemble method that combines base models in a sequential manner to achieve high predictive accuracy. A popular learning algorithm based on this ensemble method is eXtreme Gradient Boosting (XGB). We present an adaptation of XGB for classification of evolving data streams. In this setting, new data arrives over time and the relationship between the class and the features may change… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: To be published in Proceedings of the International Joint Conference on Neural Networks (IJCNN) 2020, 8 pages

  14. arXiv:2004.00430  [pdf, other

    cs.IR cs.LG stat.ML

    Seeing The Whole Patient: Using Multi-Label Medical Text Classification Techniques to Enhance Predictions of Medical Codes

    Authors: Vithya Yogarajan, Jacob Montiel, Tony Smith, Bernhard Pfahringer

    Abstract: Machine learning-based multi-label medical text classifications can be used to enhance the understanding of the human body and aid the need for patient care. We present a broad study on clinical natural language processing techniques to maximise a feature representing text when predicting medical codes on patients with multi-morbidity. We present results of multi-label medical text classification… ▽ More

    Submitted 28 March, 2020; originally announced April 2020.

  15. arXiv:1912.13405  [pdf, other

    cs.LG cs.AI stat.ML

    Classifier Chains: A Review and Perspectives

    Authors: Jesse Read, Bernhard Pfahringer, Geoff Holmes, Eibe Frank

    Abstract: The family of methods collectively known as classifier chains has become a popular approach to multi-label learning problems. This approach involves linking together off-the-shelf binary classifiers in a chain structure, such that class label predictions become features for other classifiers. Such methods have proved flexible and effective and have obtained state-of-the-art empirical performance a… ▽ More

    Submitted 15 April, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Journal ref: Journal of Artificial Intelligence Research 70 (2021) 683-718

  16. Automatic end-to-end De-identification: Is high accuracy the only metric?

    Authors: Vithya Yogarajan, Bernhard Pfahringer, Michael Mayo

    Abstract: De-identification of electronic health records (EHR) is a vital step towards advancing health informatics research and maximising the use of available data. It is a two-step process where step one is the identification of protected health information (PHI), and step two is replacing such PHI with surrogates. Despite the recent advances in automatic de-identification of EHR, significant obstacles r… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: 17 pages, 1 figure, 7 tables, review journal paper

    Report number: 04-Feb-2020

    Journal ref: Applied Artificial Intelligence, 2020

  17. arXiv:1901.07777  [pdf, ps, other

    stat.ML cs.LG

    Stochastic Gradient Trees

    Authors: Henry Gouk, Bernhard Pfahringer, Eibe Frank

    Abstract: We present an algorithm for learning decision trees using stochastic gradient information as the source of supervision. In contrast to previous approaches to gradient-based tree learning, our method operates in the incremental learning setting rather than the batch learning setting, and does not make use of soft splits or require the construction of a new tree for every update. We demonstrate how… ▽ More

    Submitted 24 September, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: Accepted at ACML 2019

  18. arXiv:1810.06765  [pdf, other

    cs.AI cs.CL

    A survey of automatic de-identification of longitudinal clinical narratives

    Authors: Vithya Yogarajan, Michael Mayo, Bernhard Pfahringer

    Abstract: Use of medical data, also known as electronic health records, in research helps develop and advance medical science. However, protecting patient confidentiality and identity while using medical data for analysis is crucial. Medical data can be in the form of tabular structures (i.e. tables), free-form narratives, and images. This study focuses on medical data in the free form longitudinal text. De… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

  19. arXiv:1809.02744  [pdf, ps, other

    cs.LG stat.ML

    On the Calibration of Nested Dichotomies for Large Multiclass Tasks

    Authors: Tim Leathart, Eibe Frank, Bernhard Pfahringer, Geoffrey Holmes

    Abstract: Nested dichotomies are used as a method of transforming a multiclass classification problem into a series of binary problems. A tree structure is induced that recursively splits the set of classes into subsets, and a binary classification model learns to discriminate between the two subsets of classes at each node. In this paper, we demonstrate that these nested dichotomies typically exhibit poor… ▽ More

    Submitted 2 October, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

  20. arXiv:1809.02740  [pdf, other

    cs.LG stat.ML

    Ensembles of Nested Dichotomies with Multiple Subset Evaluation

    Authors: Tim Leathart, Eibe Frank, Bernhard Pfahringer, Geoffrey Holmes

    Abstract: A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively applies binary splits to divide the set of classes into two subsets, and trains a binary classifier for each split. Many methods have been proposed to perform this split, each with various advantages and disadvantages. In this paper, we present a simple, g… ▽ More

    Submitted 10 September, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

  21. arXiv:1808.00111  [pdf, other

    cs.LG stat.ML

    Probability Calibration Trees

    Authors: Tim Leathart, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer

    Abstract: Obtaining accurate and well calibrated probability estimates from classifiers is useful in many applications, for example, when minimising the expected cost of classifications. Existing methods of calibrating probability estimates are applied globally, ignoring the potential for improvements by applying a more fine-grained model. We propose probability calibration trees, a modification of logistic… ▽ More

    Submitted 14 September, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Proceedings of the 9th Asian Conference on Machine Learning

    Journal ref: Leathart, T., Frank, E., Holmes, G., & Pfahringer, B. (2017). Probability calibration trees. In Proceedings of the 9th Asian Conference on Machine Learning (pp. 145-160)

  22. arXiv:1804.05965  [pdf, ps, other

    stat.ML cs.LG

    MaxGain: Regularisation of Neural Networks by Constraining Activation Magnitudes

    Authors: Henry Gouk, Bernhard Pfahringer, Eibe Frank, Michael Cree

    Abstract: Effective regularisation of neural networks is essential to combat overfitting due to the large number of parameters involved. We present an empirical analogue to the Lipschitz constant of a feed-forward neural network, which we refer to as the maximum gain. We hypothesise that constraining the gain of a network will have a regularising effect, similar to how constraining the Lipschitz constant of… ▽ More

    Submitted 1 July, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: Accepted at ECML-PKDD 2018

  23. arXiv:1804.04368  [pdf, ps, other

    stat.ML cs.LG

    Regularisation of Neural Networks by Enforcing Lipschitz Continuity

    Authors: Henry Gouk, Eibe Frank, Bernhard Pfahringer, Michael J. Cree

    Abstract: We investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant---for multiple $p$-norms---of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounde… ▽ More

    Submitted 9 August, 2020; v1 submitted 12 April, 2018; originally announced April 2018.

  24. arXiv:1604.01854  [pdf, other

    stat.ML cs.LG

    Building Ensembles of Adaptive Nested Dichotomies with Random-Pair Selection

    Authors: Tim Leathart, Bernhard Pfahringer, Eibe Frank

    Abstract: A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively splits the set of classes into two subsets, and trains a binary classifier to distinguish between each subset. Even though ensembles of nested dichotomies with random structure have been shown to perform well in practice, using a more sophisticated class s… ▽ More

    Submitted 5 July, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: ECMLPKDD 2016

  25. arXiv:1511.06442  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Fast Metric Learning For Deep Neural Networks

    Authors: Henry Gouk, Bernhard Pfahringer, Michael Cree

    Abstract: Similarity metrics are a core component of many information retrieval and machine learning systems. In this work we propose a method capable of learning a similarity metric from data equipped with a binary relation. By considering only the similarity constraints, and initially ignoring the features, we are able to learn target vectors for each instance using one of several appropriately designed l… ▽ More

    Submitted 5 April, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

  26. arXiv:1504.06366  [pdf, other

    cs.AI cs.LG

    Use of Ensembles of Fourier Spectra in Capturing Recurrent Concepts in Data Streams

    Authors: Sripirakas Sakthithasan, Russel Pears, Albert Bifet, Bernhard Pfahringer

    Abstract: In this research, we apply ensembles of Fourier encoded spectra to capture and mine recurring concepts in a data stream environment. Previous research showed that compact versions of Decision Trees can be obtained by applying the Discrete Fourier Transform to accurately capture recurrent concepts in a data stream. However, in highly volatile environments where new concepts emerge often, the approa… ▽ More

    Submitted 23 April, 2015; originally announced April 2015.

    Comments: This paper has been accepted for IJCNN 2015 conference, Ireland

  27. arXiv:1212.2487  [pdf

    cs.LG stat.ML

    Locally Weighted Naive Bayes

    Authors: Eibe Frank, Mark Hall, Bernhard Pfahringer

    Abstract: Despite its simplicity, the naive Bayes classifier has surprised machine learning researchers by exhibiting good performance on a variety of learning problems. Encouraged by these results, researchers have looked to overcome naive Bayes primary weakness - attribute independence - and improve the performance of the algorithm. This paper presents a locally weighted version of nai… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-249-256