Skip to main content

Showing 1–13 of 13 results for author: De Carvalho, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04825  [pdf, other

    cs.LG cs.AI

    Graph Mining under Data scarcity

    Authors: Appan Rakaraddi, Lam Siew-Kei, Mahardhika Pratama, Marcus de Carvalho

    Abstract: Multitude of deep learning models have been proposed for node classification in graphs. However, they tend to perform poorly under labeled-data scarcity. Although Few-shot learning for graphs has been introduced to overcome this problem, the existing models are not easily adaptable for generic graph learning frameworks like Graph Neural Networks (GNNs). Our work proposes an Uncertainty Estimator f… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 7 pages, 2 figures

  2. arXiv:2404.08480  [pdf, other

    cs.LG cs.CL stat.CO

    Decoding AI: The inside story of data analysis in ChatGPT

    Authors: Ozan Evkaya, Miguel de Carvalho

    Abstract: As a result of recent advancements in generative AI, the field of Data Science is prone to various changes. This review critically examines the Data Analysis (DA) capabilities of ChatGPT assessing its performance across a wide range of tasks. While DA provides researchers and practitioners with unprecedented analytical capabilities, it is far from being perfect, and it is important to recognize an… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 15 pages with figures and appendix

  3. arXiv:2402.12490  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Cross-Domain Continual Learning

    Authors: Marcus de Carvalho, Mahardhika Pratama, Jie Zhang, Chua Haoyan, Edward Yapp

    Abstract: Continual learning is a process that involves training learning agents to sequentially master a stream of tasks or classes without revisiting past data. The challenge lies in leveraging previously acquired knowledge to learn new tasks efficiently, while avoiding catastrophic forgetting. Existing methods primarily focus on single domains, restricting their applicability to specific problems. In t… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 Figures, 4 Tables. To be published at the IEEE International Conference on Data Engineering (ICDE) 2024

  4. arXiv:2212.04009  [pdf, other

    stat.ML cs.LG stat.ME

    A parallelizable model-based approach for marginal and multivariate clustering

    Authors: Miguel de Carvalho, Gabriel Martos Venturini, Andrej Svetlošák

    Abstract: This paper develops a clustering method that takes advantage of the sturdiness of model-based clustering, while attempting to mitigate some of its pitfalls. First, we note that standard model-based clustering likely leads to the same number of clusters per margin, which seems a rather artificial assumption for a variety of datasets. We tackle this issue by specifying a finite mixture model per mar… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  5. arXiv:2209.02112  [pdf, other

    cs.LG cs.AI

    Class-Incremental Learning via Knowledge Amalgamation

    Authors: Marcus de Carvalho, Mahardhika Pratama, Jie Zhang, Yajuan San

    Abstract: Catastrophic forgetting has been a significant problem hindering the deployment of deep learning algorithms in the continual learning setting. Numerous methods have been proposed to address the catastrophic forgetting problem where an agent loses its generalization power of old tasks while learning new tasks. We put forward an alternative strategy to handle the catastrophic forgetting with knowled… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Paper accepted at ECML PKDD 2022

  6. arXiv:2209.01556  [pdf, other

    cs.LG cs.AI

    Reinforced Continual Learning for Graphs

    Authors: Appan Rakaraddi, Siew Kei Lam, Mahardhika Pratama, Marcus De Carvalho

    Abstract: Graph Neural Networks (GNNs) have become the backbone for a myriad of tasks pertaining to graphs and similar topological data structures. While many works have been established in domains related to node and graph classification/regression tasks, they mostly deal with a single task. Continual learning on graphs is largely unexplored and existing graph continual learning approaches are limited to t… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: has been accepted for publication as a long paper at 31st ACM International Conference on Information and Knowledge Management (CIKM 22)

  7. Autonomous Cross Domain Adaptation under Extreme Label Scarcity

    Authors: Weiwei Weng, Mahardhika Pratama, Choiru Za'in, Marcus De Carvalho, Rakaraddi Appan, Andri Ashfahani, Edward Yapp Kien Yee

    Abstract: A cross domain multistream classification is a challenging problem calling for fast domain adaptations to handle different but related streams in never-ending and rapidly changing environments. Notwithstanding that existing multistream classifiers assume no labelled samples in the target stream, they still incur expensive labelling cost since they require fully labelled samples of the source strea… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  8. arXiv:2110.01326  [pdf, other

    cs.LG

    ACDC: Online Unsupervised Cross-Domain Adaptation

    Authors: Marcus de Carvalho, Mahardhika Pratama, Jie Zhang, Edward Yapp

    Abstract: We consider the problem of online unsupervised cross-domain adaptation, where two independent but related data streams with different feature spaces -- a fully labeled source stream and an unlabeled target stream -- are learned together. Unique characteristics and challenges such as covariate shift, asynchronous concept drifts, and contrasting data throughput arises. We propose ACDC, an adversaria… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  9. Arc Flow Formulations Based on Dynamic Programming: Theoretical Foundations and Applications

    Authors: Vinícius L. de Lima, Cláudio Alves, François Clautiaux, Manuel Iori, José M. Valério de Carvalho

    Abstract: Network flow formulations are among the most successful tools to solve optimization problems. Such formulations correspond to determining an optimal flow in a network. One particular class of network flow formulations is the arc flow, where variables represent flows on individual arcs of the network. For $\mathcal{NP}$-hard problems, polynomial-sized arc flow models typically provide weak linear r… ▽ More

    Submitted 15 April, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  10. arXiv:1910.03434  [pdf, other

    cs.LG stat.ML

    ATL: Autonomous Knowledge Transfer from Many Streaming Processes

    Authors: Mahardhika Pratama, Marcus de Carvalho, Renchunzi Xie, Edwin Lughofer, Jie Lu

    Abstract: Transferring knowledge across many streaming processes remains an uncharted territory in the existing literature and features unique characteristics: no labelled instance of the target domain, covariate shift of source and target domain, different period of drifts in the source and target domains. Autonomous transfer learning (ATL) is proposed in this paper as a flexible deep learning approach for… ▽ More

    Submitted 19 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: This paper has been accepted for publication in CIKM 2019

  11. arXiv:1907.13070  [pdf, other

    cs.LG stat.ML

    Predicting assisted ventilation in Amyotrophic Lateral Sclerosis using a mixture of experts and conformal predictors

    Authors: Telma Pereira, Sofia Pires, Marta Gromicho, Susana Pinto, Mamede de Carvalho, Sara C. Madeira

    Abstract: Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease characterized by a rapid motor decline, leading to respiratory failure and subsequently to death. In this context, researchers have sought for models to automatically predict disease progression to assisted ventilation in ALS patients. However, the clinical translation of such models is limited by the lack of insight 1) on the risk… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Journal ref: KDD 2019 Workshop on Applied Data Science for Healthcare

  12. arXiv:1806.08247  [pdf, other

    cs.AI

    Log Skeletons: A Classification Approach to Process Discovery

    Authors: H. M. W. Verbeek, R. Medeiros de Carvalho

    Abstract: To test the effectiveness of process discovery algorithms, a Process Discovery Contest (PDC) has been set up. This PDC uses a classification approach to measure this effectiveness: The better the discovered model can classify whether or not a new trace conforms to the event log, the better the discovery algorithm is supposed to be. Unfortunately, even the state-of-the-art fully-automated discovery… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: 16 pages with 9 figures, followed by an appendix of 14 pages with 17 figures

    MSC Class: 62H30; 93C65 ACM Class: I.5.3; H.3.3; J.1

  13. arXiv:cs/0603116  [pdf, ps, other

    cs.CV

    Fourier Analysis and Holographic Representations of 1D and 2D Signals

    Authors: G. A. Giraldi, B. F. Moutinho, D. M. L. de Carvalho, J. C. de Oliveira

    Abstract: In this paper, we focus on Fourier analysis and holographic transforms for signal representation. For instance, in the case of image processing, the holographic representation has the property that an arbitrary portion of the transformed image enables reconstruction of the whole image with details missing. We focus on holographic representation defined through the Fourier Transforms. Thus, We fi… ▽ More

    Submitted 3 April, 2006; v1 submitted 29 March, 2006; originally announced March 2006.

    Comments: 13 pages

    ACM Class: I.4.10