Skip to main content

Showing 1–24 of 24 results for author: Birke, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14961  [pdf, other

    cs.CV cs.LG

    SFDDM: Single-fold Distillation for Diffusion models

    Authors: Chi Hong, Jiyue Huang, Robert Birke, Dick Epema, Stefanie Roos, Lydia Y. Chen

    Abstract: While diffusion models effectively generate remarkable synthetic images, a key limitation is the inference inefficiency, requiring numerous sampling steps. To accelerate inference and maintain high-quality synthesis, teacher-student distillation is applied to compress the diffusion models in a progressive and binary manner by retraining, e.g., reducing the 1024-step model to a 128-step model in 3… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. DALLMi: Domain Adaption for LLM-based Multi-label Classifier

    Authors: Miruna Beţianu, Abele Mălan, Marco Aldinucci, Robert Birke, Lydia Chen

    Abstract: Large language models (LLMs) increasingly serve as the backbone for classifying text associated with distinct domains and simultaneously several labels (classes). When encountering domain shifts, e.g., classifier of movie reviews from IMDb to Rotten Tomatoes, adapting such an LLM-based multi-label classifier is challenging due to incomplete label sets at the target domain and daunting training ove… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2310.12746  [pdf, other

    cs.LG

    TabuLa: Harnessing Language Models for Tabular Data Synthesis

    Authors: Zilong Zhao, Robert Birke, Lydia Chen

    Abstract: Given the ubiquitous use of tabular data in industries and the growing concerns in data privacy and security, tabular data synthesis emerges as a critical research area. The recent state-of-the-art methods show that large language models (LLMs) can be adopted to generate realistic tabular data. As LLMs pre-process tabular data as full text, they have the advantage of avoiding the curse of dimensio… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  4. arXiv:2309.06046  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    BatMan-CLR: Making Few-shots Meta-Learners Resilient Against Label Noise

    Authors: Jeroen M. Galjaard, Robert Birke, Juan Perez, Lydia Y. Chen

    Abstract: The negative impact of label noise is well studied in classical supervised learning yet remains an open research question in meta-learning. Meta-learners aim to adapt to unseen learning tasks by learning a good initial model in meta-training and consecutively fine-tuning it according to new tasks during meta-testing. In this paper, we present the first extensive analysis of the impact of varying l… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 10 pages,3 figures

  5. arXiv:2306.15552  [pdf, other

    cs.AR cs.ET cs.LG

    A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms

    Authors: Cristina Silvano, Daniele Ielmini, Fabrizio Ferrandi, Leandro Fiorin, Serena Curzel, Luca Benini, Francesco Conti, Angelo Garofalo, Cristian Zambelli, Enrico Calore, Sebastiano Fabio Schifano, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Nicola Petra, Davide De Caro, Luciano Lavagno, Teodoro Urso, Valeria Cardellini, Gian Carlo Cardarilli, Robert Birke, Stefania Perri

    Abstract: Recent trends in deep learning (DL) imposed hardware accelerators as the most viable solution for several classes of high-performance computing (HPC) applications such as image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent advances in designing DL accelerators suitable to reach the performance requirements of HPC applications. In par… ▽ More

    Submitted 12 July, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Preprint version of our manuscript submitted to the journal @ ACM CSUR (58 pages including Appendix) on June 22nd, 2023. Major revision submitted on July 12th, 2024

  6. Model-Agnostic Federated Learning

    Authors: Gianluca Mittone, Walter Riviera, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

    Abstract: Since its debut in 2016, Federated Learning (FL) has been tied to the inner workings of Deep Neural Networks (DNNs). On the one hand, this allowed its development and widespread use as DNNs proliferated. On the other hand, it neglected all those scenarios in which using DNNs is not possible or advantageous. The fact that most current FL frameworks only allow training DNNs reinforces this problem.… ▽ More

    Submitted 18 October, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at the EuroPar'23 conference, Limassol, Cyprus

    Journal ref: In Euro-Par 2023: Parallel Processing. Euro-Par 2023. Lecture Notes in Computer Science, vol 14100. Springer, Cham

  7. Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning

    Authors: Gianluca Mittone, Nicolò Tonci, Robert Birke, Iacopo Colonnelli, Doriana Medić, Andrea Bartolini, Roberto Esposito, Emanuele Parisi, Francesco Beneventi, Mirko Polato, Massimo Torquati, Luca Benini, Marco Aldinucci

    Abstract: Decentralised Machine Learning (DML) enables collaborative machine learning without centralised input data. Federated Learning (FL) and Edge Inference are examples of DML. While tools for DML (especially FL) are starting to flourish, many are not flexible and portable enough to experiment with novel processors (e.g., RISC-V), non-fully connected network topologies, and asynchronous collaboration s… ▽ More

    Submitted 18 October, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: This paper is the accepted version of ACM copyrighted material presented at the CF'23 conference in Bologna, Italy

    Journal ref: In Proceedings of the 20th ACM International Conference on Computing Frontiers 2023 (CF '23), ACM, New York, NY, USA, 73-83

  8. arXiv:2211.09286  [pdf, other

    cs.LG

    Permutation-Invariant Tabular Data Synthesis

    Authors: Yu** Zhu, Zilong Zhao, Robert Birke, Lydia Y. Chen

    Abstract: Tabular data synthesis is an emerging approach to circumvent strict regulations on data privacy while discovering knowledge through big data. Although state-of-the-art AI-based tabular data synthesizers, e.g., table-GAN, CTGAN, TVAE, and CTAB-GAN, are effective at generating synthetic tabular data, their training is sensitive to column permutations of input data. In this paper, we first conduct an… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Paper is accepted in 2022 IEEE International Conference Big Data in Special Session Privacy and Security of Big Data (PSBD)

  9. arXiv:2210.06239  [pdf, other

    cs.LG

    FCT-GAN: Enhancing Table Synthesis via Fourier Transform

    Authors: Zilong Zhao, Robert Birke, Lydia Y. Chen

    Abstract: Synthetic tabular data emerges as an alternative for sharing knowledge while adhering to restrictive data access regulations, e.g., European General Data Protection Regulation (GDPR). Mainstream state-of-the-art tabular data synthesizers draw methodologies from Generative Adversarial Networks (GANs), which are composed of a generator and a discriminator. While convolution neural networks are shown… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  10. arXiv:2204.00401  [pdf, other

    cs.LG

    CTAB-GAN+: Enhancing Tabular Data Synthesis

    Authors: Zilong Zhao, Aditya Kunar, Robert Birke, Lydia Y. Chen

    Abstract: While data sharing is crucial for knowledge development, privacy concerns and strict regulation (e.g., European General Data Protection Regulation (GDPR)) limit its full effectiveness. Synthetic tabular data emerges as alternative to enable data sharing while fulfilling regulatory and privacy constraints. State-of-the-art tabular data synthesizers draw methodologies from Generative Adversarial Net… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.08369, arXiv:2108.10064

  11. arXiv:2108.07927  [pdf, other

    cs.LG

    Fed-TGAN: Federated Learning Framework for Synthesizing Tabular Data

    Authors: Zilong Zhao, Robert Birke, Aditya Kunar, Lydia Y. Chen

    Abstract: Generative Adversarial Networks (GANs) are typically trained to synthesize data, from images and more recently tabular data, under the assumption of directly accessible training data. Recently, federated learning (FL) is an emerging paradigm that features decentralized learning on client's local data with a privacy-preserving capability. And, while learning GANs to synthesize images on FL systems… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  12. arXiv:2108.02032  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-Label Gold Asymmetric Loss Correction with Single-Label Regulators

    Authors: Cosmin Octavian Pene, Amirmasoud Ghiassi, Taraneh Younesian, Robert Birke, Lydia Y. Chen

    Abstract: Multi-label learning is an emerging extension of the multi-class classification where an image contains multiple labels. Not only acquiring a clean and fully labeled dataset in multi-label learning is extremely expensive, but also many of the actual labels are corrupted or missing due to the automated or non-expert annotation techniques. Noisy label data decrease the prediction performance drastic… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  13. arXiv:2107.02521  [pdf, other

    cs.LG

    DTGAN: Differential Private Training for Tabular GANs

    Authors: Aditya Kunar, Robert Birke, Zilong Zhao, Lydia Chen

    Abstract: Tabular generative adversarial networks (TGAN) have recently emerged to cater to the need of synthesizing tabular data -- the most widely used data format. While synthetic tabular data offers the advantage of complying with privacy regulations, there still exists a risk of privacy leakage via inference attacks due to interpolating the properties of real data during training. Differential private (… ▽ More

    Submitted 2 August, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 16 pages, 4 figures and 5 tables, submitted to the ACML 2021 conference

  14. Enhancing Robustness of On-line Learning Models on Highly Noisy Data

    Authors: Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

    Abstract: Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper,… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Transactions on Dependable and Secure Computing. arXiv admin note: substantial text overlap with arXiv:1911.04383

  15. arXiv:2102.08369  [pdf, other

    cs.LG

    CTAB-GAN: Effective Table Data Synthesizing

    Authors: Zilong Zhao, Aditya Kunar, Hiek Van der Scheer, Robert Birke, Lydia Y. Chen

    Abstract: While data sharing is crucial for knowledge development, privacy concerns and strict regulation (e.g., European General Data Protection Regulation (GDPR)) unfortunately limit its full effectiveness. Synthetic tabular data emerges as an alternative to enable data sharing while fulfilling regulatory and privacy constraints. The state-of-the-art tabular data synthesizers draw methodologies from gener… ▽ More

    Submitted 31 May, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: This paper consists of 11 pages which contain 8 figures, 5 tables and an appendix with a user manual for our software application

    ACM Class: I.2.m

  16. arXiv:2011.06833  [pdf, other

    cs.LG

    End-to-End Learning from Noisy Crowd to Supervised Machine Learning Models

    Authors: Taraneh Younesian, Chi Hong, Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

    Abstract: Labeling real-world datasets is time consuming but indispensable for supervised machine learning models. A common solution is to distribute the labeling task across a large number of non-expert workers via crowd-sourcing. Due to the varying background and experience of crowd workers, the obtained labels are highly prone to errors and even detrimental to the learning models. In this paper, we advoc… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  17. arXiv:2010.00501  [pdf, other

    cs.DC

    PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep Learning Clusters

    Authors: Isabelly Rocha, Nathaniel Morris, Lydia Y. Chen, Pascal Felber, Robert Birke, Valerio Schiavoni

    Abstract: DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of hyperparameters. Existing approaches make use of techniques such as early stop** criteria to reduce the tuning impact on learning cost. However, these strategies… ▽ More

    Submitted 2 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: European Commission Project: LEGaTO - Low Energy Toolset for Heterogeneous Computing (EC-H2020-780681)

  18. arXiv:2007.06324  [pdf, other

    cs.LG stat.ML

    TrustNet: Learning from Trusted Data Against (A)symmetric Label Noise

    Authors: Amirmasoud Ghiassi, Taraneh Younesian, Robert Birke, Lydia Y. Chen

    Abstract: Robustness to label noise is a critical property for weakly-supervised classifiers trained on massive datasets. Robustness to label noise is a critical property for weakly-supervised classifiers trained on massive datasets. In this paper, we first derive analytical bound for any given noise patterns. Based on the insights, we design TrustNet that first adversely learns the pattern of noise corrupt… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  19. arXiv:2007.05305  [pdf, other

    cs.LG stat.ML

    ExpertNet: Adversarial Learning and Recovery Against Noisy Labels

    Authors: Amirmasoud Ghiassi, Robert Birke, Rui Han, Lydia Y. Chen

    Abstract: Today's available datasets in the wild, e.g., from social media and open platforms, present tremendous opportunities and challenges for deep learning, as there is a significant portion of tagged images, but often with noisy, i.e. erroneous, labels. Recent studies improve the robustness of deep models against noisy labels without the knowledge of true labels. In this paper, we advocate to derive a… ▽ More

    Submitted 13 July, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

  20. arXiv:2001.10399  [pdf, other

    cs.LG stat.ML

    QActor: On-line Active Learning for Noisy Labeled Stream Data

    Authors: Taraneh Younesian, Zilong Zhao, Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

    Abstract: Noisy labeled data is more a norm than a rarity for self-generated content that is continuously published on the web and social media. Due to privacy concerns and governmental regulations, such a data stream can only be stored and used for learning purposes in a limited duration. To overcome the noise in this on-line scenario we propose QActor which novel combines: the selection of supposedly clea… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  21. arXiv:1911.04383  [pdf, other

    cs.LG stat.ML

    RAD: On-line Anomaly Detection for Highly Unreliable Data

    Authors: Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

    Abstract: Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper,… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  22. arXiv:1909.05531  [pdf, other

    cs.DC

    Differential Approximation and Sprinting for Multi-Priority Big Data Engines

    Authors: Robert Birke, Isabelly Rocha, Juan Perez, Valerio Schiavoni, Pascal Felber, Lydia Y. Chen

    Abstract: Today's big data clusters based on the MapReduce paradigm are capable of executing analysis jobs with multiple priorities, providing differential latency guarantees. Traces from production systems show that the latency advantage of high-priority jobs comes at the cost of severe latency degradation of low-priority jobs as well as daunting resource waste caused by repetitive eviction and re-executio… ▽ More

    Submitted 16 September, 2019; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: European Commission Project: LEGaTO - Low Energy Toolset for Heterogeneous Computing (EC-H2020-780681)

  23. arXiv:1902.06160  [pdf, other

    cs.LG stat.ML

    WiSE-ALE: Wide Sample Estimator for Approximate Latent Embedding

    Authors: Shuyu Lin, Ronald Clark, Robert Birke, Niki Trigoni, Stephen Roberts

    Abstract: Variational Auto-encoders (VAEs) have been very successful as methods for forming compressed latent representations of complex, often high-dimensional, data. In this paper, we derive an alternative variational lower bound from the one common in VAEs, which aims to minimize aggregate information loss. Using our lower bound as the objective function for an auto-encoder enables us to place a prior on… ▽ More

    Submitted 18 March, 2019; v1 submitted 16 February, 2019; originally announced February 2019.

    Comments: 18 pages, appendix included

  24. arXiv:1807.07291  [pdf, other

    cs.LG stat.ML

    Online Label Aggregation: A Variational Bayesian Approach

    Authors: Chi Hong, Amirmasoud Ghiassi, Yichi Zhou, Robert Birke, Lydia Y. Chen

    Abstract: Noisy labeled data is more a norm than a rarity for crowd sourced contents. It is effective to distill noise and infer correct labels through aggregation results from crowd workers. To ensure the time relevance and overcome slow responses of workers, online label aggregation is increasingly requested, calling for solutions that can incrementally infer true label distribution via subsets of data it… ▽ More

    Submitted 15 November, 2020; v1 submitted 19 July, 2018; originally announced July 2018.