Skip to main content

Showing 1–7 of 7 results for author: Cebere, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.12749  [pdf, other

    cs.LG

    SurvivalGAN: Generating Time-to-Event Data for Survival Analysis

    Authors: Alexander Norcliffe, Bogdan Cebere, Fergus Imrie, Pietro Lio, Mihaela van der Schaar

    Abstract: Synthetic data is becoming an increasingly promising technology, and successful applications can improve privacy, fairness, and data democratization. While there are many methods for generating synthetic tabular data, the task remains non-trivial and unexplored for specific scenarios. One such scenario is survival data. Here, the key difficulty is censoring: for some instances, we are not aware of… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  2. arXiv:2301.07573  [pdf, other

    cs.LG cs.AI

    Synthcity: facilitating innovative use cases of synthetic data in different data modalities

    Authors: Zhaozhi Qian, Bogdan-Constantin Cebere, Mihaela van der Schaar

    Abstract: Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more. Synthcity provides the practitioners with a single access point to cutting edge research and tools in synth… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  3. AutoPrognosis 2.0: Democratizing Diagnostic and Prognostic Modeling in Healthcare with Automated Machine Learning

    Authors: Fergus Imrie, Bogdan Cebere, Eoin F. McKinney, Mihaela van der Schaar

    Abstract: Diagnostic and prognostic models are increasingly important in medicine and inform many clinical decisions. Recently, machine learning approaches have shown improvement over conventional modeling techniques by better capturing complex interactions between patient covariates in a data-driven manner. However, the use of machine learning introduces a number of technical and practical challenges that… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Journal ref: PLOS Digital Health, 2023, 2(6): e0000276

  4. arXiv:2206.07769  [pdf, other

    stat.ML cs.LG

    HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

    Authors: Daniel Jarrett, Bogdan Cebere, Tennison Liu, Alicia Curth, Mihaela van der Schaar

    Abstract: Consider the problem of imputing missing values in a dataset. One the one hand, conventional approaches using iterative imputation benefit from the simplicity and customizability of learning conditional distributions directly, but suffer from the practical requirement for appropriate model specification of each and every variable. On the other hand, recent methods using deep generative modeling be… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Journal ref: In Proc. 39th International Conference on Machine Learning (ICML 2022)

  5. arXiv:2104.12385  [pdf, other

    cs.LG cs.CR

    Syft 0.5: A Platform for Universally Deployable Structured Transparency

    Authors: Adam James Hall, Madhava Jay, Tudor Cebere, Bogdan Cebere, Koen Lennart van der Veen, George Muraru, Tongye Xu, Patrick Cason, William Abramson, Ayoub Benaissa, Chinmay Shah, Alan Aboudib, Théo Ryffel, Kritika Prakash, Tom Titcombe, Varun Kumar Khare, Maddie Shang, Ionesio Junior, Animesh Gupta, Jason Paumier, Nahua Kang, Vova Manannikov, Andrew Trask

    Abstract: We present Syft 0.5, a general-purpose framework that combines a core group of privacy-enhancing technologies that facilitate a universal set of structured transparency systems. This framework is demonstrated through the design and implementation of a novel privacy-preserving inference information flow where we pass homomorphically encrypted activation signals through a split neural network for in… ▽ More

    Submitted 27 April, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: ICLR 2021 Workshop on Distributed and Private Machine Learning (DPML 2021)

  6. arXiv:2104.03152  [pdf, other

    cs.CR cs.LG

    TenSEAL: A Library for Encrypted Tensor Operations Using Homomorphic Encryption

    Authors: Ayoub Benaissa, Bilal Retiat, Bogdan Cebere, Alaa Eddine Belfedhal

    Abstract: Machine learning algorithms have achieved remarkable results and are widely applied in a variety of domains. These algorithms often rely on sensitive and private data such as medical and financial records. Therefore, it is vital to draw further attention regarding privacy threats and corresponding defensive techniques applied to machine learning models. In this paper, we present TenSEAL, an open-s… ▽ More

    Submitted 28 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: ICLR 2021 Workshop on Distributed and Private Machine Learning (DPML 2021)

  7. arXiv:2011.09350  [pdf, other

    cs.CR cs.LG

    Asymmetric Private Set Intersection with Applications to Contact Tracing and Private Vertical Federated Machine Learning

    Authors: Nick Angelou, Ayoub Benaissa, Bogdan Cebere, William Clark, Adam James Hall, Michael A. Hoeh, Daniel Liu, Pavlos Papadopoulos, Robin Roehm, Robert Sandmann, Phillipp Schoppmann, Tom Titcombe

    Abstract: We present a multi-language, cross-platform, open-source library for asymmetric private set intersection (PSI) and PSI-Cardinality (PSI-C). Our protocol combines traditional DDH-based PSI and PSI-C protocols with compression based on Bloom filters that helps reduce communication in the asymmetric setting. Currently, our library supports C++, C, Go, WebAssembly, JavaScript, Python, and Rust, and ru… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Workshop on Privacy Preserving Machine Learning (PPML 2020)