Skip to main content

Showing 1–4 of 4 results for author: Micheletti, N

.
  1. arXiv:2405.12630  [pdf, other

    cs.CL cs.AI

    Exploration of Masked and Causal Language Modelling for Text Generation

    Authors: Nicolo Micheletti, Samuel Belkadi, Lifeng Han, Goran Nenadic

    Abstract: Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation, Causal Language Modelling (CLM), which generates text sequentially from left to right, inherently limits the freedom of the model, which does not decide when a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: working paper

  2. arXiv:2310.19727  [pdf, other

    cs.CL cs.AI cs.LG

    Generating Medical Prescriptions with Conditional Transformer

    Authors: Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Warren Del-Pinto, Goran Nenadic

    Abstract: Access to real-world medication prescriptions is essential for medical research and healthcare quality improvement. However, access to real medication prescriptions is often limited due to the sensitive nature of the information expressed. Additionally, manually labelling these instructions for training and fine-tuning Natural Language Processing (NLP) models can be tedious and expensive. We intro… ▽ More

    Submitted 18 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to: Workshop on Synthetic Data Generation with Generative AI (SyntheticData4ML Workshop) at NeurIPS 2023

  3. arXiv:2309.13202  [pdf, other

    cs.CL cs.AI

    Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical Abstracts

    Authors: Zihao Li, Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Matthew Shardlow, Goran Nenadic

    Abstract: Biomedical literature often uses complex language and inaccessible professional terminologies. That is why simplification plays an important role in improving public health literacy. Applying Natural Language Processing (NLP) models to automate such tasks allows for quick and direct accessibility for lay readers. In this work, we investigate the ability of state-of-the-art large language models (L… ▽ More

    Submitted 16 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE-ICHI 2024 https://ieeeichi2024.github.io/

  4. arXiv:2210.13958  [pdf, other

    cs.LG cs.CY stat.ML

    Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data

    Authors: Raffaele Marchesi, Nicolo Micheletti, Giuseppe Jurman, Venet Osmani

    Abstract: Several approaches have been developed to mitigate algorithmic bias stemming from health data poverty, where minority groups are underrepresented in training datasets. Augmenting the minority class using resampling (such as SMOTE) is a widely used approach due to the simplicity of the algorithms. However, these algorithms decrease data variability and may introduce correlations between samples, gi… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022 Workshop on Synthetic Data for Empowering ML Research (Neurips 2022 SyntheticData4ML)