Skip to main content

Showing 1–50 of 51 results for author: Clifton, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14377  [pdf, other

    cs.LG cs.AI

    Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection

    Authors: Rushuang Zhou, Zijun Liu, Lei Clifton, David A. Clifton, Kannie W. Y. Chan, Yuan-Ting Zhang, Yining Dong

    Abstract: Label scarcity problem is the main challenge that hinders the wide application of deep learning systems in automatic cardiovascular diseases (CVDs) detection using electrocardiography (ECG). Tuning pre-trained models alleviates this problem by transferring knowledge learned from large datasets to downstream small datasets. However, bottlenecks in computational efficiency and CVDs detection perform… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.07841  [pdf, other

    cs.LG

    Sample Selection Bias in Machine Learning for Healthcare

    Authors: Vinod Kumar Chauhan, Lei Clifton, Achille Salaün, Huiqi Yvonne Lu, Kim Branson, Patrick Schwab, Gaurav Nigam, David A. Clifton

    Abstract: While machine learning algorithms hold promise for personalised medicine, their clinical adoption remains limited. One critical factor contributing to this restraint is sample selection bias (SSB) which refers to the study population being less representative of the target population, leading to biased and potentially harmful decisions. Despite being well-known in the literature, SSB remains scarc… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 20 pages and 11 figures (under review)

  3. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  4. arXiv:2404.01620  [pdf

    cs.SD cs.AI cs.CY eess.AS

    Voice EHR: Introducing Multimodal Audio Data for Health

    Authors: James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Yen Minh Lam, Hang Nguyen, Phuc Hong, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song, Emily Ricotta, David Clifton , et al. (3 additional authors not shown)

    Abstract: Large AI models trained on audio data may have the potential to rapidly classify patients, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets using expensive recording equipment in high-income, English-speaking countries. This challenges deployment in resource-constrained, high-volume settings where audio d… ▽ More

    Submitted 1 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 19 pages, 2 figures, 7 tables

  5. arXiv:2402.10597  [pdf, other

    cs.CL cs.AI

    Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

    Authors: Niall Taylor, Upamanyu Ghose, Omid Rohanian, Mohammadmahdi Nouriborji, Andrey Kormilitzin, David Clifton, Alejo Nevado-Holgado

    Abstract: The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models without the need for complete fine-tuning, using Parameter Efficient Fine-tuning (PEFT) methods. We present an investigation into the suitability of different PEFT… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2401.00579  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, David A. Clifton

    Abstract: Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evo… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:2311.05112  [pdf

    cs.CL cs.AI

    A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

    Authors: Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, **fa Huang, **ge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

    Abstract: Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their… ▽ More

    Submitted 15 May, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Preprint. Version 5. 6 figures; 14 tables; 41 pages

  8. arXiv:2309.00810  [pdf, other

    cs.CV cs.AI

    RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model

    Authors: Fengxiang Bie, Yibo Yang, Zhongzhu Zhou, Adam Ghanem, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Golnari, David A. Clifton, Yuxiong He, Dacheng Tao, Shuaiwen Leon Song

    Abstract: Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions. Text-to-image generation using neural networks could be traced back to the emergence of Generative Adversial Network (GAN), followed by the autoregressive Transformer. Diffusion models are one prominent type of generative model used for the genera… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  9. arXiv:2306.10494  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

    Authors: Rushuang Zhou, Lei Lu, Zijun Liu, Ting Xiang, Zhen Liang, David A. Clifton, Yining Dong, Yuan-Ting Zhang

    Abstract: Electrocardiography (ECG) is a non-invasive tool for predicting cardiovascular diseases (CVDs). Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques. However, the label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets greatly hinder the widespread application of deep learning-based… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  10. arXiv:2306.06955  [pdf, other

    cs.LG

    A Brief Review of Hypernetworks in Deep Learning

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, ** Lu, Soheila Molaei, David A. Clifton

    Abstract: Hypernetworks, or hypernets in short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility, adaptability, dynamism, faster training, information sharing, and model compression etc. Hypernets have shown promising results in a variety of deep learning problems, in… ▽ More

    Submitted 10 August, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: revised categorisation, added new Section '5 When can we use Hypernets?', and other corrections(2 figures and 2 tables) (under review)

  11. arXiv:2305.15984  [pdf, other

    cs.LG stat.ME

    Dynamic Inter-treatment Information Sharing for Individualized Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, Ghadeer Ghosheh, Soheila Molaei, David A. Clifton

    Abstract: Estimation of individualized treatment effects (ITE) from observational studies is a fundamental problem in causal inference and holds significant importance across domains, including healthcare. However, limited observational datasets pose challenges in reliable ITE estimation as data have to be split among treatment groups to train an ITE learner. While information sharing among treatment groups… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted to The 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  12. arXiv:2305.03711  [pdf, other

    cs.LG cs.CY

    Medical records condensation: a roadmap towards healthcare data democratisation

    Authors: Yujiang Wang, Anshul Thakur, Mingzhi Dong, **chuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton

    Abstract: The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data… ▽ More

    Submitted 8 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  13. arXiv:2305.03710  [pdf, other

    cs.LG cs.CR

    Data Encoding For Healthcare Data Democratisation and Information Leakage Prevention

    Authors: Anshul Thakur, Tingting Zhu, Vinayak Abrol, Jacob Armstrong, Yujiang Wang, David A. Clifton

    Abstract: The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  14. arXiv:2305.03219  [pdf

    cs.LG stat.ME

    All models are local: time to replace external validation with recurrent local validation

    Authors: Alex Youssef, Michael Pencina, Anshul Thakur, Tingting Zhu, David Clifton, Nigam H. Shah

    Abstract: External validation is often recommended to ensure the generalizability of ML models. However, it neither guarantees generalizability nor equates to a model's clinical usefulness (the ultimate goal of any clinical decision-support tool). External validation is misaligned with current healthcare ML needs. First, patient data changes across time, geography, and facilities. These changes create signi… ▽ More

    Submitted 13 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  15. arXiv:2303.06458  [pdf, other

    cs.CL cs.AI cs.CV

    ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

    Authors: Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton

    Abstract: Natural Language Generation (NLG) accepts input data in the form of images, videos, or text and generates corresponding natural language text as output. Existing NLG methods mainly adopt a supervised approach and rely heavily on coupled data-to-text pairs. However, for many targeted scenarios and for non-English languages, sufficient quantities of labeled data are often not available. To relax the… ▽ More

    Submitted 3 June, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: Accepted by TPAMI (Our code and data are available at https://github.com/yangbang18/ZeroNLG)

  16. arXiv:2302.14679  [pdf, other

    cs.LG cs.CL

    Synthesizing Mixed-type Electronic Health Records using Diffusion Models

    Authors: Taha Ceritli, Ghadeer O. Ghosheh, Vinod Kumar Chauhan, Tingting Zhu, Andrew P. Creagh, David A. Clifton

    Abstract: Electronic Health Records (EHRs) contain sensitive patient information, which presents privacy concerns when sharing such data. Synthetic data generation is a promising solution to mitigate these risks, often relying on deep generative models such as Generative Adversarial Networks (GANs). However, recent studies have shown that diffusion models offer several advantages over GANs, such as generati… ▽ More

    Submitted 10 August, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Page 2, Figure 1 is updated

  17. arXiv:2302.04725  [pdf, other

    cs.CL cs.AI cs.LG

    Lightweight Transformers for Clinical Natural Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton

    Abstract: Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT and BioClinicalBERT are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  18. arXiv:2302.01735  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective

    Authors: Chenyu You, Weicheng Dai, Yifei Min, Fenglin Liu, David A. Clifton, S Kevin Zhou, Lawrence Hamilton Staib, James S Duncan

    Abstract: For medical image segmentation, contrastive learning is the dominant practice to improve the quality of visual representations by contrasting semantically similar and dissimilar pairs of samples. This is enabled by the observation that without accessing ground truth labels, negative examples with truly dissimilar anatomical features, if sampled, can significantly improve the performance. In realit… ▽ More

    Submitted 23 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by Advances in Neural Information Processing Systems (NeurIPS 2023)

  19. arXiv:2211.11427  [pdf, other

    cs.CV

    Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

    Authors: Peng **, **fa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen

    Abstract: Most video-and-language representation learning approaches employ contrastive learning, e.g., CLIP, to project the video and text features into a common latent space according to the semantic similarities of text-video pairs. However, such learned shared latent spaces are not often optimal, and the modality gap between visual and textual representation can not be fully eliminated. In this paper, w… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022

  20. arXiv:2210.12777  [pdf, other

    cs.CL cs.LG

    Generating Accurate and Faithful Discharge Instructions: Task, Dataset, and Model

    Authors: Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton

    Abstract: The "Patient Instruction" (PI), known as "Discharge Instruction", which contains critical instructional information provided both to carers and to the patient at the time of discharge, is essential for the patient to manage their condition outside hospital. An accurate and easy-to-follow PI can improve the self-management of patients which can in turn reduce hospital readmission rates. However, wr… ▽ More

    Submitted 10 January, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022. (Thirty-sixth Conference on Neural Information Processing Systems, https://openreview.net/forum?id=dp0zWsdOV1h)

  21. arXiv:2210.10530  [pdf, other

    cs.LG cs.AI stat.ME

    Adversarial De-confounding in Individualised Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Soheila Molaei, Marzia Hoque Tania, Anshul Thakur, Tingting Zhu, David A. Clifton

    Abstract: Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised tr… ▽ More

    Submitted 24 January, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: accepted to AISTATS 2023

  22. arXiv:2210.09440  [pdf, other

    cs.CL cs.AI

    Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints

    Authors: Omid Rohanian, Hannah Jauncey, Mohammadmahdi Nouriborji, Vinod Kumar Chauhan, Bronner P. Gonçalves, Christiana Kartsonaki, ISARIC Clinical Characterisation Group, Laura Merson, David Clifton

    Abstract: Processing information locked within clinical health records is a challenging task that remains an active area of research in biomedical NLP. In this work, we evaluate a broad set of machine learning techniques ranging from simple RNNs to specialised transformers such as BioBERT on a dataset containing clinical notes along with a set of annotations indicating whether a sample is cancer-related or… ▽ More

    Submitted 7 June, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  23. arXiv:2210.06425  [pdf, other

    cs.CL cs.LG

    MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, Samaneh Kouchaki, David A. Clifton

    Abstract: Pre-trained Language Models (LMs) have become an integral part of Natural Language Processing (NLP) in recent years, due to their superior performance in downstream applications. In spite of this resounding success, the usability of LMs is constrained by computational and time complexity, along with their increasing size; an issue that has been referred to as `overparameterisation'. Different stra… ▽ More

    Submitted 30 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  24. arXiv:2209.13476  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Mine yOur owN Anatomy: Revisiting Medical Image Segmentation with Extremely Limited Labels

    Authors: Chenyu You, Weicheng Dai, Fenglin Liu, Yifei Min, Haoran Su, Xiaoran Zhang, Xiaoxiao Li, David A. Clifton, Lawrence Staib, James S. Duncan

    Abstract: Recent studies on contrastive learning have achieved remarkable performance solely by leveraging few labels in the context of medical image segmentation. Existing methods mainly focus on instance discrimination and invariant map**. However, they face three common pitfalls: (1) tailness: medical image data usually follows an implicit long-tail class distribution. Blindly leveraging all pixels in… ▽ More

    Submitted 16 March, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: In this version: Add theoretical analysis and correct some typos

  25. arXiv:2209.03182  [pdf, ps, other

    cs.CL cs.LG

    On the Effectiveness of Compact Biomedical Transformers

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Samaneh Kouchaki, David A. Clifton

    Abstract: Language models pre-trained on biomedical corpora, such as BioBERT, have recently shown promising results on downstream biomedical tasks. Many existing pre-trained models, on the other hand, are resource-intensive and computationally heavy owing to factors such as embedding size, hidden dimension, and number of layers. The natural language processing (NLP) community has developed numerous strategi… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    MSC Class: 68T50

  26. COPER: Continuous Patient State Perceiver

    Authors: Vinod Kumar Chauhan, Anshul Thakur, Odhran O'Donoghue, David A. Clifton

    Abstract: In electronic health records (EHRs), irregular time-series (ITS) occur naturally due to patient health dynamics, reflected by irregular hospital visits, diseases/conditions and the necessity to measure different vitals signs at each visit etc. ITS present challenges in training machine learning algorithms which mostly are built on assumption of coherent fixed dimensional feature space. In this pap… ▽ More

    Submitted 24 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 2 figures; presented in IEEE International Conference on Biomedical and Health Informatics (IEEE BHI-2022)

  27. arXiv:2207.11846  [pdf, other

    cs.LG cs.AI

    Mixture of Input-Output Hidden Markov Models for Heterogeneous Disease Progression Modeling

    Authors: Taha Ceritli, Andrew P. Creagh, David A. Clifton

    Abstract: A particular challenge for disease progression modeling is the heterogeneity of a disease and its manifestations in the patients. Existing approaches often assume the presence of a single disease progression characteristics which is unlikely for neurodegenerative disorders such as Parkinson's disease. In this paper, we propose a hierarchical time-series model that can discover multiple disease pro… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  28. arXiv:2207.00118  [pdf, other

    cs.LG cs.AI cs.CV

    ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, Sankha Subhra Mukherjee, David A. Clifton, Neil M. Robertson

    Abstract: There is a family of label modification approaches including self and non-self label correction (LC), and output regularisation. They are widely used for training robust deep neural networks (DNNs), but have not been mathematically and thoroughly analysed together. We study them and discover three key issues: (1) We are more interested in adopting Self LC as it leverages its own knowledge and requ… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: To ease the reading, a summary of changes is put in the beginning. Our source code is available at https://github.com/XinshaoAmosWang/ProSelfLC-AT

  29. arXiv:2206.06488  [pdf, other

    cs.CV cs.LG

    Multimodal Learning with Transformers: A Survey

    Authors: Peng Xu, Xiatian Zhu, David A. Clifton

    Abstract: Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and big data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey in… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: This paper is accepted by IEEE TPAMI

  30. arXiv:2206.02909  [pdf, other

    eess.SP cs.AI cs.LG

    Self-supervised Learning for Human Activity Recognition Using 700,000 Person-days of Wearable Data

    Authors: Hang Yuan, Shing Chan, Andrew P. Creagh, Catherine Tong, Aidan Acquah, David A. Clifton, Aiden Doherty

    Abstract: Advances in deep learning for human activity recognition have been relatively limited due to the lack of large labelled datasets. In this study, we leverage self-supervised learning techniques on the UK-Biobank activity tracker dataset--the largest of its kind to date--containing more than 700,000 person-days of unlabelled wearable sensor data. Our resulting activity recognition model consistently… ▽ More

    Submitted 20 June, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Journal ref: npj Digit. Med. 7, 91 (2024)

  31. arXiv:2205.12070  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning for Multi-class Imbalanced Training

    Authors: Jenny Yang, Rasheed El-Bouri, Odhran O'Donoghue, Alexander S. Lachapelle, Andrew A. S. Soltan, David A. Clifton

    Abstract: With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority class. We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets, and extend it for use in m… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  32. arXiv:2204.00556  [pdf, other

    cs.CL cs.AI

    Nowruz at SemEval-2022 Task 7: Tackling Cloze Tests with Transformers and Ordinal Regression

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, David Clifton

    Abstract: This paper outlines the system using which team Nowruz participated in SemEval 2022 Task 7 Identifying Plausible Clarifications of Implicit and Underspecified Phrases for both subtasks A and B. Using a pre-trained transformer as a backbone, the model targeted the task of multi-task classification and ranking in the context of finding the best fillers for a cloze task related to instructional texts… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: SemEval 2022

    MSC Class: 68T50 ACM Class: I.2.7

  33. arXiv:2203.16921  [pdf

    cs.LG cs.CR

    Assessing the risk of re-identification arising from an attack on anonymised data

    Authors: Anna Antoniou, Giacomo Dossena, Julia MacMillan, Steven Hamblin, David Clifton, Paula Petrone

    Abstract: Objective: The use of routinely-acquired medical data for research purposes requires the protection of patient confidentiality via data anonymisation. The objective of this work is to calculate the risk of re-identification arising from a malicious attack to an anonymised dataset, as described below. Methods: We first present an analytical means of estimating the probability of re-identification o… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  34. arXiv:2202.03670  [pdf, other

    cs.CV cs.LG

    How to Understand Masked Autoencoders

    Authors: Shuhao Cao, Peng Xu, David A. Clifton

    Abstract: "Masked Autoencoders (MAE) Are Scalable Vision Learners" revolutionizes the self-supervised learning method in that it not only achieves the state-of-the-art for image pre-training, but is also a milestone that bridges the gap between visual and linguistic masked autoencoding (BERT-style) pre-trainings. However, to our knowledge, to date there are no theoretical perspectives to explain the powerfu… ▽ More

    Submitted 9 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  35. arXiv:2201.03004  [pdf, other

    cs.LG cs.AI cs.CR

    Privacy-aware Early Detection of COVID-19 through Adversarial Training

    Authors: Omid Rohanian, Samaneh Kouchaki, Andrew Soltan, Jenny Yang, Morteza Rohanian, Yang Yang, David Clifton

    Abstract: Early detection of COVID-19 is an ongoing area of research that can help with triage, monitoring and general health assessment of potential patients and may reduce operational strain on hospitals that cope with the coronavirus pandemic. Different machine learning techniques have been used in the literature to detect coronavirus using routine clinical data (blood tests, and vital signs). Data breac… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    ACM Class: J.3

  36. Continual learning of longitudinal health records

    Authors: J. Armstrong, D. Clifton

    Abstract: Continual learning denotes machine learning methods which can adapt to new environments while retaining and reusing knowledge gained from past experiences. Such methods address two issues encountered by models in non-stationary environments: ungeneralisability to new data, and the catastrophic forgetting of previous knowledge when retrained. This is a pervasive problem in clinical settings where p… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 15 pages, 5 figures

    Report number: 9926878

    Journal ref: 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)

  37. arXiv:2107.01707  [pdf, other

    cs.LG cs.CR cs.DC

    Towards Scheduling Federated Deep Learning using Meta-Gradients for Inter-Hospital Learning

    Authors: Rasheed el-Bouri, Tingting Zhu, David A. Clifton

    Abstract: Given the abundance and ease of access of personal data today, individual privacy has become of paramount importance, particularly in the healthcare domain. In this work, we aim to utilise patient data extracted from multiple hospital data centres to train a machine learning model without sacrificing patient privacy. We develop a scheduling algorithm in conjunction with a student-teacher algorithm… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 11 pages, 8 figures

  38. arXiv:2106.01489  [pdf, other

    cs.LG cs.AI cs.CV

    Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge

    Authors: Ziyun Li, Xinshao Wang, Di Hu, Neil M. Robertson, David A. Clifton, Christoph Meinel, Hao** Yang

    Abstract: Mutual knowledge distillation (MKD) improves a model by distilling knowledge from another model. However, \textit{not all knowledge is certain and correct}, especially under adverse conditions. For example, label noise usually leads to less reliable models due to undesired memorization \cite{zhang2017understanding,arpit2017closer}. Wrong knowledge misleads the learning rather than helps. This prob… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2022 Workshop(Trustworthy and Socially Responsible Machine Learning) paper

  39. arXiv:2103.11011  [pdf, other

    cs.CL cs.AI cs.LG

    Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David Clifton

    Abstract: Cardiac signals, such as the electrocardiogram, convey a significant amount of information about the health status of a patient which is typically summarized by a clinician in the form of a clinical report, a cumbersome process that is prone to errors. To streamline this routine process, we propose a deep neural network capable of captioning cardiac signals; it receives a cardiac signal as input a… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  40. arXiv:2011.14230  [pdf, other

    eess.SP cs.LG

    CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The process of manually searching for relevant instances in, and extracting information from, clinical databases underpin a multitude of clinical tasks. Such tasks include disease diagnosis, clinical trial recruitment, and continuing medical education. This manual search-and-extract process, however, has been hampered by the growth of large-scale clinical databases and the increased prevalence of… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted at Advances in Neural Information Processing Systems (NeurIPS) 2021

  41. arXiv:2011.14227  [pdf, other

    eess.SP cs.LG

    PCPs: Patient Cardiac Prototypes

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Many clinical deep learning algorithms are population-based and difficult to interpret. Such properties limit their clinical utility as population-based findings may not generalize to individual patients and physicians are reluctant to incorporate opaque models into their clinical workflow. To overcome these obstacles, we propose to learn patient-specific embeddings, entitled patient cardiac proto… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

  42. arXiv:2007.01135  [pdf, other

    cs.LG cs.CV stat.ML

    Student-Teacher Curriculum Learning via Reinforcement Learning: Predicting Hospital Inpatient Admission Location

    Authors: Rasheed el-Bouri, David Eyre, Peter Watkinson, Tingting Zhu, David Clifton

    Abstract: Accurate and reliable prediction of hospital admission location is important due to resource-constraints and space availability in a clinical setting, particularly when dealing with patients who come from the emergency department. In this work we propose a student-teacher network via reinforcement learning to deal with this specific problem. A representation of the weights of the student network i… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 16 pages, 31 figures, In Proceedings of the 37th International Conference on Machine Learning

    MSC Class: 00Bxx ACM Class: I.5

    Journal ref: In Proceedings of the 37th International Conference on Machine Learning, 2020

  43. arXiv:2005.13249  [pdf, other

    cs.LG eess.SP stat.ML

    CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The healthcare industry generates troves of unlabelled physiological data. This data can be exploited via contrastive learning, a self-supervised pre-training method that encourages representations of instances to be similar to one another. We propose a family of contrastive learning methods, CLOCS, that encourages representations across space, time, \textit{and} patients to be similar to one anot… ▽ More

    Submitted 16 May, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Accepted to ICML 2021

  44. arXiv:2005.03788  [pdf, other

    cs.LG cs.CV stat.ML

    ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson

    Abstract: To train robust deep neural networks (DNNs), we systematically study several target modification approaches, which include output regularisation, self and non-self label correction (LC). Two key issues are discovered: (1) Self LC is the most appealing as it exploits its own knowledge and requires no extra models. However, how to automatically decide the trust degree of a learner as training goes i… ▽ More

    Submitted 2 June, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: ProSelfLC is the first method to trust self knowledge progressively and adaptively. ProSelfLC redirects and promotes entropy minimisation, which is in marked contrast to recent practices of confidence penalty [42, 33, 6]

    Journal ref: CVPR 2021

  45. arXiv:2004.10468  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning in Active Learning

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Large sets of unlabelled data within the healthcare domain remain underutilized. Active learning offers a way to exploit these datasets by iteratively requesting an oracle (e.g. medical professional) to label instances. This process, which can be costly and time-consuming is overly-dependent upon an oracle. To alleviate this burden, we propose SoQal, a questioning strategy that dynamically determi… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  46. arXiv:2004.09578  [pdf, other

    cs.LG stat.ML

    CLOPS: Continual Learning of Physiological Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Deep learning algorithms are known to experience destructive interference when instances violate the assumption of being independent and identically distributed (i.i.d). This violation, however, is ubiquitous in clinical settings where data are streamed temporally and from a multitude of physiological sensors. To overcome this obstacle, we propose CLOPS, a replay-based continual learning strategy.… ▽ More

    Submitted 28 November, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  47. arXiv:2004.09557  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Clinical settings are often characterized by abundant unlabelled data and limited labelled data. This is typically driven by the high burden placed on oracles (e.g., physicians) to provide annotations. One way to mitigate this burden is via active learning (AL) which involves the (a) acquisition and (b) annotation of informative unlabelled instances. Whereas previous work addresses either one of t… ▽ More

    Submitted 18 May, 2022; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: ICML 2022

  48. arXiv:1912.05345  [pdf, other

    eess.SP cs.CV cs.LG

    Severity Detection Tool for Patients with Infectious Disease

    Authors: Girmaw Abebe Tadesse, Tingting Zhu, Nhan Le Nguyen Thanh, Nguyen Thanh Hung, Ha Thi Hai Duong, Truong Huu Khanh, Pham Van Quang, Duc Duong Tran, LamMinh Yen, H Rogier Van Doorn, Nguyen Van Hao, John Prince, Hamza Javed, DaniKiyasseh, Le Van Tan, Louise Thwaites, David A. Clifton

    Abstract: Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  49. arXiv:1912.00354  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Preserving Patient Privacy while Training a Predictive Model of In-hospital Mortality

    Authors: Pulkit Sharma, Farah E Shamout, David A Clifton

    Abstract: Machine learning models can be used for pattern recognition in medical data in order to improve patient outcomes, such as the prediction of in-hospital mortality. Deep learning models, in particular, require large amounts of data for model training. However, the data is often collected at different hospitals and sharing is restricted due to patient privacy concerns. In this paper, we aimed to demo… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: AI for Social Good Workshop, Neurips 2019, Vancouver, Canada

  50. arXiv:1903.12141  [pdf, other

    cs.LG cs.CV stat.ML

    IMAE for Noise-Robust Learning: Mean Absolute Error Does Not Treat Examples Equally and Gradient Magnitude's Variance Matters

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson

    Abstract: In this work, we study robust deep learning against abnormal training data from the perspective of example weighting built in empirical loss functions, i.e., gradient magnitude with respect to logits, an angle that is not thoroughly studied so far. Consequently, we have two key findings: (1) Mean Absolute Error (MAE) Does Not Treat Examples Equally. We present new observations and insightful analy… ▽ More

    Submitted 1 May, 2023; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: ICLR 2023, RTML Workshop paper. For the source code, based on the requests for academic research and kindness to cite our work, we will release and maintain it in https://github.com/XinshaoAmosWang/DeepCriticalLearning