Skip to main content

Showing 1–29 of 29 results for author: Spathis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02361  [pdf, other

    cs.LG

    Using Self-supervised Learning Can Improve Model Fairness

    Authors: Sofia Yfantidou, Dimitris Spathis, Marios Constantinides, Athena Vakali, Daniele Quercia, Fahim Kawsar

    Abstract: Self-supervised learning (SSL) has become the de facto training paradigm of large models, where pre-training is followed by supervised fine-tuning using domain-specific data and labels. Despite demonstrating comparable performance with supervised methods, comprehensive efforts to assess SSL's impact on machine learning fairness (i.e., performing equally on different demographic breakdowns) are lac… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.01640

  2. arXiv:2403.10561   

    cs.LG cs.AI

    A collection of the accepted papers for the Human-Centric Representation Learning workshop at AAAI 2024

    Authors: Dimitris Spathis, Aaqib Saeed, Ali Etemad, Sana Tonekaboni, Stefanos Laskaridis, Shohreh Deldari, Chi Ian Tang, Patrick Schwab, Shyam Tailor

    Abstract: This non-archival index is not complete, as some accepted papers chose to opt-out of inclusion. The list of all accepted papers is available on the workshop website.

    Submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2401.14107  [pdf, other

    cs.LG eess.SP

    Learning under Label Noise through Few-Shot Human-in-the-Loop Refinement

    Authors: Aaqib Saeed, Dimitris Spathis, Jungwoo Oh, Edward Choi, Ali Etemad

    Abstract: Wearable technologies enable continuous monitoring of various health metrics, such as physical activity, heart rate, sleep, and stress levels. A key challenge with wearable data is obtaining quality labels. Unlike modalities like video where the videos themselves can be effectively used to label objects or events, wearable data do not contain obvious cues about the physical manifestation of the us… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  4. arXiv:2401.02255  [pdf, other

    cs.LG eess.SP

    Balancing Continual Learning and Fine-tuning for Human Activity Recognition

    Authors: Chi Ian Tang, Lorena Qendro, Dimitris Spathis, Fahim Kawsar, Akhil Mathur, Cecilia Mascolo

    Abstract: Wearable-based Human Activity Recognition (HAR) is a key task in human-centric machine learning due to its fundamental understanding of human behaviours. Due to the dynamic nature of human behaviours, continual learning promises HAR systems that are tailored to users' needs. However, because of the difficulty in collecting labelled data with wearable sensors, existing approaches that focus on supe… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: AAAI 2024 HCRL (Human-Centric Representation Learning) Workshop

  5. arXiv:2401.01640  [pdf, other

    cs.LG cs.CY

    Evaluating Fairness in Self-supervised and Supervised Models for Sequential Data

    Authors: Sofia Yfantidou, Dimitris Spathis, Marios Constantinides, Athena Vakali, Daniele Quercia, Fahim Kawsar

    Abstract: Self-supervised learning (SSL) has become the de facto training paradigm of large models where pre-training is followed by supervised fine-tuning using domain-specific data and labels. Hypothesizing that SSL models would learn more generic, hence less biased, representations, this study explores the impact of pre-training and fine-tuning strategies on fairness (i.e., performing equally on differen… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

  6. arXiv:2309.12877  [pdf, ps, other

    cs.CY cs.ET cs.LG

    FairComp: Workshop on Fairness and Robustness in Machine Learning for Ubiquitous Computing

    Authors: Sofia Yfantidou, Dimitris Spathis, Marios Constantinides, Tong Xia, Niels van Berkel

    Abstract: How can we ensure that Ubiquitous Computing (UbiComp) research outcomes are both ethical and fair? While fairness in machine learning (ML) has gained traction in recent years, fairness in UbiComp remains unexplored. This workshop aims to discuss fairness in UbiComp research and its social, technical, and legal implications. From a social perspective, we will examine the relationship between fairne… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Journal ref: Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing (UbiComp/ISWC '23 Adjunct )

  7. arXiv:2309.06236  [pdf, other

    cs.LG cs.CL

    The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models

    Authors: Dimitris Spathis, Fahim Kawsar

    Abstract: Large Language Models (LLMs) have demonstrated remarkable generalization across diverse tasks, leading individuals to increasingly use them as personal assistants and universal computing engines. Nevertheless, a notable obstacle emerges when feeding numerical/temporal data into these models, such as data sourced from wearables or electronic health records. LLMs employ tokenizers in their input tha… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted at the Generative AI for Pervasive Computing Symposium (GenAI4PC) at UbiComp 2023

  8. arXiv:2307.16847  [pdf, other

    cs.LG

    CroSSL: Cross-modal Self-Supervised Learning for Time-series through Latent Masking

    Authors: Shohreh Deldari, Dimitris Spathis, Mohammad Malekzadeh, Fahim Kawsar, Flora Salim, Akhil Mathur

    Abstract: Limited availability of labeled data for machine learning on multimodal time-series extensively hampers progress in the field. Self-supervised learning (SSL) is a promising approach to learning data representations without relying on labels. However, existing SSL methods require expensive computations of negative pairs and are typically designed for single modalities, which limits their versatilit… ▽ More

    Submitted 19 February, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: Accepted in WSDM24. Short version presented in ML4MHD @ICML23

  9. arXiv:2307.16651  [pdf, other

    cs.LG

    UDAMA: Unsupervised Domain Adaptation through Multi-discriminator Adversarial Training with Noisy Labels Improves Cardio-fitness Prediction

    Authors: Yu Wu, Dimitris Spathis, Hong Jia, Ignacio Perez-Pozuelo, Tomas Gonzales, Soren Brage, Nicholas Wareham, Cecilia Mascolo

    Abstract: Deep learning models have shown great promise in various healthcare monitoring applications. However, most healthcare datasets with high-quality (gold-standard) labels are small-scale, as directly collecting ground truth is often costly and time-consuming. As a result, models developed and validated on small-scale datasets often suffer from overfitting and do not generalize well to unseen scenario… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Accepted at Machine Learning for Healthcare (MLHC) 2023

  10. The State of Algorithmic Fairness in Mobile Human-Computer Interaction

    Authors: Sofia Yfantidou, Marios Constantinides, Dimitris Spathis, Athena Vakali, Daniele Quercia, Fahim Kawsar

    Abstract: This paper explores the intersection of Artificial Intelligence and Machine Learning (AI/ML) fairness and mobile human-computer interaction (MobileHCI). Through a comprehensive analysis of MobileHCI proceedings published between 2017 and 2022, we first aim to understand the current state of algorithmic fairness in the community. By manually analyzing 90 papers, we found that only a small portion (… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.15585

    Journal ref: 25th International Conference on Mobile Human-Computer Interaction (MobileHCI '23 Companion), September 26--29, 2023, Athens, Greece

  11. arXiv:2303.17235  [pdf, other

    cs.LG

    Kaizen: Practical Self-supervised Continual Learning with Continual Fine-tuning

    Authors: Chi Ian Tang, Lorena Qendro, Dimitris Spathis, Fahim Kawsar, Cecilia Mascolo, Akhil Mathur

    Abstract: Self-supervised learning (SSL) has shown remarkable performance in computer vision tasks when trained offline. However, in a Continual Learning (CL) scenario where new data is introduced progressively, models still suffer from catastrophic forgetting. Retraining a model from scratch to adapt to newly generated data is time-consuming and inefficient. Previous approaches suggested re-purposing self-… ▽ More

    Submitted 7 February, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Presented at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024. The code for this work is available at https://github.com/dr-bell/kaizen

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 2841-2850

  12. arXiv:2303.15585  [pdf, other

    cs.CY cs.HC cs.LG

    Beyond Accuracy: A Critical Review of Fairness in Machine Learning for Mobile and Wearable Computing

    Authors: Sofia Yfantidou, Marios Constantinides, Dimitris Spathis, Athena Vakali, Daniele Quercia, Fahim Kawsar

    Abstract: The field of mobile and wearable computing is undergoing a revolutionary integration of machine learning. Devices can now diagnose diseases, predict heart irregularities, and unlock the full potential of human cognition. However, the underlying algorithms powering these predictions are not immune to biases with respect to sensitive attributes (e.g., gender, race), leading to discriminatory outcome… ▽ More

    Submitted 22 September, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  13. arXiv:2211.10475  [pdf, other

    eess.SP cs.LG

    Turning Silver into Gold: Domain Adaptation with Noisy Labels for Wearable Cardio-Respiratory Fitness Prediction

    Authors: Yu Wu, Dimitris Spathis, Hong Jia, Ignacio Perez-Pozuelo, Tomas I. Gonzales, Soren Brage, Nicholas Wareham, Cecilia Mascolo

    Abstract: Deep learning models have shown great promise in various healthcare applications. However, most models are developed and validated on small-scale datasets, as collecting high-quality (gold-standard) labels for health applications is often costly and time-consuming. As a result, these models may suffer from overfitting and not generalize well to unseen data. At the same time, an extensive amount of… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 5 pages

  14. arXiv:2205.13398  [pdf, other

    cs.LG

    Looking for Out-of-Distribution Environments in Multi-center Critical Care Data

    Authors: Dimitris Spathis, Stephanie L. Hyland

    Abstract: Clinical machine learning models show a significant performance drop when tested in settings not seen during training. Domain generalisation models promise to alleviate this problem, however, there is still scepticism about whether they improve over traditional training. In this work, we take a principled approach to identifying Out of Distribution (OoD) environments, motivated by the problem of c… ▽ More

    Submitted 11 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 17 pages

  15. Longitudinal cardio-respiratory fitness prediction through wearables in free-living environments

    Authors: Dimitris Spathis, Ignacio Perez-Pozuelo, Tomas I. Gonzales, Yu Wu, Soren Brage, Nicholas Wareham, Cecilia Mascolo

    Abstract: Cardiorespiratory fitness is an established predictor of metabolic disease and mortality. Fitness is directly measured as maximal oxygen consumption (VO$_{2}max$), or indirectly assessed using heart rate responses to standard exercise tests. However, such testing is costly and burdensome because it requires specialized equipment such as treadmills and oxygen masks, limiting its utility. Modern wea… ▽ More

    Submitted 24 October, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted in Nature Digital Medicine, 16 pages

  16. arXiv:2202.08981  [pdf, other

    cs.SD cs.LG eess.AS

    A Summary of the ComParE COVID-19 Challenges

    Authors: Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, **g Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller

    Abstract: The COVID-19 pandemic has caused massive humanitarian and economic damage. Teams of scientists from a broad range of disciplines have searched for methods to help governments and communities combat the disease. One avenue from the machine learning field which has been explored is the prospect of a digital mass test which can detect COVID-19 from infected individuals' respiratory sounds. We present… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 18 pages, 13 figures

  17. arXiv:2201.01232  [pdf

    cs.SD cs.LG eess.AS

    Exploring Longitudinal Cough, Breath, and Voice Data for COVID-19 Progression Prediction via Sequential Deep Learning: Model Development and Validation

    Authors: Ting Dang, **g Han, Tong Xia, Dimitris Spathis, Erika Bondareva, Chloë Siegele-Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Andres Floto, Pietro Cicuta, Cecilia Mascolo

    Abstract: Recent work has shown the potential of using audio data (eg, cough, breathing, and voice) in the screening for COVID-19. However, these approaches only focus on one-off detection and detect the infection given the current audio sample, but do not monitor disease progression in COVID-19. Limited exploration has been put forward to continuously monitor COVID-19 progression, especially recovery, thro… ▽ More

    Submitted 22 June, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: Updated title. Revised format according to journal requirements

  18. arXiv:2111.07089  [pdf, other

    cs.LG eess.SP

    Evaluating Contrastive Learning on Wearable Timeseries for Downstream Clinical Outcomes

    Authors: Kevalee Shah, Dimitris Spathis, Chi Ian Tang, Cecilia Mascolo

    Abstract: Vast quantities of person-generated health data (wearables) are collected but the process of annotating to feed to machine learning models is impractical. This paper discusses ways in which self-supervised approaches that use contrastive losses, such as SimCLR and BYOL, previously applied to the vision domain, can be applied to high-dimensional health signals for downstream classification tasks of… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  19. arXiv:2106.15523  [pdf, other

    cs.SD cs.LG eess.AS

    Sounds of COVID-19: exploring realistic performance of audio-based digital testing

    Authors: **g Han, Tong Xia, Dimitris Spathis, Erika Bondareva, Chloë Brown, Jagmohan Chauhan, Ting Dang, Andreas Grammenos, Apinan Hasthanasombat, Andres Floto, Pietro Cicuta, Cecilia Mascolo

    Abstract: Researchers have been battling with the question of how we can identify Coronavirus disease (COVID-19) cases efficiently, affordably and at scale. Recent work has shown how audio based approaches, which collect respiratory audio data (cough, breathing and voice) can be used for testing, however there is a lack of exploration of how biases and methodological decisions impact these tools' performanc… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  20. Anticipatory Detection of Compulsive Body-focused Repetitive Behaviors with Wearables

    Authors: Benjamin Lucas Searle, Dimitris Spathis, Marios Constantinides, Daniele Quercia, Cecilia Mascolo

    Abstract: Body-focused repetitive behaviors (BFRBs), like face-touching or skin-picking, are hand-driven behaviors which can damage one's appearance, if not identified early and treated. Technology for automatic detection is still under-explored, with few previous works being limited to wearables with single modalities (e.g., motion). Here, we propose a multi-sensory approach combining motion, orientation,… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted to ACM MobileHCI 2021 (20 pages, dataset/code: https://github.com/Bhorda/BFRBAnticipationDataset)

  21. arXiv:2102.13468  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates

    Authors: Björn W. Schuller, Anton Batliner, Christian Bergler, Cecilia Mascolo, **g Han, Iulia Lefter, Heysem Kaya, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Maurice Gerczuk, Panagiotis Tzirakis, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Leon J. M. Rothkrantz, Joeri Zwerts, Jelle Treep, Casper Kaandorp

    Abstract: The INTERSPEECH 2021 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the COVID-19 Cough and COVID-19 Speech Sub-Challenges, a binary classification on COVID-19 infection has to be made based on coughing sounds and speech; in the Escalation SubChallenge, a three-way assessment of the level of es… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: 5 pages

    MSC Class: 68 ACM Class: I.2.7; I.5.0; J.3

  22. SelfHAR: Improving Human Activity Recognition through Self-training with Unlabeled Data

    Authors: Chi Ian Tang, Ignacio Perez-Pozuelo, Dimitris Spathis, Soren Brage, Nick Wareham, Cecilia Mascolo

    Abstract: Machine learning and deep learning have shown great promise in mobile sensing applications, including Human Activity Recognition. However, the performance of such models in real-world settings largely depends on the availability of large datasets that captures diverse behaviors. Recently, studies in computer vision and natural language processing have shown that leveraging massive amounts of unlab… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 2021

  23. Exploring Automatic COVID-19 Diagnosis via voice and symptoms from Crowdsourced Data

    Authors: **g Han, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo

    Abstract: The development of fast and accurate screening tools, which could facilitate testing and prevent more costly clinical tests, is key to the current pandemic of COVID-19. In this context, some initial work shows promise in detecting diagnostic signals of COVID-19 from audio sounds. In this paper, we propose a voice-based framework to automatically detect individuals who have tested positive for COVI… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 5 pages, 3 figures, 2 tables, Accepted for publication at ICASSP 2021

  24. arXiv:2011.12121  [pdf, other

    eess.SP cs.CY cs.LG

    Self-supervised transfer learning of physiological representations from free-living wearable data

    Authors: Dimitris Spathis, Ignacio Perez-Pozuelo, Soren Brage, Nicholas J. Wareham, Cecilia Mascolo

    Abstract: Wearable devices such as smartwatches are becoming increasingly popular tools for objectively monitoring physical activity in free-living conditions. To date, research has primarily focused on the purely supervised task of human activity recognition, demonstrating limited success in inferring high-level health outcomes from low-level signals. Here, we present a novel self-supervised representation… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 9 pages, 3 figures (long version of extended abstract arXiv:2011.04601)

  25. arXiv:2011.11542  [pdf, other

    cs.LG eess.SP

    Exploring Contrastive Learning in Human Activity Recognition for Healthcare

    Authors: Chi Ian Tang, Ignacio Perez-Pozuelo, Dimitris Spathis, Cecilia Mascolo

    Abstract: Human Activity Recognition (HAR) constitutes one of the most important tasks for wearable and mobile sensing given its implications in human well-being and health monitoring. Motivated by the limitations of labeled datasets in HAR, particularly when employed in healthcare-related applications, this work explores the adoption and adaptation of SimCLR, a contrastive learning technique for visual rep… ▽ More

    Submitted 11 February, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Presented at Machine Learning for Mobile Health Workshop at NeurIPS 2020, Vancouver, Canada

  26. arXiv:2011.04601  [pdf, other

    cs.LG cs.AI cs.CY

    Learning Generalizable Physiological Representations from Large-scale Wearable Data

    Authors: Dimitris Spathis, Ignacio Perez-Pozuelo, Soren Brage, Nicholas J. Wareham, Cecilia Mascolo

    Abstract: To date, research on sensor-equipped mobile devices has primarily focused on the purely supervised task of human activity recognition (walking, running, etc), demonstrating limited success in inferring high-level health outcomes from low-level signals, such as acceleration. Here, we present a novel self-supervised representation learning method using activity and heart rate (HR) signals without se… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted to the Machine Learning for Mobile Health workshop at NeurIPS 2020

  27. arXiv:2006.05919  [pdf, other

    cs.SD cs.LG eess.AS

    Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data

    Authors: Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, **g Han, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo

    Abstract: Audio signals generated by the human body (e.g., sighs, breathing, heart, digestion, vibration sounds) have routinely been used by clinicians as indicators to diagnose disease or assess disease progression. Until recently, such signals were usually collected through manual auscultation at scheduled visits. Research has now started to use digital technology to gather bodily sounds (e.g., from digit… ▽ More

    Submitted 18 January, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures, 2 tables, Accepted for publication at KDD'20 (Health Day)

  28. Interactive dimensionality reduction using similarity projections

    Authors: Dimitris Spathis, Nikolaos Passalis, Anastasios Tefas

    Abstract: Recent advances in machine learning allow us to analyze and describe the content of high-dimensional data like text, audio, images or other signals. In order to visualize that data in 2D or 3D, usually Dimensionality Reduction (DR) techniques are employed. Most of these techniques, e.g., PCA or t-SNE, produce static projections without taking into account corrections from humans or other data expl… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: Accepted at Knowledge-Based Systems

  29. arXiv:1612.06259  [pdf, other

    cs.CV

    Photo-Quality Evaluation based on Computational Aesthetics: Review of Feature Extraction Techniques

    Authors: Dimitris Spathis

    Abstract: Researchers try to model the aesthetic quality of photographs into low and high- level features, drawing inspiration from art theory, psychology and marketing. We attempt to describe every feature extraction measure employed in the above process. The contribution of this literature review is the taxonomy of each feature by its implementation complexity, considering real-world applications and inte… ▽ More

    Submitted 19 December, 2016; originally announced December 2016.