Skip to main content

Showing 1–17 of 17 results for author: Little, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12237  [pdf, other

    cs.LG stat.CO stat.ML

    EKM: An exact, polynomial-time algorithm for the $K$-medoids problem

    Authors: Xi He, Max A. Little

    Abstract: The $K$-medoids problem is a challenging combinatorial clustering task, widely used in data analysis applications. While numerous algorithms have been proposed to solve this problem, none of these are able to obtain an exact (globally optimal) solution for the problem in polynomial time. In this paper, we present EKM: a novel algorithm for solving this problem exactly with worst-case… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2403.09580  [pdf, ps, other

    cs.AI cs.LG stat.ME

    Algorithmic syntactic causal identification

    Authors: Dhurim Cakiqi, Max A. Little

    Abstract: Causal identification in causal Bayes nets (CBNs) is an important tool in causal inference allowing the derivation of interventional distributions from observational distributions where this is possible in principle. However, most existing formulations of causal identification using techniques such as d-separation and do-calculus are expressed within the mathematical language of classical probabil… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 TikZ figures

  3. arXiv:2401.05159  [pdf, other

    cs.CV cs.AI

    Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN

    Authors: Muhammad Ali Farooq, Wang Yao, Michael Schukat, Mark A Little, Peter Corcoran

    Abstract: This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data generation plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training. In this context, we aim to incorporate enhanc… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Paper is submitted in EMBC 2024 Conference

  4. arXiv:2306.12344  [pdf, other

    cs.LG cs.DS stat.ML

    An efficient, provably exact, practical algorithm for the 0-1 loss linear classification problem

    Authors: Xi He, Waheed Ul Rahman, Max A. Little

    Abstract: Algorithms for solving the linear classification problem have a long history, dating back at least to 1936 with linear discriminant analysis. For linearly separable data, many algorithms can obtain the exact solution to the corresponding 0-1 loss classification problem efficiently, but for data which is not linearly separable, it has been shown that this problem, in full generality, is NP-hard. Al… ▽ More

    Submitted 2 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 19 pages, 3 figures

  5. arXiv:2211.11294  [pdf, other

    cs.DB

    TSDF: A simple yet comprehensive, unified data storage and exchange format standard for digital biosensor data in health applications

    Authors: Kasper Claes, Valentina Ticcinelli, Reham Badawy, Yordan P. Raykov, Luc J. W. Evers, Max A. Little

    Abstract: Digital sensors are increasingly being used to monitor the change over time of physiological processes in biological health and disease, often using wearable devices. This generates very large amounts of digital sensor data, for which, a consensus on a common storage, exchange and archival data format standard, has yet to be reached. To address this gap, we propose Time Series Data Format (TSDF):… ▽ More

    Submitted 22 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

  6. arXiv:2208.04315  [pdf

    cs.LG cs.AI

    Patient-Specific Game-Based Transfer Method for Parkinson's Disease Severity Prediction

    Authors: Zaifa Xue, Huibin Lu, Tao Zhang, Max A. Little

    Abstract: Dysphonia is one of the early symptoms of Parkinson's disease (PD). Most existing methods use feature selection methods to find the optimal subset of voice features for all PD patients. Few have considered the heterogeneity between patients, which implies the need to provide specific prediction models for different patients. However, building the specific model faces the challenge of small sample… ▽ More

    Submitted 12 August, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

  7. arXiv:2107.01752  [pdf, other

    cs.DS cs.LG math.RA

    Dynamic programming by polymorphic semiring algebraic shortcut fusion

    Authors: Max A. Little, Xi He, Ugur Kayas

    Abstract: Dynamic programming (DP) is an algorithmic design paradigm for the efficient, exact solution of otherwise intractable, combinatorial problems. However, DP algorithm design is often presented in an ad-hoc manner. It is sometimes difficult to justify algorithm correctness. To address this issue, this paper presents a rigorous algebraic formalism for systematically deriving DP algorithms, based on se… ▽ More

    Submitted 4 January, 2024; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: Updated v22 with revised text

    Journal ref: Formal Aspects of Computing, May 2024

  8. arXiv:2102.03885  [pdf, other

    cs.LG eess.SP math.ST

    Few-shot time series segmentation using prototype-defined infinite hidden Markov models

    Authors: Yazan Qarout, Yordan P. Raykov, Max A. Little

    Abstract: We propose a robust framework for interpretable, few-shot analysis of non-stationary sequential data based on flexible graphical models to express the structured distribution of sequential events, using prototype radial basis function (RBF) neural network emissions. A motivational link is demonstrated between prototypical neural network architectures for few-shot learning and the proposed RBF netw… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  9. arXiv:2009.01231  [pdf, other

    eess.AS cs.CY cs.LG cs.SD stat.ML

    Detecting Parkinson's Disease From an Online Speech-task

    Authors: Wasifur Rahman, Sangwu Lee, Md. Saiful Islam, Victor Nikhil Antony, Harshil Ratnu, Mohammad Rafayet Ali, Abdullah Al Mamun, Ellen Wagner, Stella Jensen-Roberts, Max A. Little, Ray Dorsey, Ehsan Hoque

    Abstract: In this paper, we envision a web-based framework that can help anyone, anywhere around the world record a short speech task, and analyze the recorded data to screen for Parkinson's disease (PD). We collected data from 726 unique participants (262 PD, 38% female; 464 non-PD, 65% female; average age: 61) -- from all over the US and beyond. A small portion of the data was collected in a lab setting t… ▽ More

    Submitted 15 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

  10. arXiv:2006.12369  [pdf, other

    cs.LG math.ST stat.ML

    Controlling for sparsity in sparse factor analysis models: adaptive latent feature sharing for piecewise linear dimensionality reduction

    Authors: Adam Farooq, Yordan P. Raykov, Petar Raykov, Max A. Little

    Abstract: Ubiquitous linear Gaussian exploratory tools such as principle component analysis (PCA) and factor analysis (FA) remain widely used as tools for: exploratory analysis, pre-processing, data visualization and related tasks. However, due to their rigid assumptions including crowding of high dimensional data, they have been replaced in many settings by more flexible and still interpretable latent feat… ▽ More

    Submitted 28 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Interactive demo available at https://colab.research.google.com/drive/1KrrHmAu6mV7tutZtYnpEbVibxs4GCwIo?usp=sharing

    ACM Class: I.5.1

  11. arXiv:2004.03047  [pdf, other

    cs.HC eess.SP

    Probabilistic modelling of gait for robust passive monitoring in daily life

    Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Bastiaan Bloem, Tom M. Heskes, Marjan Meinders, Kasper Claes, Max A. Little

    Abstract: Passive monitoring in daily life may provide invaluable insights about a person's health throughout the day. Wearable sensor devices are likely to play a key role in enabling such monitoring in a non-obtrusive fashion. However, sensor data collected in daily life reflects multiple health and behavior related factors together. This creates the need for structured principled analysis to produce reli… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  12. arXiv:1910.09648  [pdf, other

    cs.LG math.ST stat.ME

    Causal bootstrap**

    Authors: Max A. Little, Reham Badawy

    Abstract: To draw scientifically meaningful conclusions and build reliable models of quantitative phenomena, cause and effect must be taken into consideration (either implicitly or explicitly). This is particularly challenging when the measurements are not from controlled experimental (interventional) settings, since cause and effect can be obscured by spurious, indirect influences. Modern predictive techni… ▽ More

    Submitted 9 December, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: 18 pages, 3 figures

  13. arXiv:1905.11785  [pdf, other

    eess.AS cs.SD

    Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

    Authors: Amir Hossein Poorjam, Mathew Shaji Kavalekalam, Liming Shi, Yordan P. Raykov, Jesper Rindom Jensen, Max A. Little, Mads Græsbøll Christensen

    Abstract: The performance of voice-based Parkinson's disease (PD) detection systems degrades when there is an acoustic mismatch between training and operating conditions caused mainly by degradation in test signals. In this paper, we address this mismatch by considering three types of degradation commonly encountered in remote voice analysis, namely background noise, reverberation and nonlinear distortion,… ▽ More

    Submitted 31 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Preprint, 12 pages, 6 figures

  14. arXiv:1905.11010  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Adaptive probabilistic principal component analysis

    Authors: Adam Farooq, Yordan P. Raykov, Luc Evers, Max A. Little

    Abstract: Using the linear Gaussian latent variable model as a starting point we relax some of the constraints it imposes by deriving a nonparametric latent feature Gaussian variable model. This model introduces additional discrete latent variables to the original structure. The Bayesian nonparametric nature of this new model allows it to adapt complexity as more data is observed and project each data point… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  15. arXiv:1905.08557  [pdf, other

    cs.SD cs.LG eess.AS

    Bayesian Pitch Tracking Based on the Harmonic Model

    Authors: Liming Shi, Jesper Kjaer Nielsen, Jesper Rindom Jensen, Max A. Little, Mads Graesboll Christensen

    Abstract: Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model o… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  16. arXiv:1601.00960  [pdf, other

    cs.CY

    High Frequency Remote Monitoring of Parkinson's Disease via Smartphone: Platform Overview and Medication Response Detection

    Authors: Andong Zhan, Max A. Little, Denzil A. Harris, Solomon O. Abiola, E. Ray Dorsey, Suchi Saria, Andreas Terzis

    Abstract: Objective: The aim of this study is to develop a smartphone-based high-frequency remote monitoring platform, assess its feasibility for remote monitoring of symptoms in Parkinson's disease, and demonstrate the value of data collected using the platform by detecting dopaminergic medication response. Methods: We have developed HopkinsPD, a novel smartphone-based monitoring platform, which measures s… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

  17. arXiv:1304.1209  [pdf, other

    physics.data-an cs.CV physics.bio-ph q-bio.QM stat.ML

    Highly comparative time-series analysis: The empirical structure of time series and their methods

    Authors: Ben D. Fulcher, Max A. Little, Nick S. Jones

    Abstract: The process of collecting and organizing sets of observations represents a common theme throughout the history of science. However, despite the ubiquity of scientists measuring, recording, and analyzing the dynamics of different processes, an extensive organization of scientific time-series data and analysis methods has never been performed. Addressing this, annotated collections of over 35 000 re… ▽ More

    Submitted 3 April, 2013; originally announced April 2013.

    Journal ref: J. R. Soc. Interface vol. 10 no. 83 20130048 (2013)