Skip to main content

Showing 1–50 of 67 results for author: Little, A

.
  1. arXiv:2405.12237  [pdf, other

    cs.LG stat.CO stat.ML

    EKM: An exact, polynomial-time algorithm for the $K$-medoids problem

    Authors: Xi He, Max A. Little

    Abstract: The $K$-medoids problem is a challenging combinatorial clustering task, widely used in data analysis applications. While numerous algorithms have been proposed to solve this problem, none of these are able to obtain an exact (globally optimal) solution for the problem in polynomial time. In this paper, we present EKM: a novel algorithm for solving this problem exactly with worst-case… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2403.09580  [pdf, ps, other

    cs.AI cs.LG stat.ME

    Algorithmic syntactic causal identification

    Authors: Dhurim Cakiqi, Max A. Little

    Abstract: Causal identification in causal Bayes nets (CBNs) is an important tool in causal inference allowing the derivation of interventional distributions from observational distributions where this is possible in principle. However, most existing formulations of causal identification using techniques such as d-separation and do-calculus are expressed within the mathematical language of classical probabil… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 TikZ figures

  3. arXiv:2402.14276  [pdf, other

    eess.SP cs.IT

    Bispectrum Unbiasing for Dilation-Invariant Multi-reference Alignment

    Authors: Li** Yin, Anna Little, Matthew Hirn

    Abstract: Motivated by modern data applications such as cryo-electron microscopy, the goal of classic multi-reference alignment (MRA) is to recover an unknown signal $f: \mathbb{R} \to \mathbb{R}$ from many observations that have been randomly translated and corrupted by additive noise. We consider a generalization of classic MRA where signals are also corrupted by a random scale change, i.e. dilation. We p… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  4. arXiv:2401.05159  [pdf, other

    cs.CV cs.AI

    Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN

    Authors: Muhammad Ali Farooq, Wang Yao, Michael Schukat, Mark A Little, Peter Corcoran

    Abstract: This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data generation plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training. In this context, we aim to incorporate enhanc… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Paper is submitted in EMBC 2024 Conference

  5. arXiv:2307.05750  [pdf, other

    stat.ML cs.DS cs.LG math.DG

    Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

    Authors: Nicolás García Trillos, Anna Little, Daniel McKenzie, James M. Murphy

    Abstract: We analyze the convergence properties of Fermat distances, a family of density-driven metrics defined on Riemannian manifolds with an associated probability measure. Fermat distances may be defined either on discrete samples from the underlying measure, in which case they are random, or in the continuum setting, in which they are induced by geodesics under a density-distorted Riemannian metric. We… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  6. arXiv:2306.14107  [pdf, ps, other

    math-ph

    A Riemann-Hilbert approach to skew-orthogonal polynomials of symplectic type

    Authors: Alex Little

    Abstract: We present a representation of skew-orthogonal polynomials of symplectic type ($β= 4$) in terms of a matrix Riemann-Hilbert problem, for weights of the form $e^{-V(z)}$ where $V$ is a polynomial of even degree and positive leading coefficient. This is done by representing skew-orthogonality as a kind of multiple-orthogonality. From this we derive a $β= 4$ analogue of the Christoffel-Darboux formul… ▽ More

    Submitted 29 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: 38 pages, 2 figures

    MSC Class: 35Q15 (Primary); 15A52 (Secondary)

  7. arXiv:2306.12344  [pdf, other

    cs.LG cs.DS stat.ML

    An efficient, provably exact, practical algorithm for the 0-1 loss linear classification problem

    Authors: Xi He, Waheed Ul Rahman, Max A. Little

    Abstract: Algorithms for solving the linear classification problem have a long history, dating back at least to 1936 with linear discriminant analysis. For linearly separable data, many algorithms can obtain the exact solution to the corresponding 0-1 loss classification problem efficiently, but for data which is not linearly separable, it has been shown that this problem, in full generality, is NP-hard. Al… ▽ More

    Submitted 2 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 19 pages, 3 figures

  8. arXiv:2306.03173  [pdf, other

    cs.LG

    Linear Distance Metric Learning with Noisy Labels

    Authors: Meysam Alishahi, Anna Little, Jeff M. Phillips

    Abstract: In linear distance metric learning, we are given data in one Euclidean metric space and the goal is to find an appropriate linear map to another Euclidean metric space which respects certain distance conditions as much as possible. In this paper, we formalize a simple and elegant method which reduces to a general continuous convex loss optimization problem, and for different noise models we derive… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 52 pages

  9. arXiv:2303.10014  [pdf, ps, other

    physics.med-ph q-bio.TO

    Reliability of Tumour Classification from Multi-Dimensional DCE-MRI Variables using Data Transformations

    Authors: S. V. Notley, N. A. Thacker, L. Horsley, R. A. Little, Y. Watson, S. Mullamitha, G. C. Jayson, A. Jackson

    Abstract: Summary mean DCE-MRI variables show a clear dependency between signal and noise variance, which can be shown to reduce the effectiveness of difference assessments. Appropriate transformation of these variables supports statistically efficient and robust comparisons. The capabilities of DCE-MRI based descriptions of hepatic colorectal tumour classification was assessed, with regard to their potenti… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 18 pages and 6 figures

    MSC Class: 92-08

  10. arXiv:2212.00525  [pdf, other

    math-ph math.PR nlin.SI

    The complex elliptic Ginibre ensemble at weak non-Hermiticity: bulk spacing distributions

    Authors: Thomas Bothner, Alex Little

    Abstract: We show that the distribution of bulk spacings between pairs of adjacent eigenvalue real parts of a random matrix drawn from the complex elliptic Ginibre ensemble is asymptotically given by a generalization of the Gaudin-Mehta distribution, in the limit of weak non-Hermiticity. The same generalization is expressed in terms of an integro-differential Painlevé function and it is shown that the gener… ▽ More

    Submitted 12 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 39 pages, 3 figures. Version 2 corrects typos

    MSC Class: 60B20 (Primary); 60G55; 33E17; 47B35 (Secondary)

  11. arXiv:2211.11294  [pdf, other

    cs.DB

    TSDF: A simple yet comprehensive, unified data storage and exchange format standard for digital biosensor data in health applications

    Authors: Kasper Claes, Valentina Ticcinelli, Reham Badawy, Yordan P. Raykov, Luc J. W. Evers, Max A. Little

    Abstract: Digital sensors are increasingly being used to monitor the change over time of physiological processes in biological health and disease, often using wearable devices. This generates very large amounts of digital sensor data, for which, a consensus on a common storage, exchange and archival data format standard, has yet to be reached. To address this gap, we propose Time Series Data Format (TSDF):… ▽ More

    Submitted 22 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

  12. On Generalizations of the Nonwindowed Scattering Transform

    Authors: Albert Chua, Matthew Hirn, Anna Little

    Abstract: In this paper, we generalize finite depth wavelet scattering transforms, which we formulate as $\Lb^q(\mathbb{R}^n)$ norms of a cascade of continuous wavelet transforms (or dyadic wavelet transforms) and contractive nonlinearities. We then provide norms for these operators, prove that these operators are well-defined, and are Lipschitz continuous to the action of $C^2$ diffeomorphisms in specific… ▽ More

    Submitted 13 September, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: Corrected small typos throughout. Diffeomorphism stability definition in introduction was changed. The map** Phi isn't necessarily from V to V, which is reflected in the results we found

  13. arXiv:2208.04684  [pdf, other

    math-ph math.CA math.PR nlin.SI

    The complex elliptic Ginibre ensemble at weak non-Hermiticity: edge spacing distributions

    Authors: Thomas Bothner, Alex Little

    Abstract: The focus of this paper is on the distribution function of the rightmost eigenvalue for the complex elliptic Ginibre ensemble in the limit of weak non-Hermiticity. We show how the limiting distribution function can be expressed in terms of an integro-differential Painlevé-II function and how the same captures the non-trivial transition between Poisson and Airy point process extreme value statistic… ▽ More

    Submitted 1 March, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: 62 pages, 11 figures. Version 2 corrects typos and updates literature

    MSC Class: Primary 45B05; Secondary 47B35; 35Q15; 30E25; 37J35; 70H06; 34E05

  14. arXiv:2208.04315  [pdf

    cs.LG cs.AI

    Patient-Specific Game-Based Transfer Method for Parkinson's Disease Severity Prediction

    Authors: Zaifa Xue, Huibin Lu, Tao Zhang, Max A. Little

    Abstract: Dysphonia is one of the early symptoms of Parkinson's disease (PD). Most existing methods use feature selection methods to find the optimal subset of voice features for all PD patients. Few have considered the heterogeneity between patients, which implies the need to provide specific prediction models for different patients. However, building the specific model faces the challenge of small sample… ▽ More

    Submitted 12 August, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

  15. Promotheus: An End-to-End Machine Learning Framework for Optimizing Markdown in Online Fashion E-commerce

    Authors: Eleanor Loh, Jalaj Khandelwal, Brian Regan, Duncan A. Little

    Abstract: Managing discount promotional events ("markdown") is a significant part of running an e-commerce business, and inefficiencies here can significantly hamper a retailer's profitability. Traditional approaches for tackling this problem rely heavily on price elasticity modelling. However, the partial information nature of price elasticity modelling, together with the non-negotiable responsibility for… ▽ More

    Submitted 12 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: 11 pages; Accepted at KDD 2022

    MSC Class: 68T01 (Primary) 90C29; 9108 (Secondary) ACM Class: I.2.8; G.3; G.4

  16. arXiv:2206.07729  [pdf, other

    cs.LG

    Taxonomy of Benchmarks in Graph Representation Learning

    Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Sarah McGuire, Xinyi Wang, Anna Little, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

    Abstract: Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on develo** GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to w… ▽ More

    Submitted 30 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: In Proceedings of the First Learning on Graphs Conference (LoG 2022)

  17. arXiv:2110.14809  [pdf, other

    cs.LG

    Towards a Taxonomy of Graph Learning Datasets

    Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Dylan Sandfelder, Devin Kreuzer, Anna Little, Sarah McGuire, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

    Abstract: Graph neural networks (GNNs) have attracted much attention due to their ability to leverage the intrinsic geometries of the underlying data. Although many different types of GNN models have been developed, with many benchmarking procedures to demonstrate the superiority of one GNN model over the others, there is a lack of systematic understanding of the underlying benchmarking datasets, and what a… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: in Data-Centric AI Workshop at NeurIPS 2021

  18. Limit on the Electric Charge of Antihydrogen

    Authors: A. Capra, C. Amole, M. D. Ashkezari, M. Baquero-Ruiz, W. Bertsche, E. Butler, C. L. Cesar, M. Charlton, S. Eriksson, J. Fajans, T. Friesen, M. C. Fujiwara, D. R. Gill, A. Gutierrez, J. S. Hangst, W. N. Hardy, M. E. Hayden, C. A. Isaac, S. Jonsell, L . Kurchaninov, A. Little, J. T. K. McKenna, S. Menary, S. C. Napoli, P. Nolan , et al. (15 additional authors not shown)

    Abstract: The ALPHA collaboration has successfully demonstrated the production and the confinement of cold antihydrogen, $\overline{\mathrm{H}}$. An analysis of trap** data allowed a stringent limit to be placed on the electric charge of the simplest antiatom. Charge neutrality of matter is known to a very high precision, hence a neutrality limit of $\overline{\mathrm{H}}$ provides a test of CPT invarianc… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 5 pages, 3 figures

    Journal ref: Hyperfine Interact 238, 9 (2017)

  19. arXiv:2107.01752  [pdf, other

    cs.DS cs.LG math.RA

    Dynamic programming by polymorphic semiring algebraic shortcut fusion

    Authors: Max A. Little, Xi He, Ugur Kayas

    Abstract: Dynamic programming (DP) is an algorithmic design paradigm for the efficient, exact solution of otherwise intractable, combinatorial problems. However, DP algorithm design is often presented in an ad-hoc manner. It is sometimes difficult to justify algorithm correctness. To address this issue, this paper presents a rigorous algebraic formalism for systematically deriving DP algorithms, based on se… ▽ More

    Submitted 4 January, 2024; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: Updated v22 with revised text

    Journal ref: Formal Aspects of Computing, May 2024

  20. arXiv:2107.01274  [pdf, other

    eess.SP cs.IT

    Unbiasing Procedures for Scale-invariant Multi-reference Alignment

    Authors: Matthew Hirn, Anna Little

    Abstract: This article discusses a generalization of the 1-dimensional multi-reference alignment problem. The goal is to recover a hidden signal from many noisy observations, where each noisy observation includes a random translation and random dilation of the hidden signal, as well as high additive noise. We propose a method that recovers the power spectrum of the hidden signal by applying a data-driven, n… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 12 pages, 5 figures. Code reproducing numerical results at https://bitbucket.org/annavlittle/inversion-unbiasing/src/master/

  21. arXiv:2104.04432  [pdf

    stat.ME stat.AP

    A Case Study of Nonresponse Bias Analysis In Educational Assessment Surveys

    Authors: Yajuan Si, Roderick J. A. Little, Ya Mo, Nell Sedransk

    Abstract: Nonresponse bias is a widely prevalent problem for data on education. We develop a ten-step exemplar to guide nonresponse bias analysis (NRBA) in cross-sectional studies and apply these steps to the Early Childhood Longitudinal Study, Kindergarten Class of 2010-11. A key step is the construction of indices of nonresponse bias based on proxy pattern-mixture models for survey variables of interest.… ▽ More

    Submitted 25 July, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

  22. arXiv:2102.08842  [pdf, ps, other

    math.PR math-ph

    On the number of real eigenvalues of a product of truncated orthogonal random matrices

    Authors: Alex Little, Francesco Mezzadri, Nick Simm

    Abstract: Let $O$ be chosen uniformly at random from the group of $(N+L) \times (N+L)$ orthogonal matrices. Denote by $\tilde{O}$ the upper-left $N \times N$ corner of $O$, which we refer to as a truncation of $O$. In this paper we prove two conjectures of Forrester, Ipsen and Kumar (2020) on the number of real eigenvalues $N^{(m)}_{\mathbb{R}}$ of the product matrix $\tilde{O}_{1}\ldots \tilde{O}_{m}$, whe… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  23. arXiv:2102.03885  [pdf, other

    cs.LG eess.SP math.ST

    Few-shot time series segmentation using prototype-defined infinite hidden Markov models

    Authors: Yazan Qarout, Yordan P. Raykov, Max A. Little

    Abstract: We propose a robust framework for interpretable, few-shot analysis of non-stationary sequential data based on flexible graphical models to express the structured distribution of sequential events, using prototype radial basis function (RBF) neural network emissions. A motivational link is demonstrated between prototypical neural network architectures for few-shot learning and the proposed RBF netw… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  24. arXiv:2012.09385  [pdf, other

    stat.ML cs.DS cs.LG

    Balancing Geometry and Density: Path Distances on High-Dimensional Data

    Authors: Anna Little, Daniel McKenzie, James Murphy

    Abstract: New geometric and computational analyses of power-weighted shortest-path distances (PWSPDs) are presented. By illuminating the way these metrics balance density and geometry in the underlying data, we clarify their key parameters and discuss how they may be chosen in practice. Comparisons are made with related data-driven metrics, which illustrate the broader role of density in kernel-based unsupe… ▽ More

    Submitted 7 June, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    MSC Class: 05C85; 05C80 ACM Class: I.5.3

  25. arXiv:2009.01231  [pdf, other

    eess.AS cs.CY cs.LG cs.SD stat.ML

    Detecting Parkinson's Disease From an Online Speech-task

    Authors: Wasifur Rahman, Sangwu Lee, Md. Saiful Islam, Victor Nikhil Antony, Harshil Ratnu, Mohammad Rafayet Ali, Abdullah Al Mamun, Ellen Wagner, Stella Jensen-Roberts, Max A. Little, Ray Dorsey, Ehsan Hoque

    Abstract: In this paper, we envision a web-based framework that can help anyone, anywhere around the world record a short speech task, and analyze the recorded data to screen for Parkinson's disease (PD). We collected data from 726 unique participants (262 PD, 38% female; 464 non-PD, 65% female; average age: 61) -- from all over the US and beyond. A small portion of the data was collected in a lab setting t… ▽ More

    Submitted 15 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

  26. Crystallography companion agent for high-throughput materials discovery

    Authors: Phillip M. Maffettone, Lars Banko, Peng Cui, Yury Lysogorskiy, Marc A. Little, Daniel Olds, Alfred Ludwig, Andrew I. Cooper

    Abstract: The discovery of new structural and functional materials is driven by phase identification, often using X-ray diffraction (XRD). Automation has accelerated the rate of XRD measurements, greatly outpacing XRD analysis techniques that remain manual, time-consuming, error-prone, and impossible to scale. With the advent of autonomous robotic scientists or self-driving labs, contemporary techniques pro… ▽ More

    Submitted 17 March, 2021; v1 submitted 1 August, 2020; originally announced August 2020.

    Comments: For associated code, see https://github.com/maffettone/xca

    Journal ref: Nat. Comput. Sci. 1, 290 (2021)

  27. arXiv:2006.12369  [pdf, other

    cs.LG math.ST stat.ML

    Controlling for sparsity in sparse factor analysis models: adaptive latent feature sharing for piecewise linear dimensionality reduction

    Authors: Adam Farooq, Yordan P. Raykov, Petar Raykov, Max A. Little

    Abstract: Ubiquitous linear Gaussian exploratory tools such as principle component analysis (PCA) and factor analysis (FA) remain widely used as tools for: exploratory analysis, pre-processing, data visualization and related tasks. However, due to their rigid assumptions including crowding of high dimensional data, they have been replaced in many settings by more flexible and still interpretable latent feat… ▽ More

    Submitted 28 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Interactive demo available at https://colab.research.google.com/drive/1KrrHmAu6mV7tutZtYnpEbVibxs4GCwIo?usp=sharing

    ACM Class: I.5.1

  28. arXiv:2004.06139  [pdf

    stat.ME stat.AP

    Assessing Selection Bias in Regression Coefficients Estimated from Non-Probability Samples, with Applications to Genetics and Demographic Surveys

    Authors: Brady T. West, Roderick J. A. Little, Rebecca R. Andridge, Philip S. Boonstra, Erin B. Ware, Anita Pandit, Fernanda Alvarado-Leiton

    Abstract: Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers, and (b) estimated differences in subgroup means in surveys of smartphone… ▽ More

    Submitted 8 March, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: 29 pages, 4 figures, 2 tables, supplementary material

  29. arXiv:2004.03047  [pdf, other

    cs.HC eess.SP

    Probabilistic modelling of gait for robust passive monitoring in daily life

    Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Bastiaan Bloem, Tom M. Heskes, Marjan Meinders, Kasper Claes, Max A. Little

    Abstract: Passive monitoring in daily life may provide invaluable insights about a person's health throughout the day. Wearable sensor devices are likely to play a key role in enabling such monitoring in a non-obtrusive fashion. However, sensor data collected in daily life reflects multiple health and behavior related factors together. This creates the need for structured principled analysis to produce reli… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  30. arXiv:1910.09648  [pdf, other

    cs.LG math.ST stat.ME

    Causal bootstrap**

    Authors: Max A. Little, Reham Badawy

    Abstract: To draw scientifically meaningful conclusions and build reliable models of quantitative phenomena, cause and effect must be taken into consideration (either implicitly or explicitly). This is particularly challenging when the measurements are not from controlled experimental (interventional) settings, since cause and effect can be obscured by spurious, indirect influences. Modern predictive techni… ▽ More

    Submitted 9 December, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: 18 pages, 3 figures

  31. arXiv:1909.11062  [pdf, other

    eess.SP math.ST

    Wavelet invariants for statistically robust multi-reference alignment

    Authors: Matthew Hirn, Anna Little

    Abstract: We propose a nonlinear, wavelet based signal representation that is translation invariant and robust to both additive noise and random dilations. Motivated by the multi-reference alignment problem and generalizations thereof, we analyze the statistical properties of this representation given a large number of independent corruptions of a target signal. We prove the nonlinear wavelet based represen… ▽ More

    Submitted 13 July, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: 59 pages, 8 figures. v3 replaces v2 and is an extensive revision. Revisions include additional background and motivation, additional context relating the approach to other methods, a discussion of stability, and improved presentation. Code reproducing all numerical results is available at https://bitbucket.org/annavlittle/code_wavelet_invariants/

    MSC Class: 62

  32. arXiv:1908.00657  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Observation of three-state nematicity in the triangular lattice antiferromagnet Fe$_{1/3}$ NbS$_2$

    Authors: Arielle Little, Changmin Lee, Caolan John, Spencer Doyle, Eran Maniv, Nityan L. Nair, Wenqin Chen, Dylan Rees, Jörn W. F. Venderbos, Rafael Fernandes, James G. Analytis, Joseph Orenstein

    Abstract: Nematic order is the breaking of rotational symmetry in the presence of translational invariance. While originally defined in the context of liquid crystals, the concept of nematic order has arisen in crystalline matter with discrete rotational symmetry, most prominently in the tetragonal Fe-based superconductors where the parent state is four-fold symmetric. In this case the nematic director take… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: The main text is 16 pages, including 5 figures and references. Supplementary information is appended at the end of the article

    Journal ref: Nature Materials (2020)

  33. arXiv:1905.11785  [pdf, other

    eess.AS cs.SD

    Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

    Authors: Amir Hossein Poorjam, Mathew Shaji Kavalekalam, Liming Shi, Yordan P. Raykov, Jesper Rindom Jensen, Max A. Little, Mads Græsbøll Christensen

    Abstract: The performance of voice-based Parkinson's disease (PD) detection systems degrades when there is an acoustic mismatch between training and operating conditions caused mainly by degradation in test signals. In this paper, we address this mismatch by considering three types of degradation commonly encountered in remote voice analysis, namely background noise, reverberation and nonlinear distortion,… ▽ More

    Submitted 31 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Preprint, 12 pages, 6 figures

  34. arXiv:1905.11010  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Adaptive probabilistic principal component analysis

    Authors: Adam Farooq, Yordan P. Raykov, Luc Evers, Max A. Little

    Abstract: Using the linear Gaussian latent variable model as a starting point we relax some of the constraints it imposes by deriving a nonparametric latent feature Gaussian variable model. This model introduces additional discrete latent variables to the original structure. The Bayesian nonparametric nature of this new model allows it to adapt complexity as more data is observed and project each data point… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  35. arXiv:1905.08557  [pdf, other

    cs.SD cs.LG eess.AS

    Bayesian Pitch Tracking Based on the Harmonic Model

    Authors: Liming Shi, Jesper Kjaer Nielsen, Jesper Rindom Jensen, Max A. Little, Mads Graesboll Christensen

    Abstract: Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model o… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  36. arXiv:1812.11954  [pdf, other

    math.ST stat.ML

    Exact Cluster Recovery via Classical Multidimensional Scaling

    Authors: Anna Little, Yuying Xie, Qiang Sun

    Abstract: Classical multidimensional scaling is an important dimension reduction technique. Yet few theoretical results characterizing its statistical performance exist. This paper provides a theoretical framework for analyzing the quality of embedded samples produced by classical multidimensional scaling. This lays the foundation for various downstream statistical analyses, and we focus on clustering noisy… ▽ More

    Submitted 7 July, 2020; v1 submitted 31 December, 2018; originally announced December 2018.

    Comments: 42 pages in cluding appendix

  37. arXiv:1812.02585  [pdf, other

    eess.SP math.PR

    Probabilistic modelling of gait for remote passive monitoring applications

    Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Marjan J. Faber, Bastiaan R. Bloem, Kasper Claes, Max A. Little

    Abstract: Passive and non-obtrusive health monitoring using wearables can potentially bring new insights into the user's health status throughout the day and may support clinical diagnosis and treatment. However, identifying segments of free-living data that sufficiently reflect the user's health is challenging. In this work we have studied the problem of modelling real-life gait which is a very indicative… ▽ More

    Submitted 30 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/153

  38. arXiv:1810.08807  [pdf

    stat.AP

    Investigating Voice as a Biomarker for leucine-rich repeat kinase 2-Associated Parkinson's Disease

    Authors: S. Arora, N. P. Visanji, T. A. Mestre, A. Tsanas, A. AlDakheel, B. S. Connolly, C. Gasca-Salas, D. S. Kern, J. Jain, E. J. Slow, A. Faust-Socher, A. E. Lang, M. A. Little, C. Marras

    Abstract: We investigate the potential association between leucine-rich repeat kinase 2 (LRRK2) mutations and voice. Sustained phonations ('aaah' sounds) were recorded from 7 individuals with LRRK2-associated Parkinson's disease (PD), 17 participants with idiopathic PD (iPD), 20 non-manifesting LRRK2-mutation carriers, 25 related non-carriers, and 26 controls. In distinguishing LRRK2-associated PD and iPD,… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

    Comments: 27 pages including supplemental information, Journal of Parkinson's Disease, 2018

  39. arXiv:1807.11062  [pdf

    physics.ed-ph

    Exploring Mindset's Applicability to Students' Experiences with Challenge in Transformed College Physics Courses

    Authors: Angela Little, Bridget Humphrey, Abigail Green, Abhilash Nair, Vashti Sawtelle

    Abstract: The mindset literature is a longstanding area of psychological research focused on beliefs about intelligence, response to challenge, and goals for learning (Dweck, 2000). However, the mindset literature's applicability to the context of college physics has not been widely studied. In this paper we narrow our focus toward students' descriptions of their responses to challenge in college physics. W… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

  40. arXiv:1807.04098  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    A Recurrent Neural Network Survival Model: Predicting Web User Return Time

    Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

    Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

  41. arXiv:1806.03966  [pdf, other

    astro-ph.GA astro-ph.SR physics.atom-ph

    Theoretical study of ArH+ dissociative recombination and electron-impact vibrational excitation

    Authors: A. Abdoulanziz, F. Colboc, D. A. Little, Y. Moulane, J. Zs. Mezei, E. Roueff, J. Tennyson, I. F. Schneider, V. Laporta

    Abstract: Cross sections are presented for dissociative recombination and electron-impact vibrational excitation of the ArH+ molecular ion at electron energies appropriate for the interstellar environment. The R-matrix method is employed to determine the molecular structure data, i.e. the position and width of the resonance states. The cross sections and the corresponding Maxwellian rate coefficients are co… ▽ More

    Submitted 24 June, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

    Comments: 6 pages; 7 figures

    Journal ref: MNRAS, 479, 2415-2420 (2018)

  42. Field evolution of magnons in $α$-RuCl$_3$ by high-resolution polarized terahertz spectroscopy

    Authors: Liang Wu, Arielle Little, Erik E. Aldape, Dylan Rees, Eric Thewalt, Paula Lampen-Kelley, Arnab Banerjee, Craig A. Bridges, Jiaqiang Yan, Derrick Boone, Shreyas Patankar, David Goldhaber-Golden, David Mandrus, Stephen E. Nagler, Ehud Altman, Joseph Orenstein

    Abstract: The Kitaev quantum spin liquid (KSL) is a theoretically predicted state of matter whose fractionalized quasiparticles are distinct from bosonic magnons, the fundamental excitation in ordered magnets. The layered honeycomb antiferromagnet $α$-RuCl$_3$ is a KSL candidate material, as it can be driven to a magnetically disordered phase by application of an in-plane magnetic field, with $H_c \sim 7$ T… ▽ More

    Submitted 5 October, 2018; v1 submitted 3 June, 2018; originally announced June 2018.

    Comments: 8 pages, 5 figures in the main text. Appendices are included at the end of the article

    Journal ref: Phys. Rev. B 98, 094425 (2018)

  43. arXiv:1712.06206  [pdf, other

    stat.ML

    Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

    Authors: Anna Little, Mauro Maggioni, James M. Murphy

    Abstract: We consider the problem of clustering with the longest-leg path distance (LLPD) metric, which is informative for elongated and irregularly shaped clusters. We prove finite-sample guarantees on the performance of clustering with respect to this metric when random samples are drawn from multiple intrinsically low-dimensional clusters in high-dimensional space, in the presence of a large number of hi… ▽ More

    Submitted 6 March, 2019; v1 submitted 17 December, 2017; originally announced December 2017.

    Comments: 59 pages, 12 figures

  44. Multiple core hole formation by free-electron laser radiation in molecular nitrogen

    Authors: Henry I B Banks, Duncan A Little, Agapi Emmanouilidou

    Abstract: We investigate the formation of multiple-core-hole states of molecular nitrogen interacting with a free-electron laser pulse. We obtain bound and continuum molecular orbitals in the single-center expansion scheme and use these orbitals to calculate photo-ionization and Auger decay rates. Using these rates, we compute the atomic ion yields generated in this interaction. We track the population of a… ▽ More

    Submitted 8 January, 2018; v1 submitted 17 December, 2017; originally announced December 2017.

  45. arXiv:1711.07557  [pdf, other

    eess.SP

    A unified algorithm framework for quality control of sensor data for behavioural clinimetric testing

    Authors: Reham Badawy, Yordan P. Raykov, Max A. Little

    Abstract: The use of smartphone and wearable sensing technology for objective, non-invasive and remote clinimetric testing of symptoms has considerable potential. However, the clinimetric accuracy achievable with such technology is highly reliant on separating the useful from irrelevant or confounded sensor data. Monitoring patient symptoms using digital sensors outside of controlled, clinical lab settings… ▽ More

    Submitted 23 November, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

  46. arXiv:1710.08972  [pdf

    physics.ed-ph

    On the Importance of Engaging Students in Crafting Definitions

    Authors: Angela Little, Leslie Atkins Elliott

    Abstract: In this paper we describe an activity for engaging students in crafting definitions. We explore the strengths of this particular activity as well as the broader implications of engaging students in crafting definitions more generally.

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: 6 pages

  47. Imaging anomalous nematic order and strain in optimally doped BaFe$_2$(As,P)$_2$

    Authors: Eric Thewalt, Ian M. Hayes, James P. Hinton, Arielle Little, Shreyas Patankar, Liang Wu, Toni Helm, Camelia V. Stan, Nobumichi Tamura, James G. Analytis, Joseph Orenstein

    Abstract: We present the strain and temperature dependence of an anomalous nematic phase in optimally doped BaFe$_2$(As,P)$_2$. Polarized ultrafast optical measurements reveal broken 4-fold rotational symmetry in a temperature range above $T_c$ in which bulk probes do not detect a phase transition. Using ultrafast microscopy, we find that the magnitude and sign of this nematicity vary on a ${50{-}100}~μ$m l… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Journal ref: Phys. Rev. Lett. 121, 027001 (2018)

  48. arXiv:1706.09865  [pdf, other

    stat.ML cs.CY cs.LG

    Generalising Random Forest Parameter Optimisation to Include Stability and Cost

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

    Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More

    Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear in ECML-PKDD 2017

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

  49. Antiferromagnetic resonance and terahertz continuum in $α-$RuCl$_3$

    Authors: A. Little, Liang Wu, P. Lampen-Kelley, A. Banerjee, S. Patankar, D. Rees, C. A. Bridges, J. -Q. Yan, D. Mandrus, S. E. Nagler, J. Orenstein

    Abstract: We report measurements of optical absorption in the zig-zag antiferromagnet $α$-RuCl$_3$ as a function of temperature, $T$, magnetic field, $B$, and photon energy, $\hbarω$ in the range $\sim$ 0.3 to 8.3 meV, using time-domain terahertz spectroscopy. Polarized measurements show that 3-fold rotational symmetry is broken in the honeycomb plane from 2 K to 300 K. We find a sharp absorption peak at 2.… ▽ More

    Submitted 9 October, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: 5 pages, 3 figures in the main text. To appear in Phys. Rev. Lett., magnetic field data included. Supplementary information also included

    Journal ref: Phys. Rev. Lett. 119, 227201 (2017)

  50. arXiv:1704.03444  [pdf, ps, other

    physics.atom-ph

    Interaction of molecular nitrogen with Free-Electron-Laser radiation

    Authors: H. I. B. Banks, D. A. Little, J. Tennyson, A. Emmanouilidou

    Abstract: We compute molecular continuum orbitals in the single center expansion scheme. We then employ these orbitals to obtain molecular Auger rates and single-photon ionization cross sections to study the interaction of N2 with Free-Electron-Laser (FEL) pulses. The nuclei are kept fixed. We formulate rate equations for the energetically allowed molecular and atomic transitions and we account for dissocia… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.