Search | arXiv e-print repository

EKM: An exact, polynomial-time algorithm for the $K$-medoids problem

Abstract: The $K$-medoids problem is a challenging combinatorial clustering task, widely used in data analysis applications. While numerous algorithms have been proposed to solve this problem, none of these are able to obtain an exact (globally optimal) solution for the problem in polynomial time. In this paper, we present EKM: a novel algorithm for solving this problem exactly with worst-case… ▽ More The $K$-medoids problem is a challenging combinatorial clustering task, widely used in data analysis applications. While numerous algorithms have been proposed to solve this problem, none of these are able to obtain an exact (globally optimal) solution for the problem in polynomial time. In this paper, we present EKM: a novel algorithm for solving this problem exactly with worst-case $O\left(N^{K+1}\right)$ time complexity. EKM is developed according to recent advances in transformational programming and combinatorial generation, using formal program derivation steps. The derived algorithm is provably correct by construction. We demonstrate the effectiveness of our algorithm by comparing it against various approximate methods on numerous real-world datasets. We show that the wall-clock run time of our algorithm matches the worst-case time complexity analysis on synthetic datasets, clearly outperforming the exponential time complexity of benchmark branch-and-bound based MIP solvers. To our knowledge, this is the first, rigorously-proven polynomial time, practical algorithm for this ubiquitous problem. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2403.09580 [pdf, ps, other]

Algorithmic syntactic causal identification

Authors: Dhurim Cakiqi, Max A. Little

Abstract: Causal identification in causal Bayes nets (CBNs) is an important tool in causal inference allowing the derivation of interventional distributions from observational distributions where this is possible in principle. However, most existing formulations of causal identification using techniques such as d-separation and do-calculus are expressed within the mathematical language of classical probabil… ▽ More Causal identification in causal Bayes nets (CBNs) is an important tool in causal inference allowing the derivation of interventional distributions from observational distributions where this is possible in principle. However, most existing formulations of causal identification using techniques such as d-separation and do-calculus are expressed within the mathematical language of classical probability theory on CBNs. However, there are many causal settings where probability theory and hence current causal identification techniques are inapplicable such as relational databases, dataflow programs such as hardware description languages, distributed systems and most modern machine learning algorithms. We show that this restriction can be lifted by replacing the use of classical probability theory with the alternative axiomatic foundation of symmetric monoidal categories. In this alternative axiomatization, we show how an unambiguous and clean distinction can be drawn between the general syntax of causal models and any specific semantic implementation of that causal model. This allows a purely syntactic algorithmic description of general causal identification by a translation of recent formulations of the general ID algorithm through fixing. Our description is given entirely in terms of the non-parametric ADMG structure specifying a causal model and the algebraic signature of the corresponding monoidal category, to which a sequence of manipulations is then applied so as to arrive at a modified monoidal category in which the desired, purely syntactic interventional causal model, is obtained. We use this idea to derive purely syntactic analogues of classical back-door and front-door causal adjustment, and illustrate an application to a more complex causal model. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 11 pages, 2 TikZ figures

arXiv:2402.14276 [pdf, other]

Bispectrum Unbiasing for Dilation-Invariant Multi-reference Alignment

Authors: Li** Yin, Anna Little, Matthew Hirn

Abstract: Motivated by modern data applications such as cryo-electron microscopy, the goal of classic multi-reference alignment (MRA) is to recover an unknown signal $f: \mathbb{R} \to \mathbb{R}$ from many observations that have been randomly translated and corrupted by additive noise. We consider a generalization of classic MRA where signals are also corrupted by a random scale change, i.e. dilation. We p… ▽ More Motivated by modern data applications such as cryo-electron microscopy, the goal of classic multi-reference alignment (MRA) is to recover an unknown signal $f: \mathbb{R} \to \mathbb{R}$ from many observations that have been randomly translated and corrupted by additive noise. We consider a generalization of classic MRA where signals are also corrupted by a random scale change, i.e. dilation. We propose a novel data-driven unbiasing procedure which can recover an unbiased estimator of the bispectrum of the unknown signal, given knowledge of the dilation distribution. Lastly, we invert the recovered bispectrum to achieve full signal recovery, and validate our methodology on a set of synthetic signals. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2401.05159 [pdf, other]

Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN

Authors: Muhammad Ali Farooq, Wang Yao, Michael Schukat, Mark A Little, Peter Corcoran

Abstract: This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data generation plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training. In this context, we aim to incorporate enhanc… ▽ More This study explores the utilization of Dermatoscopic synthetic data generated through stable diffusion models as a strategy for enhancing the robustness of machine learning model training. Synthetic data generation plays a pivotal role in mitigating challenges associated with limited labeled datasets, thereby facilitating more effective model training. In this context, we aim to incorporate enhanced data transformation techniques by extending the recent success of few-shot learning and a small amount of data representation in text-to-image latent diffusion models. The optimally tuned model is further used for rendering high-quality skin lesion synthetic data with diverse and realistic characteristics, providing a valuable supplement and diversity to the existing training data. We investigate the impact of incorporating newly generated synthetic data into the training pipeline of state-of-art machine learning models, assessing its effectiveness in enhancing model performance and generalization to unseen real-world data. Our experimental results demonstrate the efficacy of the synthetic data generated through stable diffusion models helps in improving the robustness and adaptability of end-to-end CNN and vision transformer models on two different real-world skin lesion datasets. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: Paper is submitted in EMBC 2024 Conference

arXiv:2307.05750 [pdf, other]

Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

Authors: Nicolás García Trillos, Anna Little, Daniel McKenzie, James M. Murphy

Abstract: We analyze the convergence properties of Fermat distances, a family of density-driven metrics defined on Riemannian manifolds with an associated probability measure. Fermat distances may be defined either on discrete samples from the underlying measure, in which case they are random, or in the continuum setting, in which they are induced by geodesics under a density-distorted Riemannian metric. We… ▽ More We analyze the convergence properties of Fermat distances, a family of density-driven metrics defined on Riemannian manifolds with an associated probability measure. Fermat distances may be defined either on discrete samples from the underlying measure, in which case they are random, or in the continuum setting, in which they are induced by geodesics under a density-distorted Riemannian metric. We prove that discrete, sample-based Fermat distances converge to their continuum analogues in small neighborhoods with a precise rate that depends on the intrinsic dimensionality of the data and the parameter governing the extent of density weighting in Fermat distances. This is done by leveraging novel geometric and statistical arguments in percolation theory that allow for non-uniform densities and curved domains. Our results are then used to prove that discrete graph Laplacians based on discrete, sample-driven Fermat distances converge to corresponding continuum operators. In particular, we show the discrete eigenvalues and eigenvectors converge to their continuum analogues at a dimension-dependent rate, which allows us to interpret the efficacy of discrete spectral clustering using Fermat distances in terms of the resulting continuum limit. The perspective afforded by our discrete-to-continuum Fermat distance analysis leads to new clustering algorithms for data and related insights into efficient computations associated to density-driven spectral clustering. Our theoretical analysis is supported with numerical simulations and experiments on synthetic and real image data. △ Less

Submitted 7 July, 2023; originally announced July 2023.

arXiv:2306.14107 [pdf, ps, other]

A Riemann-Hilbert approach to skew-orthogonal polynomials of symplectic type

Authors: Alex Little

Abstract: We present a representation of skew-orthogonal polynomials of symplectic type ($β= 4$) in terms of a matrix Riemann-Hilbert problem, for weights of the form $e^{-V(z)}$ where $V$ is a polynomial of even degree and positive leading coefficient. This is done by representing skew-orthogonality as a kind of multiple-orthogonality. From this we derive a $β= 4$ analogue of the Christoffel-Darboux formul… ▽ More We present a representation of skew-orthogonal polynomials of symplectic type ($β= 4$) in terms of a matrix Riemann-Hilbert problem, for weights of the form $e^{-V(z)}$ where $V$ is a polynomial of even degree and positive leading coefficient. This is done by representing skew-orthogonality as a kind of multiple-orthogonality. From this we derive a $β= 4$ analogue of the Christoffel-Darboux formula. Finally, our Riemann-Hilbert representation allows us to derive a Lax pair whose compatibility condition may be viewed as a $β= 4$ analogue of the Toda lattice. △ Less

Submitted 29 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

Comments: 38 pages, 2 figures

MSC Class: 35Q15 (Primary); 15A52 (Secondary)

arXiv:2306.12344 [pdf, other]

An efficient, provably exact, practical algorithm for the 0-1 loss linear classification problem

Authors: Xi He, Waheed Ul Rahman, Max A. Little

Abstract: Algorithms for solving the linear classification problem have a long history, dating back at least to 1936 with linear discriminant analysis. For linearly separable data, many algorithms can obtain the exact solution to the corresponding 0-1 loss classification problem efficiently, but for data which is not linearly separable, it has been shown that this problem, in full generality, is NP-hard. Al… ▽ More Algorithms for solving the linear classification problem have a long history, dating back at least to 1936 with linear discriminant analysis. For linearly separable data, many algorithms can obtain the exact solution to the corresponding 0-1 loss classification problem efficiently, but for data which is not linearly separable, it has been shown that this problem, in full generality, is NP-hard. Alternative approaches all involve approximations of some kind, including the use of surrogates for the 0-1 loss (for example, the hinge or logistic loss) or approximate combinatorial search, none of which can be guaranteed to solve the problem exactly. Finding efficient algorithms to obtain an exact i.e. globally optimal solution for the 0-1 loss linear classification problem with fixed dimension, remains an open problem. In research we report here, we detail the rigorous construction of a new algorithm, incremental cell enumeration (ICE), that can solve the 0-1 loss classification problem exactly in polynomial time. We prove correctness using concepts from the theory of hyperplane arrangements and oriented matroids. We demonstrate the effectiveness of this algorithm on synthetic and real-world datasets, showing optimal accuracy both in and out-of-sample, in practical computational time. We also empirically demonstrate how the use of approximate upper bound leads to polynomial time run-time improvements to the algorithm whilst retaining exactness. To our knowledge, this is the first, rigorously-proven polynomial time, practical algorithm for this long-standing problem. △ Less

Submitted 2 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: 19 pages, 3 figures

arXiv:2306.03173 [pdf, other]

Linear Distance Metric Learning with Noisy Labels

Authors: Meysam Alishahi, Anna Little, Jeff M. Phillips

Abstract: In linear distance metric learning, we are given data in one Euclidean metric space and the goal is to find an appropriate linear map to another Euclidean metric space which respects certain distance conditions as much as possible. In this paper, we formalize a simple and elegant method which reduces to a general continuous convex loss optimization problem, and for different noise models we derive… ▽ More In linear distance metric learning, we are given data in one Euclidean metric space and the goal is to find an appropriate linear map to another Euclidean metric space which respects certain distance conditions as much as possible. In this paper, we formalize a simple and elegant method which reduces to a general continuous convex loss optimization problem, and for different noise models we derive the corresponding loss functions. We show that even if the data is noisy, the ground truth linear metric can be learned with any precision provided access to enough samples, and we provide a corresponding sample complexity bound. Moreover, we present an effective way to truncate the learned model to a low-rank model that can provably maintain the accuracy in loss function and in parameters -- the first such results of this type. Several experimental observations on synthetic and real data sets support and inform our theoretical results. △ Less

Submitted 20 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: 52 pages

arXiv:2303.10014 [pdf, ps, other]

Reliability of Tumour Classification from Multi-Dimensional DCE-MRI Variables using Data Transformations

Authors: S. V. Notley, N. A. Thacker, L. Horsley, R. A. Little, Y. Watson, S. Mullamitha, G. C. Jayson, A. Jackson

Abstract: Summary mean DCE-MRI variables show a clear dependency between signal and noise variance, which can be shown to reduce the effectiveness of difference assessments. Appropriate transformation of these variables supports statistically efficient and robust comparisons. The capabilities of DCE-MRI based descriptions of hepatic colorectal tumour classification was assessed, with regard to their potenti… ▽ More Summary mean DCE-MRI variables show a clear dependency between signal and noise variance, which can be shown to reduce the effectiveness of difference assessments. Appropriate transformation of these variables supports statistically efficient and robust comparisons. The capabilities of DCE-MRI based descriptions of hepatic colorectal tumour classification was assessed, with regard to their potential for use as imaging biomarkers. Four DCE-MRI parameters were extracted from 102 selected tumour regions. A multi-dimensional statistical distance metric was assessed for the challenging task of comparing intra- and inter- subject tumour differences. Statistical errors were estimated using bootstrap resampling. The potential for tumour classification was assessed via Monte Carlo simulation. Transformation of the variables and fusion into a single chi-squared statistic shows that inter subject variation in hepatic tumours is measurable and significantly greater than intra-subject variation at the group level. However, reliability analysis shows that, at current noise levels, individual tumour assessment is not possible. Appropriate data transforms for DCE-MRI derived parameters produce an improvement in statistical sensitivity compared to conventional approaches. Reliability analysis shows, that even with data transformation, DCI-MRI variables do not currently facilitate good tumour discrimination and a doubling of SNR is needed to support non-trivial levels of classification △ Less

Submitted 17 March, 2023; originally announced March 2023.

Comments: 18 pages and 6 figures

MSC Class: 92-08

arXiv:2212.00525 [pdf, other]

The complex elliptic Ginibre ensemble at weak non-Hermiticity: bulk spacing distributions

Authors: Thomas Bothner, Alex Little

Abstract: We show that the distribution of bulk spacings between pairs of adjacent eigenvalue real parts of a random matrix drawn from the complex elliptic Ginibre ensemble is asymptotically given by a generalization of the Gaudin-Mehta distribution, in the limit of weak non-Hermiticity. The same generalization is expressed in terms of an integro-differential Painlevé function and it is shown that the gener… ▽ More We show that the distribution of bulk spacings between pairs of adjacent eigenvalue real parts of a random matrix drawn from the complex elliptic Ginibre ensemble is asymptotically given by a generalization of the Gaudin-Mehta distribution, in the limit of weak non-Hermiticity. The same generalization is expressed in terms of an integro-differential Painlevé function and it is shown that the generalized Gaudin-Mehta distribution describes the crossover, with increasing degree of non-Hermiticity, from Gaudin-Mehta nearest-neighbor bulk statistics in the Gaussian Unitary Ensemble to Poisson gap statistics for eigenvalue real parts in the bulk of the Complex Ginibre Ensemble. △ Less

Submitted 12 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

Comments: 39 pages, 3 figures. Version 2 corrects typos

MSC Class: 60B20 (Primary); 60G55; 33E17; 47B35 (Secondary)

arXiv:2211.11294 [pdf, other]

TSDF: A simple yet comprehensive, unified data storage and exchange format standard for digital biosensor data in health applications

Authors: Kasper Claes, Valentina Ticcinelli, Reham Badawy, Yordan P. Raykov, Luc J. W. Evers, Max A. Little

Abstract: Digital sensors are increasingly being used to monitor the change over time of physiological processes in biological health and disease, often using wearable devices. This generates very large amounts of digital sensor data, for which, a consensus on a common storage, exchange and archival data format standard, has yet to be reached. To address this gap, we propose Time Series Data Format (TSDF):… ▽ More Digital sensors are increasingly being used to monitor the change over time of physiological processes in biological health and disease, often using wearable devices. This generates very large amounts of digital sensor data, for which, a consensus on a common storage, exchange and archival data format standard, has yet to be reached. To address this gap, we propose Time Series Data Format (TSDF): a unified, standardized format for storing all types of physiological sensor data, across diverse disease areas. We pose a series of format design criteria and review in detail current storage and exchange formats. When judged against these criteria, we find these current formats lacking, and propose a very simple, intuitive standard for both numerical sensor data and metadata, based on raw binary data and JSON-format text files, for sensor measurements/timestamps and metadata, respectively. By focusing on the common characteristics of diverse biosensor data, we define a set of necessary and sufficient metadata fields for storing, processing, exchanging, archiving and reliably interpreting, multi-channel biological time series data. Our aim is for this standardized format to increase the interpretability and exchangeability of data, thereby contributing to scientific reproducibility in studies where digital biosensor data forms a key evidence base. △ Less

Submitted 22 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

arXiv:2209.05038 [pdf, ps, other]

doi 10.1016/j.acha.2023.101597

On Generalizations of the Nonwindowed Scattering Transform

Authors: Albert Chua, Matthew Hirn, Anna Little

Abstract: In this paper, we generalize finite depth wavelet scattering transforms, which we formulate as $\Lb^q(\mathbb{R}^n)$ norms of a cascade of continuous wavelet transforms (or dyadic wavelet transforms) and contractive nonlinearities. We then provide norms for these operators, prove that these operators are well-defined, and are Lipschitz continuous to the action of $C^2$ diffeomorphisms in specific… ▽ More In this paper, we generalize finite depth wavelet scattering transforms, which we formulate as $\Lb^q(\mathbb{R}^n)$ norms of a cascade of continuous wavelet transforms (or dyadic wavelet transforms) and contractive nonlinearities. We then provide norms for these operators, prove that these operators are well-defined, and are Lipschitz continuous to the action of $C^2$ diffeomorphisms in specific cases. Lastly, we extend our results to formulate an operator invariant to the action of rotations $R \in \text{SO}(n)$ and an operator that is equivariant to the action of rotations of $R \in \text{SO}(n)$. △ Less

Submitted 13 September, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

Comments: Corrected small typos throughout. Diffeomorphism stability definition in introduction was changed. The map** Phi isn't necessarily from V to V, which is reflected in the results we found

arXiv:2208.04684 [pdf, other]

The complex elliptic Ginibre ensemble at weak non-Hermiticity: edge spacing distributions

Authors: Thomas Bothner, Alex Little

Abstract: The focus of this paper is on the distribution function of the rightmost eigenvalue for the complex elliptic Ginibre ensemble in the limit of weak non-Hermiticity. We show how the limiting distribution function can be expressed in terms of an integro-differential Painlevé-II function and how the same captures the non-trivial transition between Poisson and Airy point process extreme value statistic… ▽ More The focus of this paper is on the distribution function of the rightmost eigenvalue for the complex elliptic Ginibre ensemble in the limit of weak non-Hermiticity. We show how the limiting distribution function can be expressed in terms of an integro-differential Painlevé-II function and how the same captures the non-trivial transition between Poisson and Airy point process extreme value statistics as the degree of non-Hermiticity decreases. Our most explicit new results concern the tail asymptotics of the limiting distribution function. For the right tail we compute the leading order asymptotics uniformly in the degree of non-Hermiticity, for the left tail we compute it close to Hermiticity. △ Less

Submitted 1 March, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: 62 pages, 11 figures. Version 2 corrects typos and updates literature

MSC Class: Primary 45B05; Secondary 47B35; 35Q15; 30E25; 37J35; 70H06; 34E05

arXiv:2208.04315 [pdf]

Patient-Specific Game-Based Transfer Method for Parkinson's Disease Severity Prediction

Authors: Zaifa Xue, Huibin Lu, Tao Zhang, Max A. Little

Abstract: Dysphonia is one of the early symptoms of Parkinson's disease (PD). Most existing methods use feature selection methods to find the optimal subset of voice features for all PD patients. Few have considered the heterogeneity between patients, which implies the need to provide specific prediction models for different patients. However, building the specific model faces the challenge of small sample… ▽ More Dysphonia is one of the early symptoms of Parkinson's disease (PD). Most existing methods use feature selection methods to find the optimal subset of voice features for all PD patients. Few have considered the heterogeneity between patients, which implies the need to provide specific prediction models for different patients. However, building the specific model faces the challenge of small sample size, which makes it lack generalization ability. Instance transfer is an effective way to solve this problem. Therefore, this paper proposes a patient-specific game-based transfer (PSGT) method for PD severity prediction. First, a selection mechanism is used to select PD patients with similar disease trends to the target patient from the source domain, which greatly reduces the risk of negative transfer. Then, the contribution of the transferred subjects and their instances to the disease estimation of the target subject is fairly evaluated by the Shapley value, which improves the interpretability of the method. Next, the proportion of valid instances in the transferred subjects is determined, and the instances with higher contribution are transferred to further reduce the difference between the transferred instance subset and the target subject. Finally, the selected subset of instances is added to the training set of the target subject, and the extended data is fed into the random forest to improve the performance of the method. Parkinson's telemonitoring dataset is used to evaluate the feasibility and effectiveness. Experiment results show that the PSGT has better performance in both prediction error and stability over compared methods. △ Less

Submitted 12 August, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

arXiv:2207.01137 [pdf]

doi 10.1145/3534678.3539148

Promotheus: An End-to-End Machine Learning Framework for Optimizing Markdown in Online Fashion E-commerce

Authors: Eleanor Loh, Jalaj Khandelwal, Brian Regan, Duncan A. Little

Abstract: Managing discount promotional events ("markdown") is a significant part of running an e-commerce business, and inefficiencies here can significantly hamper a retailer's profitability. Traditional approaches for tackling this problem rely heavily on price elasticity modelling. However, the partial information nature of price elasticity modelling, together with the non-negotiable responsibility for… ▽ More Managing discount promotional events ("markdown") is a significant part of running an e-commerce business, and inefficiencies here can significantly hamper a retailer's profitability. Traditional approaches for tackling this problem rely heavily on price elasticity modelling. However, the partial information nature of price elasticity modelling, together with the non-negotiable responsibility for protecting profitability, mean that machine learning practitioners must often go through great lengths to define strategies for measuring offline model quality. In the face of this, many retailers fall back on rule-based methods, thus forgoing significant gains in profitability that can be captured by machine learning. In this paper, we introduce two novel end-to-end markdown management systems for optimising markdown at different stages of a retailer's journey. The first system, "Ithax", enacts a rational supply-side pricing strategy without demand estimation, and can be usefully deployed as a "cold start" solution to collect markdown data while maintaining revenue control. The second system, "Promotheus", presents a full framework for markdown optimization with price elasticity. We describe in detail the specific modelling and validation procedures that, within our experience, have been crucial to building a system that performs robustly in the real world. Both markdown systems achieve superior profitability compared to decisions made by our experienced operations teams in a controlled online test, with improvements of 86% (Promotheus) and 79% (Ithax) relative to manual strategies. These systems have been deployed to manage markdown at ASOS.com, and both systems can be fruitfully deployed for price optimization across a wide variety of retail e-commerce settings. △ Less

Submitted 12 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

Comments: 11 pages; Accepted at KDD 2022

MSC Class: 68T01 (Primary) 90C29; 9108 (Secondary) ACM Class: I.2.8; G.3; G.4

arXiv:2206.07729 [pdf, other]

Taxonomy of Benchmarks in Graph Representation Learning

Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Sarah McGuire, Xinyi Wang, Anna Little, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

Abstract: Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on develo** GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to w… ▽ More Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on develo** GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to what extent do they test the ability of a model to leverage graph structure vs. node features? Here, we develop a principled approach to taxonomize benchmarking datasets according to a $\textit{sensitivity profile}$ that is based on how much GNN performance changes due to a collection of graph perturbations. Our data-driven analysis provides a deeper understanding of which benchmarking data characteristics are leveraged by GNNs. Consequently, our taxonomy can aid in selection and development of adequate graph benchmarks, and better informed evaluation of future GNN methods. Finally, our approach and implementation in $\texttt{GTaxoGym}$ package are extendable to multiple graph prediction task types and future datasets. △ Less

Submitted 30 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: In Proceedings of the First Learning on Graphs Conference (LoG 2022)

arXiv:2110.14809 [pdf, other]

Towards a Taxonomy of Graph Learning Datasets

Authors: Renming Liu, Semih Cantürk, Frederik Wenkel, Dylan Sandfelder, Devin Kreuzer, Anna Little, Sarah McGuire, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

Abstract: Graph neural networks (GNNs) have attracted much attention due to their ability to leverage the intrinsic geometries of the underlying data. Although many different types of GNN models have been developed, with many benchmarking procedures to demonstrate the superiority of one GNN model over the others, there is a lack of systematic understanding of the underlying benchmarking datasets, and what a… ▽ More Graph neural networks (GNNs) have attracted much attention due to their ability to leverage the intrinsic geometries of the underlying data. Although many different types of GNN models have been developed, with many benchmarking procedures to demonstrate the superiority of one GNN model over the others, there is a lack of systematic understanding of the underlying benchmarking datasets, and what aspects of the model are being tested. Here, we provide a principled approach to taxonomize graph benchmarking datasets by carefully designing a collection of graph perturbations to probe the essential data characteristics that GNN models leverage to perform predictions. Our data-driven taxonomization of graph datasets provides a new understanding of critical dataset characteristics that will enable better model evaluation and the development of more specialized GNN models. △ Less

Submitted 27 October, 2021; originally announced October 2021.

Comments: in Data-Centric AI Workshop at NeurIPS 2021

arXiv:2107.08152 [pdf, other]

doi 10.1007/s10751-016-1382-6

Limit on the Electric Charge of Antihydrogen

Authors: A. Capra, C. Amole, M. D. Ashkezari, M. Baquero-Ruiz, W. Bertsche, E. Butler, C. L. Cesar, M. Charlton, S. Eriksson, J. Fajans, T. Friesen, M. C. Fujiwara, D. R. Gill, A. Gutierrez, J. S. Hangst, W. N. Hardy, M. E. Hayden, C. A. Isaac, S. Jonsell, L . Kurchaninov, A. Little, J. T. K. McKenna, S. Menary, S. C. Napoli, P. Nolan , et al. (15 additional authors not shown)

Abstract: The ALPHA collaboration has successfully demonstrated the production and the confinement of cold antihydrogen, $\overline{\mathrm{H}}$. An analysis of trap** data allowed a stringent limit to be placed on the electric charge of the simplest antiatom. Charge neutrality of matter is known to a very high precision, hence a neutrality limit of $\overline{\mathrm{H}}$ provides a test of CPT invarianc… ▽ More The ALPHA collaboration has successfully demonstrated the production and the confinement of cold antihydrogen, $\overline{\mathrm{H}}$. An analysis of trap** data allowed a stringent limit to be placed on the electric charge of the simplest antiatom. Charge neutrality of matter is known to a very high precision, hence a neutrality limit of $\overline{\mathrm{H}}$ provides a test of CPT invariance. The experimental technique is based on the measurement of the deflection of putatively charged $\overline{\mathrm{H}}$ in an electric field. The tendency for trapped $\overline{\mathrm{H}}$ atoms to be displaced by electrostatic fields is measured and compared to the results of a detailed simulation of $\overline{\mathrm{H}}$ dynamics in the trap. An extensive survey of the systematic errors is performed, with particular attention to those due to the silicon vertex detector, which is the device used to determine the $\overline{\mathrm{H}}$ annihilation position. The limit obtained on the charge of the $\overline{\mathrm{H}}$ atom is \mbox{$ Q = (-1.3\pm1.8\pm0.4)\times10^{-8}$}, representing the first precision measurement with $\overline{\mathrm{H}}$. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: 5 pages, 3 figures

Journal ref: Hyperfine Interact 238, 9 (2017)

arXiv:2107.01752 [pdf, other]

doi 10.1145/3664828

Dynamic programming by polymorphic semiring algebraic shortcut fusion

Authors: Max A. Little, Xi He, Ugur Kayas

Abstract: Dynamic programming (DP) is an algorithmic design paradigm for the efficient, exact solution of otherwise intractable, combinatorial problems. However, DP algorithm design is often presented in an ad-hoc manner. It is sometimes difficult to justify algorithm correctness. To address this issue, this paper presents a rigorous algebraic formalism for systematically deriving DP algorithms, based on se… ▽ More Dynamic programming (DP) is an algorithmic design paradigm for the efficient, exact solution of otherwise intractable, combinatorial problems. However, DP algorithm design is often presented in an ad-hoc manner. It is sometimes difficult to justify algorithm correctness. To address this issue, this paper presents a rigorous algebraic formalism for systematically deriving DP algorithms, based on semiring polymorphism. We start with a specification, construct an algorithm to compute the required solution which is self-evidently correct because it exhaustively generates and evaluates all possible solutions meeting the specification. We then derive, through the use of shortcut fusion, an implementation of this algorithm which is both efficient and correct. We also demonstrate how, with the use of semiring lifting, the specification can be augmented with combinatorial constraints, showing how these constraints can be fused with the algorithm. We furthermore demonstrate how existing DP algorithms for a given combinatorial problem can be abstracted from their original context and re-purposed. This approach can be applied to the full scope of combinatorial problems expressible in terms of semirings. This includes, for example: optimal probability and Viterbi decoding, probabilistic marginalization, logical inference, fuzzy sets, differentiable softmax, relational and provenance queries. The approach, building on ideas from the existing literature on constructive algorithmics, exploits generic properties of polymorphic functions, tupling and formal sums and algebraic simplifications arising from constraint algebras. We demonstrate the effectiveness of this formalism for some example applications arising in signal processing, bioinformatics and reliability engineering. Python software implementing these algorithms can be downloaded from: http://www.maxlittle.net/software/dppolyalg.zip. △ Less

Submitted 4 January, 2024; v1 submitted 4 July, 2021; originally announced July 2021.

Comments: Updated v22 with revised text

Journal ref: Formal Aspects of Computing, May 2024

arXiv:2107.01274 [pdf, other]

Unbiasing Procedures for Scale-invariant Multi-reference Alignment

Authors: Matthew Hirn, Anna Little

Abstract: This article discusses a generalization of the 1-dimensional multi-reference alignment problem. The goal is to recover a hidden signal from many noisy observations, where each noisy observation includes a random translation and random dilation of the hidden signal, as well as high additive noise. We propose a method that recovers the power spectrum of the hidden signal by applying a data-driven, n… ▽ More This article discusses a generalization of the 1-dimensional multi-reference alignment problem. The goal is to recover a hidden signal from many noisy observations, where each noisy observation includes a random translation and random dilation of the hidden signal, as well as high additive noise. We propose a method that recovers the power spectrum of the hidden signal by applying a data-driven, nonlinear unbiasing procedure, and thus the hidden signal is obtained up to an unknown phase. An unbiased estimator of the power spectrum is defined, whose error depends on the sample size and noise levels, and we precisely quantify the convergence rate of the proposed estimator. The unbiasing procedure relies on knowledge of the dilation distribution, and we implement an optimization procedure to learn the dilation variance when this parameter is unknown. Our theoretical work is supported by extensive numerical experiments on a wide range of signals. △ Less

Submitted 2 July, 2021; originally announced July 2021.

Comments: 12 pages, 5 figures. Code reproducing numerical results at https://bitbucket.org/annavlittle/inversion-unbiasing/src/master/

arXiv:2104.04432 [pdf]

A Case Study of Nonresponse Bias Analysis In Educational Assessment Surveys

Authors: Yajuan Si, Roderick J. A. Little, Ya Mo, Nell Sedransk

Abstract: Nonresponse bias is a widely prevalent problem for data on education. We develop a ten-step exemplar to guide nonresponse bias analysis (NRBA) in cross-sectional studies and apply these steps to the Early Childhood Longitudinal Study, Kindergarten Class of 2010-11. A key step is the construction of indices of nonresponse bias based on proxy pattern-mixture models for survey variables of interest.… ▽ More Nonresponse bias is a widely prevalent problem for data on education. We develop a ten-step exemplar to guide nonresponse bias analysis (NRBA) in cross-sectional studies and apply these steps to the Early Childhood Longitudinal Study, Kindergarten Class of 2010-11. A key step is the construction of indices of nonresponse bias based on proxy pattern-mixture models for survey variables of interest. A novel feature is to characterize the strength of evidence about nonresponse bias contained in these indices, based on the strength of the relationship between the characteristics in the nonresponse adjustment and the key survey variables. Our NRBA improves existing methods by incorporating both missing at random and missing not at random mechanisms, and all analyses can be done straightforwardly with standard statistical software. △ Less

Submitted 25 July, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

arXiv:2102.08842 [pdf, ps, other]

On the number of real eigenvalues of a product of truncated orthogonal random matrices

Authors: Alex Little, Francesco Mezzadri, Nick Simm

Abstract: Let $O$ be chosen uniformly at random from the group of $(N+L) \times (N+L)$ orthogonal matrices. Denote by $\tilde{O}$ the upper-left $N \times N$ corner of $O$, which we refer to as a truncation of $O$. In this paper we prove two conjectures of Forrester, Ipsen and Kumar (2020) on the number of real eigenvalues $N^{(m)}_{\mathbb{R}}$ of the product matrix $\tilde{O}_{1}\ldots \tilde{O}_{m}$, whe… ▽ More Let $O$ be chosen uniformly at random from the group of $(N+L) \times (N+L)$ orthogonal matrices. Denote by $\tilde{O}$ the upper-left $N \times N$ corner of $O$, which we refer to as a truncation of $O$. In this paper we prove two conjectures of Forrester, Ipsen and Kumar (2020) on the number of real eigenvalues $N^{(m)}_{\mathbb{R}}$ of the product matrix $\tilde{O}_{1}\ldots \tilde{O}_{m}$, where the matrices $\{\tilde{O}_{j}\}_{j=1}^{m}$ are independent copies of $\tilde{O}$. When $L$ grows in proportion to $N$, we prove that $$ \mathbb{E}(N^{(m)}_{\mathbb{R}}) = \sqrt{\frac{2m L}π}\,\mathrm{arctanh}\left(\sqrt{\frac{N}{N+L}}\right) + O(1), \qquad N \to \infty. $$ We also prove the conjectured form of the limiting real eigenvalue distribution of the product matrix. Finally, we consider the opposite regime where $L$ is fixed with respect to $N$, known as the regime of weak non-orthogonality. In this case each matrix in the product is very close to an orthogonal matrix. We show that $\mathbb{E}(N^{(m)}_{\mathbb{R}}) \sim c_{L,m}\,\log(N)$ as $N \to \infty$ and compute the constant $c_{L,m}$ explicitly. These results generalise the known results in the one matrix case due to Khoruzhenko, Sommers and Życzkowski (2010). △ Less

Submitted 17 February, 2021; originally announced February 2021.

arXiv:2102.03885 [pdf, other]

Few-shot time series segmentation using prototype-defined infinite hidden Markov models

Authors: Yazan Qarout, Yordan P. Raykov, Max A. Little

Abstract: We propose a robust framework for interpretable, few-shot analysis of non-stationary sequential data based on flexible graphical models to express the structured distribution of sequential events, using prototype radial basis function (RBF) neural network emissions. A motivational link is demonstrated between prototypical neural network architectures for few-shot learning and the proposed RBF netw… ▽ More We propose a robust framework for interpretable, few-shot analysis of non-stationary sequential data based on flexible graphical models to express the structured distribution of sequential events, using prototype radial basis function (RBF) neural network emissions. A motivational link is demonstrated between prototypical neural network architectures for few-shot learning and the proposed RBF network infinite hidden Markov model (RBF-iHMM). We show that RBF networks can be efficiently specified via prototypes allowing us to express complex nonstationary patterns, while hidden Markov models are used to infer principled high-level Markov dynamics. The utility of the framework is demonstrated on biomedical signal processing applications such as automated seizure detection from EEG data where RBF networks achieve state-of-the-art performance using a fraction of the data needed to train long-short-term memory variational autoencoders. △ Less

Submitted 7 February, 2021; originally announced February 2021.

arXiv:2012.09385 [pdf, other]

Balancing Geometry and Density: Path Distances on High-Dimensional Data

Authors: Anna Little, Daniel McKenzie, James Murphy

Abstract: New geometric and computational analyses of power-weighted shortest-path distances (PWSPDs) are presented. By illuminating the way these metrics balance density and geometry in the underlying data, we clarify their key parameters and discuss how they may be chosen in practice. Comparisons are made with related data-driven metrics, which illustrate the broader role of density in kernel-based unsupe… ▽ More New geometric and computational analyses of power-weighted shortest-path distances (PWSPDs) are presented. By illuminating the way these metrics balance density and geometry in the underlying data, we clarify their key parameters and discuss how they may be chosen in practice. Comparisons are made with related data-driven metrics, which illustrate the broader role of density in kernel-based unsupervised and semi-supervised machine learning. Computationally, we relate PWSPDs on complete weighted graphs to their analogues on weighted nearest neighbor graphs, providing high probability guarantees on their equivalence that are near-optimal. Connections with percolation theory are developed to establish estimates on the bias and variance of PWSPDs in the finite sample setting. The theoretical results are bolstered by illustrative experiments, demonstrating the versatility of PWSPDs for a wide range of data settings. Throughout the paper, our results require only that the underlying data is sampled from a low-dimensional manifold, and depend crucially on the intrinsic dimension of this manifold, rather than its ambient dimension. △ Less

Submitted 7 June, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

MSC Class: 05C85; 05C80 ACM Class: I.5.3

arXiv:2009.01231 [pdf, other]

Detecting Parkinson's Disease From an Online Speech-task

Authors: Wasifur Rahman, Sangwu Lee, Md. Saiful Islam, Victor Nikhil Antony, Harshil Ratnu, Mohammad Rafayet Ali, Abdullah Al Mamun, Ellen Wagner, Stella Jensen-Roberts, Max A. Little, Ray Dorsey, Ehsan Hoque

Abstract: In this paper, we envision a web-based framework that can help anyone, anywhere around the world record a short speech task, and analyze the recorded data to screen for Parkinson's disease (PD). We collected data from 726 unique participants (262 PD, 38% female; 464 non-PD, 65% female; average age: 61) -- from all over the US and beyond. A small portion of the data was collected in a lab setting t… ▽ More In this paper, we envision a web-based framework that can help anyone, anywhere around the world record a short speech task, and analyze the recorded data to screen for Parkinson's disease (PD). We collected data from 726 unique participants (262 PD, 38% female; 464 non-PD, 65% female; average age: 61) -- from all over the US and beyond. A small portion of the data was collected in a lab setting to compare quality. The participants were instructed to utter a popular pangram containing all the letters in the English alphabet "the quick brown fox jumps over the lazy dog..". We extracted both standard acoustic features (Mel Frequency Cepstral Coefficients (MFCC), jitter and shimmer variants) and deep learning based features from the speech data. Using these features, we trained several machine learning algorithms. We achieved 0.75 AUC (Area Under The Curve) performance on determining presence of self-reported Parkinson's disease by modeling the standard acoustic features through the XGBoost -- a gradient-boosted decision tree model. Further analysis reveal that the widely used MFCC features and a subset of previously validated dysphonia features designed for detecting Parkinson's from verbal phonation task (pronouncing 'ahh') contains the most distinct information. Our model performed equally well on data collected in controlled lab environment as well as 'in the wild' across different gender and age groups. Using this tool, we can collect data from almost anyone anywhere with a video/audio enabled device, contributing to equity and access in neurological care. △ Less

Submitted 15 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

arXiv:2008.00283 [pdf, other]

doi 10.1038/s43588-021-00059-2

Crystallography companion agent for high-throughput materials discovery

Authors: Phillip M. Maffettone, Lars Banko, Peng Cui, Yury Lysogorskiy, Marc A. Little, Daniel Olds, Alfred Ludwig, Andrew I. Cooper

Abstract: The discovery of new structural and functional materials is driven by phase identification, often using X-ray diffraction (XRD). Automation has accelerated the rate of XRD measurements, greatly outpacing XRD analysis techniques that remain manual, time-consuming, error-prone, and impossible to scale. With the advent of autonomous robotic scientists or self-driving labs, contemporary techniques pro… ▽ More The discovery of new structural and functional materials is driven by phase identification, often using X-ray diffraction (XRD). Automation has accelerated the rate of XRD measurements, greatly outpacing XRD analysis techniques that remain manual, time-consuming, error-prone, and impossible to scale. With the advent of autonomous robotic scientists or self-driving labs, contemporary techniques prohibit the integration of XRD. Here, we describe a computer program for the autonomous characterization of XRD data, driven by artificial intelligence (AI), for the discovery of new materials. Starting from structural databases, we train an ensemble model using a physically accurate synthetic dataset, which output probabilistic classifications -- rather than absolutes -- to overcome the overconfidence in traditional neural networks. This AI agent behaves as a companion to the researcher, improving accuracy and offering significant time savings. It was demonstrated on a diverse set of organic and inorganic materials characterization challenges. This innovation is directly applicable to inverse design approaches, robotic discovery systems, and can be immediately considered for other forms of characterization such as spectroscopy and the pair distribution function. △ Less

Submitted 17 March, 2021; v1 submitted 1 August, 2020; originally announced August 2020.

Comments: For associated code, see https://github.com/maffettone/xca

Journal ref: Nat. Comput. Sci. 1, 290 (2021)

arXiv:2006.12369 [pdf, other]

Controlling for sparsity in sparse factor analysis models: adaptive latent feature sharing for piecewise linear dimensionality reduction

Authors: Adam Farooq, Yordan P. Raykov, Petar Raykov, Max A. Little

Abstract: Ubiquitous linear Gaussian exploratory tools such as principle component analysis (PCA) and factor analysis (FA) remain widely used as tools for: exploratory analysis, pre-processing, data visualization and related tasks. However, due to their rigid assumptions including crowding of high dimensional data, they have been replaced in many settings by more flexible and still interpretable latent feat… ▽ More Ubiquitous linear Gaussian exploratory tools such as principle component analysis (PCA) and factor analysis (FA) remain widely used as tools for: exploratory analysis, pre-processing, data visualization and related tasks. However, due to their rigid assumptions including crowding of high dimensional data, they have been replaced in many settings by more flexible and still interpretable latent feature models. The Feature allocation is usually modelled using discrete latent variables assumed to follow either parametric Beta-Bernoulli distribution or Bayesian nonparametric prior. In this work we propose a simple and tractable parametric feature allocation model which can address key limitations of current latent feature decomposition techniques. The new framework allows for explicit control over the number of features used to express each point and enables a more flexible set of allocation distributions including feature allocations with different sparsity levels. This approach is used to derive a novel adaptive Factor analysis (aFA), as well as, an adaptive probabilistic principle component analysis (aPPCA) capable of flexible structure discovery and dimensionality reduction in a wide case of scenarios. We derive both standard Gibbs sampler, as well as, an expectation-maximization inference algorithms that converge orders of magnitude faster to a reasonable point estimate solution. The utility of the proposed aPPCA model is demonstrated for standard PCA tasks such as feature learning, data visualization and data whitening. We show that aPPCA and aFA can infer interpretable high level features both when applied on raw MNIST and when applied for interpreting autoencoder features. We also demonstrate an application of the aPPCA to more robust blind source separation for functional magnetic resonance imaging (fMRI). △ Less

Submitted 28 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

Comments: Interactive demo available at https://colab.research.google.com/drive/1KrrHmAu6mV7tutZtYnpEbVibxs4GCwIo?usp=sharing

ACM Class: I.5.1

arXiv:2004.06139 [pdf]

Assessing Selection Bias in Regression Coefficients Estimated from Non-Probability Samples, with Applications to Genetics and Demographic Surveys

Authors: Brady T. West, Roderick J. A. Little, Rebecca R. Andridge, Philip S. Boonstra, Erin B. Ware, Anita Pandit, Fernanda Alvarado-Leiton

Abstract: Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers, and (b) estimated differences in subgroup means in surveys of smartphone… ▽ More Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers, and (b) estimated differences in subgroup means in surveys of smartphone users, we derive novel measures of selection bias for estimates of the coefficients in linear and probit regression models fitted to non-probability samples, when aggregate-level auxiliary data are available for the selected sample and the target population. The measures arise from normal pattern-mixture models that allow analysts to examine the sensitivity of their inferences to assumptions about non-ignorable selection in these samples. We examine the effectiveness of the proposed measures in a simulation study, and then use them to quantify the selection bias in (a) estimated PGS-phenotype relationships in a large study of volunteers recruited via Facebook, and (b) estimated subgroup differences in mean past-year employment duration in a non-probability sample of low-educated smartphone users. We evaluate the performance of the measures in these applications using benchmark estimates from large probability samples. △ Less

Submitted 8 March, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: 29 pages, 4 figures, 2 tables, supplementary material

arXiv:2004.03047 [pdf, other]

Probabilistic modelling of gait for robust passive monitoring in daily life

Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Bastiaan Bloem, Tom M. Heskes, Marjan Meinders, Kasper Claes, Max A. Little

Abstract: Passive monitoring in daily life may provide invaluable insights about a person's health throughout the day. Wearable sensor devices are likely to play a key role in enabling such monitoring in a non-obtrusive fashion. However, sensor data collected in daily life reflects multiple health and behavior related factors together. This creates the need for structured principled analysis to produce reli… ▽ More Passive monitoring in daily life may provide invaluable insights about a person's health throughout the day. Wearable sensor devices are likely to play a key role in enabling such monitoring in a non-obtrusive fashion. However, sensor data collected in daily life reflects multiple health and behavior related factors together. This creates the need for structured principled analysis to produce reliable and interpretable predictions that can be used to support clinical diagnosis and treatment. In this work we develop a principled modelling approach for free-living gait (walking) analysis. Gait is a promising target for non-obtrusive monitoring because it is common and indicative of various movement disorders such as Parkinson's disease (PD), yet its analysis has largely been limited to experimentally controlled lab settings. To locate and characterize stationary gait segments in free living using accelerometers, we present an unsupervised statistical framework designed to segment signals into differing gait and non-gait patterns. Our flexible probabilistic framework combines empirical assumptions about gait into a principled graphical model with all of its merits. We demonstrate the approach on a new video-referenced dataset including unscripted daily living activities of 25 PD patients and 25 controls, in and around their own houses. We evaluate our ability to detect gait and predict medication induced fluctuations in PD patients based on modelled gait. Our evaluation includes a comparison between sensors attached at multiple body locations including wrist, ankle, trouser pocket and lower back. △ Less

Submitted 6 April, 2020; originally announced April 2020.

arXiv:1910.09648 [pdf, other]

Causal bootstrap**

Authors: Max A. Little, Reham Badawy

Abstract: To draw scientifically meaningful conclusions and build reliable models of quantitative phenomena, cause and effect must be taken into consideration (either implicitly or explicitly). This is particularly challenging when the measurements are not from controlled experimental (interventional) settings, since cause and effect can be obscured by spurious, indirect influences. Modern predictive techni… ▽ More To draw scientifically meaningful conclusions and build reliable models of quantitative phenomena, cause and effect must be taken into consideration (either implicitly or explicitly). This is particularly challenging when the measurements are not from controlled experimental (interventional) settings, since cause and effect can be obscured by spurious, indirect influences. Modern predictive techniques from machine learning are capable of capturing high-dimensional, nonlinear relationships between variables while relying on few parametric or probabilistic model assumptions. However, since these techniques are associational, applied to observational data they are prone to picking up spurious influences from non-experimental (observational) data, making their predictions unreliable. Techniques from causal inference, such as probabilistic causal diagrams and do-calculus, provide powerful (nonparametric) tools for drawing causal inferences from such observational data. However, these techniques are often incompatible with modern, nonparametric machine learning algorithms since they typically require explicit probabilistic models. Here, we develop causal bootstrap** for augmenting classical nonparametric bootstrap resampling with information on the causal relationship between variables. This makes it possible to resample observational data such that, if it is possible to identify an interventional relationship from that data, new data representing that relationship can be simulated from the original observational data. In this way, we can use modern machine learning algorithms unaltered to make statistically powerful, yet causally-robust, predictions. We develop several causal bootstrap** algorithms for drawing interventional inferences from observational data, for classification and regression problems, and demonstrate, using synthetic and real-world examples, the value of this approach. △ Less

Submitted 9 December, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

Comments: 18 pages, 3 figures

arXiv:1909.11062 [pdf, other]

Wavelet invariants for statistically robust multi-reference alignment

Authors: Matthew Hirn, Anna Little

Abstract: We propose a nonlinear, wavelet based signal representation that is translation invariant and robust to both additive noise and random dilations. Motivated by the multi-reference alignment problem and generalizations thereof, we analyze the statistical properties of this representation given a large number of independent corruptions of a target signal. We prove the nonlinear wavelet based represen… ▽ More We propose a nonlinear, wavelet based signal representation that is translation invariant and robust to both additive noise and random dilations. Motivated by the multi-reference alignment problem and generalizations thereof, we analyze the statistical properties of this representation given a large number of independent corruptions of a target signal. We prove the nonlinear wavelet based representation uniquely defines the power spectrum but allows for an unbiasing procedure that cannot be directly applied to the power spectrum. After unbiasing the representation to remove the effects of the additive noise and random dilations, we recover an approximation of the power spectrum by solving a convex optimization problem, and thus reduce to a phase retrieval problem. Extensive numerical experiments demonstrate the statistical robustness of this approximation procedure. △ Less

Submitted 13 July, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

Comments: 59 pages, 8 figures. v3 replaces v2 and is an extensive revision. Revisions include additional background and motivation, additional context relating the approach to other methods, a discussion of stability, and improved presentation. Code reproducing all numerical results is available at https://bitbucket.org/annavlittle/code_wavelet_invariants/

MSC Class: 62

arXiv:1908.00657 [pdf]

doi 10.1038/s41563-020-0681-0

Observation of three-state nematicity in the triangular lattice antiferromagnet Fe$_{1/3}$ NbS$_2$

Authors: Arielle Little, Changmin Lee, Caolan John, Spencer Doyle, Eran Maniv, Nityan L. Nair, Wenqin Chen, Dylan Rees, Jörn W. F. Venderbos, Rafael Fernandes, James G. Analytis, Joseph Orenstein

Abstract: Nematic order is the breaking of rotational symmetry in the presence of translational invariance. While originally defined in the context of liquid crystals, the concept of nematic order has arisen in crystalline matter with discrete rotational symmetry, most prominently in the tetragonal Fe-based superconductors where the parent state is four-fold symmetric. In this case the nematic director take… ▽ More Nematic order is the breaking of rotational symmetry in the presence of translational invariance. While originally defined in the context of liquid crystals, the concept of nematic order has arisen in crystalline matter with discrete rotational symmetry, most prominently in the tetragonal Fe-based superconductors where the parent state is four-fold symmetric. In this case the nematic director takes on only two directions, and the order parameter in such "Ising-nematic" systems is a simple scalar. Here, using a novel spatially-resolved optical polarimetry technique, we show that a qualitatively distinct nematic state arises in the triangular lattice antiferromagnet Fe$_{1/3}$NbS$_2$. The crucial difference is that the nematic order on the triangular lattice is a Z$_3$, or three-state Potts-nematic order parameter. As a consequence, the anisotropy axes of response functions such as the resistivity tensor can be continuously re-oriented by external perturbations. This discovery provides insight into realizing devices that exploit analogies with nematic liquid crystals. △ Less

Submitted 1 August, 2019; originally announced August 2019.

Comments: The main text is 16 pages, including 5 figures and references. Supplementary information is appended at the end of the article

Journal ref: Nature Materials (2020)

arXiv:1905.11785 [pdf, other]

Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

Authors: Amir Hossein Poorjam, Mathew Shaji Kavalekalam, Liming Shi, Yordan P. Raykov, Jesper Rindom Jensen, Max A. Little, Mads Græsbøll Christensen

Abstract: The performance of voice-based Parkinson's disease (PD) detection systems degrades when there is an acoustic mismatch between training and operating conditions caused mainly by degradation in test signals. In this paper, we address this mismatch by considering three types of degradation commonly encountered in remote voice analysis, namely background noise, reverberation and nonlinear distortion,… ▽ More The performance of voice-based Parkinson's disease (PD) detection systems degrades when there is an acoustic mismatch between training and operating conditions caused mainly by degradation in test signals. In this paper, we address this mismatch by considering three types of degradation commonly encountered in remote voice analysis, namely background noise, reverberation and nonlinear distortion, and investigate how these degradations influence the performance of a PD detection system. Given that the specific degradation is known, we explore the effectiveness of a variety of enhancement algorithms in compensating this mismatch and improving the PD detection accuracy. Then, we propose two approaches to automatically control the quality of recordings by identifying the presence and type of short-term and long-term degradations and protocol violations in voice signals. Finally, we experiment with using the proposed quality control methods to inform the choice of enhancement algorithm. Experimental results using the voice recordings of the mPower mobile PD data set under different degradation conditions show the effectiveness of the quality control approaches in selecting an appropriate enhancement method and, consequently, in improving the PD detection accuracy. This study is a step towards the development of a remote PD detection system capable of operating in unseen acoustic environments. △ Less

Submitted 31 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: Preprint, 12 pages, 6 figures

arXiv:1905.11010 [pdf, ps, other]

Adaptive probabilistic principal component analysis

Authors: Adam Farooq, Yordan P. Raykov, Luc Evers, Max A. Little

Abstract: Using the linear Gaussian latent variable model as a starting point we relax some of the constraints it imposes by deriving a nonparametric latent feature Gaussian variable model. This model introduces additional discrete latent variables to the original structure. The Bayesian nonparametric nature of this new model allows it to adapt complexity as more data is observed and project each data point… ▽ More Using the linear Gaussian latent variable model as a starting point we relax some of the constraints it imposes by deriving a nonparametric latent feature Gaussian variable model. This model introduces additional discrete latent variables to the original structure. The Bayesian nonparametric nature of this new model allows it to adapt complexity as more data is observed and project each data point onto a varying number of subspaces. The linear relationship between the continuous latent and observed variables make the proposed model straightforward to interpret, resembling a locally adaptive probabilistic PCA (A-PPCA). We propose two alternative Gibbs sampling procedures for inference in the new model and demonstrate its applicability on sensor data for passive health monitoring. △ Less

Submitted 27 May, 2019; originally announced May 2019.

arXiv:1905.08557 [pdf, other]

Bayesian Pitch Tracking Based on the Harmonic Model

Authors: Liming Shi, Jesper Kjaer Nielsen, Jesper Rindom Jensen, Max A. Little, Mads Graesboll Christensen

Abstract: Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model o… ▽ More Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model order, and the voicing into account as they process each data segment independently. In this paper, a fully Bayesian fundamental frequency tracking algorithm based on the harmonic model and a first-order Markov process model is proposed. Smoothness priors are imposed on the fundamental frequencies, model orders, and voicing using first-order Markov process models. Using these Markov models, fundamental frequency estimation and voicing detection errors can be reduced. Using the harmonic model, the proposed fundamental frequency tracker has an improved robustness to noise. An analytical form of the likelihood function, which can be computed efficiently, is derived. Compared to the state-of-the-art neural network and non-parametric approaches, the proposed fundamental frequency tracking algorithm reduces the mean absolute errors and gross errors by 15\% and 20\% on the Keele pitch database and 36\% and 26\% on sustained /a/ sounds from a database of Parkinson's disease voices under 0 dB white Gaussian noise. A MATLAB version of the proposed algorithm is made freely available for reproduction of the results\footnote{An implementation of the proposed algorithm using MATLAB may be found in \url{https://tinyurl.com/yxn4a543} △ Less

Submitted 21 May, 2019; originally announced May 2019.

arXiv:1812.11954 [pdf, other]

Exact Cluster Recovery via Classical Multidimensional Scaling

Authors: Anna Little, Yuying Xie, Qiang Sun

Abstract: Classical multidimensional scaling is an important dimension reduction technique. Yet few theoretical results characterizing its statistical performance exist. This paper provides a theoretical framework for analyzing the quality of embedded samples produced by classical multidimensional scaling. This lays the foundation for various downstream statistical analyses, and we focus on clustering noisy… ▽ More Classical multidimensional scaling is an important dimension reduction technique. Yet few theoretical results characterizing its statistical performance exist. This paper provides a theoretical framework for analyzing the quality of embedded samples produced by classical multidimensional scaling. This lays the foundation for various downstream statistical analyses, and we focus on clustering noisy data. Our results provide scaling conditions on the sample size, ambient dimensionality, between-class distance, and noise level under which classical multidimensional scaling followed by a distance-based clustering algorithm can recover the cluster labels of all samples with high probability. Numerical simulations confirm these scaling conditions are near-sharp. Applications to both human genomics data and natural language data lend strong support to the methodology and theory. △ Less

Submitted 7 July, 2020; v1 submitted 31 December, 2018; originally announced December 2018.

Comments: 42 pages in cluding appendix

arXiv:1812.02585 [pdf, other]

Probabilistic modelling of gait for remote passive monitoring applications

Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Marjan J. Faber, Bastiaan R. Bloem, Kasper Claes, Max A. Little

Abstract: Passive and non-obtrusive health monitoring using wearables can potentially bring new insights into the user's health status throughout the day and may support clinical diagnosis and treatment. However, identifying segments of free-living data that sufficiently reflect the user's health is challenging. In this work we have studied the problem of modelling real-life gait which is a very indicative… ▽ More Passive and non-obtrusive health monitoring using wearables can potentially bring new insights into the user's health status throughout the day and may support clinical diagnosis and treatment. However, identifying segments of free-living data that sufficiently reflect the user's health is challenging. In this work we have studied the problem of modelling real-life gait which is a very indicative behaviour for multiple movement disorders including Parkinson's disease (PD). We have developed a probabilistic framework for unsupervised analysis of the gait, clustering it into different types, which can be used to evaluate gait abnormalities occurring in daily life. Using a unique dataset which contains sensor and video recordings of people with and without PD in their own living environment, we show that our model driven approach achieves high accuracy gait detection and can capture clinical improvement after medication intake. △ Less

Submitted 30 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

Report number: ML4H/2018/153

arXiv:1810.08807 [pdf]

Investigating Voice as a Biomarker for leucine-rich repeat kinase 2-Associated Parkinson's Disease

Authors: S. Arora, N. P. Visanji, T. A. Mestre, A. Tsanas, A. AlDakheel, B. S. Connolly, C. Gasca-Salas, D. S. Kern, J. Jain, E. J. Slow, A. Faust-Socher, A. E. Lang, M. A. Little, C. Marras

Abstract: We investigate the potential association between leucine-rich repeat kinase 2 (LRRK2) mutations and voice. Sustained phonations ('aaah' sounds) were recorded from 7 individuals with LRRK2-associated Parkinson's disease (PD), 17 participants with idiopathic PD (iPD), 20 non-manifesting LRRK2-mutation carriers, 25 related non-carriers, and 26 controls. In distinguishing LRRK2-associated PD and iPD,… ▽ More We investigate the potential association between leucine-rich repeat kinase 2 (LRRK2) mutations and voice. Sustained phonations ('aaah' sounds) were recorded from 7 individuals with LRRK2-associated Parkinson's disease (PD), 17 participants with idiopathic PD (iPD), 20 non-manifesting LRRK2-mutation carriers, 25 related non-carriers, and 26 controls. In distinguishing LRRK2-associated PD and iPD, the mean sensitivity was 95.4% (SD 17.8%) and mean specificity was 89.6% (SD 26.5%). Voice features for non-manifesting carriers, related non-carriers, and controls were much less discriminatory. Vocal deficits in LRRK2-associated PD may be different than those in iPD. These preliminary results warrant longitudinal analyses and replication in larger cohorts △ Less

Submitted 20 October, 2018; originally announced October 2018.

Comments: 27 pages including supplemental information, Journal of Parkinson's Disease, 2018

arXiv:1807.11062 [pdf]

Exploring Mindset's Applicability to Students' Experiences with Challenge in Transformed College Physics Courses

Authors: Angela Little, Bridget Humphrey, Abigail Green, Abhilash Nair, Vashti Sawtelle

Abstract: The mindset literature is a longstanding area of psychological research focused on beliefs about intelligence, response to challenge, and goals for learning (Dweck, 2000). However, the mindset literature's applicability to the context of college physics has not been widely studied. In this paper we narrow our focus toward students' descriptions of their responses to challenge in college physics. W… ▽ More The mindset literature is a longstanding area of psychological research focused on beliefs about intelligence, response to challenge, and goals for learning (Dweck, 2000). However, the mindset literature's applicability to the context of college physics has not been widely studied. In this paper we narrow our focus toward students' descriptions of their responses to challenge in college physics. We ask the research questions, "can we see responses to challenge in college physics that resemble that of the mindset literature?" and "how do students express evidence of challenge and to what extent is such evidence reflective of challenges found in the mindset literature?" To answer these questions, we developed a novel coding scheme for interview dialogue around college physics challenge and students' responses to it. In this paper we present the development process of our coding scheme. We find that it is possible to see student descriptions of challenge that resemble the mindset literature's characterizations. However, college physics challenges are frequently different than those studied in the mindset literature. We show that, in the landscape of college physics challenges, mindset beliefs cannot always be considered to be the dominant factor in how students respond to challenge. Broadly, our coding scheme helps the field move beyond broad Likert-scale survey measures of students' mindset beliefs. △ Less

Submitted 29 July, 2018; originally announced July 2018.

arXiv:1807.04098 [pdf, other]

doi 10.1007/978-3-030-10997-4_10

A Recurrent Neural Network Survival Model: Predicting Web User Return Time

Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both techniques are severely limited when applied to this problem. Survival models can only incorporate aggregate representations of users instead of automatically learning a representation directly from a raw time series of user actions. RNNs can automatically learn features, but can not be directly trained with examples of non-returning users who have no target value for their return time. We develop a novel RNN survival model that removes the limitations of the state of the art methods. We demonstrate that this model can successfully be applied to return time prediction on a large e-commerce dataset with a superior ability to discriminate between returning and non-returning users than either method applied in isolation. △ Less

Submitted 11 July, 2018; originally announced July 2018.

Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

arXiv:1806.03966 [pdf, other]

doi 10.1093/mnras/sty1549

Theoretical study of ArH+ dissociative recombination and electron-impact vibrational excitation

Authors: A. Abdoulanziz, F. Colboc, D. A. Little, Y. Moulane, J. Zs. Mezei, E. Roueff, J. Tennyson, I. F. Schneider, V. Laporta

Abstract: Cross sections are presented for dissociative recombination and electron-impact vibrational excitation of the ArH+ molecular ion at electron energies appropriate for the interstellar environment. The R-matrix method is employed to determine the molecular structure data, i.e. the position and width of the resonance states. The cross sections and the corresponding Maxwellian rate coefficients are co… ▽ More Cross sections are presented for dissociative recombination and electron-impact vibrational excitation of the ArH+ molecular ion at electron energies appropriate for the interstellar environment. The R-matrix method is employed to determine the molecular structure data, i.e. the position and width of the resonance states. The cross sections and the corresponding Maxwellian rate coefficients are computed using a method based on the Multichannel Quantum Defect Theory. The main result of the paper is the very low dissociative recombination rate found at temperatures below 1000K. This is in agreement with the previous upper limit measurement in merged beams and offers a realistic explanation to the presence of ArH+ in exotic interstellar conditions. △ Less

Submitted 24 June, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

Comments: 6 pages; 7 figures

Journal ref: MNRAS, 479, 2415-2420 (2018)

arXiv:1806.00855 [pdf, other]

doi 10.1103/PhysRevB.98.094425

Field evolution of magnons in $α$-RuCl$_3$ by high-resolution polarized terahertz spectroscopy

Authors: Liang Wu, Arielle Little, Erik E. Aldape, Dylan Rees, Eric Thewalt, Paula Lampen-Kelley, Arnab Banerjee, Craig A. Bridges, Jiaqiang Yan, Derrick Boone, Shreyas Patankar, David Goldhaber-Golden, David Mandrus, Stephen E. Nagler, Ehud Altman, Joseph Orenstein

Abstract: The Kitaev quantum spin liquid (KSL) is a theoretically predicted state of matter whose fractionalized quasiparticles are distinct from bosonic magnons, the fundamental excitation in ordered magnets. The layered honeycomb antiferromagnet $α$-RuCl$_3$ is a KSL candidate material, as it can be driven to a magnetically disordered phase by application of an in-plane magnetic field, with $H_c \sim 7$ T… ▽ More The Kitaev quantum spin liquid (KSL) is a theoretically predicted state of matter whose fractionalized quasiparticles are distinct from bosonic magnons, the fundamental excitation in ordered magnets. The layered honeycomb antiferromagnet $α$-RuCl$_3$ is a KSL candidate material, as it can be driven to a magnetically disordered phase by application of an in-plane magnetic field, with $H_c \sim 7$ T. Here we report a detailed characterization of the magnetic excitation spectrum of this material by high-resolution time-domain terahertz (THz) spectroscopy. We observe two sharp magnon resonances whose frequencies and amplitudes exhibit a discontinuity as a function of applied magnetic field, as well as two broader peaks at higher energy. Below the Néel temperature, we find that linear spin wave theory can account for all of these essential features of the spectra when a $C_3$-breaking distortion of the honeycomb lattice and the presence of structural domains are taken into account. △ Less

Submitted 5 October, 2018; v1 submitted 3 June, 2018; originally announced June 2018.

Comments: 8 pages, 5 figures in the main text. Appendices are included at the end of the article

Journal ref: Phys. Rev. B 98, 094425 (2018)

arXiv:1712.06206 [pdf, other]

Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

Authors: Anna Little, Mauro Maggioni, James M. Murphy

Abstract: We consider the problem of clustering with the longest-leg path distance (LLPD) metric, which is informative for elongated and irregularly shaped clusters. We prove finite-sample guarantees on the performance of clustering with respect to this metric when random samples are drawn from multiple intrinsically low-dimensional clusters in high-dimensional space, in the presence of a large number of hi… ▽ More We consider the problem of clustering with the longest-leg path distance (LLPD) metric, which is informative for elongated and irregularly shaped clusters. We prove finite-sample guarantees on the performance of clustering with respect to this metric when random samples are drawn from multiple intrinsically low-dimensional clusters in high-dimensional space, in the presence of a large number of high-dimensional outliers. By combining these results with spectral clustering with respect to LLPD, we provide conditions under which the Laplacian eigengap statistic correctly determines the number of clusters for a large class of data sets, and prove guarantees on the labeling accuracy of the proposed algorithm. Our methods are quite general and provide performance guarantees for spectral clustering with any ultrametric. We also introduce an efficient, easy to implement approximation algorithm for the LLPD based on a multiscale analysis of adjacency graphs, which allows for the runtime of LLPD spectral clustering to be quasilinear in the number of data points. △ Less

Submitted 6 March, 2019; v1 submitted 17 December, 2017; originally announced December 2017.

Comments: 59 pages, 12 figures

arXiv:1712.06092 [pdf, ps, other]

doi 10.1088/1361-6455/aab40f

Multiple core hole formation by free-electron laser radiation in molecular nitrogen

Authors: Henry I B Banks, Duncan A Little, Agapi Emmanouilidou

Abstract: We investigate the formation of multiple-core-hole states of molecular nitrogen interacting with a free-electron laser pulse. We obtain bound and continuum molecular orbitals in the single-center expansion scheme and use these orbitals to calculate photo-ionization and Auger decay rates. Using these rates, we compute the atomic ion yields generated in this interaction. We track the population of a… ▽ More We investigate the formation of multiple-core-hole states of molecular nitrogen interacting with a free-electron laser pulse. We obtain bound and continuum molecular orbitals in the single-center expansion scheme and use these orbitals to calculate photo-ionization and Auger decay rates. Using these rates, we compute the atomic ion yields generated in this interaction. We track the population of all states throughout this interaction and compute the proportion of the population which accesses different core-hole states. We also investigate the pulse parameters that favor the formation of these core-hole states for 525 eV and 1100 eV photons. △ Less

Submitted 8 January, 2018; v1 submitted 17 December, 2017; originally announced December 2017.

arXiv:1711.07557 [pdf, other]

A unified algorithm framework for quality control of sensor data for behavioural clinimetric testing

Authors: Reham Badawy, Yordan P. Raykov, Max A. Little

Abstract: The use of smartphone and wearable sensing technology for objective, non-invasive and remote clinimetric testing of symptoms has considerable potential. However, the clinimetric accuracy achievable with such technology is highly reliant on separating the useful from irrelevant or confounded sensor data. Monitoring patient symptoms using digital sensors outside of controlled, clinical lab settings… ▽ More The use of smartphone and wearable sensing technology for objective, non-invasive and remote clinimetric testing of symptoms has considerable potential. However, the clinimetric accuracy achievable with such technology is highly reliant on separating the useful from irrelevant or confounded sensor data. Monitoring patient symptoms using digital sensors outside of controlled, clinical lab settings creates a variety of practical challenges, such as unavoidable and unexpected user behaviours. These behaviours often violate the assumptions of clinimetric testing protocols, where these protocols are designed to probe for specific symptoms. Such violations are frequent outside the lab, and can affect the accuracy of the subsequent data analysis and scientific conclusions. At the same time, curating sensor data by hand after the collection process is inherently subjective, laborious and error-prone. To address these problems, we report on a unified algorithmic framework for automated sensor data quality control, which can identify those parts of the sensor data which are sufficiently reliable for further analysis. Algorithms which are special cases of this framework for different sensor data types (e.g. accelerometer, digital audio) detect the extent to which the sensor data adheres to the assumptions of the test protocol for a variety of clinimetric tests. The approach is general enough to be applied to a large set of clinimetric tests and we demonstrate its performance on walking, balance and voice smartphone-based tests, designed to monitor the symptoms of Parkinson's disease. △ Less

Submitted 23 November, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

arXiv:1710.08972 [pdf]

doi 10.1119/1.5051144

On the Importance of Engaging Students in Crafting Definitions

Authors: Angela Little, Leslie Atkins Elliott

Abstract: In this paper we describe an activity for engaging students in crafting definitions. We explore the strengths of this particular activity as well as the broader implications of engaging students in crafting definitions more generally. In this paper we describe an activity for engaging students in crafting definitions. We explore the strengths of this particular activity as well as the broader implications of engaging students in crafting definitions more generally. △ Less

Submitted 24 October, 2017; originally announced October 2017.

Comments: 6 pages

arXiv:1709.04462 [pdf, other]

doi 10.1103/PhysRevLett.121.027001

Imaging anomalous nematic order and strain in optimally doped BaFe$_2$(As,P)$_2$

Authors: Eric Thewalt, Ian M. Hayes, James P. Hinton, Arielle Little, Shreyas Patankar, Liang Wu, Toni Helm, Camelia V. Stan, Nobumichi Tamura, James G. Analytis, Joseph Orenstein

Abstract: We present the strain and temperature dependence of an anomalous nematic phase in optimally doped BaFe$_2$(As,P)$_2$. Polarized ultrafast optical measurements reveal broken 4-fold rotational symmetry in a temperature range above $T_c$ in which bulk probes do not detect a phase transition. Using ultrafast microscopy, we find that the magnitude and sign of this nematicity vary on a ${50{-}100}~μ$m l… ▽ More We present the strain and temperature dependence of an anomalous nematic phase in optimally doped BaFe$_2$(As,P)$_2$. Polarized ultrafast optical measurements reveal broken 4-fold rotational symmetry in a temperature range above $T_c$ in which bulk probes do not detect a phase transition. Using ultrafast microscopy, we find that the magnitude and sign of this nematicity vary on a ${50{-}100}~μ$m length scale, and the temperature at which it onsets ranges from 40 K near a domain boundary to 60 K deep within a domain. Scanning Laue microdiffraction maps of local strain at room temperature indicate that the nematic order appears most strongly in regions of weak, isotropic strain. These results indicate that nematic order arises in a genuine phase transition rather than by enhancement of local anisotropy by a strong nematic susceptibility. We interpret our results in the context of a proposed surface nematic phase. △ Less

Submitted 13 September, 2017; originally announced September 2017.

Journal ref: Phys. Rev. Lett. 121, 027001 (2018)

arXiv:1706.09865 [pdf, other]

doi 10.1007/978-3-319-71273-4_9

Generalising Random Forest Parameter Optimisation to Include Stability and Cost

Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest parameters for commercial applications. We propose a novel metric that captures the stability of random forests predictions, which we argue is key for scenarios that require successive predictions. We motivate the need for multi-criteria optimization by showing that in practical applications, simply choosing the parameters that lead to the lowest error can introduce unnecessary costs and produce predictions that are not stable across independent runs. To optimize this multi-criteria trade-off, we present a new framework that efficiently finds a principled balance between these three considerations using Bayesian optimisation. The pitfalls of optimising forest parameters purely for error reduction are demonstrated using two publicly available real world datasets. We show that our framework leads to parameter settings that are markedly different from the values discovered by error reduction metrics. △ Less

Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

Comments: To appear in ECML-PKDD 2017

Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

arXiv:1704.07357 [pdf, other]

doi 10.1103/PhysRevLett.119.227201

Antiferromagnetic resonance and terahertz continuum in $α-$RuCl$_3$

Authors: A. Little, Liang Wu, P. Lampen-Kelley, A. Banerjee, S. Patankar, D. Rees, C. A. Bridges, J. -Q. Yan, D. Mandrus, S. E. Nagler, J. Orenstein

Abstract: We report measurements of optical absorption in the zig-zag antiferromagnet $α$-RuCl$_3$ as a function of temperature, $T$, magnetic field, $B$, and photon energy, $\hbarω$ in the range $\sim$ 0.3 to 8.3 meV, using time-domain terahertz spectroscopy. Polarized measurements show that 3-fold rotational symmetry is broken in the honeycomb plane from 2 K to 300 K. We find a sharp absorption peak at 2.… ▽ More We report measurements of optical absorption in the zig-zag antiferromagnet $α$-RuCl$_3$ as a function of temperature, $T$, magnetic field, $B$, and photon energy, $\hbarω$ in the range $\sim$ 0.3 to 8.3 meV, using time-domain terahertz spectroscopy. Polarized measurements show that 3-fold rotational symmetry is broken in the honeycomb plane from 2 K to 300 K. We find a sharp absorption peak at 2.56 meV upon cooling below the Néel temperature of 7 K at $B=0$ that we identify as magnetic-dipole excitation of a zero-wavevector magnon, or antiferromagnetic resonance (AFMR). With application of $B$, the AFMR broadens and shifts to lower frequency as long-range magnetic order is lost in a manner consistent with transitioning to a spin-disordered phase. From direct, internally calibrated measurement of the AFMR spectral weight, we place an upper bound on the contribution to the $dc$ susceptibility from a magnetic excitation continuum. △ Less

Submitted 9 October, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

Comments: 5 pages, 3 figures in the main text. To appear in Phys. Rev. Lett., magnetic field data included. Supplementary information also included

Journal ref: Phys. Rev. Lett. 119, 227201 (2017)

arXiv:1704.03444 [pdf, ps, other]

doi 10.1039/C7CP02345F

Interaction of molecular nitrogen with Free-Electron-Laser radiation

Authors: H. I. B. Banks, D. A. Little, J. Tennyson, A. Emmanouilidou

Abstract: We compute molecular continuum orbitals in the single center expansion scheme. We then employ these orbitals to obtain molecular Auger rates and single-photon ionization cross sections to study the interaction of N2 with Free-Electron-Laser (FEL) pulses. The nuclei are kept fixed. We formulate rate equations for the energetically allowed molecular and atomic transitions and we account for dissocia… ▽ More We compute molecular continuum orbitals in the single center expansion scheme. We then employ these orbitals to obtain molecular Auger rates and single-photon ionization cross sections to study the interaction of N2 with Free-Electron-Laser (FEL) pulses. The nuclei are kept fixed. We formulate rate equations for the energetically allowed molecular and atomic transitions and we account for dissociation through additional terms in the rate equations. Solving these equations for different parameters of the FEL pulse, allows us to identify the most efficient parameters of the FEL pulse for obtaining the highest contribution of double core hole states (DCH) in the final atomic ion fragments. Finally we identify the contribution of DCH states in the electron spectra and show that the DCH state contribution is more easily identified in the photo-ionization rather than the Auger transitions. △ Less

Submitted 11 April, 2017; originally announced April 2017.

Showing 1–50 of 67 results for author: Little, A