Search | arXiv e-print repository

arXiv:2402.16138 [pdf, other]

Integration of Conventional Surface Science Techniques with Surface-Sensitive Azimuthal and Polarization Dependent Femtosecond-Resolved Sum Frequency Generation Spectroscopy

Authors: Zhipeng Huang, Tobias Roos, Yu** Tong, R. Kramer Campen

Abstract: Experimental insight into the elementary processes underlying charge transfer across interfaces has blossomed with the wide-spread availability of ultra-high vacuum set-ups that allow the preparation and characterization of solid surfaces with well-defined molecular adsorbates over a wide ranges of temperatures. Thick layers of molecular adsorbates or heterostructures of 2D materials generally pre… ▽ More Experimental insight into the elementary processes underlying charge transfer across interfaces has blossomed with the wide-spread availability of ultra-high vacuum set-ups that allow the preparation and characterization of solid surfaces with well-defined molecular adsorbates over a wide ranges of temperatures. Thick layers of molecular adsorbates or heterostructures of 2D materials generally preclude the use of electrons or atoms as probes in such characterization. However with linear photon-in/photon-out techniques it is often challenging to assign the observed optical response to a particular portion of the interface. We and prior workers have demonstrated in work under ambient conditions that by full characterization of the symmetry of the second order nonlinear optical susceptibility, i.e. the $χ^{(2)}$, in sum frequency generation (SFG) spectroscopy, this problem can be overcome. Here we describe an ultra-high vacuum system built to allow conventional UHV sample preparation and characterization, femtosecond and polarization resolved SFG spectroscopy, the azimuthal sample rotation necessary to fully describe $χ^{(2)}$ symmetry and with sufficient stability to allow scanning SFG microscopy. We demonstrate these capabilities in proof-of-principle measurements on CO adsorbed on Pt(111) and of the clean Ag(111) surface. Because this set-up allows both full characterization of the nonlinear susceptibility and the temperature control and sample preparation/characterization of conventional UHV set-ups we expect it to be of great utility in investigation of both the basic physics and applications of solid, 2D material heterostructures. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2402.14400 [pdf, other]

Modeling 3D Infant Kinetics Using Adaptive Graph Convolutional Networks

Authors: Daniel Holmberg, Manu Airaksinen, Viviana Marchi, Andrea Guzzetta, Anna Kivi, Leena Haataja, Sampsa Vanhatalo, Teemu Roos

Abstract: Reliable methods for the neurodevelopmental assessment of infants are essential for early detection of medical issues that may need prompt interventions. Spontaneous motor activity, or 'kinetics', is shown to provide a powerful surrogate measure of upcoming neurodevelopment. However, its assessment is by and large qualitative and subjective, focusing on visually identified, age-specific gestures.… ▽ More Reliable methods for the neurodevelopmental assessment of infants are essential for early detection of medical issues that may need prompt interventions. Spontaneous motor activity, or 'kinetics', is shown to provide a powerful surrogate measure of upcoming neurodevelopment. However, its assessment is by and large qualitative and subjective, focusing on visually identified, age-specific gestures. Here, we follow an alternative approach, predicting infants' neurodevelopmental maturation based on data-driven evaluation of individual motor patterns. We utilize 3D video recordings of infants processed with pose-estimation to extract spatio-temporal series of anatomical landmarks, and apply adaptive graph convolutional networks to predict the actual age. We show that our data-driven approach achieves improvement over traditional machine learning baselines based on manually engineered features. △ Less

Submitted 20 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: 10 pages, 3 figures. Code repository available via https://github.com/deinal/infant-aagcn

MSC Class: 68T06 ACM Class: I.2; I.4; J.3

arXiv:2402.01813 [pdf, other]

An Educational Tool for Learning about Social Media Tracking, Profiling, and Recommendation

Authors: Nicolas Pope, Juho Kahila, Jari Laru, Henriikka Vartiainen, Teemu Roos, Matti Tedre

Abstract: This paper introduces an educational tool for classroom use, based on explainable AI (XAI), designed to demystify key social media mechanisms - tracking, profiling, and content recommendation - for novice learners. The tool provides a familiar, interactive interface that resonates with learners' experiences with popular social media platforms, while also offering the means to "peek under the hood"… ▽ More This paper introduces an educational tool for classroom use, based on explainable AI (XAI), designed to demystify key social media mechanisms - tracking, profiling, and content recommendation - for novice learners. The tool provides a familiar, interactive interface that resonates with learners' experiences with popular social media platforms, while also offering the means to "peek under the hood" and exposing basic mechanisms of datafication. Learners gain first-hand experience of how even the slightest actions, such as pausing to view content, are captured and recorded in their digital footprint, and further distilled into a personal profile. The tool uses real-time visualizations and verbal explanations to create a sense of immediacy: each time the user acts, the resulting changes in their engagement history and their profile are displayed in a visually engaging and understandable manner. This paper discusses the potential of XAI and educational technology in transforming data and digital literacy education and in fostering the growth of children's privacy and security mindsets. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 5 pages, 5 figures, submitted to ICALT 2024

arXiv:2310.06962 [pdf]

Open-source Pulseq sequences on Philips MRI scanners

Authors: Thomas H. M. Roos, Edwin Versteeg, Dennis W. J. Klomp, Jeroen C. W. Siero, Jannie P. Wijnen

Abstract: Purpose: This work aims to address the limitations faced by researchers in develo** and sharing new MRI sequences by implementing an interpreter for the open-source MRI pulse sequence format, Pulseq, on a Philips MRI scanner. Methods: The implementation involved modifying a few source code files to create a Pulseq interpreter for the Philips MRI system. Validation experiments were conducted us… ▽ More Purpose: This work aims to address the limitations faced by researchers in develo** and sharing new MRI sequences by implementing an interpreter for the open-source MRI pulse sequence format, Pulseq, on a Philips MRI scanner. Methods: The implementation involved modifying a few source code files to create a Pulseq interpreter for the Philips MRI system. Validation experiments were conducted using simulations and phantom scans performed on a 7T Achieva MRI system. The observed sequence and waveforms were compared to the intended ones, and the gradient waveforms produced by the scanner were verified using a field camera. Image reconstruction was performed using the raw k-space samples acquired from both the native vendor environment and the Pulseq interpreter. Results: The reconstructed images obtained through the Pulseq implementation were found to be comparable to those obtained through the native implementation. The performance of the Pulseq interpreter was assessed by profiling the CPU utilization of the MRI spectrometer, showing minimal resource utilization for certain sequences. Conclusion: The successful implementation of the Pulseq interpreter on the Philips MRI scanner demonstrates the feasibility of utilizing Pulseq sequences on Philips MRI scanners. This provides an open-source platform for MRI sequence development, facilitating collaboration among researchers and accelerating scientific progress in the field of MRI. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 15 pages, 4 figures and one table. Code for the MRI scanners will be made available for other research sites, through the vendor's code repository and forum, linked at http://openmr.nl/

arXiv:2307.09469 [pdf, other]

Graph Representation of the Magnetic Field Topology in High-Fidelity Plasma Simulations for Machine Learning Applications

Authors: Ioanna Bouri, Fanni Franssila, Markku Alho, Giulia Cozzani, Ivan Zaitsev, Minna Palmroth, Teemu Roos

Abstract: Topological analysis of the magnetic field in simulated plasmas allows the study of various physical phenomena in a wide range of settings. One such application is magnetic reconnection, a phenomenon related to the dynamics of the magnetic field topology, which is difficult to detect and characterize in three dimensions. We propose a scalable pipeline for topological data analysis and spatiotempor… ▽ More Topological analysis of the magnetic field in simulated plasmas allows the study of various physical phenomena in a wide range of settings. One such application is magnetic reconnection, a phenomenon related to the dynamics of the magnetic field topology, which is difficult to detect and characterize in three dimensions. We propose a scalable pipeline for topological data analysis and spatiotemporal graph representation of three-dimensional magnetic vector fields. We demonstrate our methods on simulations of the Earth's magnetosphere produced by Vlasiator, a supercomputer-scale Vlasov theory-based simulation for near-Earth space. The purpose of this work is to challenge the machine learning community to explore graph-based machine learning approaches to address a largely open scientific problem with wide-ranging potential impact. △ Less

Submitted 26 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: 6 pages, 3 figures, Accepted at the ICML 2023 Workshop on Machine Learning for Astrophysics

arXiv:2307.07015 [pdf, other]

Advertiser Learning in Direct Advertising Markets

Authors: Carl F. Mela, Jason M. T. Roos, Tulio Sousa

Abstract: Direct buy advertisers procure advertising inventory at fixed rates from publishers and ad networks. Such advertisers face the complex task of choosing ads amongst myriad new publisher sites. We offer evidence that advertisers do not excel at making these choices. Instead, they try many sites before settling on a favored set, consistent with advertiser learning. We subsequently model advertiser de… ▽ More Direct buy advertisers procure advertising inventory at fixed rates from publishers and ad networks. Such advertisers face the complex task of choosing ads amongst myriad new publisher sites. We offer evidence that advertisers do not excel at making these choices. Instead, they try many sites before settling on a favored set, consistent with advertiser learning. We subsequently model advertiser demand for publisher inventory wherein advertisers learn about advertising efficacy across publishers' sites. Results suggest that advertisers spend considerable resources advertising on sites they eventually abandon--in part because their prior beliefs about advertising efficacy on those sites are too optimistic. The median advertiser's expected CTR at a new site is 0.23%, five times higher than the true median CTR of 0.045%. We consider how an ad network's pooling of advertiser information remediates this problem. As ads with similar visual elements garner similar CTRs, the network's pooling of information enables advertisers to better predict ad performance at new sites. Counterfactual analyses indicate that gains from pooling advertiser information are substantial: over six months, we estimate a median advertiser welfare gain of \$2,756 (a 15.5% increase) and a median publisher revenue gain of \$9,618 (a 63.9% increase). △ Less

Submitted 30 April, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

arXiv:2103.12068 [pdf, other]

Transfer Learning with Ensembles of Deep Neural Networks for Skin Cancer Detection in Imbalanced Data Sets

Authors: Aqsa Saeed Qureshi, Teemu Roos

Abstract: Several machine learning techniques for accurate detection of skin cancer from medical images have been reported. Many of these techniques are based on pre-trained convolutional neural networks (CNNs), which enable training the models based on limited amounts of training data. However, the classification accuracy of these models still tends to be severely limited by the scarcity of representative… ▽ More Several machine learning techniques for accurate detection of skin cancer from medical images have been reported. Many of these techniques are based on pre-trained convolutional neural networks (CNNs), which enable training the models based on limited amounts of training data. However, the classification accuracy of these models still tends to be severely limited by the scarcity of representative images from malignant tumours. We propose a novel ensemble-based CNN architecture where multiple CNN models, some of which are pre-trained and some are trained only on the data at hand, along with auxiliary data in the form of metadata associated with the input images, are combined using a meta-learner. The proposed approach improves the model's ability to handle limited and imbalanced data. We demonstrate the benefits of the proposed technique using a dataset with 33126 dermoscopic images from 2056 patients. We evaluate the performance of the proposed technique in terms of the F1-measure, area under the ROC curve (AUC-ROC), and area under the PR-curve (AUC-PR), and compare it with that of seven different benchmark methods, including two recent CNN-based techniques. The proposed technique compares favourably in terms of all the evaluation metrics. △ Less

Submitted 17 May, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

arXiv:2004.02569 [pdf, other]

doi 10.1016/j.neunet.2020.10.002

Gradient-Based Training and Pruning of Radial Basis Function Networks with an Application in Materials Physics

Authors: Jussi Määttä, Viacheslav Bazaliy, Jyri Kimari, Flyura Djurabekova, Kai Nordlund, Teemu Roos

Abstract: Many applications, especially in physics and other sciences, call for easily interpretable and robust machine learning techniques. We propose a fully gradient-based technique for training radial basis function networks with an efficient and scalable open-source implementation. We derive novel closed-form optimization criteria for pruning the models for continuous as well as binary data which arise… ▽ More Many applications, especially in physics and other sciences, call for easily interpretable and robust machine learning techniques. We propose a fully gradient-based technique for training radial basis function networks with an efficient and scalable open-source implementation. We derive novel closed-form optimization criteria for pruning the models for continuous as well as binary data which arise in a challenging real-world material physics problem. The pruned models are optimized to provide compact and interpretable versions of larger models based on informed assumptions about the data distribution. Visualizations of the pruned models provide insight into the atomic configurations that determine atom-level migration processes in solid matter; these results may inform future research on designing more suitable descriptors for use with machine learning algorithms. △ Less

Submitted 6 April, 2020; originally announced April 2020.

Journal ref: Neural Networks 133, 123 (2021)

arXiv:1910.08322 [pdf, other]

A Multilabel Classification Framework for Approximate Nearest Neighbor Search

Authors: Ville Hyvönen, Elias Jääsaari, Teemu Roos

Abstract: Both supervised and unsupervised machine learning algorithms have been used to learn partition-based index structures for approximate nearest neighbor (ANN) search. Existing supervised algorithms formulate the learning task as finding a partition in which the nearest neighbors of a training set point belong to the same partition element as the point itself, so that the nearest neighbor candidates… ▽ More Both supervised and unsupervised machine learning algorithms have been used to learn partition-based index structures for approximate nearest neighbor (ANN) search. Existing supervised algorithms formulate the learning task as finding a partition in which the nearest neighbors of a training set point belong to the same partition element as the point itself, so that the nearest neighbor candidates can be retrieved by naive lookup or backtracking search. We formulate candidate set selection in ANN search directly as a multilabel classification problem where the labels correspond to the nearest neighbors of the query point, and interpret the partitions as partitioning classifiers for solving this task. Empirical results suggest that the natural classifier based on this interpretation leads to strictly improved performance when combined with any unsupervised or supervised partitioning strategy. We also prove a sufficient condition for consistency of a partitioning classifier for ANN search, and illustrate the result by verifying this condition for chronological $k$-d trees. △ Less

Submitted 13 October, 2022; v1 submitted 18 October, 2019; originally announced October 2019.

Comments: To appear in the proceedings of Conference on Neural Information Processing Systems (NeurIPS) 2022

ACM Class: G.3; H.3.3

arXiv:1908.08484 [pdf, ps, other]

doi 10.1142/S2661335219300018

Minimum Description Length Revisited

Authors: Peter Grünwald, Teemu Roos

Abstract: This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL was originally based on data compression ideas, this introduction can be read without any knowledge thereof. It takes into account all major developments since… ▽ More This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL was originally based on data compression ideas, this introduction can be read without any knowledge thereof. It takes into account all major developments since 2007, the last time an extensive overview was written. These include new methods for model selection and averaging and hypothesis testing, as well as the first completely general definition of {\em MDL estimators}. Incorporating these developments, MDL can be seen as a powerful extension of both penalized likelihood and Bayesian approaches, in which penalization functions and prior distributions are replaced by more general luckiness functions, average-case methodology is replaced by a more robust worst-case approach, and in which methods classically viewed as highly distinct, such as AIC vs BIC and cross-validation vs Bayes can, to a large extent, be viewed from a unified perspective. △ Less

Submitted 18 December, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

Comments: to appear in International Journal of Mathematics for Industry

arXiv:1812.07484 [pdf, other]

doi 10.1007/978-3-030-16145-3_46

Efficient Autotuning of Hyperparameters in Approximate Nearest Neighbor Search

Authors: Elias Jääsaari, Ville Hyvönen, Teemu Roos

Abstract: Approximate nearest neighbor algorithms are used to speed up nearest neighbor search in a wide array of applications. However, current indexing methods feature several hyperparameters that need to be tuned to reach an acceptable accuracy--speed trade-off. A grid search in the parameter space is often impractically slow due to a time-consuming index-building procedure. Therefore, we propose an algo… ▽ More Approximate nearest neighbor algorithms are used to speed up nearest neighbor search in a wide array of applications. However, current indexing methods feature several hyperparameters that need to be tuned to reach an acceptable accuracy--speed trade-off. A grid search in the parameter space is often impractically slow due to a time-consuming index-building procedure. Therefore, we propose an algorithm for automatically tuning the hyperparameters of indexing methods based on randomized space-partitioning trees. In particular, we present results using randomized k-d trees, random projection trees and randomized PCA trees. The tuning algorithm adds minimal overhead to the index-building process but is able to find the optimal hyperparameters accurately. We demonstrate that the algorithm is significantly faster than existing approaches, and that the indexing methods used are competitive with the state-of-the-art methods in query time while being faster to build. △ Less

Submitted 18 December, 2018; originally announced December 2018.

Comments: Accepted for the 23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2019

Journal ref: Advances in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science, vol 11440. Springer, Cham. pp. 590-602

arXiv:1811.11811 [pdf, other]

An Application of Storage-Optimal MatDot Codes for Coded Matrix Multiplication: Fast k-Nearest Neighbors Estimation

Authors: Utsav Sheth, Sanghamitra Dutta, Malhar Chaudhari, Haewon Jeong, Yaoqing Yang, Jukka Kohonen, Teemu Roos, Pulkit Grover

Abstract: We propose a novel application of coded computing to the problem of the nearest neighbor estimation using MatDot Codes [Fahim. et.al. 2017], that are known to be optimal for matrix multiplication in terms of recovery threshold under storage constraints. In approximate nearest neighbor algorithms, it is common to construct efficient in-memory indexes to improve query response time. One such strateg… ▽ More We propose a novel application of coded computing to the problem of the nearest neighbor estimation using MatDot Codes [Fahim. et.al. 2017], that are known to be optimal for matrix multiplication in terms of recovery threshold under storage constraints. In approximate nearest neighbor algorithms, it is common to construct efficient in-memory indexes to improve query response time. One such strategy is Multiple Random Projection Trees (MRPT), which reduces the set of candidate points over which Euclidean distance calculations are performed. However, this may result in a high memory footprint and possibly paging penalties for large or high-dimensional data. Here we propose two techniques to parallelize MRPT, that exploit data and model parallelism respectively, by dividing both the data storage and the computation efforts among different nodes in a distributed computing cluster. This is especially critical when a single compute node cannot hold the complete dataset in memory. We also propose a novel coded computation strategy based on MatDot codes for the model-parallel architecture that, in a straggler-prone environment, achieves the storage-optimal recovery threshold, i.e., the number of nodes that are required to serve a query. We experimentally demonstrate that, in the absence of straggling, our distributed approaches require less query time than execution on a single processing node, providing near-linear speedups with respect to the number of worker nodes. Through our experiments on real systems with simulated straggling, we also show that our strategy achieves a faster query execution than the uncoded strategy in a straggler-prone environment. △ Less

Submitted 28 November, 2018; originally announced November 2018.

Comments: Accepted for publication at the IEEE Big Data 2018

arXiv:1708.02497 [pdf, other]

Learning non-parametric Markov networks with mutual information

Authors: Janne Leppä-aho, Santeri Räisänen, Xiao Yang, Teemu Roos

Abstract: We propose a method for learning Markov network structures for continuous data without invoking any assumptions about the distribution of the variables. The method makes use of previous work on a non-parametric estimator for mutual information which is used to create a non-parametric test for multivariate conditional independence. This independence test is then combined with an efficient constrain… ▽ More We propose a method for learning Markov network structures for continuous data without invoking any assumptions about the distribution of the variables. The method makes use of previous work on a non-parametric estimator for mutual information which is used to create a non-parametric test for multivariate conditional independence. This independence test is then combined with an efficient constraint-based algorithm for learning the graph structure. The performance of the method is evaluated on several synthetic data sets and it is shown to learn considerably more accurate structures than competing methods when the dependencies between the variables involve non-linearities. △ Less

Submitted 8 August, 2017; originally announced August 2017.

arXiv:1602.07863 [pdf, ps, other]

doi 10.1016/j.ijar.2017.01.001

Learning Gaussian Graphical Models With Fractional Marginal Pseudo-likelihood

Authors: Janne Leppä-aho, Johan Pensar, Teemu Roos, Jukka Corander

Abstract: We propose a Bayesian approximate inference method for learning the dependence structure of a Gaussian graphical model. Using pseudo-likelihood, we derive an analytical expression to approximate the marginal likelihood for an arbitrary graph structure without invoking any assumptions about decomposability. The majority of the existing methods for learning Gaussian graphical models are either restr… ▽ More We propose a Bayesian approximate inference method for learning the dependence structure of a Gaussian graphical model. Using pseudo-likelihood, we derive an analytical expression to approximate the marginal likelihood for an arbitrary graph structure without invoking any assumptions about decomposability. The majority of the existing methods for learning Gaussian graphical models are either restricted to decomposable graphs or require specification of a tuning parameter that may have a substantial impact on learned structures. By combining a simple sparsity inducing prior for the graph structures with a default reference prior for the model parameters, we obtain a fast and easily applicable scoring function that works well for even high-dimensional data. We demonstrate the favourable performance of our approach by large-scale comparisons against the leading methods for learning non-decomposable Gaussian graphical models. A theoretical justification for our method is provided by showing that it yields a consistent estimator of the graph structure. △ Less

Submitted 25 February, 2016; originally announced February 2016.

arXiv:1512.09133 [pdf]

doi 10.1093/cercor/bhw285

Neurofeedback Tunes Scale-Free Dynamics in Spontaneous Brain Activity

Authors: Tomas Ros, Paul Frewen, Jean Theberge, Rosemarie Kluetsch, Andreas Mueller, Gian Candrian, Rakesh Jetly, Patrik Vuilleumier, Ruth Lanius

Abstract: Brain oscillations exhibit long-range temporal correlations (LRTCs), which reflect the regularity of their fluctuations: low values representing more random (decorrelated) while high values more persistent (correlated) dynamics. LRTCs constitute supporting evidence that the brain operates near criticality, a state where neuronal activities are balanced between order and randomness. Here, healthy a… ▽ More Brain oscillations exhibit long-range temporal correlations (LRTCs), which reflect the regularity of their fluctuations: low values representing more random (decorrelated) while high values more persistent (correlated) dynamics. LRTCs constitute supporting evidence that the brain operates near criticality, a state where neuronal activities are balanced between order and randomness. Here, healthy adults used closed-loop brain training (neurofeedback, NFB) to reduce the amplitude of alpha oscillations, producing a significant increase in spontaneous LRTCs post-training. This effect was reproduced in patients with post-traumatic stress disorder, where abnormally random dynamics were reversed by NFB, correlating with significant improvements in hyperarousal. Notably, regions manifesting abnormally low LRTCs (i.e., excessive randomness) normalized toward healthy population levels, consistent with theoretical predictions about self-organized criticality. Hence, when exposed to appropriate training, spontaneous cortical activity reveals a residual capacity for "self-tuning" its own temporal complexity, despite manifesting the abnormal dynamics seen in individuals with psychiatric disorder. Lastly, we observed an inverse-U relationship between strength of LRTC and oscillation amplitude, suggesting a breakdown of long-range dependence at high/low synchronization extremes, in line with recent computational models. Together, our findings offer a broader mechanistic framework for motivating research and clinical applications of NFB, encompassing disorders with perturbed LRTCs. △ Less

Submitted 2 February, 2018; v1 submitted 30 December, 2015; originally announced December 2015.

Journal ref: Cerebral Cortex (2016)

arXiv:1509.06957 [pdf, other]

doi 10.1109/BigData.2016.7840682

Fast k-NN search

Authors: Ville Hyvönen, Teemu Pitkänen, Sotiris Tasoulis, Elias Jääsaari, Risto Tuomainen, Liang Wang, Jukka Corander, Teemu Roos

Abstract: Efficient index structures for fast approximate nearest neighbor queries are required in many applications such as recommendation systems. In high-dimensional spaces, many conventional methods suffer from excessive usage of memory and slow response times. We propose a method where multiple random projection trees are combined by a novel voting scheme. The key idea is to exploit the redundancy in a… ▽ More Efficient index structures for fast approximate nearest neighbor queries are required in many applications such as recommendation systems. In high-dimensional spaces, many conventional methods suffer from excessive usage of memory and slow response times. We propose a method where multiple random projection trees are combined by a novel voting scheme. The key idea is to exploit the redundancy in a large number of candidate sets obtained by independently generated random projections in order to reduce the number of expensive exact distance evaluations. The method is straightforward to implement using sparse projections which leads to a reduced memory footprint and fast index construction. Furthermore, it enables grou** of the required computations into big matrix multiplications, which leads to additional savings due to cache effects and low-level parallelization. We demonstrate by extensive experiments on a wide variety of data sets that the method is faster than existing partitioning tree or hashing based approaches, making it the fastest available technique on high accuracy levels. △ Less

Submitted 19 August, 2016; v1 submitted 23 September, 2015; originally announced September 2015.

Journal ref: IEEE International Conference on Big Data 2016, p. 881-888

arXiv:1401.7116 [pdf, other]

Bayesian Properties of Normalized Maximum Likelihood and its Fast Computation

Authors: Andrew Barron, Teemu Roos, Kazuho Watanabe

Abstract: The normalized maximized likelihood (NML) provides the minimax regret solution in universal data compression, gambling, and prediction, and it plays an essential role in the minimum description length (MDL) method of statistical modeling and estimation. Here we show that the normalized maximum likelihood has a Bayes-like representation as a mixture of the component models, even in finite samples,… ▽ More The normalized maximized likelihood (NML) provides the minimax regret solution in universal data compression, gambling, and prediction, and it plays an essential role in the minimum description length (MDL) method of statistical modeling and estimation. Here we show that the normalized maximum likelihood has a Bayes-like representation as a mixture of the component models, even in finite samples, though the weights of linear combination may be both positive and negative. This representation addresses in part the relationship between MDL and Bayes modeling. This representation has the advantage of speeding the calculation of marginals and conditionals required for coding and prediction applications. △ Less

Submitted 28 January, 2014; originally announced January 2014.

Comments: Submitted to ISIT-2004 conference

arXiv:1401.0561 [pdf, other]

User-Generated Free-Form Gestures for Authentication: Security and Memorability

Authors: Michael Sherman, Gradeigh Clark, Yulong Yang, Shridatt Sugrim, Arttu Modig, Janne Lindqvist, Antti Oulasvirta, Teemu Roos

Abstract: This paper studies the security and memorability of free-form multitouch gestures for mobile authentication. Towards this end, we collected a dataset with a generate-test-retest paradigm where participants (N=63) generated free-form gestures, repeated them, and were later retested for memory. Half of the participants decided to generate one-finger gestures, and the other half generated multi-finge… ▽ More This paper studies the security and memorability of free-form multitouch gestures for mobile authentication. Towards this end, we collected a dataset with a generate-test-retest paradigm where participants (N=63) generated free-form gestures, repeated them, and were later retested for memory. Half of the participants decided to generate one-finger gestures, and the other half generated multi-finger gestures. Although there has been recent work on template-based gestures, there are yet no metrics to analyze security of either template or free-form gestures. For example, entropy-based metrics used for text-based passwords are not suitable for capturing the security and memorability of free-form gestures. Hence, we modify a recently proposed metric for analyzing information capacity of continuous full-body movements for this purpose. Our metric computed estimated mutual information in repeated sets of gestures. Surprisingly, one-finger gestures had higher average mutual information. Gestures with many hard angles and turns had the highest mutual information. The best-remembered gestures included signatures and simple angular shapes. We also implemented a multitouch recognizer to evaluate the practicality of free-form gestures in a real authentication system and how they perform against shoulder surfing attacks. We conclude the paper with strategies for generating secure and memorable free-form gestures, which present a robust method for mobile authentication. △ Less

Submitted 2 January, 2014; originally announced January 2014.

arXiv:1102.5225 [pdf, other]

Let Us Dance Just a Little Bit More --- On the Information Capacity of the Human Motor System

Authors: Teemu Roos, Antti Oulasvirta, Laura Leppänen, Arttu Modig

Abstract: Fitts' law is a fundamental tool in measuring the capacity of the human motor system. However, it is, by definition, limited to aimed movements toward spatially expanded targets. We revisit its information-theoretic basis with the goal of generalizing it into unconstrained trained movement such as dance and sports. The proposed new measure is based on a subject's ability to accurately reproduce a… ▽ More Fitts' law is a fundamental tool in measuring the capacity of the human motor system. However, it is, by definition, limited to aimed movements toward spatially expanded targets. We revisit its information-theoretic basis with the goal of generalizing it into unconstrained trained movement such as dance and sports. The proposed new measure is based on a subject's ability to accurately reproduce a complex movement pattern. We demonstrate our framework using motion-capture data from professional dance performances. △ Less

Submitted 13 February, 2012; v1 submitted 25 February, 2011; originally announced February 2011.

Comments: Presented at the 2012 Information Theory and Applications Workshop, San Diego, CA

arXiv:cs/0609138 [pdf, ps, other]

doi 10.1109/TSP.2009.2021633

MDL Denoising Revisited

Authors: Teemu Roos, Petri Myllymäki, Jorma Rissanen

Abstract: We refine and extend an earlier MDL denoising criterion for wavelet-based denoising. We start by showing that the denoising problem can be reformulated as a clustering problem, where the goal is to obtain separate clusters for informative and non-informative wavelet coefficients, respectively. This suggests two refinements, adding a code-length for the model index, and extending the model in ord… ▽ More We refine and extend an earlier MDL denoising criterion for wavelet-based denoising. We start by showing that the denoising problem can be reformulated as a clustering problem, where the goal is to obtain separate clusters for informative and non-informative wavelet coefficients, respectively. This suggests two refinements, adding a code-length for the model index, and extending the model in order to account for subband-dependent coefficient distributions. A third refinement is derivation of soft thresholding inspired by predictive universal coding with weighted mixtures. We propose a practical method incorporating all three refinements, which is shown to achieve good performance and robustness in denoising both artificial and natural signals. △ Less

Submitted 25 September, 2006; originally announced September 2006.

Comments: Submitted to IEEE Transactions on Information Theory, June 2006

arXiv:hep-ph/9705357 [pdf, ps, other]

doi 10.1103/PhysRevD.56.6484

Inflaton Decay and Heavy Particle Production with Negative Coupling

Authors: Brian R. Greene, Tomislav Prokopec, Thomas G. Roos

Abstract: We study the decay of the inflaton in a renormalizable two scalar theory. Since the dynamics of the system is dominated by states with large occupation numbers which admit a semiclassical description, the decay can be studied by solving the classical equations of motion on the lattice. Of particular interest is the case when the cross-coupling between the inflaton and the second scalar field is… ▽ More We study the decay of the inflaton in a renormalizable two scalar theory. Since the dynamics of the system is dominated by states with large occupation numbers which admit a semiclassical description, the decay can be studied by solving the classical equations of motion on the lattice. Of particular interest is the case when the cross-coupling between the inflaton and the second scalar field is negative, which is naturally allowed in many realistic models. While the inflaton decays via parametric resonance in the positive coupling case we find that for negative coupling there is a new mechanism of particle production which we call negative coupling instability. Due to this new mechanism the variances of the fields grow significantly larger before the production is shut off by the backreaction of the created particles. We also find that heavy particles are produced much more efficiently with negative coupling, which is of prime importance for GUT baryogenesis. Using a simple toy model for baryogenesis and the results of our lattice simulations we show that for natural values of the cross-coupling enough 10^{14}GeV bosons are created to produce a baryon to entropy ratio consistent with observation. For positive coupling the value of the cross-coupling required to produce such massive particles is unnaturally large. In addition to our numerical results we obtain analytical estimates for the maximum variances of the fields in an expanding universe for all cases of interest in our model. △ Less

Submitted 19 May, 1997; originally announced May 1997.

Comments: 41 pages + 12 figures, Latex with standard Revtex macros

Report number: CLNS 97/1481

Journal ref: Phys. Rev. D 56, 6484 (1997)

arXiv:hep-ph/9610400 [pdf, ps, other]

doi 10.1103/PhysRevD.55.3768

Lattice study of classical inflaton decay

Authors: Tomislav Prokopec, Thomas G. Roos

Abstract: We study numerically the decay of the inflaton by solving the full non-linear equations of motion on the lattice. We confirm that parametric resonance is effective in transferring energy from the inflaton to a scalar field as long as the self-interactions of the second field are very small. However, in the very broad resonance case (q>>1) the decay rate is limited by scatterings, which significa… ▽ More We study numerically the decay of the inflaton by solving the full non-linear equations of motion on the lattice. We confirm that parametric resonance is effective in transferring energy from the inflaton to a scalar field as long as the self-interactions of the second field are very small. However, in the very broad resonance case (q>>1) the decay rate is limited by scatterings, which significantly slows down the decay. We also find that the inflaton cannot decay via parametric resonance into a scalar field with moderate self-interactions. This means that the preheating stage may be completely absent in many natural inflationary models. △ Less

Submitted 28 August, 1997; v1 submitted 17 October, 1996; originally announced October 1996.

Comments: 12 pages LaTeX with standard RevTex macros, uses epsf.sty. 7 postscript figures (included). No changes. Replaced to correct a problem with the figures

Report number: CLNS 96/1438

Journal ref: Phys.Rev. D55 (1997) 3768-3775

arXiv:astro-ph/9601183 [pdf, ps, other]

doi 10.1086/192314

Arecibo HI Map** of a Large Sample of Dwarf Irregular Galaxies

Authors: G. L. Hoffman, E. E. Salpeter, B. Farhat, T. Roos, H. Williams, G. Helou

Abstract: Neutral hydrogen map** of 70 dwarf irregular (Sm, Im and BCD) galaxies is reported, with position-velocity contour maps presented for those resolved by the Arecibo beam. The galaxies were selected either from the Virgo Cluster Catalog, from similarly identified field galaxies, or from a distance-limited sample within the Arecibo declination range. We do not find any isolated dwarfs with a larg… ▽ More Neutral hydrogen map** of 70 dwarf irregular (Sm, Im and BCD) galaxies is reported, with position-velocity contour maps presented for those resolved by the Arecibo beam. The galaxies were selected either from the Virgo Cluster Catalog, from similarly identified field galaxies, or from a distance-limited sample within the Arecibo declination range. We do not find any isolated dwarfs with a larger HI to optical radius ratio than DDO 154; the ``protogalaxy'' HI 1225+01 (Giovanelli \& Haynes 1989) continues to be a unique object among dwarfs that have been mapped in HI. For all dwarfs with significant rotation we are able to determine the sense of the spin. For a number of better-resolved dwarfs we are able to determine rotation curves, in most cases extending well beyond the last measured point in available synthesis array maps. Correlations among the several measures of galaxy size and mass are studied; in Paper II (Salpeter \& Hoffman 1996) we combine these data with those for the set of all available mapped dwarf irregular galaxies and for mapped spirals spanning a similar range of redshifts to investigate variations in Tully-Fisher relations and in surface densities as functions of galaxy size and luminosity or mass. △ Less

Submitted 30 January, 1996; originally announced January 1996.

Comments: AASTeX, to appear in ApJS, 32 pages + 5 tables, figures available upon request to [email protected]

arXiv:hep-th/9511073 [pdf, ps, other]

doi 10.1103/PhysRevD.54.2944

Wilson Renormalization Group Study of Inverse Symmetry Breaking

Authors: Thomas G. Roos

Abstract: For a large class of field theories there exist portions of parameter space for which the loop expansion predicts increased symmetry breaking at high temperature. Even though this behavior would clearly have far reaching implications for cosmology such theories have not been fully investigated in the literature. This is at least partially due to the counter intuitive nature of the result, which… ▽ More For a large class of field theories there exist portions of parameter space for which the loop expansion predicts increased symmetry breaking at high temperature. Even though this behavior would clearly have far reaching implications for cosmology such theories have not been fully investigated in the literature. This is at least partially due to the counter intuitive nature of the result, which has led to speculations that it is merely an artifact of perturbation theory. To address this issue we study the simplest model displaying high temperature symmetry breaking using a Wilson renormalization group approach. We find that although the critical temperature is not reliably estimated by the loop expansion the total volume of parameter space which leads to the inverse phase structure is not significantly different from the perturbative prediction. We also investigate the temperature dependence of the coupling constants and find that they run approximately according to their one-loop $β$-functions at high temperature. Thus, in particular, the quartic coupling of $phi^4$ theory is shown to increase with temperature, in contrast to the behavior obtained in some previous studies. △ Less

Submitted 3 August, 1997; v1 submitted 10 November, 1995; originally announced November 1995.

Comments: 40 pages LaTeX, 13 Postscript figures. Uses epsf.sty, amssym.def, amssym.tex. No changes. Replaced to correct a problem with the figures

Report number: CLNS 95/1373

Journal ref: Phys.Rev. D54 (1996) 2944

Showing 1–24 of 24 results for author: Roos, T