Search | arXiv e-print repository

Ergodic imitation: Learning from what to do and what not to do

Authors: Aleksandra Kalinowska, Ahalya Prabhakar, Kathleen Fitzsimons, Todd Murphey

Abstract: With growing access to versatile robotics, it is beneficial for end users to be able to teach robots tasks without needing to code a control policy. One possibility is to teach the robot through successful task executions. However, near-optimal demonstrations of a task can be difficult to provide and even successful demonstrations can fail to capture task aspects key to robust skill replication. H… ▽ More With growing access to versatile robotics, it is beneficial for end users to be able to teach robots tasks without needing to code a control policy. One possibility is to teach the robot through successful task executions. However, near-optimal demonstrations of a task can be difficult to provide and even successful demonstrations can fail to capture task aspects key to robust skill replication. Here, we propose a learning from demonstration (LfD) approach that enables learning of robust task definitions without the need for near-optimal demonstrations. We present a novel algorithmic framework for learning tasks based on the ergodic metric -- a measure of information content in motion. Moreover, we make use of negative demonstrations -- demonstrations of what not to do -- and show that they can help compensate for imperfect demonstrations, reduce the number of demonstrations needed, and highlight crucial task elements improving robot performance. In a proof-of-concept example of cart-pole inversion, we show that negative demonstrations alone can be sufficient to successfully learn and recreate a skill. Through a human subject study with 24 participants, we show that consistently more information about a task can be captured from combined positive and negative (posneg) demonstrations than from the same amount of just positive demonstrations. Finally, we demonstrate our learning approach on simulated tasks of target reaching and table cleaning with a 7-DoF Franka arm. Our results point towards a future with robust, data-efficient LfD for novice users. △ Less

Submitted 31 March, 2021; originally announced March 2021.

Comments: Kalinowska and Prabhakar contributed equally to this work

Journal ref: International Conference on Robotics and Automation, 2021

arXiv:2012.05183 [pdf, other]

doi 10.1109/LRA.2018.2884091

Dynamical System Segmentation for Information Measures in Motion

Authors: Thomas A. Berrueta, Ana Pervan, Kathleen Fitzsimons, Todd D. Murphey

Abstract: Motions carry information about the underlying task being executed. Previous work in human motion analysis suggests that complex motions may result from the composition of fundamental submovements called movemes. The existence of finite structure in motion motivates information-theoretic approaches to motion analysis and robotic assistance. We define task embodiment as the amount of task informati… ▽ More Motions carry information about the underlying task being executed. Previous work in human motion analysis suggests that complex motions may result from the composition of fundamental submovements called movemes. The existence of finite structure in motion motivates information-theoretic approaches to motion analysis and robotic assistance. We define task embodiment as the amount of task information encoded in an agent's motions. By decoding task-specific information embedded in motion, we can use task embodiment to create detailed performance assessments. We extract an alphabet of behaviors comprising a motion without \textit{a priori} knowledge using a novel algorithm, which we call dynamical system segmentation. For a given task, we specify an optimal agent, and compute an alphabet of behaviors representative of the task. We identify these behaviors in data from agent executions, and compare their relative frequencies against that of the optimal agent using the Kullback-Leibler divergence. We validate this approach using a dataset of human subjects (n=53) performing a dynamic task, and under this measure find that individuals receiving assistance better embody the task. Moreover, we find that task embodiment is a better predictor of assistance than integrated mean-squared-error. △ Less

Submitted 9 December, 2020; originally announced December 2020.

Comments: 8 pages

Journal ref: IEEE Robotics and Automation Letters, vol. 4, no. 1, pp. 169-176, 2019

arXiv:2007.04778 [pdf, other]

Shoulder abduction loading affects motor coordination in individuals with chronic stroke, informing targeted rehabilitation

Authors: Aleksandra Kalinowska, Kyra Rudy, Millicent Schlafly, Kathleen Fitzsimons, Julius P Dewald, Todd D Murphey

Abstract: Individuals post stroke experience motor impairments, such as loss of independent joint control, leading to an overall reduction in arm function. Their motion becomes slower and more discoordinated, making it difficult to complete timing-sensitive tasks, such as balancing a glass of water or carrying a bowl with a ball inside it. Understanding how the stroke-induced motor impairments interact with… ▽ More Individuals post stroke experience motor impairments, such as loss of independent joint control, leading to an overall reduction in arm function. Their motion becomes slower and more discoordinated, making it difficult to complete timing-sensitive tasks, such as balancing a glass of water or carrying a bowl with a ball inside it. Understanding how the stroke-induced motor impairments interact with each other can help design assisted training regimens for improved recovery. In this study, we investigate the effects of abnormal joint coupling patterns induced by flexion synergy on timing-sensitive motor coordination in the paretic upper limb. We design a virtual ball-in-bowl task that requires fast movements for optimal performance and implement it on a robotic system, capable of providing varying levels of abduction loading at the shoulder. We recruit 12 participants (6 individuals with chronic stroke and 6 unimpaired controls) and assess their skill at the task at 3 levels of loading, defined by the vertical force applied at the robot end-effector. Our results show that, for individuals with stroke, loading has a significant effect on their ability to generate quick coordinated motion. With increases in loading, their overall task performance decreases and they are less able to compensate for ball dynamics---frequency analysis of their motion indicates that abduction loading weakens their ability to generate movements at the resonant frequency of the dynamic task. This effect is likely due to an increased reliance on lower resolution indirect motor pathways in individuals post stroke. Given the inter-dependency of loading and dynamic task performance, we can create targeted robot-aided training protocols focused on improving timing-sensitive motor control, similar to existing progressive loading therapies, which have shown efficacy for expanding reachable workspace post stroke. △ Less

Submitted 5 June, 2020; originally announced July 2020.

Journal ref: IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2020

arXiv:2004.05116 [pdf, other]

A note on blind contact tracing at scale with applications to the COVID-19 pandemic

Authors: Jack K. Fitzsimons, Atul Mantri, Robert Pisarczyk, Tom Rainforth, Zhikuan Zhao

Abstract: The current COVID-19 pandemic highlights the utility of contact tracing, when combined with case isolation and social distancing, as an important tool for mitigating the spread of a disease [1]. Contact tracing provides a mechanism of identifying individuals with a high likelihood of previous exposure to a contagious disease, allowing additional precautions to be put in place to prevent continued… ▽ More The current COVID-19 pandemic highlights the utility of contact tracing, when combined with case isolation and social distancing, as an important tool for mitigating the spread of a disease [1]. Contact tracing provides a mechanism of identifying individuals with a high likelihood of previous exposure to a contagious disease, allowing additional precautions to be put in place to prevent continued transmission. Here we consider a cryptographic approach to contact tracing based on secure two-party computation (2PC). We begin by considering the problem of comparing a set of location histories held by two parties to determine whether they have come within some threshold distance while at the same time maintaining the privacy of the location histories. We propose a solution to this problem using pre-shared keys, adapted from an equality testing protocol due to Ishai et al [2]. We discuss how this protocol can be used to maintain privacy within practical contact tracing scenarios, including both app-based approaches and approaches which leverage location history held by telecoms and internet service providers. We examine the efficiency of this approach and show that existing infrastructure is sufficient to support anonymised contact tracing at a national level. △ Less

Submitted 10 April, 2020; originally announced April 2020.

arXiv:1912.00874 [pdf, other]

Implicit Priors for Knowledge Sharing in Bayesian Neural Networks

Authors: Jack K Fitzsimons, Sebastian M Schmon, Stephen J Roberts

Abstract: Bayesian interpretations of neural network have a long history, dating back to early work in the 1990's and have recently regained attention because of their desirable properties like uncertainty estimation, model robustness and regularisation. We want to discuss here the application of Bayesian models to knowledge sharing between neural networks. Knowledge sharing comes in different facets, such… ▽ More Bayesian interpretations of neural network have a long history, dating back to early work in the 1990's and have recently regained attention because of their desirable properties like uncertainty estimation, model robustness and regularisation. We want to discuss here the application of Bayesian models to knowledge sharing between neural networks. Knowledge sharing comes in different facets, such as transfer learning, model distillation and shared embeddings. All of these tasks have in common that learned "features" ought to be shared across different networks. Theoretically rooted in the concepts of Bayesian neural networks this work has widespread application to general deep learning. △ Less

Submitted 2 December, 2019; originally announced December 2019.

Comments: 5 pages, 2 figures

Journal ref: 4th workshop on Bayesian Deep Learning (NeurIPS 2019)

arXiv:1911.07983 [pdf, other]

Task-Based Hybrid Shared Control for Training Through Forceful Interaction

Authors: Kathleen Fitzsimons, Aleksandra Kalinowska, Julius P. A. Dewald, Todd Murphey

Abstract: Despite the fact that robotic platforms can provide both consistent practice and objective assessments of users over the course of their training, there are relatively few instances where physical human robot interaction has been significantly more effective than unassisted practice or human-mediated training. This paper describes a hybrid shared control robot, which enhances task learning through… ▽ More Despite the fact that robotic platforms can provide both consistent practice and objective assessments of users over the course of their training, there are relatively few instances where physical human robot interaction has been significantly more effective than unassisted practice or human-mediated training. This paper describes a hybrid shared control robot, which enhances task learning through kinesthetic feedback. The assistance assesses user actions using a task-specific evaluation criterion and selectively accepts or rejects them at each time instant. Through two human subject studies (total n=68), we show that this hybrid approach of switching between full transparency and full rejection of user inputs leads to increased skill acquisition and short-term retention compared to unassisted practice. Moreover, we show that the shared control paradigm exhibits features previously shown to promote successful training. It avoids user passivity by only rejecting user actions and allowing failure at the task. It improves performance during assistance, providing meaningful task-specific feedback. It is sensitive to initial skill of the user and behaves as an `assist-as-needed' control scheme---adapting its engagement in real time based on the performance and needs of the user. Unlike other successful algorithms, it does not require explicit modulation of the level of impedance or error amplification during training and it is permissive to a range of strategies because of its evaluation criterion. We demonstrate that the proposed hybrid shared control paradigm with a task-based minimal intervention criterion significantly enhances task-specific training. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: 16 pages, submitted to the International Journal of Robotics Research

arXiv:1806.02425 [pdf, other]

doi 10.15607/RSS.2018.XIV.046

Online User Assessment for Minimal Intervention During Task-Based Robotic Assistance

Authors: Aleksandra Kalinowska, Kathleen Fitzsimons, Julius Dewald, Todd D Murphey

Abstract: We propose a novel criterion for evaluating user input for human-robot interfaces for known tasks. We use the mode insertion gradient (MIG)---a tool from hybrid control theory---as a filtering criterion that instantaneously assesses the impact of user actions on a dynamic system over a time window into the future. As a result, the filter is permissive to many chosen strategies, minimally engaging,… ▽ More We propose a novel criterion for evaluating user input for human-robot interfaces for known tasks. We use the mode insertion gradient (MIG)---a tool from hybrid control theory---as a filtering criterion that instantaneously assesses the impact of user actions on a dynamic system over a time window into the future. As a result, the filter is permissive to many chosen strategies, minimally engaging, and skill-sensitive---qualities desired when evaluating human actions. Through a human study with 28 healthy volunteers, we show that the criterion exhibits a low, but significant, negative correlation between skill level, as estimated from task-specific measures in unassisted trials, and the rate of controller intervention during assistance. Moreover, a MIG-based filter can be utilized to create a shared control scheme for training or assistance. In the human study, we observe a substantial training effect when using a MIG-based filter to perform cart-pendulum inversion, particularly when comparing improvement via the RMS error measure. Using simulation of a controlled spring-loaded inverted pendulum (SLIP) as a test case, we observe that the MIG criterion could be used for assistance to guarantee either task completion or safety of a joint human-robot system, while maintaining the system's flexibility with respect to user-chosen strategies. △ Less

Submitted 6 June, 2018; originally announced June 2018.

Comments: 10 pages

Journal ref: Robotics: Science and Systems (RSS), 2018

arXiv:1804.00281 [pdf, other]

Smooth input preparation for quantum and quantum-inspired machine learning

Authors: Zhikuan Zhao, Jack K. Fitzsimons, Patrick Rebentrost, Vedran Dunjko, Joseph F. Fitzsimons

Abstract: Machine learning has recently emerged as a fruitful area for finding potential quantum computational advantage. Many of the quantum enhanced machine learning algorithms critically hinge upon the ability to efficiently produce states proportional to high-dimensional data points stored in a quantum accessible memory. Even given query access to exponentially many entries stored in a database, the con… ▽ More Machine learning has recently emerged as a fruitful area for finding potential quantum computational advantage. Many of the quantum enhanced machine learning algorithms critically hinge upon the ability to efficiently produce states proportional to high-dimensional data points stored in a quantum accessible memory. Even given query access to exponentially many entries stored in a database, the construction of which is considered a one-off overhead, it has been argued that the cost of preparing such amplitude-encoded states may offset any exponential quantum advantage. Here we prove using smoothed analysis, that if the data-analysis algorithm is robust against small entry-wise input perturbation, state preparation can always be achieved with constant queries. This criterion is typically satisfied in realistic machine learning applications, where input data is subjective to moderate noise. Our results are equally applicable to the recent seminal progress in quantum-inspired algorithms, where specially constructed databases suffice for polylogarithmic classical algorithm in low-rank cases. The consequence of our finding is that for the purpose of practical machine learning, polylogarithmic processing time is possible under a general and flexible input model with quantum algorithms or quantum-inspired classical algorithms in the low-rank cases. △ Less

Submitted 2 August, 2019; v1 submitted 1 April, 2018; originally announced April 2018.

Comments: 5 pages, 1 figure

arXiv:1803.10520 [pdf, ps, other]

doi 10.1103/PhysRevA.100.012304

Quantum algorithms for training Gaussian Processes

Authors: Zhikuan Zhao, Jack K. Fitzsimons, Michael A. Osborne, Stephen J. Roberts, Joseph F. Fitzsimons

Abstract: Gaussian processes (GPs) are important models in supervised machine learning. Training in Gaussian processes refers to selecting the covariance functions and the associated parameters in order to improve the outcome of predictions, the core of which amounts to evaluating the logarithm of the marginal likelihood (LML) of a given model. LML gives a concrete measure of the quality of prediction that… ▽ More Gaussian processes (GPs) are important models in supervised machine learning. Training in Gaussian processes refers to selecting the covariance functions and the associated parameters in order to improve the outcome of predictions, the core of which amounts to evaluating the logarithm of the marginal likelihood (LML) of a given model. LML gives a concrete measure of the quality of prediction that a GP model is expected to achieve. The classical computation of LML typically carries a polynomial time overhead with respect to the input size. We propose a quantum algorithm that computes the logarithm of the determinant of a Hermitian matrix, which runs in logarithmic time for sparse matrices. This is applied in conjunction with a variant of the quantum linear system algorithm that allows for logarithmic time computation of the form $\mathbf{y}^TA^{-1}\mathbf{y}$, where $\mathbf{y}$ is a dense vector and $A$ is the covariance matrix. We hence show that quantum computing can be used to estimate the LML of a GP with exponentially improved efficiency under certain conditions. △ Less

Submitted 28 March, 2018; originally announced March 2018.

Comments: 5 pages. Comments welcome

Journal ref: Phys. Rev. A 100, 012304 (2019)

arXiv:1608.00117 [pdf, other]

Improved stochastic trace estimation using mutually unbiased bases

Authors: J. K. Fitzsimons, M. A. Osborne, S. J. Roberts, J. F. Fitzsimons

Abstract: We examine the problem of estimating the trace of a matrix $A$ when given access to an oracle which computes $x^\dagger A x$ for an input vector $x$. We make use of the basis vectors from a set of mutually unbiased bases, widely studied in the field of quantum information processing, in the selection of probing vectors $x$. This approach offers a new state of the art single shot sampling variance… ▽ More We examine the problem of estimating the trace of a matrix $A$ when given access to an oracle which computes $x^\dagger A x$ for an input vector $x$. We make use of the basis vectors from a set of mutually unbiased bases, widely studied in the field of quantum information processing, in the selection of probing vectors $x$. This approach offers a new state of the art single shot sampling variance while requiring only $O(\log(n))$ random bits to generate each vector. This significantly improves on traditional methods such as Hutchinson's and Gaussian estimators in terms of the number of random bits required and worst case sample variance. △ Less

Submitted 30 July, 2016; originally announced August 2016.

Comments: 5 pages, 1 figure, 2 tables. Comments welcome

arXiv:1512.03929 [pdf, ps, other]

doi 10.1103/PhysRevA.99.052331

Quantum assisted Gaussian process regression

Authors: Zhikuan Zhao, Jack K. Fitzsimons, Joseph F. Fitzsimons

Abstract: Gaussian processes (GP) are a widely used model for regression problems in supervised machine learning. Implementation of GP regression typically requires $O(n^3)$ logic gates. We show that the quantum linear systems algorithm [Harrow et al., Phys. Rev. Lett. 103, 150502 (2009)] can be applied to Gaussian process regression (GPR), leading to an exponential reduction in computation time in some ins… ▽ More Gaussian processes (GP) are a widely used model for regression problems in supervised machine learning. Implementation of GP regression typically requires $O(n^3)$ logic gates. We show that the quantum linear systems algorithm [Harrow et al., Phys. Rev. Lett. 103, 150502 (2009)] can be applied to Gaussian process regression (GPR), leading to an exponential reduction in computation time in some instances. We show that even in some cases not ideally suited to the quantum linear systems algorithm, a polynomial increase in efficiency still occurs. △ Less

Submitted 12 December, 2015; originally announced December 2015.

Comments: 4 pages. Comments welcome

Journal ref: Phys. Rev. A 99, 052331 (2019)

Showing 1–11 of 11 results for author: Fitzsimons, K