-
Hydrodynamic diffusion and its breakdown near AdS$_2$ quantum critical points
Authors:
Daniel Arean,
Richard A. Davison,
Blaise Goutéraux,
Kenta Suzuki
Abstract:
Hydrodynamics provides a universal description of interacting quantum field theories at sufficiently long times and wavelengths, but breaks down at scales dependent on microscopic details of the theory. In the vicinity of a quantum critical point, it is expected that some aspects of the dynamics are universal and dictated by properties of the critical point. We use gauge-gravity duality to investi…
▽ More
Hydrodynamics provides a universal description of interacting quantum field theories at sufficiently long times and wavelengths, but breaks down at scales dependent on microscopic details of the theory. In the vicinity of a quantum critical point, it is expected that some aspects of the dynamics are universal and dictated by properties of the critical point. We use gauge-gravity duality to investigate the breakdown of diffusive hydrodynamics in two low temperature states dual to black holes with AdS$_2$ horizons, which exhibit quantum critical dynamics with an emergent scaling symmetry in time. We find that the breakdown is characterized by a collision between the diffusive pole of the retarded Green's function with a pole associated to the AdS$_2$ region of the geometry, such that the local equilibration time is set by infra-red properties of the theory. The absolute values of the frequency and wavevector at the collision ($ω_{eq}$ and $k_{eq}$) provide a natural characterization of all the low temperature diffusivities $D$ of the states via $D=ω_{eq}/k_{eq}^2$ where $ω_{eq}=2πΔT$ is set by the temperature $T$ and the scaling dimension $Δ$ of an operator of the infra-red quantum critical theory. We confirm that these relations are also satisfied in an SYK chain model in the limit of strong interactions. Our work paves the way towards a deeper understanding of transport in quantum critical phases.
△ Less
Submitted 10 August, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
Rearrangement: A Challenge for Embodied AI
Authors:
Dhruv Batra,
Angel X. Chang,
Sonia Chernova,
Andrew J. Davison,
Jia Deng,
Vladlen Koltun,
Sergey Levine,
Jitendra Malik,
Igor Mordatch,
Roozbeh Mottaghi,
Manolis Savva,
Hao Su
Abstract:
We describe a framework for research and evaluation in Embodied AI. Our proposal is based on a canonical task: Rearrangement. A standard task can focus the development of new techniques and serve as a source of trained models that can be transferred to other settings. In the rearrangement task, the goal is to bring a given physical environment into a specified state. The goal state can be specifie…
▽ More
We describe a framework for research and evaluation in Embodied AI. Our proposal is based on a canonical task: Rearrangement. A standard task can focus the development of new techniques and serve as a source of trained models that can be transferred to other settings. In the rearrangement task, the goal is to bring a given physical environment into a specified state. The goal state can be specified by object poses, by images, by a description in language, or by letting the agent experience the environment in the goal state. We characterize rearrangement scenarios along different axes and describe metrics for benchmarking rearrangement performance. To facilitate research and exploration, we present experimental testbeds of rearrangement scenarios in four different simulation environments. We anticipate that other datasets will be released and new simulation platforms will be built to support training of rearrangement agents and their deployment on physical systems.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
Deep Probabilistic Feature-metric Tracking
Authors:
Binbin Xu,
Andrew J. Davison,
Stefan Leutenegger
Abstract:
Dense image alignment from RGB-D images remains a critical issue for real-world applications, especially under challenging lighting conditions and in a wide baseline setting. In this paper, we propose a new framework to learn a pixel-wise deep feature map and a deep feature-metric uncertainty map predicted by a Convolutional Neural Network (CNN), which together formulate a deep probabilistic featu…
▽ More
Dense image alignment from RGB-D images remains a critical issue for real-world applications, especially under challenging lighting conditions and in a wide baseline setting. In this paper, we propose a new framework to learn a pixel-wise deep feature map and a deep feature-metric uncertainty map predicted by a Convolutional Neural Network (CNN), which together formulate a deep probabilistic feature-metric residual of the two-view constraint that can be minimised using Gauss-Newton in a coarse-to-fine optimisation framework. Furthermore, our network predicts a deep initial pose for faster and more reliable convergence. The optimisation steps are differentiable and unrolled to train in an end-to-end fashion. Due to its probabilistic essence, our approach can easily couple with other residuals, where we show a combination with ICP. Experimental results demonstrate state-of-the-art performances on the TUM RGB-D dataset and the 3D rigid object tracking dataset. We further demonstrate our method's robustness and convergence qualitatively.
△ Less
Submitted 25 November, 2020; v1 submitted 31 August, 2020;
originally announced August 2020.
-
Improved inference on risk measures for univariate extremes
Authors:
Léo R. Belzile,
Anthony C. Davison
Abstract:
We discuss the use of likelihood asymptotics for inference on risk measures in univariate extreme value problems, focusing on estimation of high quantiles and similar summaries of risk for uncertainty quantification. We study whether higher-order approximation based on the tangent exponential model can provide improved inferences, and conclude that inference based on maxima is generally robust to…
▽ More
We discuss the use of likelihood asymptotics for inference on risk measures in univariate extreme value problems, focusing on estimation of high quantiles and similar summaries of risk for uncertainty quantification. We study whether higher-order approximation based on the tangent exponential model can provide improved inferences, and conclude that inference based on maxima is generally robust to mild model misspecification and that profile likelihood-based confidence intervals will often be adequate, whereas inferences based on threshold exceedances can be badly biased but may be improved by higher-order methods, at least for moderate sample sizes. We use the methods to shed light on catastrophic rainfall in Venezuela, flooding in Venice, and the lifetimes of Italian semi-supercentenarians.
△ Less
Submitted 27 January, 2021; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Next Waves in Veridical Network Embedding
Authors:
Owen G. Ward,
Zhen Huang,
Andrew Davison,
Tian Zheng
Abstract:
Embedding nodes of a large network into a metric (e.g., Euclidean) space has become an area of active research in statistical machine learning, which has found applications in natural and social sciences. Generally, a representation of a network object is learned in a Euclidean geometry and is then used for subsequent tasks regarding the nodes and/or edges of the network, such as community detecti…
▽ More
Embedding nodes of a large network into a metric (e.g., Euclidean) space has become an area of active research in statistical machine learning, which has found applications in natural and social sciences. Generally, a representation of a network object is learned in a Euclidean geometry and is then used for subsequent tasks regarding the nodes and/or edges of the network, such as community detection, node classification and link prediction. Network embedding algorithms have been proposed in multiple disciplines, often with domain-specific notations and details. In addition, different measures and tools have been adopted to evaluate and compare the methods proposed under different settings, often dependent of the downstream tasks. As a result, it is challenging to study these algorithms in the literature systematically. Motivated by the recently proposed Veridical Data Science (VDS) framework, we propose a framework for network embedding algorithms and discuss how the principles of predictability, computability and stability apply in this context. The utilization of this framework in network embedding holds the potential to motivate and point to new directions for future research.
△ Less
Submitted 12 August, 2021; v1 submitted 10 July, 2020;
originally announced July 2020.
-
An EAGLE's View of Ex-situ Galaxy Growth
Authors:
Thomas A. Davison,
Mark A. Norris,
Joel L. Pfeffer,
Jonathan J. Davies,
Robert A. Crain
Abstract:
Modern observational and analytic techniques now enable the direct measurement of star formation histories and the inference of galaxy assembly histories. However, current theoretical predictions of assembly are not ideally suited for direct comparison with such observational data. We therefore extend the work of prior examinations of the contribution of ex-situ stars to the stellar mass budget of…
▽ More
Modern observational and analytic techniques now enable the direct measurement of star formation histories and the inference of galaxy assembly histories. However, current theoretical predictions of assembly are not ideally suited for direct comparison with such observational data. We therefore extend the work of prior examinations of the contribution of ex-situ stars to the stellar mass budget of simulated galaxies. Our predictions are specifically tailored for direct testing with a new generation of observational techniques by calculating ex-situ fractions as functions of galaxy mass and morphological type, for a range of surface brightnesses. These enable comparison with results from large FoV IFU spectrographs, and increasingly accurate spectral fitting, providing a look-up method for the estimated accreted fraction. We furthermore provide predictions of ex-situ mass fractions as functions of galaxy mass, galactocentric radius and environment. Using $z=0$ snapshots from the 100cMpc$^3$ and 25cMpc$^3$ EAGLE simulations we corroborate the findings of prior studies, finding that ex-situ fraction increases with stellar mass for central and satellite galaxies in a stellar mass range of 2$\times$10$^{7}$ - 1.9$\times$10$^{12}$ M$_{\odot}$. For those galaxies of mass M$_*$>5$\times$10$^{8}$M$_{\odot}$, we find that the total ex-situ mass fraction is greater for more extended galaxies at fixed mass. When categorising satellite galaxies by their parent group/cluster halo mass we find that the ex-situ fraction decreases with increasing parent halo mass at fixed galaxy mass. This apparently counter-intuitive result may be the result of high passing velocities within large cluster halos inhibiting efficient accretion onto individual galaxies.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
NodeSLAM: Neural Object Descriptors for Multi-View Shape Reconstruction
Authors:
Edgar Sucar,
Kentaro Wada,
Andrew Davison
Abstract:
The choice of scene representation is crucial in both the shape inference algorithms it requires and the smart applications it enables. We present efficient and optimisable multi-class learned object descriptors together with a novel probabilistic and differential rendering engine, for principled full object shape inference from one or more RGB-D images. Our framework allows for accurate and robus…
▽ More
The choice of scene representation is crucial in both the shape inference algorithms it requires and the smart applications it enables. We present efficient and optimisable multi-class learned object descriptors together with a novel probabilistic and differential rendering engine, for principled full object shape inference from one or more RGB-D images. Our framework allows for accurate and robust 3D object reconstruction which enables multiple applications including robot gras** and placing, augmented reality, and the first object-level SLAM system capable of optimising object poses and shapes jointly with camera trajectory.
△ Less
Submitted 10 October, 2020; v1 submitted 9 April, 2020;
originally announced April 2020.
-
MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion
Authors:
Kentaro Wada,
Edgar Sucar,
Stephen James,
Daniel Lenton,
Andrew J. Davison
Abstract:
Robots and other smart devices need efficient object-based scene representations from their on-board vision systems to reason about contact, physics and occlusion. Recognized precise object models will play an important role alongside non-parametric reconstructions of unrecognized structures. We present a system which can estimate the accurate poses of multiple known objects in contact and occlusi…
▽ More
Robots and other smart devices need efficient object-based scene representations from their on-board vision systems to reason about contact, physics and occlusion. Recognized precise object models will play an important role alongside non-parametric reconstructions of unrecognized structures. We present a system which can estimate the accurate poses of multiple known objects in contact and occlusion from real-time, embodied multi-view vision. Our approach makes 3D object pose proposals from single RGB-D views, accumulates pose estimates and non-parametric occupancy information from multiple views as the camera moves, and performs joint optimization to estimate consistent, non-intersecting poses for multiple objects in contact.
We verify the accuracy and robustness of our approach experimentally on 2 object datasets: YCB-Video, and our own challenging Cluttered YCB-Video. We demonstrate a real-time robotics application where a robot arm precisely and orderly disassembles complicated piles of objects, using only on-board RGB-D vision.
△ Less
Submitted 8 April, 2020;
originally announced April 2020.
-
Tail risk inference via expectiles in heavy-tailed time series
Authors:
Anthony C. Davison,
Simone A. Padoan,
Gilles Stupfler
Abstract:
Expectiles define the only law-invariant, coherent and elicitable risk measure apart from the expectation. The popularity of expectile-based risk measures is steadily growing and their properties have been studied for independent data, but further results are needed to use extreme expectiles with dependent time series such as financial data. In this paper we establish a basis for inference on extr…
▽ More
Expectiles define the only law-invariant, coherent and elicitable risk measure apart from the expectation. The popularity of expectile-based risk measures is steadily growing and their properties have been studied for independent data, but further results are needed to use extreme expectiles with dependent time series such as financial data. In this paper we establish a basis for inference on extreme expectiles and expectile-based marginal expected shortfall in a general $β$-mixing context that encompasses ARMA, ARCH and GARCH models with heavy-tailed innovations. Simulations and applications to financial returns show that the new estimators and confidence intervals greatly improve on existing ones when the data are dependent.
△ Less
Submitted 12 October, 2021; v1 submitted 8 April, 2020;
originally announced April 2020.
-
Bundle Adjustment on a Graph Processor
Authors:
Joseph Ortiz,
Mark Pupilli,
Stefan Leutenegger,
Andrew J. Davison
Abstract:
Graph processors such as Graphcore's Intelligence Processing Unit (IPU) are part of the major new wave of novel computer architecture for AI, and have a general design with massively parallel computation, distributed on-chip memory and very high inter-core communication bandwidth which allows breakthrough performance for message passing algorithms on arbitrary graphs. We show for the first time th…
▽ More
Graph processors such as Graphcore's Intelligence Processing Unit (IPU) are part of the major new wave of novel computer architecture for AI, and have a general design with massively parallel computation, distributed on-chip memory and very high inter-core communication bandwidth which allows breakthrough performance for message passing algorithms on arbitrary graphs. We show for the first time that the classical computer vision problem of bundle adjustment (BA) can be solved extremely fast on a graph processor using Gaussian Belief Propagation. Our simple but fully parallel implementation uses the 1216 cores on a single IPU chip to, for instance, solve a real BA problem with 125 keyframes and 1919 points in under 40ms, compared to 1450ms for the Ceres CPU library. Further code optimisation will surely increase this difference on static problems, but we argue that the real promise of graph processing is for flexible in-place optimisation of general, dynamically changing factor graphs representing Spatial AI problems. We give indications of this with experiments showing the ability of GBP to efficiently solve incremental SLAM problems, and deal with robust cost functions and different types of factors.
△ Less
Submitted 30 March, 2020; v1 submitted 6 March, 2020;
originally announced March 2020.
-
Comparing View-Based and Map-Based Semantic Labelling in Real-Time SLAM
Authors:
Zoe Landgraf,
Fabian Falck,
Michael Bloesch,
Stefan Leutenegger,
Andrew Davison
Abstract:
Generally capable Spatial AI systems must build persistent scene representations where geometric models are combined with meaningful semantic labels. The many approaches to labelling scenes can be divided into two clear groups: view-based which estimate labels from the input view-wise data and then incrementally fuse them into the scene model as it is built; and map-based which label the generated…
▽ More
Generally capable Spatial AI systems must build persistent scene representations where geometric models are combined with meaningful semantic labels. The many approaches to labelling scenes can be divided into two clear groups: view-based which estimate labels from the input view-wise data and then incrementally fuse them into the scene model as it is built; and map-based which label the generated scene model. However, there has so far been no attempt to quantitatively compare view-based and map-based labelling. Here, we present an experimental framework and comparison which uses real-time height map fusion as an accessible platform for a fair comparison, opening up the route to further systematic research in this area.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Functional Peaks-over-threshold Analysis
Authors:
Raphaël de Fondeville,
Anthony C. Davison
Abstract:
Peaks-over-threshold analysis using the generalized Pareto distribution is widely applied in modelling tails of univariate random variables, but much information may be lost when complex extreme events are studied using univariate results. In this paper, we extend peaks-over-threshold analysis to extremes of functional data. Threshold exceedances defined using a functional $r$ are modelled by the…
▽ More
Peaks-over-threshold analysis using the generalized Pareto distribution is widely applied in modelling tails of univariate random variables, but much information may be lost when complex extreme events are studied using univariate results. In this paper, we extend peaks-over-threshold analysis to extremes of functional data. Threshold exceedances defined using a functional $r$ are modelled by the generalized $r$-Pareto process, a functional generalization of the generalized Pareto distribution that covers the three classical regimes for the decay of tail probabilities, and that is the only possible continuous limit for $r$-exceedances of a properly rescaled process. We give construction rules, simulation algorithms and inference procedures for generalized $r$-Pareto processes, discuss model validation, and use the new methodology to study extreme European windstorms and heavy spatial rainfall.
△ Less
Submitted 13 January, 2022; v1 submitted 7 February, 2020;
originally announced February 2020.
-
DeepFactors: Real-Time Probabilistic Dense Monocular SLAM
Authors:
Jan Czarnowski,
Tristan Laidlow,
Ronald Clark,
Andrew J. Davison
Abstract:
The ability to estimate rich geometry and camera motion from monocular imagery is fundamental to future interactive robotics and augmented reality applications. Different approaches have been proposed that vary in scene geometry representation (sparse landmarks, dense maps), the consistency metric used for optimising the multi-view problem, and the use of learned priors. We present a SLAM system t…
▽ More
The ability to estimate rich geometry and camera motion from monocular imagery is fundamental to future interactive robotics and augmented reality applications. Different approaches have been proposed that vary in scene geometry representation (sparse landmarks, dense maps), the consistency metric used for optimising the multi-view problem, and the use of learned priors. We present a SLAM system that unifies these methods in a probabilistic framework while still maintaining real-time performance. This is achieved through the use of a learned compact depth map representation and reformulating three different types of errors: photometric, reprojection and geometric, which we make use of within standard factor graph software. We evaluate our system on trajectory estimation and depth reconstruction on real-world sequences and present various examples of estimated dense geometry.
△ Less
Submitted 14 January, 2020;
originally announced January 2020.
-
Human mortality at extreme age
Authors:
Léo R. Belzile,
Anthony C. Davison,
Holger Rootzén,
Dmitrii Zholud
Abstract:
We use a combination of extreme value theory, survival analysis and computer-intensive methods to analyze the mortality of Italian and French semi-supercentenarians for whom there are validated records. After accounting for the effects of the sampling frame, there appears to be a constant rate of mortality beyond age 108 years and no difference between countries and cohorts. These findings are con…
▽ More
We use a combination of extreme value theory, survival analysis and computer-intensive methods to analyze the mortality of Italian and French semi-supercentenarians for whom there are validated records. After accounting for the effects of the sampling frame, there appears to be a constant rate of mortality beyond age 108 years and no difference between countries and cohorts. These findings are consistent with previous work based on the International Database on Longevity and suggest that any physical upper bound for humans is so large that it is unlikely to be approached. There is no evidence of differences in survival between women and men after age 108 in the Italian data and the International Database on Longevity; however survival is lower for men in the French data.
△ Less
Submitted 2 October, 2020; v1 submitted 13 January, 2020;
originally announced January 2020.
-
An Unethical Optimization Principle
Authors:
Nicholas Beale,
Heather Battey,
Anthony C. Davison,
Robert S. MacKay
Abstract:
If an artificial intelligence aims to maximise risk-adjusted return, then under mild conditions it is disproportionately likely to pick an unethical strategy unless the objective function allows sufficiently for this risk. Even if the proportion $η$ of available unethical strategies is small, the probability ${p_U}$ of picking an unethical strategy can become large; indeed unless returns are fat-t…
▽ More
If an artificial intelligence aims to maximise risk-adjusted return, then under mild conditions it is disproportionately likely to pick an unethical strategy unless the objective function allows sufficiently for this risk. Even if the proportion $η$ of available unethical strategies is small, the probability ${p_U}$ of picking an unethical strategy can become large; indeed unless returns are fat-tailed ${p_U}$ tends to unity as the strategy space becomes large. We define an Unethical Odds Ratio Upsilon ($Υ$) that allows us to calculate ${p_U}$ from $η$, and we derive a simple formula for the limit of $Υ$ as the strategy space becomes large. We give an algorithm for estimating $Υ$ and ${p_U}$ in finite cases and discuss how to deal with infinite strategy spaces. We show how this principle can be used to help detect unethical strategies and to estimate $η$. Finally we sketch some policy implications of this work.
△ Less
Submitted 12 November, 2019;
originally announced November 2019.
-
Learning One-Shot Imitation from Humans without Humans
Authors:
Alessandro Bonardi,
Stephen James,
Andrew J. Davison
Abstract:
Humans can naturally learn to execute a new task by seeing it performed by other individuals once, and then reproduce it in a variety of configurations. Endowing robots with this ability of imitating humans from third person is a very immediate and natural way of teaching new tasks. Only recently, through meta-learning, there have been successful attempts to one-shot imitation learning from humans…
▽ More
Humans can naturally learn to execute a new task by seeing it performed by other individuals once, and then reproduce it in a variety of configurations. Endowing robots with this ability of imitating humans from third person is a very immediate and natural way of teaching new tasks. Only recently, through meta-learning, there have been successful attempts to one-shot imitation learning from humans; however, these approaches require a lot of human resources to collect the data in the real world to train the robot. But is there a way to remove the need for real world human demonstrations during training? We show that with Task-Embedded Control Networks, we can infer control polices by embedding human demonstrations that can condition a control policy and achieve one-shot imitation learning. Importantly, we do not use a real human arm to supply demonstrations during training, but instead leverage domain randomisation in an application that has not been seen before: sim-to-real transfer on humans. Upon evaluating our approach on pushing and placing tasks in both simulation and in the real world, we show that in comparison to a system that was trained on real-world data we are able to achieve similar results by utilising only simulation data.
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
FutureMap** 2: Gaussian Belief Propagation for Spatial AI
Authors:
Andrew J. Davison,
Joseph Ortiz
Abstract:
We argue the case for Gaussian Belief Propagation (GBP) as a strong algorithmic framework for the distributed, generic and incremental probabilistic estimation we need in Spatial AI as we aim at high performance smart robots and devices which operate within the constraints of real products. Processor hardware is changing rapidly, and GBP has the right character to take advantage of highly distribu…
▽ More
We argue the case for Gaussian Belief Propagation (GBP) as a strong algorithmic framework for the distributed, generic and incremental probabilistic estimation we need in Spatial AI as we aim at high performance smart robots and devices which operate within the constraints of real products. Processor hardware is changing rapidly, and GBP has the right character to take advantage of highly distributed processing and storage while estimating global quantities, as well as great flexibility. We present a detailed tutorial on GBP, relating to the standard factor graph formulation used in robotics and computer vision, and give several simulation examples with code which demonstrate its properties.
△ Less
Submitted 7 November, 2022; v1 submitted 30 October, 2019;
originally announced October 2019.
-
RLBench: The Robot Learning Benchmark & Learning Environment
Authors:
Stephen James,
Zicong Ma,
David Rovick Arrojo,
Andrew J. Davison
Abstract:
We present a challenging new benchmark and learning-environment for robot learning: RLBench. The benchmark features 100 completely unique, hand-designed tasks ranging in difficulty, from simple target reaching and door opening, to longer multi-stage tasks, such as opening an oven and placing a tray in it. We provide an array of both proprioceptive observations and visual observations, which includ…
▽ More
We present a challenging new benchmark and learning-environment for robot learning: RLBench. The benchmark features 100 completely unique, hand-designed tasks ranging in difficulty, from simple target reaching and door opening, to longer multi-stage tasks, such as opening an oven and placing a tray in it. We provide an array of both proprioceptive observations and visual observations, which include rgb, depth, and segmentation masks from an over-the-shoulder stereo camera and an eye-in-hand monocular camera. Uniquely, each task comes with an infinite supply of demos through the use of motion planners operating on a series of waypoints given during task creation time; enabling an exciting flurry of demonstration-based learning. RLBench has been designed with scalability in mind; new tasks, along with their motion-planned demos, can be easily created and then verified by a series of tools, allowing users to submit their own tasks to the RLBench task repository. This large-scale benchmark aims to accelerate progress in a number of vision-guided manipulation research areas, including: reinforcement learning, imitation learning, multi-task learning, geometric computer vision, and in particular, few-shot learning. With the benchmark's breadth of tasks and demonstrations, we propose the first large-scale few-shot challenge in robotics. We hope that the scale and diversity of RLBench offers unparalleled research opportunities in the robot learning community and beyond.
△ Less
Submitted 26 September, 2019;
originally announced September 2019.
-
PyRep: Bringing V-REP to Deep Robot Learning
Authors:
Stephen James,
Marc Freese,
Andrew J. Davison
Abstract:
PyRep is a toolkit for robot learning research, built on top of the virtual robotics experimentation platform (V-REP). Through a series of modifications and additions, we have created a tailored version of V-REP built with robot learning in mind. The new PyRep toolkit offers three improvements: (1) a simple and flexible API for robot control and scene manipulation, (2) a new rendering engine, and…
▽ More
PyRep is a toolkit for robot learning research, built on top of the virtual robotics experimentation platform (V-REP). Through a series of modifications and additions, we have created a tailored version of V-REP built with robot learning in mind. The new PyRep toolkit offers three improvements: (1) a simple and flexible API for robot control and scene manipulation, (2) a new rendering engine, and (3) speed boosts upwards of 10,000x in comparison to the previous Python Remote API. With these improvements, we believe PyRep is the ideal toolkit to facilitate rapid prototy** of learning algorithms in the areas of reinforcement learning, imitation learning, state estimation, map**, and computer vision.
△ Less
Submitted 26 June, 2019;
originally announced June 2019.
-
Horizon constraints on holographic Green's functions
Authors:
Mike Blake,
Richard A. Davison,
David Vegh
Abstract:
We explore a new class of general properties of thermal holographic Green's functions that can be deduced from the near-horizon behaviour of classical perturbations in asymptotically anti-de Sitter spacetimes. We show that at negative imaginary Matsubara frequencies and appropriate complex values of the wavenumber the retarded Green's functions of generic operators are not uniquely defined, due to…
▽ More
We explore a new class of general properties of thermal holographic Green's functions that can be deduced from the near-horizon behaviour of classical perturbations in asymptotically anti-de Sitter spacetimes. We show that at negative imaginary Matsubara frequencies and appropriate complex values of the wavenumber the retarded Green's functions of generic operators are not uniquely defined, due to the lack of a unique ingoing solution for the bulk perturbations. From a boundary perspective these `pole-skip**' points correspond to locations in the complex frequency and momentum planes at which a line of poles of the retarded Green's function intersects with a line of zeroes. As a consequence the dispersion relations of collective modes in the boundary theory at energy scales $ω\sim T$ are directly constrained by the bulk dynamics near the black-brane horizon. For the case of conserved $U(1)$ current and energy-momentum tensor operators we give examples where the dispersion relations of hydrodynamic modes pass through a succession of pole-skip** points as real wavenumber is increased. We discuss implications of our results for transport, hydrodynamics and quantum chaos in holographic systems.
△ Less
Submitted 13 January, 2020; v1 submitted 29 April, 2019;
originally announced April 2019.
-
Event-based Vision: A Survey
Authors:
Guillermo Gallego,
Tobi Delbruck,
Garrick Orchard,
Chiara Bartolozzi,
Brian Taba,
Andrea Censi,
Stefan Leutenegger,
Andrew Davison,
Joerg Conradt,
Kostas Daniilidis,
Davide Scaramuzza
Abstract:
Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of…
▽ More
Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur. Hence, event cameras have a large potential for robotics and computer vision in challenging scenarios for traditional cameras, such as low-latency, high speed, and high dynamic range. However, novel methods are required to process the unconventional output of these sensors in order to unlock their potential. This paper provides a comprehensive overview of the emerging field of event-based vision, with a focus on the applications and the algorithms developed to unlock the outstanding properties of event cameras. We present event cameras from their working principle, the actual sensors that are available and the tasks that they have been used for, from low-level vision (feature detection and tracking, optic flow, etc.) to high-level vision (reconstruction, segmentation, recognition). We also discuss the techniques developed to process events, including learning-based techniques, as well as specialized processors for these novel sensors, such as spiking neural networks. Additionally, we highlight the challenges that remain to be tackled and the opportunities that lie ahead in the search for a more efficient, bio-inspired way for machines to perceive and interact with the world.
△ Less
Submitted 8 August, 2020; v1 submitted 17 April, 2019;
originally announced April 2019.
-
SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representations
Authors:
Shuaifeng Zhi,
Michael Bloesch,
Stefan Leutenegger,
Andrew J. Davison
Abstract:
Systems which incrementally create 3D semantic maps from image sequences must store and update representations of both geometry and semantic entities. However, while there has been much work on the correct formulation for geometrical estimation, state-of-the-art systems usually rely on simple semantic representations which store and update independent label estimates for each surface element (dept…
▽ More
Systems which incrementally create 3D semantic maps from image sequences must store and update representations of both geometry and semantic entities. However, while there has been much work on the correct formulation for geometrical estimation, state-of-the-art systems usually rely on simple semantic representations which store and update independent label estimates for each surface element (depth pixels, surfels, or voxels). Spatial correlation is discarded, and fused label maps are incoherent and noisy.
We introduce a new compact and optimisable semantic representation by training a variational auto-encoder that is conditioned on a colour image. Using this learned latent space, we can tackle semantic label fusion by jointly optimising the low-dimenional codes associated with each of a set of overlap** images, producing consistent fused label maps which preserve spatial correlation. We also show how this approach can be used within a monocular keyframe based semantic map** system where a similar code approach is used for geometry. The probabilistic formulation allows a flexible formulation where we can jointly estimate motion, geometry and semantics in a unified optimisation.
△ Less
Submitted 18 March, 2019; v1 submitted 15 March, 2019;
originally announced March 2019.
-
Overcoming Multi-Model Forgetting
Authors:
Yassine Benyahia,
Kaicheng Yu,
Kamil Bennani-Smires,
Martin Jaggi,
Anthony Davison,
Mathieu Salzmann,
Claudiu Musat
Abstract:
We identify a phenomenon, which we refer to as multi-model forgetting, that occurs when sequentially training multiple deep networks with partially-shared parameters; the performance of previously-trained models degrades as one optimizes a subsequent one, due to the overwriting of shared parameters. To overcome this, we introduce a statistically-justified weight plasticity loss that regularizes th…
▽ More
We identify a phenomenon, which we refer to as multi-model forgetting, that occurs when sequentially training multiple deep networks with partially-shared parameters; the performance of previously-trained models degrades as one optimizes a subsequent one, due to the overwriting of shared parameters. To overcome this, we introduce a statistically-justified weight plasticity loss that regularizes the learning of a model's shared parameters according to their importance for the previous models, and demonstrate its effectiveness when training two models sequentially and for neural architecture search. Adding weight plasticity in neural architecture search preserves the best models to the end of the search and yields improved results in both natural language processing and computer vision tasks.
△ Less
Submitted 2 March, 2019; v1 submitted 21 February, 2019;
originally announced February 2019.
-
Penultimate Analysis of the Conditional Multivariate Extremes Tail Model
Authors:
Thomas Lugrin,
Anthony C. Davison,
Jonathan A. Tawn
Abstract:
Models for extreme values are generally derived from limit results, which are meant to be good enough approximations when applied to finite samples. Depending on the speed of convergence of the process underlying the data, these approximations may fail to represent subasymptotic features present in the data, and thus may introduce bias. The case of univariate maxima has been widely explored in the…
▽ More
Models for extreme values are generally derived from limit results, which are meant to be good enough approximations when applied to finite samples. Depending on the speed of convergence of the process underlying the data, these approximations may fail to represent subasymptotic features present in the data, and thus may introduce bias. The case of univariate maxima has been widely explored in the literature, a prominent example being the slow convergence to their Gumbel limit of Gaussian maxima, which are better approximated by a negative Weibull distribution at finite levels. In the context of subasymptotic multivariate extremes, research has only dealt with specific cases related to componentwise maxima and multivariate regular variation. This paper explores the conditional extremes model (Heffernan and Tawn, 2004) in order to shed light on its finite-sample behaviour and to reduce the bias of extrapolations beyond the range of the available data. We identify second-order features for different types of conditional copulas, and obtain results that echo those from the univariate context. These results suggest possible extensions of the conditional tail model, which will enable it to be fitted at less extreme thresholds.
△ Less
Submitted 19 February, 2019;
originally announced February 2019.
-
Trends in the extremes of environments associated with severe US thunderstorms
Authors:
Erwan Koch,
Jonathan Koh,
Anthony C. Davison,
Chiara Lepore,
Michael K. Tippett
Abstract:
Severe thunderstorms can have devastating impacts. Concurrently high values of convective available potential energy (CAPE) and storm relative helicity (SRH) are known to be conducive to severe weather, so high values of PROD=$\sqrt{\mathrm{CAPE}} \times$SRH have been used to indicate high risk of severe thunderstorms. We consider the extreme values of these three variables for a large area of the…
▽ More
Severe thunderstorms can have devastating impacts. Concurrently high values of convective available potential energy (CAPE) and storm relative helicity (SRH) are known to be conducive to severe weather, so high values of PROD=$\sqrt{\mathrm{CAPE}} \times$SRH have been used to indicate high risk of severe thunderstorms. We consider the extreme values of these three variables for a large area of the contiguous US over the period 1979-2015, and use extreme-value theory and a multiple testing procedure to show that there is a significant time trend in the extremes for PROD maxima in April, May and August, for CAPE maxima in April, May and June, and for maxima of SRH in April and May. These observed increases in CAPE are also relevant for rainfall extremes and are expected in a warmer climate, but have not previously been reported. Moreover, we show that the El Niño-Southern Oscillation explains variation in the extremes of PROD and SRH in February. Our results suggest that the risk from severe thunderstorms in April and May is increasing in parts of the US where it was already high, and that the risk from storms in February tends to be higher over the main part of the region during La Niña years. Our results differ from those obtained in earlier studies using extreme-value techniques to analyze a quantity similar to PROD.
△ Less
Submitted 30 October, 2019; v1 submitted 30 January, 2019;
originally announced January 2019.
-
Self-Supervised Generalisation with Meta Auxiliary Learning
Authors:
Shikun Liu,
Andrew J. Davison,
Edward Johns
Abstract:
Learning with auxiliary tasks can improve the ability of a primary task to generalise. However, this comes at the cost of manually labelling auxiliary data. We propose a new method which automatically learns appropriate labels for an auxiliary task, such that any supervised learning task can be improved without requiring access to any further data. The approach is to train two neural networks: a l…
▽ More
Learning with auxiliary tasks can improve the ability of a primary task to generalise. However, this comes at the cost of manually labelling auxiliary data. We propose a new method which automatically learns appropriate labels for an auxiliary task, such that any supervised learning task can be improved without requiring access to any further data. The approach is to train two neural networks: a label-generation network to predict the auxiliary labels, and a multi-task network to train the primary task alongside the auxiliary task. The loss for the label-generation network incorporates the loss of the multi-task network, and so this interaction between the two networks can be seen as a form of meta learning with a double gradient. We show that our proposed method, Meta AuXiliary Learning (MAXL), outperforms single-task learning on 7 image datasets, without requiring any additional data. We also show that MAXL outperforms several other baselines for generating auxiliary labels, and is even competitive when compared with human-defined auxiliary labels. The self-supervised nature of our method leads to a promising new direction towards automated generalisation. Source code can be found at https://github.com/lorenmt/maxl.
△ Less
Submitted 26 November, 2019; v1 submitted 25 January, 2019;
originally announced January 2019.
-
Impact of irrelevant deformations on thermodynamics and transport in holographic quantum critical states
Authors:
Richard A. Davison,
Simon A. Gentle,
Blaise Goutéraux
Abstract:
We study thermodynamic and transport observables of quantum critical states that arise in the infra-red limit of holographic renormalisation group flows. Although these observables are expected to exhibit quantum critical scaling, there are a number of cases in which their frequency and temperature dependences are in apparent contradiction with scaling theories. We study two different classes of e…
▽ More
We study thermodynamic and transport observables of quantum critical states that arise in the infra-red limit of holographic renormalisation group flows. Although these observables are expected to exhibit quantum critical scaling, there are a number of cases in which their frequency and temperature dependences are in apparent contradiction with scaling theories. We study two different classes of examples, and show in both cases that the apparent breakdown of scaling is a consequence of the dependence of observables on an irrelevant deformation of the quantum critical state. By assigning scaling dimensions to the near-horizon observables, we formulate improved scaling theories that are completely consistent with all explicit holographic results once the dependence on the dangerously irrelevant coupling is properly accounted for. In addition to governing thermodynamic and transport phenomena in these states, we show that the dangerously irrelevant coupling also controls late-time equilibration, which occurs at a rate parametrically slower than the temperature $1/τ_{eq}\ll T$. At very late times, transport is diffusion-dominated, with a diffusivity that can be written simply in terms of $τ_{eq}$ and the butterfly velocity, $D\sim v_B^2τ_{eq}$. We conjecture that in such cases there exists a long-lived, propagating collective mode with velocity $v_s$, and in this case the relation $D=v_s^2τ_{eq}$ holds exactly in the limit $τ_{eq} T\gg1$.
△ Less
Submitted 15 October, 2019; v1 submitted 28 December, 2018;
originally announced December 2018.
-
MID-Fusion: Octree-based Object-Level Multi-Instance Dynamic SLAM
Authors:
Binbin Xu,
Wenbin Li,
Dimos Tzoumanikas,
Michael Bloesch,
Andrew Davison,
Stefan Leutenegger
Abstract:
We propose a new multi-instance dynamic RGB-D SLAM system using an object-level octree-based volumetric representation. It can provide robust camera tracking in dynamic environments and at the same time, continuously estimate geometric, semantic, and motion properties for arbitrary objects in the scene. For each incoming frame, we perform instance segmentation to detect objects and refine mask bou…
▽ More
We propose a new multi-instance dynamic RGB-D SLAM system using an object-level octree-based volumetric representation. It can provide robust camera tracking in dynamic environments and at the same time, continuously estimate geometric, semantic, and motion properties for arbitrary objects in the scene. For each incoming frame, we perform instance segmentation to detect objects and refine mask boundaries using geometric and motion information. Meanwhile, we estimate the pose of each existing moving object using an object-oriented tracking method and robustly track the camera pose against the static scene. Based on the estimated camera pose and object poses, we associate segmented masks with existing models and incrementally fuse corresponding colour, depth, semantic, and foreground object probabilities into each object model. In contrast to existing approaches, our system is the first system to generate an object-level dynamic volumetric map from a single RGB-D camera, which can be used directly for robotic tasks. Our method can run at 2-3 Hz on a CPU, excluding the instance segmentation part. We demonstrate its effectiveness by quantitatively and qualitatively testing it on both synthetic and real-world sequences.
△ Less
Submitted 21 March, 2019; v1 submitted 19 December, 2018;
originally announced December 2018.
-
A global-local approach for detecting hotspots in multiple-response regression
Authors:
Hélène Ruffieux,
Anthony C. Davison,
Jörg Hager,
Jamie Inshaw,
Benjamin P. Fairfax,
Sylvia Richardson,
Leonardo Bottolo
Abstract:
We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting hotspots, i.e., predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional…
▽ More
We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting hotspots, i.e., predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional mechanisms underlying disease endpoints. Existing hierarchical regression approaches designed to model hotspots suffer from two limitations: their discrimination of hotspots is sensitive to the choice of top-level scale parameters for the propensity of predictors to be hotspots, and they do not scale to large predictor and response vectors, e.g., of dimensions $10^3-10^5$ in genetic applications. We address these shortcomings by introducing a flexible hierarchical regression framework that is tailored to the detection of hotspots and scalable to the above dimensions. Our proposal implements a fully Bayesian model for hotspots based on the horseshoe shrinkage prior. Its global-local formulation shrinks noise globally and hence accommodates the highly sparse nature of genetic analyses, while being robust to individual signals, thus leaving the effects of hotspots unshrunk. Inference is carried out using a fast variational algorithm coupled with a novel simulated annealing procedure that allows efficient exploration of multimodal distributions.
△ Less
Submitted 15 May, 2020; v1 submitted 8 November, 2018;
originally announced November 2018.
-
Task-Embedded Control Networks for Few-Shot Imitation Learning
Authors:
Stephen James,
Michael Bloesch,
Andrew J. Davison
Abstract:
Much like humans, robots should have the ability to leverage knowledge from previously learned tasks in order to learn new tasks quickly in new and unfamiliar environments. Despite this, most robot learning approaches have focused on learning a single task, from scratch, with a limited notion of generalisation, and no way of leveraging the knowledge to learn other tasks more efficiently. One possi…
▽ More
Much like humans, robots should have the ability to leverage knowledge from previously learned tasks in order to learn new tasks quickly in new and unfamiliar environments. Despite this, most robot learning approaches have focused on learning a single task, from scratch, with a limited notion of generalisation, and no way of leveraging the knowledge to learn other tasks more efficiently. One possible solution is meta-learning, but many of the related approaches are limited in their ability to scale to a large number of tasks and to learn further tasks without forgetting previously learned ones. With this in mind, we introduce Task-Embedded Control Networks, which employ ideas from metric learning in order to create a task embedding that can be used by a robot to learn new tasks from one or more demonstrations. In the area of visually-guided manipulation, we present simulation results in which we surpass the performance of a state-of-the-art method when using only visual information from each demonstration. Additionally, we demonstrate that our approach can also be used in conjunction with domain randomisation to train our few-shot learning ability in simulation and then deploy in the real world without any additional training. Once deployed, the robot can learn new tasks from a single real-world demonstration.
△ Less
Submitted 7 October, 2018;
originally announced October 2018.
-
Fast Automatic Smoothing for Generalized Additive Models
Authors:
Yousra El-Bachir,
Anthony C. Davison
Abstract:
Multiple generalized additive models (GAMs) are a type of distributional regression wherein parameters of probability distributions depend on predictors through smooth functions, with selection of the degree of smoothness via $L_2$ regularization. Multiple GAMs allow finer statistical inference by incorporating explanatory information in any or all of the parameters of the distribution. Owing to t…
▽ More
Multiple generalized additive models (GAMs) are a type of distributional regression wherein parameters of probability distributions depend on predictors through smooth functions, with selection of the degree of smoothness via $L_2$ regularization. Multiple GAMs allow finer statistical inference by incorporating explanatory information in any or all of the parameters of the distribution. Owing to their nonlinearity, flexibility and interpretability, GAMs are widely used, but reliable and fast methods for automatic smoothing in large datasets are still lacking, despite recent advances. We develop a general methodology for automatically learning the optimal degree of $L_2$ regularization for multiple GAMs using an empirical Bayes approach. The smooth functions are penalized by different amounts, which are learned simultaneously by maximization of a marginal likelihood through an approximate expectation-maximization algorithm that involves a double Laplace approximation at the E-step, and leads to an efficient M-step. Empirical analysis shows that the resulting algorithm is numerically stable, faster than all existing methods and achieves state-of-the-art accuracy. For illustration, we apply it to an important and challenging problem in the analysis of extremal data.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
LS-Net: Learning to Solve Nonlinear Least Squares for Monocular Stereo
Authors:
Ronald Clark,
Michael Bloesch,
Jan Czarnowski,
Stefan Leutenegger,
Andrew J. Davison
Abstract:
Sum-of-squares objective functions are very popular in computer vision algorithms. However, these objective functions are not always easy to optimize. The underlying assumptions made by solvers are often not satisfied and many problems are inherently ill-posed. In this paper, we propose LS-Net, a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost…
▽ More
Sum-of-squares objective functions are very popular in computer vision algorithms. However, these objective functions are not always easy to optimize. The underlying assumptions made by solvers are often not satisfied and many problems are inherently ill-posed. In this paper, we propose LS-Net, a neural nonlinear least squares optimization algorithm which learns to effectively optimize these cost functions even in the presence of adversities. Unlike traditional approaches, the proposed solver requires no hand-crafted regularizers or priors as these are implicitly learned from the data. We apply our method to the problem of motion stereo ie. jointly estimating the motion and scene geometry from pairs of images of a monocular sequence. We show that our learned optimizer is able to efficiently and effectively solve this challenging optimization problem.
△ Less
Submitted 9 September, 2018;
originally announced September 2018.
-
Many-body chaos and energy dynamics in holography
Authors:
Mike Blake,
Richard A. Davison,
Sašo Grozdanov,
Hong Liu
Abstract:
Recent developments have indicated that in addition to out-of-time ordered correlation functions (OTOCs), quantum chaos also has a sharp manifestation in the thermal energy density two-point functions, at least for maximally chaotic systems. The manifestation, referred to as pole-skip**, concerns the analytic behaviour of energy density two-point functions around a special point $ω= i λ$,…
▽ More
Recent developments have indicated that in addition to out-of-time ordered correlation functions (OTOCs), quantum chaos also has a sharp manifestation in the thermal energy density two-point functions, at least for maximally chaotic systems. The manifestation, referred to as pole-skip**, concerns the analytic behaviour of energy density two-point functions around a special point $ω= i λ$, $k = i λ/v_B$ in the complex frequency and momentum plane. Here $λ$ and $v_B$ are the Lyapunov exponent and butterfly velocity characterising quantum chaos. In this paper we provide an argument that the phenomenon of pole-skip** is universal for general finite temperature systems dual to Einstein gravity coupled to matter. In doing so we uncover a surprising universal feature of the linearised Einstein equations around a static black hole geometry. We also study analytically a holographic axion model where all of the features of our general argument as well as the pole-skip** phenomenon can be verified in detail.
△ Less
Submitted 10 October, 2018; v1 submitted 4 September, 2018;
originally announced September 2018.
-
Fusion++: Volumetric Object-Level SLAM
Authors:
John McCormac,
Ronald Clark,
Michael Bloesch,
Andrew J. Davison,
Stefan Leutenegger
Abstract:
We propose an online object-level SLAM system which builds a persistent and accurate 3D graph map of arbitrary reconstructed objects. As an RGB-D camera browses a cluttered indoor scene, Mask-RCNN instance segmentations are used to initialise compact per-object Truncated Signed Distance Function (TSDF) reconstructions with object size-dependent resolutions and a novel 3D foreground mask. Reconstru…
▽ More
We propose an online object-level SLAM system which builds a persistent and accurate 3D graph map of arbitrary reconstructed objects. As an RGB-D camera browses a cluttered indoor scene, Mask-RCNN instance segmentations are used to initialise compact per-object Truncated Signed Distance Function (TSDF) reconstructions with object size-dependent resolutions and a novel 3D foreground mask. Reconstructed objects are stored in an optimisable 6DoF pose graph which is our only persistent map representation. Objects are incrementally refined via depth fusion, and are used for tracking, relocalisation and loop closure detection. Loop closures cause adjustments in the relative pose estimates of object instances, but no intra-object war**. Each object also carries semantic information which is refined over time and an existence probability to account for spurious instance predictions. We demonstrate our approach on a hand-held RGB-D sequence from a cluttered office scene with a large number and variety of object instances, highlighting how the system closes loops and makes good use of existing objects on repeated loops. We quantitatively evaluate the trajectory error of our system against a baseline approach on the RGB-D SLAM benchmark, and qualitatively compare reconstruction quality of discovered objects on the YCB video dataset. Performance evaluation shows our approach is highly memory efficient and runs online at 4-8Hz (excluding relocalisation) despite not being optimised at the software level.
△ Less
Submitted 28 August, 2018; v1 submitted 25 August, 2018;
originally announced August 2018.
-
SLAMBench2: Multi-Objective Head-to-Head Benchmarking for Visual SLAM
Authors:
Bruno Bodin,
Harry Wagstaff,
Sajad Saeedi,
Luigi Nardi,
Emanuele Vespa,
John H Mayer,
Andy Nisbet,
Mikel Luján,
Steve Furber,
Andrew J Davison,
Paul H. J. Kelly,
Michael O'Boyle
Abstract:
SLAM is becoming a key component of robotics and augmented reality (AR) systems. While a large number of SLAM algorithms have been presented, there has been little effort to unify the interface of such algorithms, or to perform a holistic comparison of their capabilities. This is a problem since different SLAM applications can have different functional and non-functional requirements. For example,…
▽ More
SLAM is becoming a key component of robotics and augmented reality (AR) systems. While a large number of SLAM algorithms have been presented, there has been little effort to unify the interface of such algorithms, or to perform a holistic comparison of their capabilities. This is a problem since different SLAM applications can have different functional and non-functional requirements. For example, a mobile phonebased AR application has a tight energy budget, while a UAV navigation system usually requires high accuracy. SLAMBench2 is a benchmarking framework to evaluate existing and future SLAM systems, both open and close source, over an extensible list of datasets, while using a comparable and clearly specified list of performance metrics. A wide variety of existing SLAM algorithms and datasets is supported, e.g. ElasticFusion, InfiniTAM, ORB-SLAM2, OKVIS, and integrating new ones is straightforward and clearly specified by the framework. SLAMBench2 is a publicly-available software framework which represents a starting point for quantitative, comparable and validatable experimental research to investigate trade-offs across SLAM systems.
△ Less
Submitted 21 August, 2018;
originally announced August 2018.
-
Navigating the Landscape for Real-time Localisation and Map** for Robotics and Virtual and Augmented Reality
Authors:
Sajad Saeedi,
Bruno Bodin,
Harry Wagstaff,
Andy Nisbet,
Luigi Nardi,
John Mawer,
Nicolas Melot,
Oscar Palomar,
Emanuele Vespa,
Tom Spink,
Cosmin Gorgovan,
Andrew Webb,
James Clarkson,
Erik Tomusk,
Thomas Debrunner,
Kuba Kaszyk,
Pablo Gonzalez-de-Aledo,
Andrey Rodchenko,
Graham Riley,
Christos Kotselidis,
Björn Franke,
Michael F. P. O'Boyle,
Andrew J. Davison,
Paul H. J. Kelly,
Mikel Luján
, et al. (1 additional authors not shown)
Abstract:
Visual understanding of 3D environments in real-time, at low power, is a huge computational challenge. Often referred to as SLAM (Simultaneous Localisation and Map**), it is central to applications spanning domestic and industrial robotics, autonomous vehicles, virtual and augmented reality. This paper describes the results of a major research effort to assemble the algorithms, architectures, to…
▽ More
Visual understanding of 3D environments in real-time, at low power, is a huge computational challenge. Often referred to as SLAM (Simultaneous Localisation and Map**), it is central to applications spanning domestic and industrial robotics, autonomous vehicles, virtual and augmented reality. This paper describes the results of a major research effort to assemble the algorithms, architectures, tools, and systems software needed to enable delivery of SLAM, by supporting applications specialists in selecting and configuring the appropriate algorithm and the appropriate hardware, and compilation pathway, to meet their performance, accuracy, and energy consumption goals. The major contributions we present are (1) tools and methodology for systematic quantitative evaluation of SLAM algorithms, (2) automated, machine-learning-guided exploration of the algorithmic and implementation design space with respect to multiple objectives, (3) end-to-end simulation tools to enable optimisation of heterogeneous, accelerated architectures for the specific algorithmic requirements of the various SLAM algorithmic approaches, and (4) tools for delivering, where appropriate, accelerated, adaptive SLAM solutions in a managed, JIT-compiled, adaptive runtime context.
△ Less
Submitted 20 August, 2018;
originally announced August 2018.
-
Slow relaxation and diffusion in holographic quantum critical phases
Authors:
Richard A. Davison,
Simon A. Gentle,
Blaise Goutéraux
Abstract:
The dissipative dynamics of strongly interacting systems are often characterised by the timescale set by the inverse temperature $τ_P\sim\hbar/(k_BT)$. We show that near a class of strongly interacting quantum critical points that arise in the infra-red limit of translationally invariant holographic theories, there is a collective excitation (a quasinormal mode of the dual black hole spacetime) wh…
▽ More
The dissipative dynamics of strongly interacting systems are often characterised by the timescale set by the inverse temperature $τ_P\sim\hbar/(k_BT)$. We show that near a class of strongly interacting quantum critical points that arise in the infra-red limit of translationally invariant holographic theories, there is a collective excitation (a quasinormal mode of the dual black hole spacetime) whose lifetime $τ_{eq}$ is parametrically longer than $τ_P$: $τ_{eq}\gg T^{-1}$. The lifetime is enhanced due to its dependence on a dangerously irrelevant coupling that breaks the particle-hole symmetry and the invariance under Lorentz boosts of the quantum critical point. The thermal diffusivity (in units of the butterfly velocity) is anomalously large near the quantum critical point and is governed by $τ_{eq}$ rather than $τ_P$. We conjecture that there exists a long-lived, propagating collective mode with velocity $v_s$, and in this case the relation $D=v_s^2τ_{eq}$ holds exactly in the limit $Tτ_{eq}\gg1$. While scale invariance is broken, a generalised scaling theory still holds provided that the dependence of observables on the dangerously irrelevant coupling is incorporated. Our work further underlines the connection between dangerously irrelevant deformations and slow equilibration.
△ Less
Submitted 29 June, 2021; v1 submitted 16 August, 2018;
originally announced August 2018.
-
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Authors:
Jan Matas,
Stephen James,
Andrew J. Davison
Abstract:
We have seen much recent progress in rigid object manipulation, but interaction with deformable objects has notably lagged behind. Due to the large configuration space of deformable objects, solutions using traditional modelling approaches require significant engineering work. Perhaps then, bypassing the need for explicit modelling and instead learning the control in an end-to-end manner serves as…
▽ More
We have seen much recent progress in rigid object manipulation, but interaction with deformable objects has notably lagged behind. Due to the large configuration space of deformable objects, solutions using traditional modelling approaches require significant engineering work. Perhaps then, bypassing the need for explicit modelling and instead learning the control in an end-to-end manner serves as a better approach? Despite the growing interest in the use of end-to-end robot learning approaches, only a small amount of work has focused on their applicability to deformable object manipulation. Moreover, due to the large amount of data needed to learn these end-to-end solutions, an emerging trend is to learn control policies in simulation and then transfer them over to the real world. To-date, no work has explored whether it is possible to learn and transfer deformable object policies. We believe that if sim-to-real methods are to be employed further, then it should be possible to learn to interact with a wide variety of objects, and not only rigid objects. In this work, we use a combination of state-of-the-art deep reinforcement learning algorithms to solve the problem of manipulating deformable objects (specifically cloth). We evaluate our approach on three tasks --- folding a towel up to a mark, folding a face towel diagonally, and dra** a piece of cloth over a hanger. Our agents are fully trained in simulation with domain randomisation, and then successfully deployed in the real world without having seen any real deformable objects.
△ Less
Submitted 7 October, 2018; v1 submitted 20 June, 2018;
originally announced June 2018.
-
A Review on Facial Micro-Expressions Analysis: Datasets, Features and Metrics
Authors:
Walied Merghani,
Adrian K. Davison,
Moi Hoon Yap
Abstract:
Facial micro-expressions are very brief, spontaneous facial expressions that appear on the face of humans when they either deliberately or unconsciously conceal an emotion. Micro-expression has shorter duration than macro-expression, which makes it more challenging for human and machine. Over the past ten years, automatic micro-expressions recognition has attracted increasing attention from resear…
▽ More
Facial micro-expressions are very brief, spontaneous facial expressions that appear on the face of humans when they either deliberately or unconsciously conceal an emotion. Micro-expression has shorter duration than macro-expression, which makes it more challenging for human and machine. Over the past ten years, automatic micro-expressions recognition has attracted increasing attention from researchers in psychology, computer science, security, neuroscience and other related disciplines. The aim of this paper is to provide the insights of automatic micro-expressions and recommendations for future research. There has been a lot of datasets released over the last decade that facilitated the rapid growth in this field. However, comparison across different datasets is difficult due to the inconsistency in experiment protocol, features used and evaluation methods. To address these issues, we review the datasets, features and the performance metrics deployed in the literature. Relevant challenges such as the spatial temporal settings during data collection, emotional classes versus objective classes in data labelling, face regions in data analysis, standardisation of metrics and the requirements for real-world implementation are discussed. We conclude by proposing some promising future directions to advancing micro-expressions research.
△ Less
Submitted 7 May, 2018;
originally announced May 2018.
-
CodeSLAM - Learning a Compact, Optimisable Representation for Dense Visual SLAM
Authors:
Michael Bloesch,
Jan Czarnowski,
Ronald Clark,
Stefan Leutenegger,
Andrew J. Davison
Abstract:
The representation of geometry in real-time 3D perception systems continues to be a critical research issue. Dense maps capture complete surface shape and can be augmented with semantic labels, but their high dimensionality makes them computationally costly to store and process, and unsuitable for rigorous probabilistic inference. Sparse feature-based representations avoid these problems, but capt…
▽ More
The representation of geometry in real-time 3D perception systems continues to be a critical research issue. Dense maps capture complete surface shape and can be augmented with semantic labels, but their high dimensionality makes them computationally costly to store and process, and unsuitable for rigorous probabilistic inference. Sparse feature-based representations avoid these problems, but capture only partial scene information and are mainly useful for localisation only.
We present a new compact but dense representation of scene geometry which is conditioned on the intensity data from a single image and generated from a code consisting of a small number of parameters. We are inspired by work both on learned depth from images, and auto-encoders. Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlap** keyframes to attain global consistency. Conditioning the depth map on the image allows the code to only represent aspects of the local geometry which cannot directly be predicted from the image. We explain how to learn our code representation, and demonstrate its advantageous properties in monocular SLAM.
△ Less
Submitted 14 April, 2019; v1 submitted 3 April, 2018;
originally announced April 2018.
-
FutureMap**: The Computational Structure of Spatial AI Systems
Authors:
Andrew J. Davison
Abstract:
We discuss and predict the evolution of Simultaneous Localisation and Map** (SLAM) into a general geometric and semantic `Spatial AI' perception capability for intelligent embodied devices. A big gap remains between the visual perception performance that devices such as augmented reality eyewear or comsumer robots will require and what is possible within the constraints imposed by real products.…
▽ More
We discuss and predict the evolution of Simultaneous Localisation and Map** (SLAM) into a general geometric and semantic `Spatial AI' perception capability for intelligent embodied devices. A big gap remains between the visual perception performance that devices such as augmented reality eyewear or comsumer robots will require and what is possible within the constraints imposed by real products. Co-design of algorithms, processors and sensors will be needed. We explore the computational structure of current and future Spatial AI algorithms and consider this within the landscape of ongoing hardware developments.
△ Less
Submitted 29 March, 2018;
originally announced March 2018.
-
End-to-End Multi-Task Learning with Attention
Authors:
Shikun Liu,
Edward Johns,
Andrew J. Davison
Abstract:
We propose a novel multi-task learning architecture, which allows learning of task-specific feature-level attention. Our design, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool, together with a soft-attention module for each task. These modules allow for learning of task-specific features from the global features, whilst simultaneously…
▽ More
We propose a novel multi-task learning architecture, which allows learning of task-specific feature-level attention. Our design, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool, together with a soft-attention module for each task. These modules allow for learning of task-specific features from the global features, whilst simultaneously allowing for features to be shared across different tasks. The architecture can be trained end-to-end and can be built upon any feed-forward neural network, is simple to implement, and is parameter efficient. We evaluate our approach on a variety of datasets, across both image-to-image predictions and image classification tasks. We show that our architecture is state-of-the-art in multi-task learning compared to existing methods, and is also less sensitive to various weighting schemes in the multi-task loss function. Code is available at https://github.com/lorenmt/mtan.
△ Less
Submitted 5 April, 2019; v1 submitted 28 March, 2018;
originally announced March 2018.
-
Parameter estimation for discretely-observed linear birth-and-death processes
Authors:
Anthony C. Davison,
Sophie Hautphenne,
Andrea Kraus
Abstract:
Birth-and-death processes are widely used to model the development of biological populations. Although they are relatively simple models, their parameters can be challenging to estimate, because the likelihood can become numerically unstable when data arise from the most common sampling schemes, such as annual population censuses. Simple estimators may be based on an embedded Galton-Watson process…
▽ More
Birth-and-death processes are widely used to model the development of biological populations. Although they are relatively simple models, their parameters can be challenging to estimate, because the likelihood can become numerically unstable when data arise from the most common sampling schemes, such as annual population censuses. Simple estimators may be based on an embedded Galton-Watson process, but this presupposes that the observation times are equi-spaced. We estimate the birth, death, and growth rates of a linear birth-and-death process whose population size is periodically observed via an embedded Galton-Watson process, and by maximizing a saddlepoint approximation to the likelihood. We show that a Gaussian approximation to the saddlepoint-based likelihood connects the two approaches, we establish consistency and asymptotic normality of quasi-likelihood estimators, compare our estimators on some numerical examples, and apply our results to census data for two endangered bird populations and the H1N1 influenza pandemic.
△ Less
Submitted 9 October, 2018; v1 submitted 14 February, 2018;
originally announced February 2018.
-
DFUNet: Convolutional Neural Networks for Diabetic Foot Ulcer Classification
Authors:
Manu Goyal,
Neil D. Reeves,
Adrian K. Davison,
Satyan Rajbhandari,
Jennifer Spragg,
Moi Hoon Yap
Abstract:
Globally, in 2016, one out of eleven adults suffered from Diabetes Mellitus. Diabetic Foot Ulcers (DFU) are a major complication of this disease, which if not managed properly can lead to amputation. Current clinical approaches to DFU treatment rely on patient and clinician vigilance, which has significant limitations such as the high cost involved in the diagnosis, treatment and lengthy care of t…
▽ More
Globally, in 2016, one out of eleven adults suffered from Diabetes Mellitus. Diabetic Foot Ulcers (DFU) are a major complication of this disease, which if not managed properly can lead to amputation. Current clinical approaches to DFU treatment rely on patient and clinician vigilance, which has significant limitations such as the high cost involved in the diagnosis, treatment and lengthy care of the DFU. We collected an extensive dataset of foot images, which contain DFU from different patients. In this paper, we have proposed the use of traditional computer vision features for detecting foot ulcers among diabetic patients, which represent a cost-effective, remote and convenient healthcare solution. Furthermore, we used Convolutional Neural Networks (CNNs) for the first time in DFU classification. We have proposed a novel convolutional neural network architecture, DFUNet, with better feature extraction to identify the feature differences between healthy skin and the DFU. Using 10-fold cross-validation, DFUNet achieved an AUC score of 0.962. This outperformed both the machine learning and deep learning classifiers we have tested. Here we present the development of a novel and highly sensitive DFUNet for objectively detecting the presence of DFUs. This novel approach has the potential to deliver a paradigm shift in diabetic foot care.
△ Less
Submitted 10 December, 2017; v1 submitted 28 November, 2017;
originally announced November 2017.
-
Semantic Texture for Robust Dense Tracking
Authors:
Jan Czarnowski,
Stefan Leutenegger,
Andrew Davison
Abstract:
We argue that robust dense SLAM systems can make valuable use of the layers of features coming from a standard CNN as a pyramid of `semantic texture' which is suitable for dense alignment while being much more robust to nuisance factors such as lighting than raw RGB values. We use a straightforward Lucas-Kanade formulation of image alignment, with a schedule of iterations over the coarse-to-fine l…
▽ More
We argue that robust dense SLAM systems can make valuable use of the layers of features coming from a standard CNN as a pyramid of `semantic texture' which is suitable for dense alignment while being much more robust to nuisance factors such as lighting than raw RGB values. We use a straightforward Lucas-Kanade formulation of image alignment, with a schedule of iterations over the coarse-to-fine levels of a pyramid, and simply replace the usual image pyramid by the hierarchy of convolutional feature maps from a pre-trained CNN. The resulting dense alignment performance is much more robust to lighting and other variations, as we show by camera rotation tracking experiments on time-lapse sequences captured over many hours. Looking towards the future of scene representation for real-time visual SLAM, we further demonstrate that a selection using simple criteria of a small number of the total set of features output by a CNN gives just as accurate but much more efficient tracking performance.
△ Less
Submitted 29 August, 2017;
originally announced August 2017.
-
Objective Classes for Micro-Facial Expression Recognition
Authors:
Adrian K. Davison,
Walied Merghani,
Moi Hoon Yap
Abstract:
Micro-expressions are brief spontaneous facial expressions that appear on a face when a person conceals an emotion, making them different to normal facial expressions in subtlety and duration. Currently, emotion classes within the CASME II dataset are based on Action Units and self-reports, creating conflicts during machine learning training. We will show that classifying expressions using Action…
▽ More
Micro-expressions are brief spontaneous facial expressions that appear on a face when a person conceals an emotion, making them different to normal facial expressions in subtlety and duration. Currently, emotion classes within the CASME II dataset are based on Action Units and self-reports, creating conflicts during machine learning training. We will show that classifying expressions using Action Units, instead of predicted emotion, removes the potential bias of human reporting. The proposed classes are tested using LBP-TOP, HOOF and HOG 3D feature descriptors. The experiments are evaluated on two benchmark FACS coded datasets: CASME II and SAMM. The best result achieves 86.35\% accuracy when classifying the proposed 5 classes on CASME II using HOG 3D, outperforming the result of the state-of-the-art 5-class emotional-based classification in CASME II. Results indicate that classification based on Action Units provides an objective method to improve micro-expression recognition.
△ Less
Submitted 3 December, 2017; v1 submitted 24 August, 2017;
originally announced August 2017.
-
Sustainable computational science: the ReScience initiative
Authors:
Nicolas P. Rougier,
Konrad Hinsen,
Frédéric Alexandre,
Thomas Arildsen,
Lorena Barba,
Fabien C. Y. Benureau,
C. Titus Brown,
Pierre de Buyl,
Ozan Caglayan,
Andrew P. Davison,
Marc André Delsuc,
Georgios Detorakis,
Alexandra K. Diem,
Damien Drix,
Pierre Enel,
Benoît Girard,
Olivia Guest,
Matt G. Hall,
Rafael Neto Henriques,
Xavier Hinaut,
Kamil S Jaron,
Mehdi Khamassi,
Almar Klein,
Tiina Manninen,
Pietro Marchesi
, et al. (20 additional authors not shown)
Abstract:
Computer science offers a large set of tools for prototy**, writing, running, testing, validating, sharing and reproducing results, however computational science lags behind. In the best case, authors may provide their source code as a compressed archive and they may feel confident their research is reproducible. But this is not exactly true. James Buckheit and David Donoho proposed more than tw…
▽ More
Computer science offers a large set of tools for prototy**, writing, running, testing, validating, sharing and reproducing results, however computational science lags behind. In the best case, authors may provide their source code as a compressed archive and they may feel confident their research is reproducible. But this is not exactly true. James Buckheit and David Donoho proposed more than two decades ago that an article about computational results is advertising, not scholarship. The actual scholarship is the full software environment, code, and data that produced the result. This implies new workflows, in particular in peer-reviews. Existing journals have been slow to adapt: source codes are rarely requested, hardly ever actually executed to check that they produce the results advertised in the article. ReScience is a peer-reviewed journal that targets computational research and encourages the explicit replication of already published research, promoting new and open-source implementations in order to ensure that the original research can be replicated from its description. To achieve this goal, the whole publishing chain is radically different from other traditional scientific journals. ReScience resides on GitHub where each new implementation of a computational study is made available together with comments, explanations, and software tests.
△ Less
Submitted 11 November, 2017; v1 submitted 14 July, 2017;
originally announced July 2017.
-
Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task
Authors:
Stephen James,
Andrew J. Davison,
Edward Johns
Abstract:
End-to-end control for robot manipulation and gras** is emerging as an attractive alternative to traditional pipelined approaches. However, end-to-end methods tend to either be slow to train, exhibit little or no generalisability, or lack the ability to accomplish long-horizon or multi-stage tasks. In this paper, we show how two simple techniques can lead to end-to-end (image to velocity) execut…
▽ More
End-to-end control for robot manipulation and gras** is emerging as an attractive alternative to traditional pipelined approaches. However, end-to-end methods tend to either be slow to train, exhibit little or no generalisability, or lack the ability to accomplish long-horizon or multi-stage tasks. In this paper, we show how two simple techniques can lead to end-to-end (image to velocity) execution of a multi-stage task, which is analogous to a simple tidying routine, without having seen a single real image. This involves locating, reaching for, and gras** a cube, then locating a basket and drop** the cube inside. To achieve this, robot trajectories are computed in a simulator, to collect a series of control velocities which accomplish the task. Then, a CNN is trained to map observed images to velocities, using domain randomisation to enable generalisation to real world images. Results show that we are able to successfully accomplish the task in the real world with the ability to generalise to novel environments, including those with dynamic lighting conditions, distractor objects, and moving objects, including the basket itself. We believe our approach to be simple, highly scalable, and capable of learning long-horizon tasks that have until now not been shown with the state-of-the-art in end-to-end robot control.
△ Less
Submitted 17 October, 2017; v1 submitted 7 July, 2017;
originally announced July 2017.
-
Hydrodynamic flows of non-Fermi liquids: magnetotransport and bilayer drag
Authors:
Aavishkar A. Patel,
Richard A. Davison,
Alex Levchenko
Abstract:
We consider a hydrodynamic description of transport for generic two dimensional electron systems that lack Galilean invariance and do not fall into the category of Fermi liquids. We study magnetoresistance and show that it is governed only by the electronic viscosity provided that the wavelength of the underlying disorder potential is large compared to the microscopic equilibration length. We also…
▽ More
We consider a hydrodynamic description of transport for generic two dimensional electron systems that lack Galilean invariance and do not fall into the category of Fermi liquids. We study magnetoresistance and show that it is governed only by the electronic viscosity provided that the wavelength of the underlying disorder potential is large compared to the microscopic equilibration length. We also derive the Coulomb drag transresistance for double-layer non-Fermi liquid systems in the hydrodynamic regime. As an example, we consider frictional drag between two quantum Hall states with half-filled lowest Landau levels, each described by a Fermi surface of composite fermions coupled to a $U(1)$ gauge field. We contrast our results to prior calculations of drag of Chern-Simons composite particles and place our findings in the context of available experimental data.
△ Less
Submitted 9 November, 2017; v1 submitted 12 June, 2017;
originally announced June 2017.
-
Thermal diffusivity and chaos in metals without quasiparticles
Authors:
Mike Blake,
Richard A. Davison,
Subir Sachdev
Abstract:
We study the thermal diffusivity $D_T$ in models of metals without quasiparticle excitations (`strange metals'). The many-body quantum chaos and transport properties of such metals can be efficiently described by a holographic representation in a gravitational theory in an emergent curved spacetime with an additional spatial dimension. We find that at generic infra-red fixed points $D_T$ is always…
▽ More
We study the thermal diffusivity $D_T$ in models of metals without quasiparticle excitations (`strange metals'). The many-body quantum chaos and transport properties of such metals can be efficiently described by a holographic representation in a gravitational theory in an emergent curved spacetime with an additional spatial dimension. We find that at generic infra-red fixed points $D_T$ is always related to parameters characterizing many-body quantum chaos: the butterfly velocity $v_B$, and Lyapunov time $τ_L$ through $D_T \sim v_B^2 τ_L$. The relationship holds independently of the charge density, periodic potential strength or magnetic field at the fixed point. The generality of this result follows from the observation that the thermal conductivity of strange metals depends only on the metric near the horizon of a black hole in the emergent spacetime, and is otherwise insensitive to the profile of any matter fields.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.