Skip to main content

Showing 1–50 of 132 results for author: Roberts, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.15755  [pdf, other

    stat.ME cs.MA cs.SI stat.AP

    Optimized Model Selection for Estimating Treatment Effects from Costly Simulations of the US Opioid Epidemic

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Agent-based simulation with a synthetic population can help us compare different treatment conditions while kee** everything else constant within the same population (i.e., as digital twins). Such population-scale simulations require large computational power (i.e., CPU resources) to get accurate estimates for treatment effects. We can use meta models of the simulation results to circumvent the… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: To be presented in 2024 Annual Simulation Conference (ANNSIM'24)

  2. arXiv:2310.10500  [pdf, other

    q-fin.TR cs.LG q-fin.PM

    Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies

    Authors: Kieran Wood, Samuel Kessler, Stephen J. Roberts, Stefan Zohren

    Abstract: Forecasting models for systematic trading strategies do not adapt quickly when financial market conditions rapidly change, as was seen in the advent of the COVID-19 pandemic in 2020, causing many forecasting models to take loss-making positions. To deal with such situations, we propose a novel time-series trend-following forecaster that can quickly adapt to new market conditions, referred to as re… ▽ More

    Submitted 28 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: minor edits

  3. arXiv:2309.10215  [pdf

    cs.CY cs.HC cs.SI

    In Consideration of Indigenous Data Sovereignty: Data Mining as a Colonial Practice

    Authors: Jennafer Shae Roberts, Laura N Montoya

    Abstract: Data mining reproduces colonialism, and Indigenous voices are being left out of the development of technology that relies on data, such as artificial intelligence. This research stresses the need for the inclusion of Indigenous Data Sovereignty and centers on the importance of Indigenous rights over their own data. Inclusion is necessary in order to integrate Indigenous knowledge into the design,… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 12 pages, 1 Figure, Future Technologies Conference (FTC) 2023. arXiv admin note: substantial text overlap with arXiv:2208.04700

  4. arXiv:2309.08776  [pdf, other

    cs.LG cs.AI cs.RO

    Projected Task-Specific Layers for Multi-Task Reinforcement Learning

    Authors: Josselin Somerville Roberts, Julia Di

    Abstract: Multi-task reinforcement learning could enable robots to scale across a wide variety of manipulation tasks in homes and workplaces. However, generalizing from one task to another and mitigating negative task interference still remains a challenge. Addressing this challenge by successfully sharing information across tasks will depend on how well the structure underlying the tasks is captured. In th… ▽ More

    Submitted 6 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Journal ref: ICRA 2024

  5. arXiv:2309.05139  [pdf, other

    cs.CV cs.RO

    A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application

    Authors: Josselin Somerville Roberts, Paul-Emile Giacomelli, Yoni Gozlan, Julia Di

    Abstract: Conventional wheeled robots are unable to traverse scientifically interesting, but dangerous, cave environments. Multi-limbed climbing robot designs, such as ReachBot, are able to grasp irregular surface features and execute climbing motions to overcome obstacles, given suitable grasp locations. To support grasp site identification, we present a method for detecting rock cracks and edges, the SKel… ▽ More

    Submitted 6 November, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

    Journal ref: IEEE IRC 2023

  6. arXiv:2308.13040  [pdf, other

    cs.MA cs.SI stat.AP

    Estimating Treatment Effects Using Costly Simulation Samples from a Population-Scale Model of Opioid Use Disorder

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Large-scale models require substantial computational resources for analysis and studying treatment conditions. Specifically, estimating treatment effects using simulations may require a lot of infeasible resources to allocate at every treatment condition. Therefore, it is essential to develop efficient methods to allocate computational resources for estimating treatment effects. Agent-based simula… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: To be presented in IEEE International Conference on Biomedical and Health Informatics 2023, repository link: https://github.com/abdulrahmanfci/intervention-estimation

  7. arXiv:2308.12212  [pdf, other

    q-fin.PM cs.AI cs.LG q-fin.TR stat.ML

    Learning to Learn Financial Networks for Optimising Momentum Strategies

    Authors: Xingyue Pu, Stefan Zohren, Stephen Roberts, Xiaowen Dong

    Abstract: Network momentum provides a novel type of risk premium, which exploits the interconnections among assets in a financial network to predict future returns. However, the current process of constructing financial networks relies heavily on expensive databases and financial expertise, limiting accessibility for small-sized and academic institutions. Furthermore, the traditional approach treats network… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 9 pages

  8. arXiv:2308.11294  [pdf, other

    q-fin.PM cs.LG eess.SP q-fin.TR

    Network Momentum across Asset Classes

    Authors: Xingyue Pu, Stephen Roberts, Xiaowen Dong, Stefan Zohren

    Abstract: We investigate the concept of network momentum, a novel trading signal derived from momentum spillover across assets. Initially observed within the confines of pairwise economic and fundamental ties, such as the stock-bond connection of the same company and stocks linked through supply-demand chains, momentum spillover implies a propagation of momentum risk premium from one asset to another. The s… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 27 pages

  9. arXiv:2308.02399  [pdf, ps, other

    cs.CY cs.SI

    The Glamorisation of Unpaid Labour: AI and its Influencers

    Authors: Nana Mgbechikwere Nwachukwu, Jennafer Shae Roberts, Laura N Montoya

    Abstract: To harness the true potential of Artificial Intelligence (AI) for societal betterment, we need to move away from prioritising corporate interests which exploit Global South workers in the digital age. The unpaid labour and societal harms which are generated by Digital Value Networks (DVNs) disproportionately affect workers in Africa, Latin America, and India and need to be regulated. In this resea… ▽ More

    Submitted 15 September, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: 4 pages, 2 pages of references, Deep Learning Indaba 2023 Short Paper

  10. arXiv:2307.12186  [pdf, other

    cs.MA cs.SI stat.AP

    Inferring epidemic dynamics using Gaussian process emulation of agent-based simulations

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Computational models help decision makers understand epidemic dynamics to optimize public health interventions. Agent-based simulation of disease spread in synthetic populations allows us to compare and contrast different effects across identical populations or to investigate the effect of interventions kee** every other factor constant between ``digital twins''. FRED (A Framework for Reconstruc… ▽ More

    Submitted 11 September, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: To be presented in Winter Simulation Conference 2023, repository link: https://github.com/abdulrahmanfci/gpr-abm

  11. arXiv:2307.11948  [pdf, other

    cs.LG

    The instabilities of large learning rate training: a loss landscape view

    Authors: Lawrence Wang, Stephen Roberts

    Abstract: Modern neural networks are undeniably successful. Numerous works study how the curvature of loss landscapes can affect the quality of solutions. In this work we study the loss landscape by considering the Hessian matrix during network training with large learning rates - an attractive regime that is (in)famously unstable. We characterise the instabilities of gradient descent, and we observe the st… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.18490

  12. arXiv:2306.13914  [pdf, other

    stat.ML cs.LG

    G-TRACER: Expected Sharpness Optimization

    Authors: John Williams, Stephen Roberts

    Abstract: We propose a new regularization scheme for the optimization of deep learning architectures, G-TRACER ("Geometric TRACE Ratio"), which promotes generalization by seeking flat minima, and has a sound theoretical basis as an approximation to a natural-gradient descent based optimization of a generalized Bayes objective. By augmenting the loss function with a TRACER, curvature-regularized optimizers (… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 16 pages, 2 figures

    MSC Class: 62-08

  13. arXiv:2306.01936  [pdf, other

    cs.CV eess.IV

    Sub-Meter Tree Height Map** of California using Aerial Images and LiDAR-Informed U-Net Model

    Authors: Fabien H Wagner, Sophia Roberts, Alison L Ritz, Griffin Carter, Ricardo Dalagnol, Samuel Favrichon, Mayumi CM Hirye, Martin Brandt, Philipe Ciais, Sassan Saatchi

    Abstract: Tree canopy height is one of the most important indicators of forest biomass, productivity, and species diversity, but it is challenging to measure accurately from the ground and from space. Here, we used a U-Net model adapted for regression to map the canopy height of all trees in the state of California with very high-resolution aerial imagery (60 cm) from the USDA-NAIP program. The U-Net model… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 29 pages, 9 figures, submitted to Remote Sensing in Ecology and Conservation (RSEC)

    MSC Class: 92-08 ACM Class: I.4.9; I.5.4

  14. arXiv:2305.18490  [pdf, other

    cs.LG

    SANE: The phases of gradient descent through Sharpness Adjusted Number of Effective parameters

    Authors: Lawrence Wang, Stephen J. Roberts

    Abstract: Modern neural networks are undeniably successful. Numerous studies have investigated how the curvature of loss landscapes can affect the quality of solutions. In this work we consider the Hessian matrix during network training. We reiterate the connection between the number of "well-determined" or "effective" parameters and the generalisation performance of neural nets, and we demonstrate its use… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  15. arXiv:2305.14737  [pdf, other

    physics.soc-ph cs.SI

    The Rhythms of Transient Relationships: Allocating time between weekdays and weekends

    Authors: Valentín Vergara Hidd, Mailun Zhang, Simone Centellegher, Sam G. B. Roberts, Bruno Lepri, Eduardo López

    Abstract: A fundamental question of any new relationship is, will it last? Transient relationships, recently defined by the authors, are an ideal type of social tie to explore this question: these relationships are characterized by distinguishable starting and ending temporal points, linking the question of tie longevity to relationship finite lifetime. In this study, we use mobile phone data sets from the… ▽ More

    Submitted 28 August, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 15 pages, 4 figures. Submitted for review at Royal Society Open Science R1

  16. arXiv:2302.10175  [pdf, other

    q-fin.PM cs.LG q-fin.TR stat.ML

    Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies

    Authors: Wee Ling Tan, Stephen Roberts, Stefan Zohren

    Abstract: We introduce Spatio-Temporal Momentum strategies, a class of models that unify both time-series and cross-sectional momentum strategies by trading assets based on their cross-sectional momentum features over time. While both time-series and cross-sectional momentum strategies are designed to systematically capture momentum risk premia, these strategies are regarded as distinct implementations and… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: The Journal of Financial Data Science, Summer 2023

  17. On Sequential Bayesian Inference for Continual Learning

    Authors: Samuel Kessler, Adam Cobb, Tim G. J. Rudner, Stefan Zohren, Stephen J. Roberts

    Abstract: Sequential Bayesian inference can be used for continual learning to prevent catastrophic forgetting of past tasks and provide an informative prior when learning new tasks. We revisit sequential Bayesian inference and test whether having access to the true posterior is guaranteed to prevent catastrophic forgetting in Bayesian neural networks. To do this we perform sequential Bayesian inference usin… ▽ More

    Submitted 9 July, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Published in Entropy, 24 pages, 14 figures

  18. arXiv:2212.08571  [pdf, other

    cs.SD cs.LG eess.AS stat.AP

    Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

    Authors: Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris Holmes, David Hurley, Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts, Björn W. Schuller, Ana Tendero-Cañadas, Tracey Thornley, Alexander Titcomb

    Abstract: Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously ass… ▽ More

    Submitted 27 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  19. arXiv:2212.08570  [pdf, other

    cs.SD cs.LG eess.AS

    Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

    Authors: Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven Gilmour, Stephen Roberts, Chris Holmes

    Abstract: Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  20. arXiv:2212.07738  [pdf

    cs.SD cs.LG eess.AS

    A large-scale and PCR-referenced vocal audio dataset for COVID-19

    Authors: Jobie Budd, Kieran Baker, Emma Karoune, Harry Coppock, Selina Patel, Ana Tendero Cañadas, Alexander Titcomb, Richard Payne, David Hurley, Sabrina Egglestone, Lorraine Butler, Jonathon Mellor, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Radka Jersakova, Rachel A. McKendry, Peter Diggle, Sylvia Richardson, Björn W. Schuller, Steven Gilmour, Davide Pigoli, Stephen Roberts, Josef Packham, Tracey Thornley , et al. (1 additional authors not shown)

    Abstract: The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmi… ▽ More

    Submitted 3 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 39 pages, 4 figures

  21. arXiv:2211.15944  [pdf, other

    cs.LG cs.AI

    The Effectiveness of World Models for Continual Reinforcement Learning

    Authors: Samuel Kessler, Mateusz Ostaszewski, Michał Bortkiewicz, Mateusz Żarski, Maciej Wołczyk, Jack Parker-Holder, Stephen J. Roberts, Piotr Miłoś

    Abstract: World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically employ a replay buffer for training, which can be naturally extended to continual learning. We systematically study how different selective experience replay meth… ▽ More

    Submitted 12 July, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted at CoLLAs 2023, 21 pages, 15 figures

  22. arXiv:2210.12719  [pdf, other

    cs.LG cs.AI

    Learning General World Models in a Handful of Reward-Free Deployments

    Authors: Yingchen Xu, Jack Parker-Holder, Aldo Pacchiano, Philip J. Ball, Oleh Rybkin, Stephen J. Roberts, Tim Rocktäschel, Edward Grefenstette

    Abstract: Building generally capable agents is a grand challenge for deep reinforcement learning (RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate generalization, exploration should be task agnostic; 2) to facilitate scalability, exploration policies should collect large quantities of data without costly centralized retraining. Combining these two properties, we i… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: To be published at NeurIPS 2022. Code and videos available at https://ycxuyingchen.github.io/cascade/

  23. arXiv:2208.09968  [pdf, other

    q-fin.TR cs.IR cs.LG q-fin.PM

    Transfer Ranking in Finance: Applications to Cross-Sectional Momentum with Data Scarcity

    Authors: Daniel Poh, Stephen Roberts, Stefan Zohren

    Abstract: Cross-sectional strategies are a classical and popular trading style, with recent high performing variants incorporating sophisticated neural architectures. While these strategies have been applied successfully to data-rich settings involving mature assets with long histories, deploying them on instruments with limited samples generally produce over-fitted models with degraded performance. In this… ▽ More

    Submitted 21 February, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: 18 pages, 12 figures

  24. arXiv:2208.04700  [pdf, other

    cs.CY

    Decolonisation, Global Data Law, and Indigenous Data Sovereignty

    Authors: Jennafer Shae Roberts, Laura N Montoya

    Abstract: This research examines the impact of digital neo-colonialism on the Global South and encourages the development of legal and economic incentives to protect Indigenous cultures globally. Data governance is discussed in an evolutionary context while focusing on data sharing and data mining. Case studies that exemplify the need to steer global data law towards protecting the earth, while addressing i… ▽ More

    Submitted 28 July, 2022; originally announced August 2022.

    Comments: 16 pages, 1 table

  25. arXiv:2207.00986  [pdf, other

    cs.LG cs.AI cs.CV

    Stabilizing Off-Policy Deep Reinforcement Learning from Pixels

    Authors: Edoardo Cetin, Philip J. Ball, Steve Roberts, Oya Celiktutan

    Abstract: Off-policy reinforcement learning (RL) from pixel observations is notoriously unstable. As a result, many successful algorithms must combine different domain-specific practices and auxiliary losses to learn meaningful behaviors in complex environments. In this work, we provide novel analysis demonstrating that these instabilities arise from performing temporal-difference learning with a convolutio… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: Short presentation at ICML 2022

  26. arXiv:2205.06799  [pdf, other

    cs.SD cs.LG eess.AS

    The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes

    Authors: Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen Roberts

    Abstract: The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch senso… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: 5 pages, part of the ACM Multimedia 2022 Grand Challenge "The ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE 2022)"

    MSC Class: 68 ACM Class: I.2.7; I.5.0; J.3

  27. arXiv:2204.07612  [pdf, ps, other

    cs.AI cs.CY cs.GL cs.HC cs.NE

    Contextualizing Artificially Intelligent Morality: A Meta-Ethnography of Top-Down, Bottom-Up, and Hybrid Models for Theoretical and Applied Ethics in Artificial Intelligence

    Authors: Jennafer S. Roberts, Laura N. Montoya

    Abstract: In this meta-ethnography, we explore three different angles of ethical artificial intelligence (AI) design implementation including the philosophical ethical viewpoint, the technical perspective, and framing through a political lens. Our qualitative research includes a literature review that highlights the cross-referencing of these angles by discussing the value and drawbacks of contrastive top-d… ▽ More

    Submitted 8 September, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: 22 pages, 4 tables, accepted for publication in the Future of Information and Communication Conference (FICC) 2023 proceedings will be published in Springer series "Lecture Notes in Networks and Systems" and submitted for consideration to Web of Science, SCOPUS, INSPEC, WTI Frankfurt eG, zbMATH and SCImago

  28. arXiv:2203.08015  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    On-the-fly Strategy Adaptation for ad-hoc Agent Coordination

    Authors: Jaleh Zand, Jack Parker-Holder, Stephen J. Roberts

    Abstract: Training agents in cooperative settings offers the promise of AI agents able to interact effectively with humans (and other agents) in the real world. Multi-agent reinforcement learning (MARL) has the potential to achieve this goal, demonstrating success in a series of challenging problems. However, whilst these advances are significant, the vast majority of focus has been on the self-play paradig… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Extended abstract published in AAMAS 2022

  29. arXiv:2112.08534  [pdf, other

    cs.LG q-fin.TR stat.ML

    Trading with the Momentum Transformer: An Intelligent and Interpretable Architecture

    Authors: Kieran Wood, Sven Giegerich, Stephen Roberts, Stefan Zohren

    Abstract: We introduce the Momentum Transformer, an attention-based deep-learning architecture, which outperforms benchmark time-series momentum and mean-reversion trading strategies. Unlike state-of-the-art Long Short-Term Memory (LSTM) architectures, which are sequential in nature and tailored to local processing, an attention mechanism provides our architecture with a direct connection to all previous ti… ▽ More

    Submitted 22 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: included motivation for attention mechanism and additional architecture details

  30. arXiv:2110.11286  [pdf, other

    cs.LG physics.comp-ph

    One-Shot Transfer Learning of Physics-Informed Neural Networks

    Authors: Shaan Desai, Marios Mattheakis, Hayden Joy, Pavlos Protopapas, Stephen Roberts

    Abstract: Solving differential equations efficiently and accurately sits at the heart of progress in many areas of scientific research, from classical dynamical systems to quantum mechanics. There is a surge of interest in using Physics-Informed Neural Networks (PINNs) to tackle such problems as they provide numerous benefits over traditional numerical approaches. Despite their potential benefits for solvin… ▽ More

    Submitted 5 July, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: ICML AI4Science Workshop 2022

  31. arXiv:2110.07607  [pdf, other

    cs.SD cs.CV eess.AS

    HumBugDB: A Large-scale Acoustic Mosquito Dataset

    Authors: Ivan Kiskin, Marianne Sinka, Adam D. Cobb, Waqas Rafique, Lawrence Wang, Davide Zilli, Benjamin Gutteridge, Rinita Dam, Theodoros Marinos, Yunpeng Li, Dickson Msaky, Emmanuel Kaindoa, Gerard Killeen, Eva Herreros-Moya, Kathy J. Willis, Stephen J. Roberts

    Abstract: This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and y… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. 10 pages main, 39 pages including appendix. This paper accompanies the dataset found at https://zenodo.org/record/4904800 with corresponding code at https://github.com/HumBug-Mosquito/HumBugDB

    ACM Class: E.0; I.2.1; J.3

  32. arXiv:2110.05167  [pdf, other

    stat.ML cs.LG

    Robust and Scalable SDE Learning: A Functional Perspective

    Authors: Scott Cameron, Tyron Cameron, Arnu Pretorius, Stephen Roberts

    Abstract: Stochastic differential equations provide a rich class of flexible generative models, capable of describing a wide range of spatio-temporal processes. A host of recent work looks to learn data-representing SDEs, using neural networks and other flexible function approximators. Despite these advances, learning remains computationally expensive due to the sequential nature of SDE integrators. In this… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  33. arXiv:2110.04135  [pdf, other

    cs.LG cs.AI

    Revisiting Design Choices in Offline Model-Based Reinforcement Learning

    Authors: Cong Lu, Philip J. Ball, Jack Parker-Holder, Michael A. Osborne, Stephen J. Roberts

    Abstract: Offline reinforcement learning enables agents to leverage large pre-collected datasets of environment transitions to learn control policies, circumventing the need for potentially expensive or unsafe online data collection. Significant progress has been made recently in offline model-based reinforcement learning, approaches which leverage a learned dynamics model. This typically involves construct… ▽ More

    Submitted 16 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Spotlight @ ICLR 2022; Spotlight @ RL4RealLife Workshop ICML2021

  34. arXiv:2107.08872  [pdf

    cs.SI

    Proximity in face-to-face interaction is associated with mobile phone communication

    Authors: Tobias Bornakke, Talayeh Aledavood, Jari Saramäki, Sam G. B. Roberts

    Abstract: The frequency of mobile communication is often used as an indicator of the strength of a tie between two individuals, but how mobile communication relates to other forms of behaving close in social relationships is poorly understood. We used a unique multi-channel 10-month dataset from 510 participants to examine how the frequency of mobile communication was related to the frequency of face-to-fac… ▽ More

    Submitted 20 July, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: 31 pages, 4 Tables, 1 Figure

  35. arXiv:2107.08024  [pdf, other

    cs.LG nlin.CD physics.comp-ph

    Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems

    Authors: Shaan Desai, Marios Mattheakis, David Sondak, Pavlos Protopapas, Stephen Roberts

    Abstract: Accurately learning the temporal behavior of dynamical systems requires models with well-chosen learning biases. Recent innovations embed the Hamiltonian and Lagrangian formalisms into neural networks and demonstrate a significant improvement over other approaches in predicting trajectories of physical systems. These methods generally tackle autonomous systems that depend implicitly on time or sys… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: [under review]

    Journal ref: Phys. Rev. E 104, 034312 (2021)

  36. arXiv:2106.15883  [pdf, other

    cs.LG

    Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL

    Authors: Jack Parker-Holder, Vu Nguyen, Shaan Desai, Stephen Roberts

    Abstract: Despite a series of recent successes in reinforcement learning (RL), many RL algorithms remain sensitive to hyperparameters. As such, there has recently been interest in the field of AutoRL, which seeks to automate design decisions to create more general algorithms. Recent work suggests that population based approaches may be effective AutoRL algorithms, by learning hyperparameter schedules on the… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  37. arXiv:2106.07452  [pdf, other

    stat.ML cs.LG

    Marginalising over Stationary Kernels with Bayesian Quadrature

    Authors: Saad Hamid, Sebastian Schulze, Michael A. Osborne, Stephen J. Roberts

    Abstract: Marginalising over families of Gaussian Process kernels produces flexible model classes with well-calibrated uncertainty estimates. Existing approaches require likelihood evaluations of many kernels, rendering them prohibitively expensive for larger datasets. We propose a Bayesian Quadrature scheme to make this marginalisation more efficient and thereby more practical. Through use of the maximum m… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  38. arXiv:2106.02940  [pdf, other

    cs.LG cs.AI

    Same State, Different Task: Continual Reinforcement Learning without Interference

    Authors: Samuel Kessler, Jack Parker-Holder, Philip Ball, Stefan Zohren, Stephen J. Roberts

    Abstract: Continual Learning (CL) considers the problem of training an agent sequentially on a set of tasks while seeking to retain performance on all previous tasks. A key challenge in CL is catastrophic forgetting, which arises when performance on a previously mastered task is reduced when learning a new task. While a variety of methods exist to combat forgetting, in some cases tasks are fundamentally inc… ▽ More

    Submitted 15 March, 2022; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: Accepted as an oral at AAAI 2022. 17 pages and 12 figures

  39. arXiv:2106.02469  [pdf, other

    cs.LG stat.ML

    Can convolutional ResNets approximately preserve input distances? A frequency analysis perspective

    Authors: Lewis Smith, Joost van Amersfoort, Haiwen Huang, Stephen Roberts, Yarin Gal

    Abstract: ResNets constrained to be bi-Lipschitz, that is, approximately distance preserving, have been a crucial component of recently proposed techniques for deterministic uncertainty quantification in neural models. We show that theoretical justifications for recent regularisation schemes trying to enforce such a constraint suffer from a crucial flaw -- the theoretical link between the regularisation sch… ▽ More

    Submitted 17 June, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Main paper 10 pages including references, appendix 10 pages. 7 figures and 6 tables including appendix

  40. arXiv:2105.13727  [pdf, other

    stat.ML cs.LG q-fin.TR

    Slow Momentum with Fast Reversion: A Trading Strategy Using Deep Learning and Changepoint Detection

    Authors: Kieran Wood, Stephen Roberts, Stefan Zohren

    Abstract: Momentum strategies are an important part of alternative investments and are at the heart of commodity trading advisors (CTAs). These strategies have, however, been found to have difficulties adjusting to rapid changes in market conditions, such as during the 2020 market crash. In particular, immediately after momentum turning points, where a trend reverses from an uptrend (downtrend) to a downtre… ▽ More

    Submitted 20 December, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: minor changes made to methodology to match implementation

    Journal ref: The Journal of Financial Data Science Winter 2022, jfds.2021.1.081

  41. arXiv:2105.10019  [pdf, other

    q-fin.PM cs.IR cs.LG q-fin.TR

    Enhancing Cross-Sectional Currency Strategies by Context-Aware Learning to Rank with Self-Attention

    Authors: Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts

    Abstract: The performance of a cross-sectional currency strategy depends crucially on accurately ranking instruments prior to portfolio construction. While this ranking step is traditionally performed using heuristics, or by sorting the outputs produced by pointwise regression or classification techniques, strategies using Learning to Rank algorithms have recently presented themselves as competitive and via… ▽ More

    Submitted 27 January, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 10 pages, 4 figures

  42. arXiv:2104.13554  [pdf, other

    cs.CE

    Mesoscale simulation of woven composite design decisions

    Authors: Lincoln N. Collins, Scott A. Roberts

    Abstract: Characterizing the connection between material design decisions/parameters and their effective properties allows for accelerated materials development and optimization. We present a global sensitivity analysis of woven composite thermophysical properties, including density, volume fraction, thermal conductivity, specific heat, moduli, permeability, and tortuosity, predicted using mesoscale finite… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  43. arXiv:2104.05632  [pdf, other

    cs.LG cs.AI

    Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment

    Authors: Philip J. Ball, Cong Lu, Jack Parker-Holder, Stephen Roberts

    Abstract: Reinforcement learning from large-scale offline datasets provides us with the ability to learn policies without potentially unsafe or impractical exploration. Significant progress has been made in the past few years in dealing with the challenge of correcting for differing behavior between the data collection and learned policies. However, little attention has been paid to potentially changing dyn… ▽ More

    Submitted 3 August, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted @ ICML 2021; Spotlight @ ICLR 2021 "Self-Supervision for Reinforcement Learning Workshop"

  44. arXiv:2104.03180  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness Guarantees for Gaussian Processes

    Authors: Andrea Patane, Arno Blaas, Luca Laurenti, Luca Cardelli, Stephen Roberts, Marta Kwiatkowska

    Abstract: Gaussian processes (GPs) enable principled computation of model uncertainty, making them attractive for safety-critical applications. Such scenarios demand that GP decisions are not only accurate, but also robust to perturbations. In this paper we present a framework to analyse adversarial robustness of GPs, defined as invariance of the model's decision to bounded perturbations. Given a compact su… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Submitted for publication

  45. arXiv:2101.11331  [pdf, other

    cs.LG math.OC

    OffCon$^3$: What is state of the art anyway?

    Authors: Philip J. Ball, Stephen J. Roberts

    Abstract: Two popular approaches to model-free continuous control tasks are SAC and TD3. At first glance these approaches seem rather different; SAC aims to solve the entropy-augmented MDP by minimising the KL-divergence between a stochastic proposal policy and a hypotheical energy-basd soft Q-function policy, whereas TD3 is derived from DPG, which uses a deterministic policy to perform policy gradient asce… ▽ More

    Submitted 14 March, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  46. arXiv:2101.02689  [pdf, ps, other

    stat.ML cs.LG

    The Effect of Prior Lipschitz Continuity on the Adversarial Robustness of Bayesian Neural Networks

    Authors: Arno Blaas, Stephen J. Roberts

    Abstract: It is desirable, and often a necessity, for machine learning models to be robust against adversarial attacks. This is particularly true for Bayesian models, as they are well-suited for safety-critical applications, in which adversarial attacks can have catastrophic outcomes. In this work, we take a deeper look at the adversarial robustness of Bayesian Neural Networks (BNNs). In particular, we cons… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 4 pages, 2 tables, AAAI 2021 Workshop Towards Robust, Secure and Efficient Machine Learning

  47. Quantifying the unknown impact of segmentation uncertainty on image-based simulations

    Authors: Michael C. Krygier, Tyler LaBonte, Carianne Martinez, Chance Norris, Krish Sharma, Lincoln N. Collins, Partha P. Mukherjee, Scott A. Roberts

    Abstract: Image-based simulation, the use of 3D images to calculate physical quantities, fundamentally relies on image segmentation to create the computational geometry. However, this process introduces image segmentation uncertainty because there is a variety of different segmentation tools (both manual and machine-learning-based) that will each produce a unique and valid segmentation. First, we demonstrat… ▽ More

    Submitted 9 September, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Journal ref: Nature Communications 12, 5414 (2021)

  48. arXiv:2012.07149  [pdf, other

    q-fin.TR cs.IR cs.LG q-fin.PM

    Building Cross-Sectional Systematic Strategies By Learning to Rank

    Authors: Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts

    Abstract: The success of a cross-sectional systematic strategy depends critically on accurately ranking assets prior to portfolio construction. Contemporary techniques perform this ranking step either with simple heuristics or by sorting outputs from standard regression or classification models, which have been demonstrated to be sub-optimal for ranking in other domains (e.g. information retrieval). To addr… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures

  49. Towards real-time finite-strain anisotropic thermo-visco-elastodynamic analysis of soft tissues for thermal ablative therapy

    Authors: **ao Zhang, Remi Jacob Lay, Stuart K. Roberts, Sunita Chauhan

    Abstract: Accurate and efficient prediction of soft tissue temperatures is essential to computer-assisted treatment systems for thermal ablation. It can be used to predict tissue temperatures and ablation volumes for personalised treatment planning and image-guided intervention. Numerically, it requires full nonlinear modelling of the coupled computational bioheat transfer and biomechanics, and efficient so… ▽ More

    Submitted 31 December, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: Submitted to Computer Methods and Programs in Biomedicine

    Journal ref: Computer Methods and Programs in Biomedicine, vol. 198, pp. 105789, 2021

  50. arXiv:2008.03273  [pdf, other

    cs.LG eess.SY stat.ML

    SafePILCO: a software tool for safe and data-efficient policy synthesis

    Authors: Kyriakos Polymenakos, Nikitas Rontsis, Alessandro Abate, Stephen Roberts

    Abstract: SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known PILCO algorithm, originally written in MATLAB, to support safe learning. We provide a Python implementation and leverage existing libraries that allow the codebase to remain short and modular, which is appropriate for wider use by the verification, reinforcement learning, and co… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: Shorter Version published as a software tool demonstration at QEST 2020