Skip to main content

Showing 1–50 of 85 results for author: Hanna, J

.
  1. arXiv:2406.17168  [pdf, other

    cs.LG cs.AI cs.RO

    Reinforcement Learning via Auxiliary Task Distillation

    Authors: Abhinav Narayan Harish, Larry Heck, Josiah P. Hanna, Zsolt Kira, Andrew Szot

    Abstract: We present Reinforcement Learning via Auxiliary Task Distillation (AuxDistill), a new method that enables reinforcement learning (RL) to perform long-horizon robot control problems by distilling behaviors from auxiliary RL tasks. AuxDistill achieves this by concurrently carrying out multi-task RL with auxiliary tasks, which are easier to learn and relevant to the main task. A weighted distillation… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.05064  [pdf, other

    cs.LG

    Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Qiaomin Xie, Robert Nowak

    Abstract: In this paper, we study multi-task structured bandit problem where the goal is to learn a near-optimal algorithm that minimizes cumulative regret. The tasks share a common structure and the algorithm exploits the shared structure to minimize the cumulative regret for an unseen but related test task. We use a transformer as a decision-making algorithm to learn this shared structure so as to general… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.02165  [pdf, other

    cs.LG

    SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak

    Abstract: In this paper, we study safe data collection for the purpose of policy evaluation in tabular Markov decision processes (MDPs). In policy evaluation, we are given a \textit{target} policy and asked to estimate the expected cumulative reward it will obtain. Policy evaluation requires data and we are interested in the question of what \textit{behavior} policy should collect the data for the most accu… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2405.07838  [pdf, other

    cs.LG cs.AI

    Adaptive Exploration for Data-Efficient General Value Function Evaluations

    Authors: Arushi Jain, Josiah P. Hanna, Doina Precup

    Abstract: General Value Functions (GVFs) (Sutton et al, 2011) are an established way to represent predictive knowledge in reinforcement learning. Each GVF computes the expected return for a given policy, based on a unique pseudo-reward. Multiple GVFs can be estimated in parallel using off-policy learning from a single stream of data, often sourced from a fixed behavior policy or pre-collected dataset. This… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 20 pages, 9 figures, Under Review

  5. arXiv:2405.06638  [pdf, ps, other

    cond-mat.soft

    Assorted remarks on bending measures and energies for plates and shells, and their invariance properties

    Authors: E. Vitral, J. A. Hanna

    Abstract: In this note, we address several issues, including some raised in recent works and commentary, related to bending measures and energies for plates and shells, and certain of their invariance properties. We discuss the distinction between definitions and results in our and others' approaches, correct an error and citation oversights in our work, and provide additional brief observations regarding t… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: an extended comment on our and others' works, including a correction

  6. arXiv:2403.15356  [pdf, other

    cs.CV

    Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation

    Authors: Zhitong Xiong, Yi Wang, Fahong Zhang, Adam J. Stewart, Joëlle Hanna, Damian Borth, Ioannis Papoutsis, Bertrand Le Saux, Gustau Camps-Valls, Xiao Xiang Zhu

    Abstract: The development of foundation models has revolutionized our ability to interpret the Earth's surface using satellite observational data. Traditional models have been siloed, tailored to specific sensors or data types like optical, radar, and hyperspectral, each with its own unique characteristics. This specialization hinders the potential for a holistic analysis that could benefit from the combine… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 36 pages, 7 figures

  7. arXiv:2402.07102  [pdf, other

    cs.LG cs.AI

    Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments

    Authors: Jeongyeol Kwon, Liu Yang, Robert Nowak, Josiah Hanna

    Abstract: Learning a good history representation is one of the core challenges of reinforcement learning (RL) in partially observable environments. Recent works have shown the advantages of various auxiliary tasks for facilitating representation learning. However, the effectiveness of such auxiliary tasks has not been fully convincing, especially in partially observable environments that require long-term m… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  8. arXiv:2311.08290  [pdf, other

    cs.LG

    On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling

    Authors: Nicholas E. Corrado, Josiah P. Hanna

    Abstract: On-policy reinforcement learning (RL) algorithms perform policy updates using i.i.d. trajectories collected by the current policy. However, after observing only a finite number of trajectories, on-policy sampling may produce data that fails to match the expected on-policy data distribution. This sampling error leads to noisy updates and data inefficient on-policy learning. Recent work in the polic… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  9. arXiv:2311.00327  [pdf, other

    cs.LG

    Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

    Authors: Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak

    Abstract: We study multi-task representation learning for the problem of pure exploration in bilinear bandits. In bilinear bandits, an action takes the form of a pair of arms from two different entity types and the reward is a bilinear function of the known feature vectors of the arms. In the \textit{multi-task bilinear bandit problem}, we aim to find optimal actions for multiple tasks that share a common l… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  10. arXiv:2310.18409  [pdf, other

    cs.LG

    State-Action Similarity-Based Representations for Off-Policy Evaluation

    Authors: Brahma S. Pavse, Josiah P. Hanna

    Abstract: In reinforcement learning, off-policy evaluation (OPE) is the problem of estimating the expected return of an evaluation policy given a fixed dataset that was collected by running one or more different policies. One of the more empirically successful algorithms for OPE has been the fitted q-evaluation (FQE) algorithm that uses temporal difference updates to learn an action-value function, which is… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023

  11. arXiv:2310.18247  [pdf, other

    cs.LG cs.RO

    Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

    Authors: Nicholas E. Corrado, Yuxiao Qu, John U. Balis, Adam Labiosa, Josiah P. Hanna

    Abstract: In offline reinforcement learning (RL), an RL agent learns to solve a task using only a fixed dataset of previously collected data. While offline RL has been successful in learning real-world robot control policies, it typically requires large amounts of expert-quality data to learn effective policies that generalize to out-of-distribution states. Unfortunately, such data is often difficult and ex… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  12. arXiv:2310.17786  [pdf, other

    cs.LG

    Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates

    Authors: Nicholas E. Corrado, Josiah P. Hanna

    Abstract: Recently, data augmentation (DA) has emerged as a method for leveraging domain knowledge to inexpensively generate additional data in reinforcement learning (RL) tasks, often yielding substantial improvements in data efficiency. While prior work has demonstrated the utility of incorporating augmented data directly into model-free RL updates, it is not well-understood when a particular DA strategy… ▽ More

    Submitted 16 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  13. arXiv:2310.17478  [pdf, other

    cond-mat.soft

    Buckling mediated by mobile localized elastic excitations

    Authors: R. S. Hutton, E. Vitral, E. Hamm, J. A. Hanna

    Abstract: Experiments reveal that structural transitions in thin sheets are mediated by the passage of transient and stable mobile localized elastic excitations. These ``crumples'' or ``d-cones'' nucleate, propagate, interact, annihilate, and escape. Much of the dynamics occurs on millisecond time scales. Nucleation sites correspond to regions where generators of the ideal unstretched surface converge. Addi… ▽ More

    Submitted 26 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: expanded data set, changes to text including rearrangements to suit journal format. contains link to videos

  14. arXiv:2307.01741  [pdf, other

    cs.CV

    Ben-ge: Extending BigEarthNet with Geographical and Environmental Data

    Authors: Michael Mommert, Nicolas Kesseli, Joëlle Hanna, Linus Scheibenreif, Damian Borth, Begüm Demir

    Abstract: Deep learning methods have proven to be a powerful tool in the analysis of large amounts of complex Earth observation data. However, while Earth observation data are multi-modal in most cases, only single or few modalities are typically considered. In this work, we present the ben-ge dataset, which supplements the BigEarthNet-MM dataset by compiling freely and globally available geographical and e… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted for presentation at the IEEE International Geoscience and Remote Sensing Symposium 2023

  15. arXiv:2306.01896  [pdf, other

    cs.LG

    Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces

    Authors: Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, Josiah P. Hanna

    Abstract: In many reinforcement learning (RL) applications, we want policies that reach desired states and then keep the controlled system within an acceptable region around the desired states over an indefinite period of time. This latter objective is called stability and is especially important when the state space is unbounded, such that the states can be arbitrarily far from each other and the agent can… ▽ More

    Submitted 26 May, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2024

  16. arXiv:2305.14133  [pdf, other

    cs.LG

    Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

    Authors: Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht

    Abstract: Reinforcement Learning (RL) environments can produce training data with spurious correlations between features due to the amount of training data or its limited feature coverage. This can lead to RL agents encoding these misleading correlations in their latent representation, preventing the agent from generalising if the correlation changes within the environment or when deployed in the real world… ▽ More

    Submitted 12 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Conference on Neural Information Processing Systems (NeurIPS), 2023

  17. arXiv:2303.12729  [pdf, other

    physics.class-ph physics.ed-ph

    Comment on the elastica section in Thorne and Blandford "Modern Classical Physics", the shape of things, and the aspect ratio of reality

    Authors: J. A. Hanna

    Abstract: I point out and diagnose an error in a figure in a textbook on classical physics. The error helps to illustrate a pitfall encountered when dealing with the shapes of objects, and perhaps also reflects general cultural attitudes in physics. Another, less interesting, error is noted in passing.

    Submitted 22 March, 2023; originally announced March 2023.

  18. arXiv:2301.12357  [pdf, other

    stat.ML cs.LG

    SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits

    Authors: Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, Robert Nowak

    Abstract: In this paper, we study the problem of optimal data collection for policy evaluation in linear bandits. In policy evaluation, we are given a target policy and asked to estimate the expected reward it will obtain when executed in a multi-armed bandit environment. Our work is the first work that focuses on such optimal data collection strategy for policy evaluation involving heteroscedastic reward n… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

  19. arXiv:2301.02428  [pdf, other

    math.NA

    Sensitivity analysis using Physics-informed neural networks

    Authors: John M. Hanna, José V. Aguado, Sebastien Comas-Cardona, Ramzi Askri, Domenico Borzacchiello

    Abstract: The goal of this paper is to provide a simple approach to perform local sensitivity analysis using Physics-informed neural networks (PINN). The main idea lies in adding a new term in the loss function that regularizes the solution in a small neighborhood near the nominal value of the parameter of interest. The added term represents the derivative of the loss function with respect to the parameter… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 22 pages, 11 figures

  20. arXiv:2212.08302  [pdf, other

    cs.LG cs.AI

    Safe Evaluation For Offline Learning: Are We Ready To Deploy?

    Authors: Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor

    Abstract: The world currently offers an abundance of data in multiple domains, from which we can learn reinforcement learning (RL) policies without further interaction with the environment. RL agents learning offline from such data is possible but deploying them while learning might be dangerous in domains where safety is critical. Therefore, it is essential to find a way to estimate how a newly-learned age… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2021 Workshop on Deployable Decision Making in Embodied Systems [Spotlight]

  21. arXiv:2212.07486  [pdf, other

    cs.LG cs.AI

    Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction

    Authors: Brahma S. Pavse, Josiah P. Hanna

    Abstract: We consider the problem of off-policy evaluation (OPE) in reinforcement learning (RL), where the goal is to estimate the performance of an evaluation policy, $π_e$, using a fixed dataset, $\mathcal{D}$, collected by one or more policies that may be different from $π_e$. Current OPE algorithms may produce poor OPE estimates under policy distribution shift i.e., when the probability of a particular… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI 2023

  22. arXiv:2211.06129  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    The Impact of Inter-grain Phases on the Ionic Conductivity of LAGP Solid Electrolyte Prepared by Spark Plasma Sintering

    Authors: Sorina Cretu, David G. Bradley, Omer Ulas Kudu, Li Patrick Wen Feng, Linh Lan Nguyen, Tuan Tu Nguyen, Arash Jamali, Jean-Noel Chotard, Vincent Seznec, John V. Hanna, Arnaud Demortière, Martial Duchamp

    Abstract: Li1.5Al0.5Ge1.5(PO4)3 (LAGP) is a promising oxide solid electrolyte for all-solid-state batteries due to its excellent air stability, wide electrochemical stability window and cost-effective precursor materials. However, further improvement in their ionic conductivity performance is hindered by the presence of inter-grain phases leading to a major obstacle to the advanced design of oxide based sol… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 31 pages, 5 figures (SI, 13 pages, 12 figures)

  23. arXiv:2209.09446  [pdf, other

    cs.LG cs.AI

    A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret

    Authors: Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna

    Abstract: In various control task domains, existing controllers provide a baseline level of performance that -- though possibly suboptimal -- should be maintained. Reinforcement learning (RL) algorithms that rely on extensive exploration of the state and action space can be used to optimize a control policy. However, fully exploratory RL algorithms may decrease performance below a baseline level during trai… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

  24. arXiv:2208.05657  [pdf, other

    physics.flu-dyn math-ph physics.class-ph

    Momentum and pseudomomentum in a shallow water equation

    Authors: J. A. Hanna

    Abstract: A basic shallow water system with variable topography is analyzed from the point of view of a Lagrangian derivation of momentum, energy, and pseudomomentum balances. A two-dimensional action and associated momentum equation are derived. The latter is further manipulated to derive additional equations for energy and pseudomomentum. This revealed structure emphasizes broken symmetries in space and a… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: at the beach

  25. arXiv:2207.05480  [pdf, other

    cs.LG

    Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

    Authors: Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht

    Abstract: Reinforcement Learning (RL) agents are often unable to generalise well to environment variations in the state space that were not observed during training. This issue is especially problematic for image-based RL, where a change in just one variable, such as the background colour, can change many pixels in the image. The changed pixels can lead to drastic changes in the agent's latent representatio… ▽ More

    Submitted 27 February, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: International Conference on Learning Representations (ICLR), 2023

  26. arXiv:2206.07864  [pdf, other

    physics.class-ph cond-mat.soft

    Anomalous curvature evolution and geometric regularization of energy focusing in the snap** dynamics of a flexible body

    Authors: A. R. Dehadrai, J. A. Hanna

    Abstract: We examine the focusing of kinetic energy and the amplification of various quantities during the snap** motion of the free end of a flexible structure. This brief but violent event appears to be a regularized finite-time singularity, with remarkably large spikes in velocity, acceleration, and tension easily induced by generic initial and boundary conditions. A numerical scheme for the inextensib… ▽ More

    Submitted 7 April, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: revised text and figures

  27. Avoiding localization instabilities in rotary pleating

    Authors: Tian Yu, J. A. Hanna

    Abstract: Rotary pleating is a widely used process for making filters out of nonwoven fabric sheets. This involves indirect elastic-plastic bending of pre-weakened creases by continuously injecting material into an accordion-shaped pack. This step can fail through a localization instability that creates a kink in a pleat facet instead of in the desired crease location. In the present work, we consider the e… ▽ More

    Submitted 20 March, 2023; v1 submitted 4 June, 2022; originally announced June 2022.

  28. arXiv:2205.14323  [pdf, other

    cs.DB cs.LG

    Multi-agent Databases via Independent Learning

    Authors: Chi Zhang, Olga Papaemmanouil, Josiah P. Hanna, Aditya Akella

    Abstract: Machine learning is rapidly being used in database research to improve the effectiveness of numerous tasks included but not limited to query optimization, workload scheduling, physical design, etc. Currently, the research focus has been on replacing a single database component responsible for one task by its learning-based counterpart. However, query performance is not simply determined by the per… ▽ More

    Submitted 5 August, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Journal ref: AIDB@VLDB 2022 Proceedings of 4th International Workshop on Applied AI for Database Systems and Applications

  29. arXiv:2205.01009  [pdf, other

    cs.CC cs.DM cs.DS math.CO

    Token sliding on graphs of girth five

    Authors: Valentin Bartier, Nicolas Bousquet, Jihad Hanna, Amer E. Mouawad, Sebastian Siebertz

    Abstract: In the Token Sliding problem we are given a graph $G$ and two independent sets $I_s$ and $I_t$ in $G$ of size $k \geq 1$. The goal is to decide whether there exists a sequence $\langle I_1, I_2, \ldots, I_\ell \rangle$ of independent sets such that for all $i \in \{1,\ldots, \ell\}$ the set $I_i$ is an independent set of size $k$, $I_1 = I_s$, $I_\ell = I_t$ and… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  30. arXiv:2203.04510  [pdf, other

    cs.LG

    ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak

    Abstract: This paper studies the problem of data collection for policy evaluation in Markov decision processes (MDPs). In policy evaluation, we are given a target policy and asked to estimate the expected cumulative reward it will obtain in an environment formalized as an MDP. We develop theory for optimal data collection within the class of tree-structured MDPs by first deriving an oracle data collection s… ▽ More

    Submitted 17 June, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted for the $38^{\text {th }}$ Conference on Uncertainty in Artificial Intelligence (UAI 2022)

  31. Energies for elastic plates and shells from quadratic-stretch elasticity

    Authors: E. Vitral, J. A. Hanna

    Abstract: We derive stretching and bending energies for isotropic elastic plates and shells. Through the dimensional reduction of a bulk elastic energy quadratic in Biot strains, we obtain two-dimensional bending energies quadratic in bending measures featuring a bilinear coupling of stretches and geometric curvatures. For plates, the bending measure is invariant under spatial dilations and naturally extend… ▽ More

    Submitted 18 June, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: final minor changes

  32. arXiv:2111.14552  [pdf, other

    cs.LG

    Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning

    Authors: Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah P. Hanna

    Abstract: Reinforcement learning (RL) algorithms are often categorized as either on-policy or off-policy depending on whether they use data from a target policy of interest or from a different behavior policy. In this paper, we study a subtle distinction between on-policy data and on-policy sampling in the context of the RL sub-problem of policy evaluation. We observe that on-policy sampling may fail to mat… ▽ More

    Submitted 10 October, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Published in 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  33. Dilation-invariant bending of elastic plates, and broken symmetry in shells

    Authors: E. Vitral, J. A. Hanna

    Abstract: We propose bending energies for isotropic elastic plates and shells. For a plate, we define and employ a surface tensor that symmetrically couples stretch and curvature such that any elastic energy density constructed from its invariants is invariant under spatial dilations. This kinematic measure and its corresponding isotropic quadratic energy resolve outstanding issues in thin structure elastic… ▽ More

    Submitted 18 June, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: final minor changes

  34. arXiv:2109.14290  [pdf, other

    math.NA physics.flu-dyn

    Residual-based adaptivity for two-phase flow simulation in porous media using Physics-informed Neural Networks

    Authors: John M. Hanna, Jose V. Aguado, Sebastien Comas-Cardona, Ramzi Askri, Domenico Borzacchiello

    Abstract: This paper aims to provide a machine learning framework to simulate two-phase flow in porous media. The proposed algorithm is based on Physics-informed neural networks (PINN). A novel residual-based adaptive PINN is developed and compared with the residual-based adaptive refinement (RAR) method and with PINN with fixed collocation points. The proposed algorithm is expected to have great potential… ▽ More

    Submitted 10 February, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

  35. arXiv:2108.02530  [pdf, other

    cs.RO

    Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles

    Authors: Josiah P. Hanna, Arrasy Rahman, Elliot Fosong, Francisco Eiras, Mihai Dobre, John Redford, Subramanian Ramamoorthy, Stefano V. Albrecht

    Abstract: Recognising the goals or intentions of observed vehicles is a key step towards predicting the long-term future behaviour of other agents in an autonomous driving scenario. When there are unseen obstacles or occluded vehicles in a scenario, goal recognition may be confounded by the effects of these unseen entities on the behaviour of observed vehicles. Existing prediction algorithms that assume rat… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)

  36. arXiv:2108.01137  [pdf, other

    physics.flu-dyn

    Selective energy and enstrophy modification of two-dimensional decaying turbulence

    Authors: Aditya G. Nair, James Hanna, Matteo Aureli

    Abstract: In two-dimensional decaying homogeneous isotropic turbulence, kinetic energy and enstrophy are respectively transferred to larger and smaller scales. In such spatiotemporally complex dynamics, it is challenging to identify the important flow structures that govern this behavior. We propose and numerically employ two flow modification strategies that leverage the inviscid global conservation of ene… ▽ More

    Submitted 6 December, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 15 pages, 11 figures

    MSC Class: 76F05

  37. arXiv:2107.10894  [pdf, other

    cs.CV eess.IV

    Power Plant Classification from Remote Imaging with Deep Learning

    Authors: Michael Mommert, Linus Scheibenreif, Joëlle Hanna, Damian Borth

    Abstract: Satellite remote imaging enables the detailed study of land use patterns on a global scale. We investigate the possibility to improve the information content of traditional land use classification by identifying the nature of industrial sites from medium-resolution remote sensing images. In this work, we focus on classifying different types of power plants from Sentinel-2 imaging data. Using a Res… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Presented at the 2021 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

  38. arXiv:2107.08966  [pdf, other

    cs.LG cs.AI

    Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration

    Authors: Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht

    Abstract: Intrinsic rewards can improve exploration in reinforcement learning, but the exploration process may suffer from instability caused by non-stationary reward sha** and strong dependency on hyperparameters. In this work, we introduce Decoupled RL (DeRL) as a general framework which trains separate policies for intrinsically-motivated exploration and exploitation. Such decoupling allows DeRL to lev… ▽ More

    Submitted 9 February, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Published at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) 2022

  39. Quadratic-stretch elasticity

    Authors: E. Vitral, J. A. Hanna

    Abstract: A nonlinear small-strain elastic theory is constructed from a systematic expansion in Biot strains, truncated at quadratic order. The primary motivation is the desire for a clean separation between stretching and bending energies for shells, which appears to arise only from reduction of a bulk energy of this type. An approximation of isotropic invariants, bypassing the solution of a quartic equati… ▽ More

    Submitted 8 July, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Minor edits

  40. arXiv:2104.09649  [pdf

    cond-mat.mtrl-sci

    Insight into the partitioning and clustering mechanism of rare-earth cations in alkali aluminoborosilicate glasses

    Authors: Hrishikesh Kamat, Fu Wang, Kristian Barnsley, John V. Hanna, Alexei M. Tyryshkin, Ashutosh Goel

    Abstract: Rare-earth (RE) containing alkali aluminoborosilicate glasses find increasingly broad technological applications, with their further development only impeded by yet-poor understanding of coordination environment and structural role of RE ions in glasses. In this work we combine free induction decay (FID)-detected electron paramagnetic resonance (EPR), electron spin echo envelope modulation (ESEEM)… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  41. arXiv:2103.15173  [pdf, ps, other

    physics.class-ph nlin.SI

    Exterior dissipation, proportional decay, and integrals of motion

    Authors: M. Aureli, J. A. Hanna

    Abstract: Given a dynamical system with $m$ independent conserved quantities, we construct a multi-parameter family of new systems in which these quantities evolve monotonically and proportionally, and are replaced by $m-1$ conserved linear combinations of themselves, with any of the original quantities as limiting cases. The modification of the dynamics employs an exterior product of gradients of the origi… ▽ More

    Submitted 3 August, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: minor changes

    Journal ref: Phys. Rev. Lett. 127, 134101 (2021)

  42. Cutting holes in bistable folds

    Authors: T. Yu, I. Andrade-Silva, M. A. Dias, J. A. Hanna

    Abstract: A folded disk is bistable, as it can be popped through to an inverted state with elastic energy localized in a small, highly-deformed region on the fold. Cutting out this singularity relaxes the surrounding material and leads to a loss of bistability when the hole dimensions reach a critical size. These dimensions are strongly anisotropic and feature a surprising re-entrant behavior, such that rem… ▽ More

    Submitted 24 May, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: whither the singularity?

  43. arXiv:2008.06738  [pdf, other

    cs.LG cs.AI stat.ML

    Reducing Sampling Error in Batch Temporal Difference Learning

    Authors: Brahma Pavse, Ishan Durugkar, Josiah Hanna, Peter Stone

    Abstract: Temporal difference (TD) learning is one of the main foundations of modern reinforcement learning. This paper studies the use of TD(0), a canonical TD algorithm, to estimate the value function of a given policy from a batch of data. In this batch setting, we show that TD(0) may converge to an inaccurate value function because the update following an action is weighted according to the number of ti… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2020

  44. arXiv:2008.06166  [pdf, other

    cond-mat.soft

    Anisotropic swelling of anisotropic elastic panels

    Authors: H. G. Wood, J. A. Hanna

    Abstract: While isotropic in-plane swelling problems for thin elastic sheets have been studied extensively in recent years, many shape-programmable materials, including nematic solids and 3D-printed structures, are anisotropic, as are most industrial sheet materials. In this theoretical work, we consider central swelling and shrinkage of plates of aspect ratio and material properties relevant to the manufac… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: uzwellinga

  45. arXiv:2008.01594  [pdf, other

    cs.AI cs.LG

    An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

    Authors: Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone

    Abstract: We examine the problem of transferring a policy learned in a source environment to a target environment with different dynamics, particularly in the case where it is critical to reduce the amount of interaction with the target environment during learning. This problem is particularly important in sim-to-real transfer because simulators inevitably model real-world dynamics imperfectly. In this pape… ▽ More

    Submitted 16 November, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Journal ref: Neural Information Processing Systems (NeurIPS 2020)

  46. arXiv:2008.01281  [pdf, other

    cs.RO

    Stochastic Grounded Action Transformation for Robot Learning in Simulation

    Authors: Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, Peter Stone

    Abstract: Robot control policies learned in simulation do not often transfer well to the real world. Many existing solutions to this sim-to-real problem, such as the Grounded Action Transformation (GAT) algorithm, seek to correct for or ground these differences by matching the simulator to the real world. However, the efficacy of these approaches is limited if they do not explicitly account for stochasticit… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted at 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020

  47. arXiv:2008.01279  [pdf, other

    cs.RO

    Reinforced Grounded Action Transformation for Sim-to-Real Transfer

    Authors: Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, Peter Stone

    Abstract: Robots can learn to do complex tasks in simulation, but often, learned behaviors fail to transfer well to the real world due to simulator imperfections (the reality gap). Some existing solutions to this sim-to-real problem, such as Grounded Action Transformation (GAT), use a small amount of real-world experience to minimize the reality gap by grounding the simulator. While very effective in certai… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted at International Conference on Intelligent Robots and Systems (IROS) 2020

  48. arXiv:2007.09327  [pdf, ps, other

    cs.CR cs.LG cs.MA

    Towards Quantum-Secure Authentication and Key Agreement via Abstract Multi-Agent Interaction

    Authors: Ibrahim H. Ahmed, Josiah P. Hanna, Elliot Fosong, Stefano V. Albrecht

    Abstract: Current methods for authentication and key agreement based on public-key cryptography are vulnerable to quantum computing. We propose a novel approach based on artificial intelligence research in which communicating parties are viewed as autonomous agents which interact repeatedly using their private decision models. Authentication and key agreement are decided based on the agents' observed behavi… ▽ More

    Submitted 9 July, 2021; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: Published at the 19th International Conference on Practical Applications of Agents and Multi-Agent Systems (PAAMS 2021)

  49. arXiv:2007.06023  [pdf, other

    physics.class-ph cond-mat.soft math-ph physics.flu-dyn

    Pseudomomentum: origins and consequences

    Authors: H. Singh, J. A. Hanna

    Abstract: The balance of pseudomomentum is discussed and applied to simple elasticity, ideal fluids, and the mechanics of inextensible rods and sheets. A general framework is presented in which the simultaneous variation of an action with respect to position, time, and material labels yields bulk balance laws and jump conditions for momentum, energy, and pseudomomentum. The example of simple elasticity of s… ▽ More

    Submitted 11 August, 2022; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: incorporates correction (DOIs are 10.1007/s00033-021-01507-9 and 10.1007/s00033-022-01819-4)

    Journal ref: Z. Angew. Math. Phys. 72, 122 (2021) and Z. Angew. Math. Phys. 73, 189 (2022)

  50. arXiv:2007.00707  [pdf, other

    physics.class-ph nlin.SI

    An integrable family of torqued, damped, rigid rotors

    Authors: J. A. Hanna

    Abstract: Expositions of the Euler equations for the rotation of a rigid body often invoke the idea of a specially damped system whose energy dissipates while its angular momentum magnitude is conserved in the body frame. An attempt to explicitly construct such a dam** function leads to a more general, but still integrable, system of cubic equations whose trajectories are confined to nested sets of quadri… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: staying amused during quarantine