-
Sequential Maximal Updated Density Parameter Estimation for Dynamical Systems with Parameter Drift
Authors:
Carlos del-Castillo-Negrete,
Rylan Spence,
Troy Butler,
Clint Dawson
Abstract:
We present a novel method for generating sequential parameter estimates and quantifying epistemic uncertainty in dynamical systems within a data-consistent (DC) framework. The DC framework differs from traditional Bayesian approaches due to the incorporation of the push-forward of an initial density, which performs selective regularization in parameter directions not informed by the data in the re…
▽ More
We present a novel method for generating sequential parameter estimates and quantifying epistemic uncertainty in dynamical systems within a data-consistent (DC) framework. The DC framework differs from traditional Bayesian approaches due to the incorporation of the push-forward of an initial density, which performs selective regularization in parameter directions not informed by the data in the resulting updated density. This extends a previous study that included the linear Gaussian theory within the DC framework and introduced the maximal updated density (MUD) estimate as an alternative to both least squares and maximum a posterior (MAP) estimates. In this work, we introduce algorithms for operational settings of MUD estimation in real or near-real time where spatio-temporal datasets arrive in packets to provide updated estimates of parameters and identify potential parameter drift. Computational diagnostics within the DC framework prove critical for evaluating (1) the quality of the DC update and MUD estimate and (2) the detection of parameter value drift. The algorithms are applied to estimate (1) wind drag parameters in a high-fidelity storm surge model, (2) thermal diffusivity field for a heat conductivity problem, and (3) changing infection and incubation rates of an epidemiological model.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
RADIUM: Predicting and Repairing End-to-End Robot Failures using Gradient-Accelerated Sampling
Authors:
Charles Dawson,
Anjali Parashar,
Chuchu Fan
Abstract:
Before autonomous systems can be deployed in safety-critical applications, we must be able to understand and verify the safety of these systems. For cases where the risk or cost of real-world testing is prohibitive, we propose a simulation-based framework for a) predicting ways in which an autonomous system is likely to fail and b) automatically adjusting the system's design and control policy to…
▽ More
Before autonomous systems can be deployed in safety-critical applications, we must be able to understand and verify the safety of these systems. For cases where the risk or cost of real-world testing is prohibitive, we propose a simulation-based framework for a) predicting ways in which an autonomous system is likely to fail and b) automatically adjusting the system's design and control policy to preemptively mitigate those failures. Existing tools for failure prediction struggle to search over high-dimensional environmental parameters, cannot efficiently handle end-to-end testing for systems with vision in the loop, and provide little guidance on how to mitigate failures once they are discovered. We approach this problem through the lens of approximate Bayesian inference and use differentiable simulation and rendering for efficient failure case prediction and repair. For cases where a differentiable simulator is not available, we provide a gradient-free version of our algorithm, and we include a theoretical and empirical evaluation of the trade-offs between gradient-based and gradient-free methods. We apply our approach on a range of robotics and control problems, including optimizing search patterns for robot swarms, UAV formation control, and robust network control. Compared to optimization-based falsification methods, our method predicts a more diverse, representative set of failure modes, and we find that our use of differentiable simulation yields solutions that have up to 10x lower cost and requires up to 2x fewer iterations to converge relative to gradient-free techniques. In hardware experiments, we find that repairing control policies using our method leads to a 5x robustness improvement. Accompanying code and video can be found at https://mit-realm.github.io/radium/
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Storm Surge Modeling in the AI ERA: Using LSTM-based Machine Learning for Enhancing Forecasting Accuracy
Authors:
Stefanos Giaremis,
Noujoud Nader,
Clint Dawson,
Hartmut Kaiser,
Carola Kaiser,
Efstratios Nikidis
Abstract:
Physics simulation results of natural processes usually do not fully capture the real world. This is caused for instance by limits in what physical processes are simulated and to what accuracy. In this work we propose and analyze the use of an LSTM-based deep learning network machine learning (ML) architecture for capturing and predicting the behavior of the systemic error for storm surge forecast…
▽ More
Physics simulation results of natural processes usually do not fully capture the real world. This is caused for instance by limits in what physical processes are simulated and to what accuracy. In this work we propose and analyze the use of an LSTM-based deep learning network machine learning (ML) architecture for capturing and predicting the behavior of the systemic error for storm surge forecast models with respect to real-world water height observations from gauge stations during hurricane events. The overall goal of this work is to predict the systemic error of the physics model and use it to improve the accuracy of the simulation results post factum. We trained our proposed ML model on a dataset of 61 historical storms in the coastal regions of the U.S. and we tested its performance in bias correcting modeled water level data predictions from hurricane Ian (2022). We show that our model can consistently improve the forecasting accuracy for hurricane Ian -- unknown to the ML model -- at all gauge station coordinates used for the initial data. Moreover, by examining the impact of using different subsets of the initial training dataset, containing a number of relatively similar or different hurricanes in terms of hurricane track, we found that we can obtain similar quality of bias correction by only using a subset of six hurricanes. This is an important result that implies the possibility to apply a pre-trained ML model to real-time hurricane forecasting results with the goal of bias correcting and improving the produced simulation accuracy. The presented work is an important first step in creating a bias correction system for real-time storm surge forecasting applicable to the full simulation area. It also presents a highly transferable and operationally applicable methodology for improving the accuracy in a wide range of physics simulation scenarios beyond storm surge forecasting.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Dichotomous Dynamics of Magnetic Monopole Fluids
Authors:
Chun-Chih Hsu,
Hiroto Takahashi,
Fabian Jerzembeck,
Jahnatta Dasini,
Chaia Carroll,
Ritika Dusad,
Jonathan Ward,
Catherine Dawson,
Sudarshan Sharma,
Graeme Luke,
Stephen J. Blundell,
Claudio Castelnovo,
Jonathan N. Hallén,
Roderich Moessner,
J. C. Séamus Davis
Abstract:
A recent advance in the study of emergent magnetic monopoles was the discovery that monopole motion is restricted to dynamical fractal trajectories (J. Hallén et al, Science 378, 1218 (2022)) thus explaining the characteristics of magnetic monopole noise spectra (Dusad, R. et al. Nature 571, 234 (2019); Samarakoon, A. M. et al. Proc. Natl. Acad. Sci. 119, e2117453119 (2022)). Here we apply this ne…
▽ More
A recent advance in the study of emergent magnetic monopoles was the discovery that monopole motion is restricted to dynamical fractal trajectories (J. Hallén et al, Science 378, 1218 (2022)) thus explaining the characteristics of magnetic monopole noise spectra (Dusad, R. et al. Nature 571, 234 (2019); Samarakoon, A. M. et al. Proc. Natl. Acad. Sci. 119, e2117453119 (2022)). Here we apply this new theory to explore the dynamics of field-driven monopole currents, finding them comprised of two quite distinct transport processes: initially swift fractal rearrangements of local monopole configurations followed by conventional monopole diffusion. This theory also predicts a characteristic frequency dependence of the dissipative loss-angle for AC-field-driven currents. To explore these novel perspectives on monopole transport, we introduce simultaneous monopole current control and measurement techniques using SQUID-based monopole current sensors. For the canonical material Dy2Ti2O7, we measure $Φ(t)$, the time-dependence of magnetic flux threading the sample when a net monopole current $J(t) = \dotΦ(t)/μ_0$ is generated by applying an external magnetic field $B_0(t)$. These experiments find a sharp dichotomy of monopole currents, separated by their distinct relaxation time-constants before and after $t \approx 600 μs$ from monopole current initiation. Application of sinusoidal magnetic fields $B_0(t) = Bcos(ωt)$ generates oscillating monopole currents whose loss angle $θ(f)$ exhibits a characteristic transition at frequency $f \approx 1.8$ kHz over the same temperature range. Finally, the magnetic noise power is also dichotomic, diminishing sharply after $t \approx 600 μs$. This complex phenomenology represents a new form of heterogeneous dynamics generated by the interplay of fractionalization and local spin configurational symmetry.
△ Less
Submitted 9 April, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning
Authors:
Allen M. Wang,
Oswin So,
Charles Dawson,
Darren T. Garnier,
Cristina Rea,
Chuchu Fan
Abstract:
The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment…
▽ More
The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment is a hybrid physics and machine learning model trained on simulations of the SPARC primary reference discharge (PRD) ramp-down, an upcoming burning plasma scenario which we use as a testbed. To address physics uncertainty and model inaccuracies, the simulation environment is massively parallelized on GPU with randomized physics parameters during policy training. The trained policy is then successfully transferred to a higher fidelity simulator where it successfully ramps down the plasma while avoiding user-specified disruptive limits. We also address the crucial issue of safety criticality by demonstrating that a constraint-conditioned policy can be used as a trajectory design assistant to design a library of feed-forward trajectories to handle different physics conditions and user settings. As a library of trajectories is more interpretable and verifiable offline, we argue such an approach is a promising path for leveraging the capabilities of reinforcement learning in the safety-critical context of burning plasma tokamaks. Finally, we demonstrate how the training environment can be a useful platform for other feed-forward optimization approaches by using an evolutionary algorithm to perform optimization of feed-forward trajectories that are robust to physics uncertainty
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Learning Safe Control for Multi-Robot Systems: Methods, Verification, and Open Challenges
Authors:
Kunal Garg,
Songyuan Zhang,
Oswin So,
Charles Dawson,
Chuchu Fan
Abstract:
In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focussing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design fo…
▽ More
In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focussing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design for multi-robot systems. We start with various types of shielding-based methods, such as safety certificates, predictive filters, and reachability tools. Then, we review the current state of control barrier certificate learning in both a centralized and distributed manner, followed by a comprehensive review of multi-agent reinforcement learning with a particular focus on safety. Next, we discuss the state-of-the-art verification tools for the correctness of learning-based methods. Based on the capabilities and the limitations of the state of the art methods in learning and verification for MAS, we identify various broad themes for open challenges: how to design methods that can achieve good performance along with safety guarantees; how to decompose single-agent based centralized methods for MAS; how to account for communication-related practical issues; and how to assess transfer of theoretical guarantees to practice.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Adversarial optimization leads to over-optimistic security-constrained dispatch, but sampling can help
Authors:
Charles Dawson,
Chuchu Fan
Abstract:
To ensure safe, reliable operation of the electrical grid, we must be able to predict and mitigate likely failures. This need motivates the classic security-constrained AC optimal power flow (SCOPF) problem. SCOPF is commonly solved using adversarial optimization, where the dispatcher and an adversary take turns optimizing a robust dispatch and adversarial attack, respectively. We show that advers…
▽ More
To ensure safe, reliable operation of the electrical grid, we must be able to predict and mitigate likely failures. This need motivates the classic security-constrained AC optimal power flow (SCOPF) problem. SCOPF is commonly solved using adversarial optimization, where the dispatcher and an adversary take turns optimizing a robust dispatch and adversarial attack, respectively. We show that adversarial optimization is liable to severely overestimate the robustness of the optimized dispatch (when the adversary encounters a local minimum), leading the operator to falsely believe that their dispatch is secure.
To prevent this overconfidence, we develop a novel adversarial sampling approach that prioritizes diversity in the predicted attacks. We find that our method not only substantially improves the robustness of the optimized dispatch but also avoids overconfidence, accurately characterizing the likelihood of voltage collapse under a given threat model. We demonstrate a proof-of-concept on small-scale transmission systems with 14 and 57 nodes.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Efficacy of reduced order source terms for a coupled wave-circulation model in the Gulf of Mexico
Authors:
Mark Loveland,
Jessica Meixner,
Eirik Valseth,
Clint Dawson
Abstract:
During hurricanes, coupled wave-circulation models are critical tools for public safety. The standard approach is to use a high fidelity circulation model coupled with a wave model which uses the most advanced source terms. As a result, the models can be highly computationally expensive and so this study investigates the potential consequences of using highly simplified (reduced order) source term…
▽ More
During hurricanes, coupled wave-circulation models are critical tools for public safety. The standard approach is to use a high fidelity circulation model coupled with a wave model which uses the most advanced source terms. As a result, the models can be highly computationally expensive and so this study investigates the potential consequences of using highly simplified (reduced order) source terms within the wave model component of the coupled wave-circulation model. The trade-off between run time and accuracy with respect to observations is quantified for a set of two storms that impacted the Gulf of Mexico, Hurricane Ike and Hurricane Ida. Water surface elevations as well as wave statistics (significant wave height, peak period, and mean wave direction) are compared to observations. The usage of the reduced order source terms yielded significant savings in computational cost. Additionally, relatively low amounts of additional error with respect to observations during the simulations with reduced order source terms. However, large changes in global model outputs of the wave statistics were observed based on the choice of source terms particularly near the track of each hurricane.
△ Less
Submitted 16 May, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Evaluating Adversarial Robustness with Expected Viable Performance
Authors:
Ryan McCoppin,
Colin Dawson,
Sean M. Kennedy,
Leslie M. Blaha
Abstract:
We introduce a metric for evaluating the robustness of a classifier, with particular attention to adversarial perturbations, in terms of expected functionality with respect to possible adversarial perturbations. A classifier is assumed to be non-functional (that is, has a functionality of zero) with respect to a perturbation bound if a conventional measure of performance, such as classification ac…
▽ More
We introduce a metric for evaluating the robustness of a classifier, with particular attention to adversarial perturbations, in terms of expected functionality with respect to possible adversarial perturbations. A classifier is assumed to be non-functional (that is, has a functionality of zero) with respect to a perturbation bound if a conventional measure of performance, such as classification accuracy, is less than a minimally viable threshold when the classifier is tested on examples from that perturbation bound. Defining robustness in terms of an expected value is motivated by a domain general approach to robustness quantification.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
A Bayesian approach to breaking things: efficiently predicting and repairing failure modes via sampling
Authors:
Charles Dawson,
Chuchu Fan
Abstract:
Before autonomous systems can be deployed in safety-critical applications, we must be able to understand and verify the safety of these systems. For cases where the risk or cost of real-world testing is prohibitive, we propose a simulation-based framework for a) predicting ways in which an autonomous system is likely to fail and b) automatically adjusting the system's design to preemptively mitiga…
▽ More
Before autonomous systems can be deployed in safety-critical applications, we must be able to understand and verify the safety of these systems. For cases where the risk or cost of real-world testing is prohibitive, we propose a simulation-based framework for a) predicting ways in which an autonomous system is likely to fail and b) automatically adjusting the system's design to preemptively mitigate those failures. We frame this problem through the lens of approximate Bayesian inference and use differentiable simulation for efficient failure case prediction and repair. We apply our approach on a range of robotics and control problems, including optimizing search patterns for robot swarms and reducing the severity of outages in power transmission networks. Compared to optimization-based falsification techniques, our method predicts a more diverse, representative set of failure modes, and we also find that our use of differentiable simulation yields solutions that have up to 10x lower cost and requires up to 2x fewer iterations to converge relative to gradient-free techniques. Code and videos can be found at https://mit-realm.github.io/breaking-things/
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Fault-tolerant complexes
Authors:
Hector Bombin,
Chris Dawson,
Terry Farrelly,
Yehua Liu,
Naomi Nickerson,
Mihir Pant,
Fernando Pastawski,
Sam Roberts
Abstract:
Fault-tolerant complexes describe surface-code fault-tolerant protocols from a single geometric object. We first introduce fusion complexes that define a general family of fusion-based quantum computing (FBQC) fault-tolerant quantum protocols based on surface codes. We show that any 3-dimensional cell complex where each edge has four incident faces gives a valid fusion complex. This construction e…
▽ More
Fault-tolerant complexes describe surface-code fault-tolerant protocols from a single geometric object. We first introduce fusion complexes that define a general family of fusion-based quantum computing (FBQC) fault-tolerant quantum protocols based on surface codes. We show that any 3-dimensional cell complex where each edge has four incident faces gives a valid fusion complex. This construction enables an automated search for fault tolerance schemes, allowing us to identify 627 examples within a moderate search time. We implement this using the open-source software tool Gavrog and present threshold results for a variety of schemes, finding fusion networks with higher erasure and Pauli thresholds than those existing in the literature. We then define more general structures we call fault-tolerant complexes that provide a homological description of fault tolerance from a large family of low-level error models, which include circuit-based computation, floquet-based computation, and FBQC with multi-qubit measurements. This extends the applicability of homological descriptions of fault tolerance, and enables the generation of many new schemes which have not been previously identified. We also define families of fault-tolerant complexes for color codes and 3d single-shot subsystem codes, which enables similar constructive methods, and we present several new examples of each.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
WAVEx: Stabilized Finite Elements for Spectral Wind Wave Models Using FEniCSx
Authors:
Mark Loveland,
Eirik Valseth,
Jessica Meixner,
Clint Dawson
Abstract:
The prediction of the wind wave spectrum of the ocean using numerical models are an important tool for researchers, engineers, and communities living in coastal areas. The governing equation of the wind wave models, the Wave Action Balance Equation, presents unique challenges for implementing reliable numerical models because it is highly advective, highly nonlinear and high dimensional. Historica…
▽ More
The prediction of the wind wave spectrum of the ocean using numerical models are an important tool for researchers, engineers, and communities living in coastal areas. The governing equation of the wind wave models, the Wave Action Balance Equation, presents unique challenges for implementing reliable numerical models because it is highly advective, highly nonlinear and high dimensional. Historically, most operational models have utilized finite difference methods, others have used finite volume methods but relatively few attempts at using finite element methods. In this work, we seek to fill this gap by investigating several different finite element discretizations of the Wave Action Balance Equation. The methods, which include streamline upwind Petrov-Galerkin (SUPG), least squares, and discontinuous Galerkin, are implemented and convergence properties are examined for some simplified 2-D test cases. Then, a new spectral wind wave model, WAVEx, is formulated and implemented for the full problem setting. WAVEx uses continuous finite elements along with SUPG stabilization in geographic/spectral space that allows for fully unstructured triangular meshes in both geographic and spectral space. For propagation in time, a second order fully implicit finite difference method is used. When source terms are active, a second order operator splitting scheme is used to linearize the problem. In the splitting scheme, propagation is solved using the implicit method and the nonlinear source terms are treated explicitly. Several test cases, including analytic tests and laboratory experiments, are demonstrated and results are compared to analytic solutions, observations, as well as output from another model that is used operationally.
△ Less
Submitted 6 November, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Rapid Flood Inundation Forecast Using Fourier Neural Operator
Authors:
Alexander Y. Sun,
Zhi Li,
Wonhyun Lee,
Qixing Huang,
Bridget R. Scanlon,
Clint Dawson
Abstract:
Flood inundation forecast provides critical information for emergency planning before and during flood events. Real time flood inundation forecast tools are still lacking. High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding. Here we present a hybrid proc…
▽ More
Flood inundation forecast provides critical information for emergency planning before and during flood events. Real time flood inundation forecast tools are still lacking. High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding. Here we present a hybrid process-based and data-driven machine learning (ML) approach for flood extent and inundation depth prediction. We used the Fourier neural operator (FNO), a highly efficient ML method, for surrogate modeling. The FNO model is demonstrated over an urban area in Houston (Texas, U.S.) by training using simulated water depths (in 15-min intervals) from six historical storm events and then tested over two holdout events. Results show FNO outperforms the baseline U-Net model. It maintains high predictability at all lead times tested (up to 3 hrs) and performs well when applying to new sites, suggesting strong generalization skill.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
A Discontinuous Galerkin Finite Element Model for Compound Flood Simulations
Authors:
Chayanon Wichitrnithed,
Eirik Valseth,
Ethan J. Kubatko,
Younghun Kang,
Mackenzie Hudson,
Clint Dawson
Abstract:
Recent tropical cyclones, e.g., Hurricane Harvey (2017), have lead to significant rainfall and resulting runoff with accompanying flooding. When the runoff interacts with storm surge, the resulting floods can be greatly amplified and lead to effects that cannot be modeled by simple superposition of its distinctive sources. In an effort to develop accurate numerical simulations of runoff, surge, an…
▽ More
Recent tropical cyclones, e.g., Hurricane Harvey (2017), have lead to significant rainfall and resulting runoff with accompanying flooding. When the runoff interacts with storm surge, the resulting floods can be greatly amplified and lead to effects that cannot be modeled by simple superposition of its distinctive sources. In an effort to develop accurate numerical simulations of runoff, surge, and compounding floods, we develop a local discontinuous Galerkin method for modified shallow water equations. In this modification, nonzero sources to the continuity equation are included to incorporate rainfall into the model using parametric rainfall models from literature as well as hindcast data. The discontinuous Galerkin spatial discretization is accompanied with a strong stability preserving explicit Runge Kutta time integrator. Hence, temporal stability is ensured through the CFL condition and we exploit the embarrassingly parallel nature of the developed method using MPI parallelization. We demonstrate the capabilities of the developed method though a sequence of physically relevant numerical tests, including small scale test cases based on laboratory measurements and large scale experiments with Hurricane Harvey in the Gulf of Mexico. The results highlight the conservation properties and robustness of the developed method and show the potential of compound flood modeling using our approach.
△ Less
Submitted 17 December, 2023; v1 submitted 26 July, 2023;
originally announced July 2023.
-
AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers
Authors:
Yongchao Chen,
Jacob Arkin,
Charles Dawson,
Yang Zhang,
Nicholas Roy,
Chuchu Fan
Abstract:
For effective human-robot interaction, robots need to understand, plan, and execute complex, long-horizon tasks described by natural language. Recent advances in large language models (LLMs) have shown promise for translating natural language into robot action sequences for complex tasks. However, existing approaches either translate the natural language directly into robot trajectories or factor…
▽ More
For effective human-robot interaction, robots need to understand, plan, and execute complex, long-horizon tasks described by natural language. Recent advances in large language models (LLMs) have shown promise for translating natural language into robot action sequences for complex tasks. However, existing approaches either translate the natural language directly into robot trajectories or factor the inference process by decomposing language into task sub-goals and relying on a motion planner to execute each sub-goal. When complex environmental and temporal constraints are involved, inference over planning tasks must be performed jointly with motion plans using traditional task-and-motion planning (TAMP) algorithms, making factorization into subgoals untenable. Rather than using LLMs to directly plan task sub-goals, we instead perform few-shot translation from natural language task descriptions to an intermediate task representation that can then be consumed by a TAMP algorithm to jointly solve the task and motion plan. To improve translation, we automatically detect and correct both syntactic and semantic errors via autoregressive re-prompting, resulting in significant improvements in task completion. We show that our approach outperforms several methods using LLMs as planners in complex task domains. See our project website https://yongchao98.github.io/MIT-REALM-AutoTAMP/ for prompts, videos, and code.
△ Less
Submitted 21 March, 2024; v1 submitted 10 June, 2023;
originally announced June 2023.
-
Increasing error tolerance in quantum computers with dynamic bias arrangement
Authors:
Hector Bombín,
Chris Dawson,
Naomi Nickerson,
Mihir Pant,
Jordan Sullivan
Abstract:
Many quantum operations are expected to exhibit bias in the structure of their errors. Recent works have shown that a fixed bias can be exploited to improve error tolerance by statically arranging the errors in beneficial configurations. In some cases an error bias can be dynamically reconfigurable, an example being linear optical fusion where the basis of a fusion failure can be chosen before the…
▽ More
Many quantum operations are expected to exhibit bias in the structure of their errors. Recent works have shown that a fixed bias can be exploited to improve error tolerance by statically arranging the errors in beneficial configurations. In some cases an error bias can be dynamically reconfigurable, an example being linear optical fusion where the basis of a fusion failure can be chosen before the measurement is made. Here we introduce methods for increasing error tolerance in this setting by using classical decision-making to adaptively choose the bias in measurements as a fault tolerance protocol proceeds. We study this technique in the setting of linear optical fusion based quantum computing (FBQC). We provide examples demonstrating that by dynamically arranging erasures, the loss tolerance can be tripled when compared to a static arrangement of biased errors while using the same quantum resources: we show that for the best FBQC architecture of Bartolucci et al. (2023) the threshold increases from $2.7\%$ to $7.5\%$ per photon with the same resource state by using dynamic biasing. Our method does not require any specific code structure beyond having a syndrome graph representation. We have chosen to illustrate these techniques using an architecture which is otherwise identical to that in Bartolucci et al. (2023), but deployed together with other techniques, such as different fusion networks, higher loss thresholds are possible.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Modular decoding: parallelizable real-time decoding for quantum computers
Authors:
Héctor Bombín,
Chris Dawson,
Ye-Hua Liu,
Naomi Nickerson,
Fernando Pastawski,
Sam Roberts
Abstract:
Universal fault-tolerant quantum computation will require real-time decoding algorithms capable of quickly extracting logical outcomes from the stream of data generated by noisy quantum hardware. We propose modular decoding, an approach capable of addressing this challenge with minimal additional communication and without sacrificing decoding accuracy. We introduce the edge-vertex decomposition, a…
▽ More
Universal fault-tolerant quantum computation will require real-time decoding algorithms capable of quickly extracting logical outcomes from the stream of data generated by noisy quantum hardware. We propose modular decoding, an approach capable of addressing this challenge with minimal additional communication and without sacrificing decoding accuracy. We introduce the edge-vertex decomposition, a concrete instance of modular decoding for lattice-surgery style fault-tolerant blocks which is remarkably effective. This decomposition of the global decoding problem into sub-tasks mirrors the logical-block-network structure of a fault-tolerant quantum circuit. We identify the buffering condition as a key requirement controlling decoder quality; it demands a sufficiently large separation (buffer) between a correction committed by a decoding sub-task and the data unavailable to it. We prove that the fault distance of the protocol is preserved if the buffering condition is satisfied. Finally, we implement edge-vertex modular decoding and apply it on a variety of quantum circuits, including the Clifford component of the 15-to-1 magic-state distillation protocol. Monte Carlo simulations on a range of buffer sizes provide quantitative evidence that buffers are both necessary and sufficient to guarantee decoder accuracy. Our results show that modular decoding meets all the practical requirements necessary to support real-world fault-tolerant quantum computers.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Shield Model Predictive Path Integral: A Computationally Efficient Robust MPC Approach Using Control Barrier Functions
Authors:
Ji Yin,
Charles Dawson,
Chuchu Fan,
Panagiotis Tsiotras
Abstract:
Model Predictive Path Integral (MPPI) control is a type of sampling-based model predictive control that simulates thousands of trajectories and uses these trajectories to synthesize optimal controls on-the-fly. In practice, however, MPPI encounters problems limiting its application. For instance, it has been observed that MPPI tends to make poor decisions if unmodeled dynamics or environmental dis…
▽ More
Model Predictive Path Integral (MPPI) control is a type of sampling-based model predictive control that simulates thousands of trajectories and uses these trajectories to synthesize optimal controls on-the-fly. In practice, however, MPPI encounters problems limiting its application. For instance, it has been observed that MPPI tends to make poor decisions if unmodeled dynamics or environmental disturbances exist, preventing its use in safety-critical applications. Moreover, the multi-threaded simulations used by MPPI require significant onboard computational resources, making the algorithm inaccessible to robots without modern GPUs. To alleviate these issues, we propose a novel (Shield-MPPI) algorithm that provides robustness against unpredicted disturbances and achieves real-time planning using a much smaller number of parallel simulations on regular CPUs. The novel Shield-MPPI algorithm is tested on an aggressive autonomous racing platform both in simulation and using experiments. The results show that the proposed controller greatly reduces the number of constraint violations compared to state-of-the-art robust MPPI variants and stochastic MPC methods.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
Chance-Constrained Trajectory Optimization for High-DOF Robots in Uncertain Environments
Authors:
Charles Dawson,
Ashkan Jasour,
Andreas Hofmann,
Brian Williams
Abstract:
Many practical applications of robotics require systems that can operate safely despite uncertainty. In the context of motion planning, two types of uncertainty are particularly important when planning safe robot trajectories. The first is environmental uncertainty -- uncertainty in the locations of nearby obstacles, stemming from sensor noise or (in the case of obstacles' future locations) predic…
▽ More
Many practical applications of robotics require systems that can operate safely despite uncertainty. In the context of motion planning, two types of uncertainty are particularly important when planning safe robot trajectories. The first is environmental uncertainty -- uncertainty in the locations of nearby obstacles, stemming from sensor noise or (in the case of obstacles' future locations) prediction error. The second class of uncertainty is uncertainty in the robots own state, typically caused by tracking or estimation error. To achieve high levels of safety, it is necessary for robots to consider both of these sources of uncertainty. In this paper, we propose a risk-bounded trajectory optimization algorithm, known as Sequential Convex Optimization with Risk Optimization (SCORA), to solve chance-constrained motion planning problems despite both environmental uncertainty and tracking error. Through experiments in simulation, we demonstrate that SCORA significantly outperforms state-of-the-art risk-aware motion planners both in planning time and in the safety of the resulting trajectories.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Parameter Estimation with Maximal Updated Densities
Authors:
Michael Pilosov,
Carlos del-Castillo-Negrete,
Tian Yu Yen,
Troy Butler,
Clint Dawson
Abstract:
A recently developed measure-theoretic framework solves a stochastic inverse problem (SIP) for models where uncertainties in model output data are predominantly due to aleatoric (i.e., irreducible) uncertainties in model inputs (i.e., parameters). The subsequent inferential target is a distribution on parameters. Another type of inverse problem is to quantify uncertainties in estimates of "true" p…
▽ More
A recently developed measure-theoretic framework solves a stochastic inverse problem (SIP) for models where uncertainties in model output data are predominantly due to aleatoric (i.e., irreducible) uncertainties in model inputs (i.e., parameters). The subsequent inferential target is a distribution on parameters. Another type of inverse problem is to quantify uncertainties in estimates of "true" parameter values under the assumption that such uncertainties should be reduced as more data are incorporated into the problem, i.e., the uncertainty is considered epistemic. A major contribution of this work is the formulation and solution of such a parameter identification problem (PIP) within the measure-theoretic framework developed for the SIP. The approach is novel in that it utilizes a solution to a stochastic forward problem (SFP) to update an initial density only in the parameter directions informed by the model output data. In other words, this method performs "selective regularization" only in the parameter directions not informed by data. The solution is defined by a maximal updated density (MUD) point where the updated density defines the measure-theoretic solution to the PIP. Another significant contribution of this work is the full theory of existence and uniqueness of MUD points for linear maps with Gaussian distributions. Data-constructed Quantity of Interest (QoI) maps are also presented and analyzed for solving the PIP within this measure-theoretic framework as a means of reducing uncertainties in the MUD estimate. We conclude with a demonstration of the general applicability of the method on two problems involving either spatial or temporal data for estimating uncertain model parameters.
△ Less
Submitted 19 January, 2023; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Barrier functions enable safety-conscious force-feedback control
Authors:
Charles Dawson,
Austin Garrett,
Falk Pollok,
Yang Zhang,
Chuchu Fan
Abstract:
In order to be effective partners for humans, robots must become increasingly comfortable with making contact with their environment. Unfortunately, it is hard for robots to distinguish between ``just enough'' and ``too much'' force: some force is required to accomplish the task but too much might damage equipment or injure humans. Traditional approaches to designing compliant force-feedback contr…
▽ More
In order to be effective partners for humans, robots must become increasingly comfortable with making contact with their environment. Unfortunately, it is hard for robots to distinguish between ``just enough'' and ``too much'' force: some force is required to accomplish the task but too much might damage equipment or injure humans. Traditional approaches to designing compliant force-feedback controllers, such as stiffness control, require difficult hand-tuning of control parameters and make it difficult to build safe, effective robot collaborators. In this paper, we propose a novel yet easy-to-implement force feedback controller that uses control barrier functions (CBFs) to derive a compliant controller directly from users' specifications of the maximum allowable forces and torques. We compare our approach to traditional stiffness control to demonstrate potential advantages of our control architecture, and we demonstrate the effectiveness of our controller on an example human-robot collaboration task: cooperative manipulation of a bulky object.
△ Less
Submitted 25 September, 2022;
originally announced September 2022.
-
Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields
Authors:
Mukun Tong,
Charles Dawson,
Chuchu Fan
Abstract:
To navigate complex environments, robots must increasingly use high-dimensional visual feedback (e.g. images) for control. However, relying on high-dimensional image data to make control decisions raises important questions; particularly, how might we prove the safety of a visual-feedback controller? Control barrier functions (CBFs) are powerful tools for certifying the safety of feedback controll…
▽ More
To navigate complex environments, robots must increasingly use high-dimensional visual feedback (e.g. images) for control. However, relying on high-dimensional image data to make control decisions raises important questions; particularly, how might we prove the safety of a visual-feedback controller? Control barrier functions (CBFs) are powerful tools for certifying the safety of feedback controllers in the state-feedback setting, but CBFs have traditionally been poorly-suited to visual feedback control due to the need to predict future observations in order to evaluate the barrier function. In this work, we solve this issue by leveraging recent advances in neural radiance fields (NeRFs), which learn implicit representations of 3D scenes and can render images from previously-unseen camera perspectives, to provide single-step visual foresight for a CBF-based controller. This novel combination is able to filter out unsafe actions and intervene to preserve safety. We demonstrate the effect of our controller in real-time simulation experiments where it successfully prevents the robot from taking dangerous actions.
△ Less
Submitted 28 February, 2023; v1 submitted 25 September, 2022;
originally announced September 2022.
-
Cross-mode Stabilized Stochastic Shallow Water Systems Using Stochastic Finite Element Methods
Authors:
Chen Chen,
Clint Dawson,
Eirik Valseth
Abstract:
The development of surrogate models to study uncertainties in hydrologic systems requires significant effort in the development of sampling strategies and forward model simulations. Furthermore, in applications where prediction time is critical, such as prediction of hurricane storm surge, the predictions of system response and uncertainties can be required within short time frames. Here, we devel…
▽ More
The development of surrogate models to study uncertainties in hydrologic systems requires significant effort in the development of sampling strategies and forward model simulations. Furthermore, in applications where prediction time is critical, such as prediction of hurricane storm surge, the predictions of system response and uncertainties can be required within short time frames. Here, we develop an efficient stochastic shallow water model to address these issues. To discretize the physical and probability spaces we use a Stochastic Galerkin method and a Incremental Pressure Correction scheme to advance the solution in time. To overcome discrete stability issues, we propose cross-mode stabilization methods which employs existing stabilization methods in the probability space by adding stabilization terms to every stochastic mode in a modes-coupled way. We extensively verify the developed method for both idealized shallow water test cases and hindcasting of past hurricanes. We subsequently use the developed and verified method to perform a comprehensive statistical analysis of the established shallow water surrogate models. Finally, we propose a predictor for hurricane storm surge under uncertain wind drag coefficients and demonstrate its effectivity for Hurricanes Ike and Harvey.
△ Less
Submitted 25 December, 2022; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Projected Sensitivity of DMRadio-m$^3$: A Search for the QCD Axion Below $1\,μ$eV
Authors:
DMRadio Collaboration,
L. Brouwer,
S. Chaudhuri,
H. -M. Cho,
J. Corbin,
W. Craddock,
C. S. Dawson,
A. Droster,
J. W. Foster,
J. T. Fry,
P. W. Graham,
R. Henning,
K. D. Irwin,
F. Kadribasic,
Y. Kahn,
A. Keller,
R. Kolevatov,
S. Kuenstner,
A. F. Leder,
D. Li,
J. L. Ouellet,
K. Pappas,
A. Phipps,
N. M. Rapidis,
B. R. Safdi
, et al. (9 additional authors not shown)
Abstract:
The QCD axion is one of the most compelling candidates to explain the dark matter abundance of the universe. With its extremely small mass ($\ll 1\,\mathrm{eV}/c^2$), axion dark matter interacts as a classical field rather than a particle. Its coupling to photons leads to a modification of Maxwell's equations that can be measured with extremely sensitive readout circuits. DMRadio-m$^3$ is a next-g…
▽ More
The QCD axion is one of the most compelling candidates to explain the dark matter abundance of the universe. With its extremely small mass ($\ll 1\,\mathrm{eV}/c^2$), axion dark matter interacts as a classical field rather than a particle. Its coupling to photons leads to a modification of Maxwell's equations that can be measured with extremely sensitive readout circuits. DMRadio-m$^3$ is a next-generation search for axion dark matter below $1\,μ$eV using a $>4$ T static magnetic field, a coaxial inductive pickup, a tunable LC resonator, and a DC-SQUID readout. It is designed to search for QCD axion dark matter over the range $20\,\mathrm{neV}\lesssim m_ac^2\lesssim 800\,\mathrm{neV}$ ($5\,\mathrm{MHz}<ν<200\,\mathrm{MHz}$). The primary science goal aims to achieve DFSZ sensitivity above $m_ac^2\approx 120$ neV (30 MHz), with a secondary science goal of probing KSVZ axions down to $m_ac^2\approx40\,\mathrm{neV}$ (10 MHz).
△ Less
Submitted 8 December, 2022; v1 submitted 28 April, 2022;
originally announced April 2022.
-
A Framework for Flexible Peak Storm Surge Prediction
Authors:
Benjamin Pachev,
Prateek Arora,
Carlos del-Castillo-Negrete,
Eirik Valseth,
Clint Dawson
Abstract:
Storm surge is a major natural hazard in coastal regions, responsible both for significant property damage and loss of life. Accurate, efficient models of storm surge are needed both to assess long-term risk and to guide emergency management decisions. While high-fidelity regional- and global-ocean circulation models such as the ADvanced CIRCulation (ADCIRC) model can accurately predict storm surg…
▽ More
Storm surge is a major natural hazard in coastal regions, responsible both for significant property damage and loss of life. Accurate, efficient models of storm surge are needed both to assess long-term risk and to guide emergency management decisions. While high-fidelity regional- and global-ocean circulation models such as the ADvanced CIRCulation (ADCIRC) model can accurately predict storm surge, they are very computationally expensive. Here we develop a novel surrogate model for peak storm surge prediction based on a multi-stage approach. In the first stage, points are classified as inundated or not. In the second, the level of inundation is predicted . Additionally, we propose a new formulation of the surrogate problem in which storm surge is predicted independently for each point. This allows for predictions to be made directly for locations not present in the training data, and significantly reduces the number of model parameters. We demonstrate our modeling framework on two study areas: the Texas coast and the northern portion of the Alaskan coast. For Texas, the model is trained with a database of 446 synthetic hurricanes. The model is able to accurately match ADCIRC predictions on a test set of synthetic storms. We further present a test of the model on Hurricanes Ike (2008) and Harvey (2017). For Alaska, the model is trained on a dataset of 109 historical surge events. We test the surrogate model on actual surge events including the recent Typhoon Merbok (2022) that take place after the events in the training data. For both datasets, the surrogate model achieves similar performance to ADCIRC on real events when compared to observational data. In both cases, the surrogate models are many orders of magnitude faster than ADCIRC.
△ Less
Submitted 31 March, 2023; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Certifiable Robot Design Optimization using Differentiable Programming
Authors:
Charles Dawson,
Chuchu Fan
Abstract:
There is a growing need for computational tools to automatically design and verify autonomous systems, especially complex robotic systems involving perception, planning, control, and hardware in the autonomy stack. Differentiable programming has recently emerged as powerful tool for modeling and optimization. However, very few studies have been done to understand how differentiable programming can…
▽ More
There is a growing need for computational tools to automatically design and verify autonomous systems, especially complex robotic systems involving perception, planning, control, and hardware in the autonomy stack. Differentiable programming has recently emerged as powerful tool for modeling and optimization. However, very few studies have been done to understand how differentiable programming can be used for robust, certifiable end-to-end design optimization. In this paper, we fill this gap by combining differentiable programming for robot design optimization with a novel statistical framework for certifying the robustness of optimized designs. Our framework can conduct end-to-end optimization and robustness certification for robotics systems, enabling simultaneous optimization of navigation, perception, planning, control, and hardware subsystems.
Using simulation and hardware experiments, we show how our tool can be used to solve practical problems in robotics. First, we optimize sensor placements for robot navigation (a design with 5 subsystems and 6 tunable parameters) in under 5 minutes to achieve an 8.4x performance improvement compared to the initial design. Second, we solve a multi-agent collaborative manipulation task (3 subsystems and 454 parameters) in under an hour to achieve a 44% performance improvement over the initial design. We find that differentiable programming enables much faster (32% and 20x, respectively for each example) optimization than approximate gradient methods. We certify the robustness of each design and successfully deploy the optimized designs in hardware. An open-source implementation is available at https://github.com/MIT-REALM/architect
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
Application of a dispersive wave hydro-sediment-morphodynamic model in the Ria Formosa lagoon
Authors:
Kazbek Kazhyken,
Eirik Valseth,
Juha Videman,
Clint Dawson
Abstract:
Results of an application of a dispersive wave hydro-sediment-morphodynamic model in the western circulation cell of the Ria Formosa lagoon located in the Algarve region of the southern Portugal are presented. This area of interest has a couple of features that complicate the application of the dispersive wave model: (1) the area has a complex irregular geometry with a number of barrier islands th…
▽ More
Results of an application of a dispersive wave hydro-sediment-morphodynamic model in the western circulation cell of the Ria Formosa lagoon located in the Algarve region of the southern Portugal are presented. This area of interest has a couple of features that complicate the application of the dispersive wave model: (1) the area has a complex irregular geometry with a number of barrier islands that separate the lagoon from the Atlantic Ocean, artificial and naturally occurring tidal inlets, and a number of curling channels inside the lagoon that interconnect the inlets and serve as waterways between the lagoon settlements; (2) the tidal range in the area can reach up to 3.5 m; therefore, the terrain inside the lagoon is characterized by vast salt marshes and tidal flats, and the wetting-drying process is a key component of any hydrodynamic simulation in this area. A model representation of the area has been developed by generating an unstructured finite element mesh of the circulation cell, and collecting data on parameters that characterize the tidal waves in the area, and bottom friction and sediment transport models used in the simulations. The results of the simulations indicate that the dispersive wave model can be applied in coastal areas with nontrivial underlying physical processes, and complex irregular geometries. Moreover, the dispersive term of the model is capable of capturing additional flow characteristics that are otherwise not present in hydrodynamic simulations that involve the nonlinear shallow water equations; and these additional flow features can, in their turn, affect the resulting sediment transport and bed morphodynamic process simulations.
△ Less
Submitted 11 August, 2023; v1 submitted 3 April, 2022;
originally announced April 2022.
-
Introducing DMRadio-GUT, a search for GUT-scale QCD axions
Authors:
L. Brouwer,
S. Chaudhuri,
H. -M. Cho,
J. Corbin,
C. S. Dawson,
A. Droster,
J. W. Foster,
J. T. Fry,
P. W. Graham,
R. Henning,
K. D. Irwin,
F. Kadribasic,
Y. Kahn,
A. Keller,
R. Kolevatov,
S. Kuenstner,
A. F. Leder,
D. Li,
J. L. Ouellet,
K. M. W. Pappas,
A. Phipps,
N. M. Rapidis,
B. R. Safdi,
C. P. Salemi,
M. Simanovskaia
, et al. (7 additional authors not shown)
Abstract:
The QCD axion is a leading dark matter candidate that emerges as part of the solution to the strong CP problem in the Standard Model. The coupling of the axion to photons is the most common experimental probe, but much parameter space remains unexplored. The coupling of the QCD axion to the Standard Model scales linearly with the axion mass; therefore, the highly-motivated region 0.4-120 neV, corr…
▽ More
The QCD axion is a leading dark matter candidate that emerges as part of the solution to the strong CP problem in the Standard Model. The coupling of the axion to photons is the most common experimental probe, but much parameter space remains unexplored. The coupling of the QCD axion to the Standard Model scales linearly with the axion mass; therefore, the highly-motivated region 0.4-120 neV, corresponding to a GUT-scale axion, is particularly difficult to reach. This paper presents the design requirements for a definitive search for GUT-scale axions and reviews the technological advances needed to enable this program.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
Robust Counterexample-guided Optimization for Planning from Differentiable Temporal Logic
Authors:
Charles Dawson,
Chuchu Fan
Abstract:
Signal temporal logic (STL) provides a powerful, flexible framework for specifying complex autonomy tasks; however, existing methods for planning based on STL specifications have difficulty scaling to long-horizon tasks and are not robust to external disturbances. In this paper, we present an algorithm for finding robust plans that satisfy STL specifications. Our method alternates between local op…
▽ More
Signal temporal logic (STL) provides a powerful, flexible framework for specifying complex autonomy tasks; however, existing methods for planning based on STL specifications have difficulty scaling to long-horizon tasks and are not robust to external disturbances. In this paper, we present an algorithm for finding robust plans that satisfy STL specifications. Our method alternates between local optimization and local falsification, using automatically differentiable temporal logic to iteratively optimize its plan in response to counterexamples found during the falsification process. We benchmark our counterexample-guided planning method against state-of-the-art planning methods on two long-horizon satellite rendezvous missions, showing that our method finds high-quality plans that satisfy STL specifications despite adversarial disturbances. We find that our method consistently finds plans that are robust to adversarial disturbances and requires less than half the time of competing methods. We provide an implementation of our planner at https://github.com/MIT-REALM/architect.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Safe Control with Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction methods
Authors:
Charles Dawson,
Sicun Gao,
Chuchu Fan
Abstract:
Learning-enabled control systems have demonstrated impressive empirical performance on challenging control problems in robotics, but this performance comes at the cost of reduced transparency and lack of guarantees on the safety or stability of the learned controllers. In recent years, new techniques have emerged to provide these guarantees by learning certificates alongside control policies -- th…
▽ More
Learning-enabled control systems have demonstrated impressive empirical performance on challenging control problems in robotics, but this performance comes at the cost of reduced transparency and lack of guarantees on the safety or stability of the learned controllers. In recent years, new techniques have emerged to provide these guarantees by learning certificates alongside control policies -- these certificates provide concise, data-driven proofs that guarantee the safety and stability of the learned control system. These methods not only allow the user to verify the safety of a learned controller but also provide supervision during training, allowing safety and stability requirements to influence the training process itself. In this paper, we provide a comprehensive survey of this rapidly develo** field of certificate learning. We hope that this paper will serve as an accessible introduction to the theory and practice of certificate learning, both to those who wish to apply these tools to practical robotics problems and to those who wish to dive more deeply into the theory of learning for control.
△ Less
Submitted 20 December, 2022; v1 submitted 23 February, 2022;
originally announced February 2022.
-
Extending FEniCS to Work in Higher Dimensions Using Tensor Product Finite Elements
Authors:
Mark Loveland,
Eirik Valseth,
Matt Lukac,
Clint Dawson
Abstract:
We present a method to extend the finite element library FEniCS to solve problems with domains in dimensions above three by constructing tensor product finite elements. This methodology only requires that the high dimensional domain is structured as a Cartesian product of two lower dimensional subdomains. In this study we consider Dirichlet problems for scalar linear partial differential equations…
▽ More
We present a method to extend the finite element library FEniCS to solve problems with domains in dimensions above three by constructing tensor product finite elements. This methodology only requires that the high dimensional domain is structured as a Cartesian product of two lower dimensional subdomains. In this study we consider Dirichlet problems for scalar linear partial differential equations, though the methodology can be extended to non-linear problems. The utilization of tensor product finite elements allows us to construct a global system of linear algebraic equations that only relies on the finite element infrastructure of the lower dimensional subdomains contained in FEniCS. We demonstrate the effectiveness of our methodology in four distinctive test cases. The first test case is a Poisson equation posed in a four dimensional domain which is a Cartesian product of two unit squares solved using the classical Galerkin finite element method. The second test case is the wave equation in space-time, where the computational domain is a Cartesian product of a two dimensional space grid and a one dimensional time interval. In this second case we also employ the Galerkin method. The third test case is an advection dominated advection-diffusion equation where the global domain is a Cartesian product of two one dimensional intervals in which the streamline upwind Petrov-Galerkin method is applied to ensure discrete stability. The final test case uses the Galerkin approach to solve a Poisson problem on a Cartesian product of two intervals with a spatially varying, non-separable diffusivity term. In all cases, a p=1 basis is used and optimal L^2 convergence rates of order h^{p+1} of the errors are achieved with respect to h refinement
△ Less
Submitted 25 May, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Learning Safe, Generalizable Perception-based Hybrid Control with Certificates
Authors:
Charles Dawson,
Bethany Lowenkamp,
Dylan Goff,
Chuchu Fan
Abstract:
Many robotic tasks require high-dimensional sensors such as cameras and Lidar to navigate complex environments, but develo** certifiably safe feedback controllers around these sensors remains a challenging open problem, particularly when learning is involved. Previous works have proved the safety of perception-feedback controllers by separating the perception and control subsystems and making st…
▽ More
Many robotic tasks require high-dimensional sensors such as cameras and Lidar to navigate complex environments, but develo** certifiably safe feedback controllers around these sensors remains a challenging open problem, particularly when learning is involved. Previous works have proved the safety of perception-feedback controllers by separating the perception and control subsystems and making strong assumptions on the abilities of the perception subsystem. In this work, we introduce a novel learning-enabled perception-feedback hybrid controller, where we use Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) to show the safety and liveness of a full-stack perception-feedback controller. We use neural networks to learn a CBF and CLF for the full-stack system directly in the observation space of the robot, without the need to assume a separate perception-based state estimator. Our hybrid controller, called LOCUS (Learning-enabled Observation-feedback Control Using Switching), can safely navigate unknown environments, consistently reach its goal, and generalizes safely to environments outside of the training dataset. We demonstrate LOCUS in experiments both in simulation and in hardware, where it successfully navigates a changing environment using feedback from a Lidar sensor.
△ Less
Submitted 3 January, 2022;
originally announced January 2022.
-
Logical blocks for fault-tolerant topological quantum computation
Authors:
Hector Bombin,
Chris Dawson,
Ryan V. Mishmash,
Naomi Nickerson,
Fernando Pastawski,
Sam Roberts
Abstract:
Logical gates constitute the building blocks of fault-tolerant quantum computation. While quantum error-corrected memories have been extensively studied in the literature, explicit constructions and detailed analyses of thresholds and resource overheads of universal logical gate sets have so far been limited. In this paper, we present a comprehensive framework for universal fault-tolerant logic mo…
▽ More
Logical gates constitute the building blocks of fault-tolerant quantum computation. While quantum error-corrected memories have been extensively studied in the literature, explicit constructions and detailed analyses of thresholds and resource overheads of universal logical gate sets have so far been limited. In this paper, we present a comprehensive framework for universal fault-tolerant logic motivated by the combined need for platform-independent logical gate definitions, flexible and scalable tools for numerical analysis, and exploration of novel schemes for universal logic that improve resource overheads. Central to our framework is the description of logical gates holistically in a way which treats space and time on a similar footing. Focusing on schemes based on surface codes, we introduce explicit, but platform-independent representations of topological logic gates -- called logical blocks -- and generate new, overhead-efficient methods for universal quantum computation. As a specific example, we propose fault-tolerant schemes based on surface codes concatenated with more general low-density parity check (LDPC) codes. The logical blocks framework enables a convenient map** from an abstract description of the logical gate to a precise set of physical instructions for both circuit-based and fusion-based quantum computation (FBQC). Using this, we numerically simulate a surface-code-based universal gate set implemented with FBQC, and verify that their thresholds are consistent with the bulk memory threshold. We find that boundaries, defects, and twists can significantly impact the logical error rate scaling, with periodic boundary conditions potentially halving the resource requirements. Motivated by the favorable logical error rates for boundaryless computation, we introduce a novel computational scheme based on the teleportation of twists that may offer further resource reductions.
△ Less
Submitted 13 April, 2023; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions
Authors:
Charles Dawson,
Zengyi Qin,
Sicun Gao,
Chuchu Fan
Abstract:
Safety and stability are common requirements for robotic control systems; however, designing safe, stable controllers remains difficult for nonlinear and uncertain models. We develop a model-based learning approach to synthesize robust feedback controllers with safety and stability guarantees. We take inspiration from robust convex optimization and Lyapunov theory to define robust control Lyapunov…
▽ More
Safety and stability are common requirements for robotic control systems; however, designing safe, stable controllers remains difficult for nonlinear and uncertain models. We develop a model-based learning approach to synthesize robust feedback controllers with safety and stability guarantees. We take inspiration from robust convex optimization and Lyapunov theory to define robust control Lyapunov barrier functions that generalize despite model uncertainty. We demonstrate our approach in simulation on problems including car trajectory tracking, nonlinear control with obstacle avoidance, satellite rendezvous with safety constraints, and flight control with a learned ground effect model. Simulation results show that our approach yields controllers that match or exceed the capabilities of robust MPC while reducing computational costs by an order of magnitude.
△ Less
Submitted 6 October, 2021; v1 submitted 14 September, 2021;
originally announced September 2021.
-
SCA-Net: A Self-Correcting Two-Layer Autoencoder for Hyper-spectral Unmixing
Authors:
Gurpreet Singh,
Soumyajit Gupta,
Clint Dawson
Abstract:
Hyperspectral unmixing involves separating a pixel as a weighted combination of its constituent endmembers and corresponding fractional abundances, with the current state of the art results achieved by neural models on benchmark datasets. However, these networks are severely over-parameterized and consequently, the invariant endmember spectra extracted as decoder weights have a high variance over…
▽ More
Hyperspectral unmixing involves separating a pixel as a weighted combination of its constituent endmembers and corresponding fractional abundances, with the current state of the art results achieved by neural models on benchmark datasets. However, these networks are severely over-parameterized and consequently, the invariant endmember spectra extracted as decoder weights have a high variance over multiple runs. These approaches perform substantial post-processing while requiring an exact specification of the number of endmembers and specialized initialization of weights from other algorithms like VCA. We show for the first time that a two-layer autoencoder (SCA), with $2FK$ parameters ($F$ features, $K$ endmembers), achieves error metrics that are scales apart ($10^{-5})$ from previously reported values $(10^{-2})$. SCA converges to this low error solution starting from a random initialization of weights. We also show that SCA, based upon a bi-orthogonal representation, performs a self-correction when the number of endmembers are over-specified. Numerical experiments on Samson, Jasper, and Urban datasets demonstrate that SCA outperforms previously reported error metrics for all the cases while being robust to noise and outliers.
△ Less
Submitted 28 October, 2021; v1 submitted 10 February, 2021;
originally announced February 2021.
-
A Hybrid 2-stage Neural Optimization for Pareto Front Extraction
Authors:
Gurpreet Singh,
Soumyajit Gupta,
Matthew Lease,
Clint Dawson
Abstract:
Classification, recommendation, and ranking problems often involve competing goals with additional constraints (e.g., to satisfy fairness or diversity criteria). Such optimization problems are quite challenging, often involving non-convex functions along with considerations of user preferences in balancing trade-offs. Pareto solutions represent optimal frontiers for jointly optimizing multiple com…
▽ More
Classification, recommendation, and ranking problems often involve competing goals with additional constraints (e.g., to satisfy fairness or diversity criteria). Such optimization problems are quite challenging, often involving non-convex functions along with considerations of user preferences in balancing trade-offs. Pareto solutions represent optimal frontiers for jointly optimizing multiple competing objectives. A major obstacle for frequently used linear-scalarization strategies is that the resulting optimization problem might not always converge to a global optimum. Furthermore, such methods only return one solution point per run. A Pareto solution set is a subset of all such global optima over multiple runs for different trade-off choices. Therefore, a Pareto front can only be guaranteed with multiple runs of the linear-scalarization problem, where all runs converge to their respective global optima. Consequently, extracting a Pareto front for practical problems is computationally intractable with substantial computational overheads, limited scalability, and reduced accuracy. We propose a robust, low cost, two-stage, hybrid neural Pareto optimization approach that is accurate and scales (compute space and time) with data dimensions, as well as number of functions and constraints. The first stage (neural network) efficiently extracts a weak Pareto front, using Fritz-John conditions as the discriminator, with no assumptions of convexity on the objectives or constraints. The second stage (efficient Pareto filter) extracts the strong Pareto optimal subset given the weak front from stage 1. Fritz-John conditions provide us with theoretical bounds on approximation error between the true and network extracted weak Pareto front. Numerical experiments demonstrates the accuracy and efficiency on a canonical set of benchmark problems and a fairness optimization task from prior works.
△ Less
Submitted 13 February, 2021; v1 submitted 27 January, 2021;
originally announced January 2021.
-
Fusion-based quantum computation
Authors:
Sara Bartolucci,
Patrick Birchall,
Hector Bombin,
Hugo Cable,
Chris Dawson,
Mercedes Gimeno-Segovia,
Eric Johnston,
Konrad Kieling,
Naomi Nickerson,
Mihir Pant,
Fernando Pastawski,
Terry Rudolph,
Chris Sparrow
Abstract:
We introduce fusion-based quantum computing (FBQC) - a model of universal quantum computation in which entangling measurements, called fusions, are performed on the qubits of small constant-sized entangled resource states. We introduce a stabilizer formalism for analyzing fault tolerance and computation in these schemes. This framework naturally captures the error structure that arises in certain…
▽ More
We introduce fusion-based quantum computing (FBQC) - a model of universal quantum computation in which entangling measurements, called fusions, are performed on the qubits of small constant-sized entangled resource states. We introduce a stabilizer formalism for analyzing fault tolerance and computation in these schemes. This framework naturally captures the error structure that arises in certain physical systems for quantum computing, such as photonics. FBQC can offer significant architectural simplifications, enabling hardware made up of many identical modules, requiring an extremely low depth of operations on each physical qubit and reducing classical processing requirements. We present two pedagogical examples of fault-tolerant schemes constructed in this framework and numerically evaluate their threshold under a hardware agnostic fusion error model including both erasure and Pauli error. We also study an error model of linear optical quantum computing with probabilistic fusion and photon loss. In FBQC the non-determinism of fusion is directly dealt with by the quantum error correction protocol, along with other errors. We find that tailoring the fault-tolerance framework to the physical system allows the scheme to have a higher threshold than schemes reported in literature. We present a ballistic scheme which can tolerate a 10.4% probability of suffering photon loss in each fusion.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
A Stable Mixed FE Method for Nearly Incompressible Linear Elastostatics
Authors:
Eirik Valseth,
Albert Romkes,
Austin R. Kaul,
Clint Dawson
Abstract:
We present a new, stable, mixed finite element (FE) method for linear elastostatics of nearly incompressible solids. The method is the automatic variationally stable FE (AVS-FE) method of Calo, Romkes and Valseth, in which we consider a Petrov-Galerkin weak formulation where the stress and displacement variables are in the space H(div)xH1, respectively. This allows us to employ a fully conforming…
▽ More
We present a new, stable, mixed finite element (FE) method for linear elastostatics of nearly incompressible solids. The method is the automatic variationally stable FE (AVS-FE) method of Calo, Romkes and Valseth, in which we consider a Petrov-Galerkin weak formulation where the stress and displacement variables are in the space H(div)xH1, respectively. This allows us to employ a fully conforming FE discretization for any elastic solid using classical FE subspaces of H(div) and H1. Hence, the resulting FE approximation yields both continuous stresses and displacements.
To ensure stability of the method, we employ the philosophy of the discontinuous Petrov-Galerkin (DPG) method of Demkowicz and Gopalakrishnan and use optimal test spaces. Thus, the resulting FE discretization is stable even as the Poisson ratio approaches 0.5, and the system of linear algebraic equations is symmetric and positive definite. Our method also comes with a built-in a posteriori error estimator as well as well as indicators which are used to drive mesh adaptive refinements. We present several numerical verifications of our method including comparisons to existing FE technologies.
△ Less
Submitted 7 May, 2021; v1 submitted 15 January, 2021;
originally announced January 2021.
-
An Adaptive Stable Space-Time FE Method for the Shallow Water Equations
Authors:
Eirik Valseth,
Clint Dawson
Abstract:
We consider the finite element (FE) approximation of the shallow water equations (SWE) by considering discretizations in which both space and time are established using an unconditionally stable FE method. Particularly, we consider the automatic variationally stable FE (AVS-FE) method, a type of discontinuous Petrov-Galerkin (DPG) method. The philosophy of the DPG method allows us to break the tes…
▽ More
We consider the finite element (FE) approximation of the shallow water equations (SWE) by considering discretizations in which both space and time are established using an unconditionally stable FE method. Particularly, we consider the automatic variationally stable FE (AVS-FE) method, a type of discontinuous Petrov-Galerkin (DPG) method. The philosophy of the DPG method allows us to break the test space and achieve unconditionally stable FE approximations as well as accurate a posteriori error estimators upon solution of a saddle point system of equations. The resulting error indicators allow us to employ mesh adaptive strategies and perform space-time mesh refinements, i.e., local time step**.
We derive a priori error estimates for the AVS-FE method and linearized SWE and perform numerical verifications to confirm corresponding asymptotic convergence behavior. In an effort to keep the computational cost low, we consider an alternative space-time approach in which the space-time domain is partitioned into finite sized space-time slices. Hence, we can perform adaptivity on each individual slice to preset error tolerances as needed for a particular application. Numerical verifications comparing the two alternatives indicate the space-time slices are superior for simulations over long times, whereas the solutions are indistinguishable for short times. Multiple numerical verifications show the adaptive mesh refinement capabilities of the AVS-FE method, as well the application of the method to commonly applied benchmarks for the SWE.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Range-Net: A High Precision Streaming SVD for Big Data Applications
Authors:
Gurpreet Singh,
Soumyajit Gupta,
Matthew Lease,
Clint Dawson
Abstract:
In a Big Data setting computing the dominant SVD factors is restrictive due to the main memory requirements. Recently introduced streaming Randomized SVD schemes work under the restrictive assumption that the singular value spectrum of the data has exponential decay. This is seldom true for any practical data. Although these methods are claimed to be applicable to scientific computations due to as…
▽ More
In a Big Data setting computing the dominant SVD factors is restrictive due to the main memory requirements. Recently introduced streaming Randomized SVD schemes work under the restrictive assumption that the singular value spectrum of the data has exponential decay. This is seldom true for any practical data. Although these methods are claimed to be applicable to scientific computations due to associated tail-energy error bounds, the approximation errors in the singular vectors and values are high when the aforementioned assumption does not hold. Furthermore from a practical perspective, oversampling can still be memory intensive or worse can exceed the feature dimension of the data. To address these issues, we present Range-Net as an alternative to randomized SVD that satisfies the tail-energy lower bound given by Eckart-Young-Mirsky (EYM) theorem. Range-Net is a deterministic two-stage neural optimization approach with random initialization, where the main memory requirement depends explicitly on the feature dimension and desired rank, independent of the sample dimension. The data samples are read in a streaming setting with the network minimization problem converging to the desired rank-r approximation. Range-Net is fully interpretable where all the network outputs and weights have a specific meaning. We provide theoretical guarantees that Range-Net extracted SVD factors satisfy EYM tail-energy lower bound at machine precision. Our numerical experiments on real data at various scales confirms this bound. A comparison against the state of the art streaming Randomized SVD shows that Range-Net accuracy is better by six orders of magnitude while being memory efficient.
△ Less
Submitted 18 March, 2021; v1 submitted 27 October, 2020;
originally announced October 2020.
-
Discontinuous Galerkin methods for a dispersive wave hydro-sediment-morphodynamic model
Authors:
Kazbek Kazhyken,
Juha Videman,
Clint Dawson
Abstract:
A dispersive wave hydro-sediment-morphodynamic model developed by complementing the shallow water hydro-sediment-morphodynamic (SHSM) equations with the dispersive term from the Green-Naghdi equations is presented. A numerical solution algorithm for the model based on the second-order Strang operator splitting is presented. The model is partitioned into two parts, (1) the SHSM equations and (2) th…
▽ More
A dispersive wave hydro-sediment-morphodynamic model developed by complementing the shallow water hydro-sediment-morphodynamic (SHSM) equations with the dispersive term from the Green-Naghdi equations is presented. A numerical solution algorithm for the model based on the second-order Strang operator splitting is presented. The model is partitioned into two parts, (1) the SHSM equations and (2) the dispersive correction part, which are discretized using discontinuous Galerkin finite element methods. This splitting technique provides a facility to select dynamically regions of a problem domain where the dispersive term is not applied, e.g. wave breaking regions where the dispersive wave model is no longer valid. Algorithms that can handle wetting-drying and detect wave breaking are provided and a number of numerical examples are presented to validate the developed numerical solution algorithm. The results of the simulations indicate that the model is capable of predicting sediment transport and bed morphodynamic processes correctly provided that the empirical models for the suspended and bed load transport are properly calibrated. Moreover, the developed model is able to accurately capture hydrodynamics and wave dispersion effects up to swash zones, and its application is justified for simulations where dispersive wave effects are prevalent.
△ Less
Submitted 11 October, 2020;
originally announced October 2020.
-
Automatic Variationally Stable Analysis for Finite Element Computations: Transient Convection-Diffusion Problems
Authors:
Eirik Valseth,
Pouria Behnoudfar,
Clint Dawson,
Albert Romkes
Abstract:
We establish stable finite element (FE) approximations of convection-diffusion initial boundary value problems using the automatic variationally stable finite element (AVS-FE) method. The transient convection-diffusion problem leads to issues in classical FE methods as the differential operator can be considered singular perturbation in both space and time. The unconditional stability of the AVS-F…
▽ More
We establish stable finite element (FE) approximations of convection-diffusion initial boundary value problems using the automatic variationally stable finite element (AVS-FE) method. The transient convection-diffusion problem leads to issues in classical FE methods as the differential operator can be considered singular perturbation in both space and time. The unconditional stability of the AVS-FE method, regardless of the underlying differential operator, allows us significant flexibility in the construction of FE approximations. We take two distinct approaches to the FE discretization of the convection-diffusion problem: i) considering a space-time approach in which the temporal discretization is established using finite elements, and ii) a method of lines approach in which we employ the AVS-FE method in space whereas the temporal domain is discretized using the generalized-alpha method. In the generalized-alpha method, we discretize the temporal domain into finite sized time-steps and adopt the generalized-alpha method as time integrator. Then, we derive a corresponding norm for the obtained operator to guarantee the temporal stability of the method.
We present numerical verifications for both approaches, including numerical asymptotic convergence studies highlighting optimal convergence properties. Furthermore, in the spirit of the discontinuous Petrov-Galerkin method by Demkowicz and Gopalakrishnan, the AVS-FE method also leads to readily available a posteriori error estimates through a Riesz representer of the residual of the AVS-FE approximations. Hence, the norm of the resulting local restrictions of these estimates serve as error indicators in both space and time for which we present multiple numerical verifications adaptive strategies.
△ Less
Submitted 30 March, 2023; v1 submitted 30 September, 2020;
originally announced October 2020.
-
Learning Quantities of Interest from Dynamical Systems for Observation-Consistent Inversion
Authors:
Steven Mattis,
Kyle Robert Steffen,
Troy Butler,
Clint N. Dawson,
Donald Estep
Abstract:
Dynamical systems arise in a wide variety of mathematical models from science and engineering. A common challenge is to quantify uncertainties on model inputs (parameters) that correspond to a quantitative characterization of uncertainties on observable Quantities of Interest (QoI). To this end, we consider a stochastic inverse problem (SIP) with a solution described by a pullback probability meas…
▽ More
Dynamical systems arise in a wide variety of mathematical models from science and engineering. A common challenge is to quantify uncertainties on model inputs (parameters) that correspond to a quantitative characterization of uncertainties on observable Quantities of Interest (QoI). To this end, we consider a stochastic inverse problem (SIP) with a solution described by a pullback probability measure. We call this an observation-consistent solution, as its subsequent push-forward through the QoI map matches the observed probability distribution on model outputs. A distinction is made between QoI useful for solving the SIP and arbitrary model output data. In dynamical systems, model output data are often given as a series of state variable responses recorded over a particular time window. Consequently, the dimension of output data can easily exceed $\mathcal{O}(1E4)$ or more due to the frequency of observations, and the correct choice or construction of a QoI from this data is not self-evident. We present a new framework, Learning Uncertain Quantities (LUQ), that facilitates the tractable solution of SIPs for dynamical systems. Given ensembles of predicted (simulated) time series and (noisy) observed data, LUQ provides routines for filtering data, unsupervised learning of the underlying dynamics, classifying observations, and feature extraction to learn the QoI map. Subsequently, time series data are transformed into samples of the underlying predicted and observed distributions associated with the QoI so that solutions to the SIP are computable. Following the introduction and demonstration of LUQ, numerical results from several SIPs are presented for a variety of dynamical systems arising in the life and physical sciences. For scientific reproducibility, we provide links to our Python implementation of LUQ and to all data and scripts required to reproduce the results in this manuscript.
△ Less
Submitted 16 July, 2021; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Measuring the Credibility of Student Attendance Data in Higher Education for Data Mining
Authors:
Mohammed Alsuwaiket,
Christian Dawson,
Firat Batmaz
Abstract:
Educational Data Mining (EDM) is a develo** discipline, concerned with expanding the classical Data Mining (DM) methods and develo** new methods for discovering the data that originate from educational systems. Student attendance in higher education has always been dealt with in a classical way, educators rely on counting the occurrence of attendance or absence building their knowledge about s…
▽ More
Educational Data Mining (EDM) is a develo** discipline, concerned with expanding the classical Data Mining (DM) methods and develo** new methods for discovering the data that originate from educational systems. Student attendance in higher education has always been dealt with in a classical way, educators rely on counting the occurrence of attendance or absence building their knowledge about students as well as modules based on this count. This method is neither credible nor does it necessarily provide a real indication of a student performance. This study tries to formulate the extracted knowledge in a way that guarantees achieving accurate and credible results. Student attendance data, gathered from the educational system, were first cleaned in order to remove any randomness and noise, then various attributes were studied so as to highlight the most significant ones that affect the real attendance of students. The next step was to derive an equation that measures the Student Attendance Credibility (SAC) considering the attributes chosen in the previous step. The reliability of the newly developed measure was then evaluated in order to examine its consistency. Finally, the J48 DM classification technique was utilized in order to classify modules based on the strength of their SAC values. Results of this study were promising, and credibility values achieved using the newly derived formula gave accurate, credible, and real indicators of student attendance, as well as accurate classification of modules based on the credibility of student attendance on those modules.
△ Less
Submitted 1 September, 2020;
originally announced September 2020.
-
Prevention is Better than Cure: Handling Basis Collapse and Transparency in Dense Networks
Authors:
Gurpreet Singh,
Soumyajit Gupta,
Clint N. Dawson
Abstract:
Dense nets are an integral part of any classification and regression problem. Recently, these networks have found a new application as solvers for known representations in various domains. However, one crucial issue with dense nets is it's feature interpretation and lack of reproducibility over multiple training runs. In this work, we identify a basis collapse issue as a primary cause and propose…
▽ More
Dense nets are an integral part of any classification and regression problem. Recently, these networks have found a new application as solvers for known representations in various domains. However, one crucial issue with dense nets is it's feature interpretation and lack of reproducibility over multiple training runs. In this work, we identify a basis collapse issue as a primary cause and propose a modified loss function that circumvents this problem. We also provide a few general guidelines relating the choice of activations to loss surface roughness and appropriate scaling for designing low-weight dense nets. We demonstrate through carefully chosen numerical experiments that the basis collapse issue leads to the design of massively redundant networks. Our approach results in substantially concise nets, having $100 \times$ fewer parameters, while achieving a much lower $(10\times)$ MSE loss at scale than reported in prior works. Further, we show that the width of a dense net is acutely dependent on the feature complexity. This is in contrast to the dimension dependent width choice reported in prior theoretical works. To the best of our knowledge, this is the first time these issues and contradictions have been reported and experimentally verified. With our design guidelines we render transparency in terms of a low-weight network design. We share our codes for full reproducibility available at https://github.com/smjtgupta/Dense_Net_Regress.
△ Less
Submitted 22 August, 2020;
originally announced August 2020.
-
An Unconditionally Stable Space-Time FE Method for the Korteweg-de Vries Equation
Authors:
Eirik Valseth,
Clint Dawson
Abstract:
We introduce an unconditionally stable finite element (FE) method, the automatic variationally stable FE (AVS-FE) method for the numerical analysis of the Korteweg-de Vries (KdV) equation. The AVS-FE method is a Petrov-Galerkin method which employs the concept of optimal discontinuous test functions of the discontinuous Petrov- Galerkin (DPG) method. However, since AVS-FE method is a minimum resid…
▽ More
We introduce an unconditionally stable finite element (FE) method, the automatic variationally stable FE (AVS-FE) method for the numerical analysis of the Korteweg-de Vries (KdV) equation. The AVS-FE method is a Petrov-Galerkin method which employs the concept of optimal discontinuous test functions of the discontinuous Petrov- Galerkin (DPG) method. However, since AVS-FE method is a minimum residual method, we establish a global saddle point system instead of computing optimal test functions element-by-element. This system allows us to seek both the approximate solution of the KdV initial boundary value problem (IBVP) and a Riesz representer of the approximation error. The AVS-FE method distinguishes itself from other minimum residual methods by using globally continuous Hilbert spaces, such as H1, while at the same time using broken Hilbert spaces for the test. Consequently, the AVS-FE approximations are classical C0 continuous FE solutions. The unconditional stability of this method allows us to solve the KdV equation space and time without having to satisfy a CFL condition. We present several numerical verifications for both linear and nonlinear versions of the KdV equation leading to optimal convergence behavior. Finally, we present a numerical verification of adaptive mesh refinements in both space and time for the nonlinear KdV equation.
△ Less
Submitted 11 June, 2020;
originally announced June 2020.
-
Discontinuous Galerkin methods for a dispersive wave hydro-morphodynamic model with bed-load transport
Authors:
Kazbek Kazhyken,
Juha Videman,
Clint Dawson
Abstract:
A dispersive wave hydro-morphodynamic model coupling the Green-Naghdi equations (the hydrodynamic part) with the sediment continuity Exner equation (the morphodynamic part) is presented. Numerical solution algorithms based on discontinuous Galerkin finite element discretizations of the model are proposed. The algorithms include both coupled and decoupled approaches for solving the hydrodynamic and…
▽ More
A dispersive wave hydro-morphodynamic model coupling the Green-Naghdi equations (the hydrodynamic part) with the sediment continuity Exner equation (the morphodynamic part) is presented. Numerical solution algorithms based on discontinuous Galerkin finite element discretizations of the model are proposed. The algorithms include both coupled and decoupled approaches for solving the hydrodynamic and morphodynamic parts simultaneously and separately from each other, respectively. The Strang operator splitting technique is employed to treat the dispersive terms separately, and it provides the ability to ignore the dispersive terms in specified regions, such as surf zones. Algorithms that can handle wetting-drying and detect wave breaking are presented. The numerical solution algorithms are validated with numerical experiments to demonstrate the ability of the algorithms to accurately resolve hydrodynamics of solitary and regular waves, and morphodynamic changes induced by such waves. The results indicate that the model has the potential to be used in studies of coastal morphodynamics driven by dispersive water waves, given that the hydrodynamic part resolves the water motion and dispersive wave effects with sufficient accuracy up to swash zones, and the morphodynamic model can capture the major features of bed erosion and deposition.
△ Less
Submitted 4 October, 2020; v1 submitted 2 May, 2020;
originally announced May 2020.
-
Adaptive Total Variation Stable Local Timestep** for Conservation Laws
Authors:
Maximilian Bremer,
John Bachan,
Cy Chan,
Clint Dawson
Abstract:
This paper proposes a first-order total variation diminishing (TVD) treatment for coarsening and refining of local timestep size in response to dynamic local variations in wave speeds for nonlinear conservation laws. The algorithm is accompanied with a proof of formal correctness showing that given a sufficiently small minimum timestep the algorithm will produce TVD solution for nonlinear scalar c…
▽ More
This paper proposes a first-order total variation diminishing (TVD) treatment for coarsening and refining of local timestep size in response to dynamic local variations in wave speeds for nonlinear conservation laws. The algorithm is accompanied with a proof of formal correctness showing that given a sufficiently small minimum timestep the algorithm will produce TVD solution for nonlinear scalar conservation laws. A key feature of the algorithm is its formulation as a discrete event simulation, which allows for easy and efficient parallelization using existing software. Numerical results demonstrate the stability and adaptivity of the method for the shallow water equations. We also introduce a performance model to load balance and explain the observed performance gains. Performance results are presented for a single node on Stampede2's Skylake partition using an optimistic parallel discrete event simulator. Results show the proposed algorithm recovering 59%-77% of the theoretically achievable speed-up with the discrepancies being attributed to the cost of computing the CFL condition and load imbalance.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
Provably Safe Trajectory Optimization in the Presence of Uncertain Convex Obstacles
Authors:
Charles Dawson,
Ashkan Jasour,
Andreas Hofmann,
Brian Williams
Abstract:
Real-world environments are inherently uncertain, and to operate safely in these environments robots must be able to plan around this uncertainty. In the context of motion planning, we desire systems that can maintain an acceptable level of safety as the robot moves, even when the exact locations of nearby obstacles are not known. In this paper, we solve this chance-constrained motion planning pro…
▽ More
Real-world environments are inherently uncertain, and to operate safely in these environments robots must be able to plan around this uncertainty. In the context of motion planning, we desire systems that can maintain an acceptable level of safety as the robot moves, even when the exact locations of nearby obstacles are not known. In this paper, we solve this chance-constrained motion planning problem using a sequential convex optimization framework. To constrain the risk of collision incurred by planned movements, we employ geometric objects called $ε$-shadows to compute upper bounds on the risk of collision between the robot and uncertain obstacles. We use these $ε$-shadow-based estimates as constraints in a nonlinear trajectory optimization problem, which we then solve by iteratively linearizing the non-convex risk constraints. This sequential optimization approach quickly finds trajectories that accomplish the desired motion while maintaining a user-specified limit on collision risk. Our method can be applied to robots and environments with arbitrary convex geometry; even in complex environments, it runs in less than a second and provides provable guarantees on the safety of planned trajectories, enabling fast, reactive, and safe robot motion in realistic environments.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Fast Certification of Collision Probability Bounds with Uncertain Convex Obstacles
Authors:
Charles Dawson,
Andreas Hofmann,
Brian Williams
Abstract:
To operate reactively in uncertain environments, robots need to be able to quickly estimate the risk that they will collide with their environment. This ability is important for both planning (to ensure that plans maintain acceptable levels of safety) and execution (to provide real-time warnings when risk exceeds some threshold). Existing methods for estimating this risk are often limited to model…
▽ More
To operate reactively in uncertain environments, robots need to be able to quickly estimate the risk that they will collide with their environment. This ability is important for both planning (to ensure that plans maintain acceptable levels of safety) and execution (to provide real-time warnings when risk exceeds some threshold). Existing methods for estimating this risk are often limited to models with simplified geometry (e.g. point robots); others handle complex geometry but are too slow for many applications. In this paper, we present two algorithms for quickly computing upper bounds on the risk of collision between a robot and uncertain obstacles by searching for certificate regions that capture collision probability mass while avoiding the robot. These algorithms come with strong theoretical guarantees that the true risk does not exceed the estimated value, support arbitrary geometry via convex decomposition, and provide fast query times ($<200μ$s) in representative scenarios. We characterize the performance of these algorithms in environments of varying complexity, demonstrating at least an order of magnitude speedup over existing techniques.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.