Search | arXiv e-print repository

E(n) Equivariant Topological Neural Networks

Authors: Claudio Battiloro, Ege Karaismailoğlu, Mauricio Tec, George Dasoulas, Michelle Audirac, Francesca Dominici

Abstract: Graph neural networks excel at modeling pairwise interactions, but they cannot flexibly accommodate higher-order interactions and features. Topological deep learning (TDL) has emerged recently as a promising tool for addressing this issue. TDL enables the principled modeling of arbitrary multi-way, hierarchical higher-order interactions by operating on combinatorial topological spaces, such as sim… ▽ More Graph neural networks excel at modeling pairwise interactions, but they cannot flexibly accommodate higher-order interactions and features. Topological deep learning (TDL) has emerged recently as a promising tool for addressing this issue. TDL enables the principled modeling of arbitrary multi-way, hierarchical higher-order interactions by operating on combinatorial topological spaces, such as simplicial or cell complexes, instead of graphs. However, little is known about how to leverage geometric features such as positions and velocities for TDL. This paper introduces E(n)-Equivariant Topological Neural Networks (ETNNs), which are E(n)-equivariant message-passing networks operating on combinatorial complexes, formal objects unifying graphs, hypergraphs, simplicial, path, and cell complexes. ETNNs incorporate geometric node features while respecting rotation and translation equivariance. Moreover, ETNNs are natively ready for settings with heterogeneous interactions. We provide a theoretical analysis to show the improved expressiveness of ETNNs over architectures for geometric graphs. We also show how several E(n) equivariant variants of TDL models can be directly derived from our framework. The broad applicability of ETNNs is demonstrated through two tasks of vastly different nature: i) molecular property prediction on the QM9 benchmark and ii) land-use regression for hyper-local estimation of air pollution with multi-resolution irregular geospatial data. The experiment results indicate that ETNNs are an effective tool for learning from diverse types of richly structured data, highlighting the benefits of principled geometric inductive bias. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 32 pages, 11 figures, 8 tables

arXiv:2312.14196 [pdf, other]

Optimizing Heat Alert Issuance with Reinforcement Learning

Authors: Ellen M. Considine, Rachel C. Nethery, Gregory A. Wellenius, Francesca Dominici, Mauricio Tec

Abstract: A key strategy in societal adaptation to climate change is the use of alert systems to reduce the adverse health impacts of extreme heat events by prompting preventative action. In this work, we investigate reinforcement learning (RL) as a tool to optimize the effectiveness of such systems. Our contributions are threefold. First, we introduce a novel RL environment enabling the evaluation of the e… ▽ More A key strategy in societal adaptation to climate change is the use of alert systems to reduce the adverse health impacts of extreme heat events by prompting preventative action. In this work, we investigate reinforcement learning (RL) as a tool to optimize the effectiveness of such systems. Our contributions are threefold. First, we introduce a novel RL environment enabling the evaluation of the effectiveness of heat alert policies to reduce heat-related hospitalizations. The rewards model is trained from a comprehensive dataset of historical weather, Medicare health records, and socioeconomic/geographic features. We use variational Bayesian techniques to address low-signal effects and spatial heterogeneity, which are commonly encountered in climate & health settings. The transition model incorporates real historical weather patterns enriched by a data augmentation mechanism based on climate region similarity. Second, we use this environment to evaluate standard RL algorithms in the context of heat alert issuance. Our analysis shows that policy constraints are needed to improve the initially poor performance of RL. Lastly, a post hoc contrastive analysis provides insight into scenarios where our modified heat alert-RL policies yield significant gains/losses over the current National Weather Service alert policy in the United States. △ Less

Submitted 10 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: Main text has 21 pages with 3 tables and 7 figures

arXiv:2312.00710 [pdf, other]

SpaCE: The Spatial Confounding Environment

Authors: Mauricio Tec, Ana Trisovic, Michelle Audirac, Sophie Woodward, Jie Kate Hu, Naeem Khoshnevis, Francesca Dominici

Abstract: Spatial confounding poses a significant challenge in scientific studies involving spatial data, where unobserved spatial variables can influence both treatment and outcome, possibly leading to spurious associations. To address this problem, we introduce SpaCE: The Spatial Confounding Environment, the first toolkit to provide realistic benchmark datasets and tools for systematically evaluating caus… ▽ More Spatial confounding poses a significant challenge in scientific studies involving spatial data, where unobserved spatial variables can influence both treatment and outcome, possibly leading to spurious associations. To address this problem, we introduce SpaCE: The Spatial Confounding Environment, the first toolkit to provide realistic benchmark datasets and tools for systematically evaluating causal inference methods designed to alleviate spatial confounding. Each dataset includes training data, true counterfactuals, a spatial graph with coordinates, and smoothness and confounding scores characterizing the effect of a missing spatial confounder. It also includes realistic semi-synthetic outcomes and counterfactuals, generated using state-of-the-art machine learning ensembles, following best practices for causal inference benchmarks. The datasets cover real treatment and covariates from diverse domains, including climate, health and social sciences. SpaCE facilitates an automated end-to-end pipeline, simplifying data loading, experimental setup, and evaluating machine learning and causal inference models. The SpaCE project provides several dozens of datasets of diverse sizes and spatial complexity. It is publicly available as a Python package, encouraging community feedback and contributions. △ Less

Submitted 5 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

arXiv:2302.02560 [pdf, other]

Causal Estimation of Exposure Shifts with Neural Networks: Evaluating the Health Benefits of Stricter Air Quality Standards in the US

Authors: Mauricio Tec, Oladimeji Mudele, Kevin Josey, Francesca Dominici

Abstract: In policy research, one of the most critical analytic tasks is to estimate the causal effect of a policy-relevant shift to the distribution of a continuous exposure/treatment on an outcome of interest. We call this problem shift-response function (SRF) estimation. Existing neural network methods involving robust causal-effect estimators lack theoretical guarantees and practical implementations for… ▽ More In policy research, one of the most critical analytic tasks is to estimate the causal effect of a policy-relevant shift to the distribution of a continuous exposure/treatment on an outcome of interest. We call this problem shift-response function (SRF) estimation. Existing neural network methods involving robust causal-effect estimators lack theoretical guarantees and practical implementations for SRF estimation. Motivated by a key policy-relevant question in public health, we develop a neural network method and its theoretical underpinnings to estimate SRFs with robustness and efficiency guarantees. We then apply our method to data consisting of 68 million individuals and 27 million deaths across the U.S. to estimate the causal effect from revising the US National Ambient Air Quality Standards (NAAQS) for PM 2.5 from 12 $μg/m^3$ to 9 $μg/m^3$. This change has been recently proposed by the US Environmental Protection Agency (EPA). Our goal is to estimate, for the first time, the reduction in deaths that would result from this anticipated revision using causal methods for SRFs. Our proposed method, called {T}argeted {R}egularization for {E}xposure {S}hifts with Neural {Net}works (TRESNET), contributes to the neural network literature for causal inference in two ways: first, it proposes a targeted regularization loss with theoretical properties that ensure double robustness and achieves asymptotic efficiency specific for SRF estimation; second, it enables loss functions from the exponential family of distributions to accommodate non-continuous outcome distributions (such as hospitalization or mortality counts). We complement our application with benchmark experiments that demonstrate TRESNET's broad applicability and competitiveness. △ Less

Submitted 6 December, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

arXiv:2210.12122 [pdf, other]

Targeted active learning for probabilistic models

Authors: Christopher Tosh, Mauricio Tec, Wesley Tansey

Abstract: A fundamental task in science is to design experiments that yield valuable insights about the system under study. Mathematically, these insights can be represented as a utility or risk function that shapes the value of conducting each experiment. We present PDBAL, a targeted active learning method that adaptively designs experiments to maximize scientific utility. PDBAL takes a user-specified risk… ▽ More A fundamental task in science is to design experiments that yield valuable insights about the system under study. Mathematically, these insights can be represented as a utility or risk function that shapes the value of conducting each experiment. We present PDBAL, a targeted active learning method that adaptively designs experiments to maximize scientific utility. PDBAL takes a user-specified risk function and combines it with a probabilistic model of the experimental outcomes to choose designs that rapidly converge on a high-utility model. We prove theoretical bounds on the label complexity of PDBAL and provide fast closed-form solutions for designing experiments with common exponential family likelihoods. In simulation studies, PDBAL consistently outperforms standard untargeted approaches that focus on maximizing expected information gain over the design space. Finally, we demonstrate the scientific potential of PDBAL through a study on a large cancer drug screen dataset where PDBAL quickly recovers the most efficacious drugs with a small fraction of the total number of experiments. △ Less

Submitted 21 October, 2022; originally announced October 2022.

arXiv:2209.12316 [pdf, other]

Weather2vec: Representation Learning for Causal Inference with Non-Local Confounding in Air Pollution and Climate Studies

Authors: Mauricio Tec, James Scott, Corwin Zigler

Abstract: Estimating the causal effects of a spatially-varying intervention on a spatially-varying outcome may be subject to non-local confounding (NLC), a phenomenon that can bias estimates when the treatments and outcomes of a given unit are dictated in part by the covariates of other nearby units. In particular, NLC is a challenge for evaluating the effects of environmental policies and climate events on… ▽ More Estimating the causal effects of a spatially-varying intervention on a spatially-varying outcome may be subject to non-local confounding (NLC), a phenomenon that can bias estimates when the treatments and outcomes of a given unit are dictated in part by the covariates of other nearby units. In particular, NLC is a challenge for evaluating the effects of environmental policies and climate events on health-related outcomes such as air pollution exposure. This paper first formalizes NLC using the potential outcomes framework, providing a comparison with the related phenomenon of causal interference. Then, it proposes a broadly applicable framework, termed "weather2vec", that uses the theory of balancing scores to learn representations of non-local information into a scalar or vector defined for each observational unit, which is subsequently used to adjust for confounding in conjunction with causal inference methods. The framework is evaluated in a simulation study and two case studies on air pollution where the weather is an (inherently regional) known confounder. △ Less

Submitted 11 December, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

Journal ref: AAAI 2023

arXiv:2205.04023 [pdf, other]

doi 10.1080/00031305.2022.2129787

A Comparative Tutorial of Bayesian Sequential Design and Reinforcement Learning

Authors: Mauricio Tec, Yunshan Duan, Peter Müller

Abstract: Reinforcement Learning (RL) is a computational approach to reward-driven learning in sequential decision problems. It implements the discovery of optimal actions by learning from an agent interacting with an environment rather than from supervised data. We contrast and compare RL with traditional sequential design, focusing on simulation-based Bayesian sequential design (BSD). Recently, there has… ▽ More Reinforcement Learning (RL) is a computational approach to reward-driven learning in sequential decision problems. It implements the discovery of optimal actions by learning from an agent interacting with an environment rather than from supervised data. We contrast and compare RL with traditional sequential design, focusing on simulation-based Bayesian sequential design (BSD). Recently, there has been an increasing interest in RL techniques for healthcare applications. We introduce two related applications as motivating examples. In both applications, the sequential nature of the decisions is restricted to sequential stop**. Rather than a comprehensive survey, the focus of the discussion is on solutions using standard tools for these two relatively simple sequential stop** problems. Both problems are inspired by adaptive clinical trial design. We use examples to explain the terminology and mathematical background that underlie each framework and map one to the other. The implementations and results illustrate the many similarities between RL and BSD. The results motivate the discussion of the potential strengths and limitations of each approach. △ Less

Submitted 4 October, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: The American Statistician (2022)

arXiv:2203.11798 [pdf, other]

Bayesian Nonparametric Adjustment of Confounding

Authors: Chanmin Kim, Mauricio Tec, Corwin M Zigler

Abstract: Analysis of observational studies increasingly confronts the challenge of determining which of a possibly high-dimensional set of available covariates are required to satisfy the assumption of ignorable treatment assignment for estimation of causal effects. We propose a Bayesian nonparametric approach that simultaneously 1) prioritizes inclusion of adjustment variables in accordance with existing… ▽ More Analysis of observational studies increasingly confronts the challenge of determining which of a possibly high-dimensional set of available covariates are required to satisfy the assumption of ignorable treatment assignment for estimation of causal effects. We propose a Bayesian nonparametric approach that simultaneously 1) prioritizes inclusion of adjustment variables in accordance with existing principles of confounder selection; 2) estimates causal effects in a manner that permits complex relationships among confounders, exposures, and outcomes; and 3) provides causal estimates that account for uncertainty in the nature of confounding. The proposal relies on specification of multiple Bayesian Additive Regression Trees models, linked together with a common prior distribution that accrues posterior selection probability to covariates on the basis of association with both the exposure and the outcome of interest. A set of extensive simulation studies demonstrates that the proposed method performs well relative to similarly-motivated methodologies in a variety of scenarios. We deploy the method to investigate the causal effect of emissions from coal-fired power plants on ambient air pollution concentrations, where the prospect of confounding due to local and regional meteorological factors introduces uncertainty around the confounding role of a high-dimensional set of measured variables. Ultimately, we show that the proposed method produces more efficient and more consistent results across adjacent years than alternative methods, lending strength to the evidence of the causal relationship between SO2 emissions and ambient particulate pollution. △ Less

Submitted 22 March, 2022; originally announced March 2022.

Comments: 35 pages, 5 figures

arXiv:2203.00136 [pdf, other]

doi 10.1007/978-3-030-98096-2_12

Estimating Importation Risk of Covid-19 in Hurricane Evacuations: A Prediction Framework Applied to Hurricane Laura in Texas

Authors: Michelle Audirac, Mauricio Tec, Enrique Garcia-Tejeda, Spencer Fox

Abstract: In August 2020, as Texas was coming down from a large summer COVID-19 surge, forecasts suggested that Hurricane Laura was tracking towards 6M residents along the East Texas coastline, threatening to spread COVID-19 across the state and cause pandemic resurgences. To assist local authorities facing the dual-threat, we integrated survey expectations of coastal residents and observed hurricane evacua… ▽ More In August 2020, as Texas was coming down from a large summer COVID-19 surge, forecasts suggested that Hurricane Laura was tracking towards 6M residents along the East Texas coastline, threatening to spread COVID-19 across the state and cause pandemic resurgences. To assist local authorities facing the dual-threat, we integrated survey expectations of coastal residents and observed hurricane evacuation rates in a statistical framework that combined with local pandemic conditions predicts how COVID-19 would spread in response to a hurricane. For Hurricane Laura, we estimate that 499,500 [90% Credible Interval (CI): 347,500, 624,000] people evacuated the Texan counties, that no single county accumulated more than 2.5% of hurricane evacuees, and that there were 2,900 [90% CI: 1,700, 5,800] exportations of Covid-19 across the state. In general, reception estimates were concentrated in regions with higher population densities. Nonetheless, higher importation risk is expected in small Districts, with a maximum number of importations of 10 per 10,000 residents in our case study. Overall, we present a flexible and transferable framework that captures spatial heterogeneity and incorporates geographic components for predicting population movement in the wake of a natural disaster. As hurricanes continue to increase in both frequency and strength, our framework can be deployed in response to anticipated hurricane paths to guide disaster preparedness and planning. △ Less

Submitted 28 February, 2022; originally announced March 2022.

Comments: 13 pages, 6 figures

Journal ref: iGISc 2021: Advances in Geospatial Data Science pp 163-175

arXiv:2105.13345 [pdf, other]

Adversarial Intrinsic Motivation for Reinforcement Learning

Authors: Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone

Abstract: Learning with an objective to minimize the mismatch with a reference distribution has been shown to be useful for generative modeling and imitation learning. In this paper, we investigate whether one such objective, the Wasserstein-1 distance between a policy's state visitation distribution and a target distribution, can be utilized effectively for reinforcement learning (RL) tasks. Specifically,… ▽ More Learning with an objective to minimize the mismatch with a reference distribution has been shown to be useful for generative modeling and imitation learning. In this paper, we investigate whether one such objective, the Wasserstein-1 distance between a policy's state visitation distribution and a target distribution, can be utilized effectively for reinforcement learning (RL) tasks. Specifically, this paper focuses on goal-conditioned reinforcement learning where the idealized (unachievable) target distribution has full measure at the goal. This paper introduces a quasimetric specific to Markov Decision Processes (MDPs) and uses this quasimetric to estimate the above Wasserstein-1 distance. It further shows that the policy that minimizes this Wasserstein-1 distance is the policy that reaches the goal in as few steps as possible. Our approach, termed Adversarial Intrinsic Motivation (AIM), estimates this Wasserstein-1 distance through its dual objective and uses it to compute a supplemental reward function. Our experiments show that this reward function changes smoothly with respect to transitions in the MDP and directs the agent's exploration to find the goal efficiently. Additionally, we combine AIM with Hindsight Experience Replay (HER) and show that the resulting algorithm accelerates learning significantly on several simulated robotics tasks when compared to other rewards that encourage exploration or accelerate learning. △ Less

Submitted 28 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

arXiv:1911.08106 [pdf, other]

How Likely are Ride-share Drivers to Earn a Living Wage? Large-scale Spatio-temporal Density Smoothing with the Graph-fused Elastic Net

Authors: Mauricio Tec, Natalia Zuniga-Garcia, Randy B. Machemehl, James G. Scott

Abstract: Ride-sourcing or transportation network companies (TNCs) provide on-demand transportation service for compensation, connecting drivers of personal vehicles with passengers through smartphone applications. In this study, we consider the problem of estimating a spatiotemporally varying probability distribution for the productivity of a TNC driver, using data on more than 1.2 million TNC trips in Aus… ▽ More Ride-sourcing or transportation network companies (TNCs) provide on-demand transportation service for compensation, connecting drivers of personal vehicles with passengers through smartphone applications. In this study, we consider the problem of estimating a spatiotemporally varying probability distribution for the productivity of a TNC driver, using data on more than 1.2 million TNC trips in Austin, Texas. We propose a graph-based smoothing approach that allows for distinct spatial and temporal dynamics, including different degrees of smoothness, spatio-temporal interactions, and interpolation in regions with little or no data. For such a goal, we introduce the Graph-fused Elastic Net (GFEN) and use it in combination with a dyadic tree decomposition for density estimation. In addition, we present an optimization-driven approach for fast point estimates scalable to massive graphs. Bayesian inference and uncertainty quantification with MCMC are also illustrated. The main results demonstrate that the optimization strategy is an effective exploration tool for selecting adequate regularization schemes using Bayesian optimization of the cross-validation loss. Two key empirical findings made possible by our method include: 1) the probability that a TNC driver can expect to earn a living wage in Austin exhibits high variability in space and time, from as low as 25% to as high as 85%; and 2) some drivers suffer considerable "tail risk", with the bottom 10% of the earnings distribution falling below $10 per hour -- grossly below a living wage in Austin for a single adult -- for specific times and locations. All code and data for the paper are publicly available, as a Shiny app for visualizing the results and a software package in Julia for implementing the GFEN. △ Less

Submitted 9 July, 2021; v1 submitted 19 November, 2019; originally announced November 2019.

arXiv:1810.06738 [pdf, other]

Random clique covers for graphs with local density and global sparsity

Authors: Sinead A. Williamson, Mauricio Tec

Abstract: Large real-world graphs tend to be sparse, but they often contain many densely connected subgraphs and exhibit high clustering coefficients. While recent random graph models can capture this sparsity, they ignore the local density, or vice versa. We develop a Bayesian nonparametric graph model based on random edge clique covers, and show that this model can capture power law degree distribution, g… ▽ More Large real-world graphs tend to be sparse, but they often contain many densely connected subgraphs and exhibit high clustering coefficients. While recent random graph models can capture this sparsity, they ignore the local density, or vice versa. We develop a Bayesian nonparametric graph model based on random edge clique covers, and show that this model can capture power law degree distribution, global sparsity and non-vanishing local clustering coefficient. This distribution can be used directly as a prior on observed graphs, or as part of a hierarchical Bayesian model for inferring latent graph structures. △ Less

Submitted 17 July, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

Comments: Appears in UAI 2019. This version includes appendices

arXiv:1809.10329 [pdf, other]

Evaluation of Ride-Sourcing Search Frictions and Driver Productivity: A Spatial Denoising Approach

Authors: Natalia Zuniga-Garcia, Mauricio Tec, James G. Scott, Natalia Ruiz-Juri, Randy B. Machemehl

Abstract: This paper considers the problem of measuring spatial and temporal variation in driver productivity on ride-sourcing trips. This variation is especially important from a driver's perspective: if a platform's drivers experience systematic disparities in earnings because of variation in their riders' destinations, they may perceive the pricing model as inequitable. This perception can exacerbate sea… ▽ More This paper considers the problem of measuring spatial and temporal variation in driver productivity on ride-sourcing trips. This variation is especially important from a driver's perspective: if a platform's drivers experience systematic disparities in earnings because of variation in their riders' destinations, they may perceive the pricing model as inequitable. This perception can exacerbate search frictions if it leads drivers to avoid locations where they believe they may be assigned "unlucky" fares. To characterize any such systematic disparities in productivity, we develop an analytic framework with three key components. First, we propose a productivity metric that looks two consecutive trips ahead, thus capturing the effect on expected earnings of market conditions at drivers' drop-off locations. Second, we develop a natural experiment by analyzing trips with a common origin but varying destinations, thus isolating purely spatial effects on productivity. Third, we apply a spatial denoising method that allows us to work with raw spatial information exhibiting high levels of noise and sparsity, without having to aggregate data into large, low-resolution spatial zones. By applying our framework to data on more than 1.4 million rides in Austin, Texas, we find significant spatial variation in ride-sourcing driver productivity and search frictions. Drivers at the same location experienced disparities in productivity after being dispatched on trips with different destinations, with origin-based surge pricing increasing these earnings disparities. Our results show that trip distance is the dominant factor in driver productivity: short trips yielded lower productivity, even when ending in areas with high demand. These findings suggest that new pricing strategies are required to minimize random disparities in driver earnings. △ Less

Submitted 11 October, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

Comments: 34 pages

Showing 1–13 of 13 results for author: Tec, M