-
Planetary Causal Inference: Implications for the Geography of Poverty
Authors:
Kazuki Sakamoto,
Connor T. Jerzak,
Adel Daoud
Abstract:
Earth observation data such as satellite imagery can, when combined with machine learning, can have far-reaching impacts on our understanding of the geography of poverty through the prediction of living conditions, especially where government-derived economic indicators are either unavailable or potentially untrustworthy. Recent work has progressed in using Earth Observation (EO) data not only to…
▽ More
Earth observation data such as satellite imagery can, when combined with machine learning, can have far-reaching impacts on our understanding of the geography of poverty through the prediction of living conditions, especially where government-derived economic indicators are either unavailable or potentially untrustworthy. Recent work has progressed in using Earth Observation (EO) data not only to predict spatial economic outcomes but also to explore cause and effect, an understanding which is critical for downstream policy analysis. In this review, we first document the growth of interest in using satellite images together with EO data in causal analysis. We then trace the relationship between spatial statistics and machine learning methods before discussing four ways in which EO data has been used in causal machine learning pipelines -- (1.) poverty outcome imputation for downstream causal analysis, (2.) EO image deconfounding, (3.) EO-based treatment effect heterogeneity, and (4.) EO-based transportability analysis. We conclude by providing a step-by-step workflow for how researchers can incorporate EO data in causal ML analysis going forward, outlining major choices of data, models, and evaluation metrics.
△ Less
Submitted 5 July, 2024; v1 submitted 30 May, 2024;
originally announced June 2024.
-
Can Large Language Models (or Humans) Disentangle Text?
Authors:
Nicolas Audinet de Pieuchon,
Adel Daoud,
Connor Thomas Jerzak,
Moa Johansson,
Richard Johansson
Abstract:
We investigate the potential of large language models (LLMs) to disentangle text variables--to remove the textual traces of an undesired forbidden variable in a task sometimes known as text distillation and closely related to the fairness in AI and causal inference literature. We employ a range of various LLM approaches in an attempt to disentangle text by identifying and removing information abou…
▽ More
We investigate the potential of large language models (LLMs) to disentangle text variables--to remove the textual traces of an undesired forbidden variable in a task sometimes known as text distillation and closely related to the fairness in AI and causal inference literature. We employ a range of various LLM approaches in an attempt to disentangle text by identifying and removing information about a target variable while preserving other relevant signals. We show that in the strong test of removing sentiment, the statistical association between the processed text and sentiment is still detectable to machine learning classifiers post-LLM-disentanglement. Furthermore, we find that human annotators also struggle to disentangle sentiment while preserving other semantic content. This suggests there may be limited separability between concept variables in some text contexts, highlighting limitations of methods relying on text-level transformations and also raising questions about the robustness of disentanglement methods that achieve statistical independence in representation space.
△ Less
Submitted 3 May, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Application of 2D Homography for High Resolution Traffic Data Collection using CCTV Cameras
Authors:
Linlin Zhang,
Xiang Yu,
Abdulateef Daud,
Abdul Rashid Mussah,
Yaw Adu-Gyamfi
Abstract:
Traffic cameras remain the primary source data for surveillance activities such as congestion and incident monitoring. To date, State agencies continue to rely on manual effort to extract data from networked cameras due to limitations of the current automatic vision systems including requirements for complex camera calibration and inability to generate high resolution data. This study implements a…
▽ More
Traffic cameras remain the primary source data for surveillance activities such as congestion and incident monitoring. To date, State agencies continue to rely on manual effort to extract data from networked cameras due to limitations of the current automatic vision systems including requirements for complex camera calibration and inability to generate high resolution data. This study implements a three-stage video analytics framework for extracting high-resolution traffic data such vehicle counts, speed, and acceleration from infrastructure-mounted CCTV cameras. The key components of the framework include object recognition, perspective transformation, and vehicle trajectory reconstruction for traffic data collection. First, a state-of-the-art vehicle recognition model is implemented to detect and classify vehicles. Next, to correct for camera distortion and reduce partial occlusion, an algorithm inspired by two-point linear perspective is utilized to extracts the region of interest (ROI) automatically, while a 2D homography technique transforms the CCTV view to bird's-eye view (BEV). Cameras are calibrated with a two-layer matrix system to enable the extraction of speed and acceleration by converting image coordinates to real-world measurements. Individual vehicle trajectories are constructed and compared in BEV using two time-space-feature-based object trackers, namely Motpy and BYTETrack. The results of the current study showed about +/- 4.5% error rate for directional traffic counts, less than 10% MSE for speed bias between camera estimates in comparison to estimates from probe data sources. Extracting high-resolution data from traffic cameras has several implications, ranging from improvements in traffic management and identify dangerous driving behavior, high-risk areas for accidents, and other safety concerns, enabling proactive measures to reduce accidents and fatalities.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
Deep Learning With DAGs
Authors:
Sourabh Balgi,
Adel Daoud,
Jose M. Peña,
Geoffrey T. Wodtke,
Jesse Zhou
Abstract:
Social science theories often postulate causal relationships among a set of variables or events. Although directed acyclic graphs (DAGs) are increasingly used to represent these theories, their full potential has not yet been realized in practice. As non-parametric causal models, DAGs require no assumptions about the functional form of the hypothesized relationships. Nevertheless, to simplify the…
▽ More
Social science theories often postulate causal relationships among a set of variables or events. Although directed acyclic graphs (DAGs) are increasingly used to represent these theories, their full potential has not yet been realized in practice. As non-parametric causal models, DAGs require no assumptions about the functional form of the hypothesized relationships. Nevertheless, to simplify the task of empirical evaluation, researchers tend to invoke such assumptions anyway, even though they are typically arbitrary and do not reflect any theoretical content or prior knowledge. Moreover, functional form assumptions can engender bias, whenever they fail to accurately capture the complexity of the causal system under investigation. In this article, we introduce causal-graphical normalizing flows (cGNFs), a novel approach to causal inference that leverages deep neural networks to empirically evaluate theories represented as DAGs. Unlike conventional approaches, cGNFs model the full joint distribution of the data according to a DAG supplied by the analyst, without relying on stringent assumptions about functional form. In this way, the method allows for flexible, semi-parametric estimation of any causal estimand that can be identified from the DAG, including total effects, conditional effects, direct and indirect effects, and path-specific effects. We illustrate the method with a reanalysis of Blau and Duncan's (1967) model of status attainment and Zhou's (2019) model of conditional versus controlled mobility. To facilitate adoption, we provide open-source software together with a series of online tutorials for implementing cGNFs. The article concludes with a discussion of current limitations and directions for future development.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Image2PCI -- A Multitask Learning Framework for Estimating Pavement Condition Indices Directly from Images
Authors:
Neema Jakisa Owor,
Hang Du,
Abdulateef Daud,
Armstrong Aboah,
Yaw Adu-Gyamfi
Abstract:
The Pavement Condition Index (PCI) is a widely used metric for evaluating pavement performance based on the type, extent and severity of distresses detected on a pavement surface. In recent times, significant progress has been made in utilizing deep-learning approaches to automate PCI estimation process. However, the current approaches rely on at least two separate models to estimate PCI values --…
▽ More
The Pavement Condition Index (PCI) is a widely used metric for evaluating pavement performance based on the type, extent and severity of distresses detected on a pavement surface. In recent times, significant progress has been made in utilizing deep-learning approaches to automate PCI estimation process. However, the current approaches rely on at least two separate models to estimate PCI values -- one model dedicated to determining the type and extent and another for estimating their severity. This approach presents several challenges, including complexities, high computational resource demands, and maintenance burdens that necessitate careful consideration and resolution. To overcome these challenges, the current study develops a unified multi-tasking model that predicts the PCI directly from a top-down pavement image. The proposed architecture is a multi-task model composed of one encoder for feature extraction and four decoders to handle specific tasks: two detection heads, one segmentation head and one PCI estimation head. By multitasking, we are able to extract features from the detection and segmentation heads for automatically estimating the PCI directly from the images. The model performs very well on our benchmarked and open pavement distress dataset that is annotated for multitask learning (the first of its kind). To our best knowledge, this is the first work that can estimate PCI directly from an image at real time speeds while maintaining excellent accuracy on all related tasks for crack detection and segmentation.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Edge Computing-Enabled Road Condition Monitoring: System Development and Evaluation
Authors:
Abdulateef Daud,
Mark Amo-Boateng,
Neema Jakisa Owor,
Armstrong Aboah,
Yaw Adu-Gyamfi
Abstract:
Real-time pavement condition monitoring provides highway agencies with timely and accurate information that could form the basis of pavement maintenance and rehabilitation policies. Existing technologies rely heavily on manual data processing, are expensive and therefore, difficult to scale for frequent, networklevel pavement condition monitoring. Additionally, these systems require sending large…
▽ More
Real-time pavement condition monitoring provides highway agencies with timely and accurate information that could form the basis of pavement maintenance and rehabilitation policies. Existing technologies rely heavily on manual data processing, are expensive and therefore, difficult to scale for frequent, networklevel pavement condition monitoring. Additionally, these systems require sending large packets of data to the cloud which requires large storage space, are computationally expensive to process, and results in high latency. The current study proposes a solution that capitalizes on the widespread availability of affordable Micro Electro-Mechanical System (MEMS) sensors, edge computing and internet connection capabilities of microcontrollers, and deployable machine learning (ML) models to (a) design an Internet of Things (IoT)-enabled device that can be mounted on axles of vehicles to stream live pavement condition data (b) reduce latency through on-device processing and analytics of pavement condition sensor data before sending to the cloud servers. In this study, three ML models including Random Forest, LightGBM and XGBoost were trained to predict International Roughness Index (IRI) at every 0.1-mile segment. XGBoost had the highest accuracy with an RMSE and MAPE of 16.89in/mi and 20.3%, respectively. In terms of the ability to classify the IRI of pavement segments based on ride quality according to MAP-21 criteria, our proposed device achieved an average accuracy of 96.76% on I-70EB and 63.15% on South Providence. Overall, our proposed device demonstrates significant potential in providing real-time pavement condition data to State Highway Agencies (SHA) and Department of Transportation (DOTs) with a satisfactory level of accuracy.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
CausalImages: An R Package for Causal Inference with Earth Observation, Bio-medical, and Social Science Images
Authors:
Connor T. Jerzak,
Adel Daoud
Abstract:
The causalimages R package enables causal inference with image and image sequence data, providing new tools for integrating novel data sources like satellite and bio-medical imagery into the study of cause and effect. One set of functions enables image-based causal inference analyses. For example, one key function decomposes treatment effect heterogeneity by images using an interpretable Bayesian…
▽ More
The causalimages R package enables causal inference with image and image sequence data, providing new tools for integrating novel data sources like satellite and bio-medical imagery into the study of cause and effect. One set of functions enables image-based causal inference analyses. For example, one key function decomposes treatment effect heterogeneity by images using an interpretable Bayesian framework. This allows for determining which types of images or image sequences are most responsive to interventions. A second modeling function allows researchers to control for confounding using images. The package also allows investigators to produce embeddings that serve as vector summaries of the image or video content. Finally, infrastructural functions are also provided, such as tools for writing large-scale image and image sequence data as sequentialized byte strings for more rapid image analysis. causalimages therefore opens new capabilities for causal inference in R, letting researchers use informative imagery in substantive analyses in a fast and accessible manner.
△ Less
Submitted 9 November, 2023; v1 submitted 29 September, 2023;
originally announced October 2023.
-
Towards Smart Education through the Internet of Things: A Review
Authors:
Afzal Badshah,
Anwar Ghani,
Ali Daud,
Ateeqa Jalal,
Muhammad Bilal,
Jon Crowcroft
Abstract:
IoT is a fundamental enabling technology for creating smart spaces, which can assist the effective face-to-face and online education systems. The transition to smart education (integrating IoT and AI into the education system) is appealing, which has a concrete impact on learners' engagement, motivation, attendance, and deep learning. Traditional education faces many challenges, including administ…
▽ More
IoT is a fundamental enabling technology for creating smart spaces, which can assist the effective face-to-face and online education systems. The transition to smart education (integrating IoT and AI into the education system) is appealing, which has a concrete impact on learners' engagement, motivation, attendance, and deep learning. Traditional education faces many challenges, including administration, pedagogy, assessment, and classroom supervision. Recent developments in ICT (e.g., IoT, AI and 5G, etc.) have yielded lots of smart solutions for various aspects of life; however, smart solutions are not well integrated into the education system. In particular, the COVID-19 pandemic situation had further emphasized the adoption of new smart solutions in education. This study reviews the related studies and addresses the (i) problems in the traditional education system with possible solutions, (ii) the transition towards smart education, and (iii) research challenges in the transition to smart education (i.e, computational and social resistance). Considering these studies, smart solutions (e.g., smart pedagogy, smart assessment, smart classroom, smart administration, etc.) are introduced to the problems of the traditional system. This exploratory study opens new trends for scholars and the market to integrate ICT, IoT, and AI into smart education.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Integrating Earth Observation Data into Causal Inference: Challenges and Opportunities
Authors:
Connor T. Jerzak,
Fredrik Johansson,
Adel Daoud
Abstract:
Observational studies require adjustment for confounding factors that are correlated with both the treatment and outcome. In the setting where the observed variables are tabular quantities such as average income in a neighborhood, tools have been developed for addressing such confounding. However, in many parts of the develo** world, features about local communities may be scarce. In this contex…
▽ More
Observational studies require adjustment for confounding factors that are correlated with both the treatment and outcome. In the setting where the observed variables are tabular quantities such as average income in a neighborhood, tools have been developed for addressing such confounding. However, in many parts of the develo** world, features about local communities may be scarce. In this context, satellite imagery can play an important role, serving as a proxy for the confounding variables otherwise unobserved. In this paper, we study confounder adjustment in this non-tabular setting, where patterns or objects found in satellite images contribute to the confounder bias. Using the evaluation of anti-poverty aid programs in Africa as our running example, we formalize the challenge of performing causal adjustment with such unstructured data -- what conditions are sufficient to identify causal effects, how to perform estimation, and how to quantify the ways in which certain aspects of the unstructured image object are most predictive of the treatment decision. Via simulation, we also explore the sensitivity of satellite image-based observational inference to image resolution and to misspecification of the image-associated confounder. Finally, we apply these tools in estimating the effect of anti-poverty interventions in African communities from satellite imagery.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
GC-GRU-N for Traffic Prediction using Loop Detector Data
Authors:
Maged Shoman,
Armstrong Aboah,
Abdulateef Daud,
Yaw Adu-Gyamfi
Abstract:
Because traffic characteristics display stochastic nonlinear spatiotemporal dependencies, traffic prediction is a challenging task. In this paper develop a graph convolution gated recurrent unit (GC GRU N) network to extract the essential Spatio temporal features. we use Seattle loop detector data aggregated over 15 minutes and reframe the problem through space and time. The model performance is c…
▽ More
Because traffic characteristics display stochastic nonlinear spatiotemporal dependencies, traffic prediction is a challenging task. In this paper develop a graph convolution gated recurrent unit (GC GRU N) network to extract the essential Spatio temporal features. we use Seattle loop detector data aggregated over 15 minutes and reframe the problem through space and time. The model performance is compared o benchmark models; Historical Average, Long Short Term Memory (LSTM), and Transformers. The proposed model ranked second with the fastest inference time and a very close performance to first place (Transformers). Our model also achieves a running time that is six times faster than transformers. Finally, we present a comparative study of our model and the available benchmarks using metrics such as training time, inference time, MAPE, MAE and RMSE. Spatial and temporal aspects are also analyzed for each of the trained models.
△ Less
Submitted 13 November, 2022;
originally announced November 2022.
-
$ρ$-GNF : A Novel Sensitivity Analysis Approach Under Unobserved Confounders
Authors:
Sourabh Balgi,
Jose M. Peña,
Adel Daoud
Abstract:
We propose a new sensitivity analysis model that combines copulas and normalizing flows for causal inference under unobserved confounding. We refer to the new model as $ρ$-GNF ($ρ$-Graphical Normalizing Flow), where $ρ{\in}[-1,+1]$ is a bounded sensitivity parameter representing the backdoor non-causal association due to unobserved confounding modeled using the most well studied and widely popular…
▽ More
We propose a new sensitivity analysis model that combines copulas and normalizing flows for causal inference under unobserved confounding. We refer to the new model as $ρ$-GNF ($ρ$-Graphical Normalizing Flow), where $ρ{\in}[-1,+1]$ is a bounded sensitivity parameter representing the backdoor non-causal association due to unobserved confounding modeled using the most well studied and widely popular Gaussian copula. Specifically, $ρ$-GNF enables us to estimate and analyse the frontdoor causal effect or average causal effect (ACE) as a function of $ρ$. We call this the $ρ_{curve}$. The $ρ_{curve}$ enables us to specify the confounding strength required to nullify the ACE. We call this the $ρ_{value}$. Further, the $ρ_{curve}$ also enables us to provide bounds for the ACE given an interval of $ρ$ values. We illustrate the benefits of $ρ$-GNF with experiments on simulated and real-world data in terms of our empirical ACE bounds being narrower than other popular ACE bounds.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Image-based Treatment Effect Heterogeneity
Authors:
Connor T. Jerzak,
Fredrik Johansson,
Adel Daoud
Abstract:
Randomized controlled trials (RCTs) are considered the gold standard for estimating the average treatment effect (ATE) of interventions. One use of RCTs is to study the causes of global poverty -- a subject explicitly cited in the 2019 Nobel Memorial Prize awarded to Duflo, Banerjee, and Kremer "for their experimental approach to alleviating global poverty." Because the ATE is a population summary…
▽ More
Randomized controlled trials (RCTs) are considered the gold standard for estimating the average treatment effect (ATE) of interventions. One use of RCTs is to study the causes of global poverty -- a subject explicitly cited in the 2019 Nobel Memorial Prize awarded to Duflo, Banerjee, and Kremer "for their experimental approach to alleviating global poverty." Because the ATE is a population summary, anti-poverty experiments often seek to unpack the effect variation around the ATE by conditioning (CATE) on tabular variables such as age and ethnicity that were measured during the RCT data collection. Although such variables are key to unpacking CATE, using only such variables may fail to capture historical, geographical, or neighborhood-specific contributors to effect variation, as tabular RCT data are often only observed near the time of the experiment. In global poverty research, when the location of the experiment units is approximately known, satellite imagery can provide a window into such factors important for understanding heterogeneity. However, there is no method that specifically enables applied researchers to analyze CATE from images. In this paper, using a deep probabilistic modeling framework, we develop such a method that estimates latent clusters of images by identifying images with similar treatment effects distributions. Our interpretable image CATE model also includes a sensitivity factor that quantifies the importance of image segments contributing to the effect cluster prediction. We compare the proposed methods against alternatives in simulation; also, we show how the model works in an actual RCT, estimating the effects of an anti-poverty intervention in northern Uganda and obtaining a posterior predictive distribution over effects for the rest of the country where no experimental data was collected. We make all models available in open-source software.
△ Less
Submitted 25 May, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Estimating Causal Effects Under Image Confounding Bias with an Application to Poverty in Africa
Authors:
Connor T. Jerzak,
Fredrik Johansson,
Adel Daoud
Abstract:
Observational studies of causal effects require adjustment for confounding factors. In the tabular setting, where these factors are well-defined, separate random variables, the effect of confounding is well understood. However, in public policy, ecology, and in medicine, decisions are often made in non-tabular settings, informed by patterns or objects detected in images (e.g., maps, satellite or t…
▽ More
Observational studies of causal effects require adjustment for confounding factors. In the tabular setting, where these factors are well-defined, separate random variables, the effect of confounding is well understood. However, in public policy, ecology, and in medicine, decisions are often made in non-tabular settings, informed by patterns or objects detected in images (e.g., maps, satellite or tomography imagery). Using such imagery for causal inference presents an opportunity because objects in the image may be related to the treatment and outcome of interest. In these cases, we rely on the images to adjust for confounding but observed data do not directly label the existence of the important objects. Motivated by real-world applications, we formalize this challenge, how it can be handled, and what conditions are sufficient to identify and estimate causal effects. We analyze finite-sample performance using simulation experiments, estimating effects using a propensity adjustment algorithm that employs a machine learning model to estimate the image confounding. Our experiments also examine sensitivity to misspecification of the image pattern mechanism. Finally, we use our methodology to estimate the effects of policy interventions on poverty in African communities from satellite imagery.
△ Less
Submitted 15 February, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Conceptualizing Treatment Leakage in Text-based Causal Inference
Authors:
Adel Daoud,
Connor T. Jerzak,
Richard Johansson
Abstract:
Causal inference methods that control for text-based confounders are becoming increasingly important in the social sciences and other disciplines where text is readily available. However, these methods rely on a critical assumption that there is no treatment leakage: that is, the text only contains information about the confounder and no information about treatment assignment. When this assumption…
▽ More
Causal inference methods that control for text-based confounders are becoming increasingly important in the social sciences and other disciplines where text is readily available. However, these methods rely on a critical assumption that there is no treatment leakage: that is, the text only contains information about the confounder and no information about treatment assignment. When this assumption does not hold, methods that control for text to adjust for confounders face the problem of post-treatment (collider) bias. However, the assumption that there is no treatment leakage may be unrealistic in real-world situations involving text, as human language is rich and flexible. Language appearing in a public policy document or health records may refer to the future and the past simultaneously, and thereby reveal information about the treatment assignment.
In this article, we define the treatment-leakage problem, and discuss the identification as well as the estimation challenges it raises. Second, we delineate the conditions under which leakage can be addressed by removing the treatment-related signal from the text in a pre-processing step we define as text distillation. Lastly, using simulation, we show how treatment leakage introduces a bias in estimates of the average treatment effect (ATE) and how text distillation can mitigate this bias.
△ Less
Submitted 1 May, 2022;
originally announced May 2022.
-
A Region-Based Deep Learning Approach to Automated Retail Checkout
Authors:
Maged Shoman,
Armstrong Aboah,
Alex Morehead,
Ye Duan,
Abdulateef Daud,
Yaw Adu-Gyamfi
Abstract:
Automating the product checkout process at conventional retail stores is a task poised to have large impacts on society generally speaking. Towards this end, reliable deep learning models that enable automated product counting for fast customer checkout can make this goal a reality. In this work, we propose a novel, region-based deep learning approach to automate product counting using a customize…
▽ More
Automating the product checkout process at conventional retail stores is a task poised to have large impacts on society generally speaking. Towards this end, reliable deep learning models that enable automated product counting for fast customer checkout can make this goal a reality. In this work, we propose a novel, region-based deep learning approach to automate product counting using a customized YOLOv5 object detection pipeline and the DeepSORT algorithm. Our results on challenging, real-world test videos demonstrate that our method can generalize its predictions to a sufficient level of accuracy and with a fast enough runtime to warrant deployment to real-world commercial settings. Our proposed method won 4th place in the 2022 AI City Challenge, Track 4, with an F1 score of 0.4400 on experimental validation data.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Improving VANET's Performance by Incorporated Fog-Cloud Layer (FCL)
Authors:
Ghassan Samara,
Mohammed Rasmi,
Nael A Sweerky,
Essam Al Daoud,
Amer Abu Salem
Abstract:
Because of its usefulness in various fields including as safety applications, traffic control applications, and entertainment applications, VANET is an essential topic that is now being investigated intensively. VANET confronts numerous challenges in terms of reaction time, storage capacity, and reliability, particularly in real-time applications. As a result, merging cloud computing and cloud com…
▽ More
Because of its usefulness in various fields including as safety applications, traffic control applications, and entertainment applications, VANET is an essential topic that is now being investigated intensively. VANET confronts numerous challenges in terms of reaction time, storage capacity, and reliability, particularly in real-time applications. As a result, merging cloud computing and cloud computing has recently been researched. The goal of this study is to develop a system that merges the fog and cloud layers into a single layer known as the included fog-cloud layer. To lower the time it takes for real-time applications on VANETs to respond while also improving data flow management over the Internet and achieving an efficient perception service while avoiding the high cost of cloud connectivity.
△ Less
Submitted 30 March, 2022;
originally announced April 2022.
-
Counterfactual Analysis of the Impact of the IMF Program on Child Poverty in the Global-South Region using Causal-Graphical Normalizing Flows
Authors:
Sourabh Balgi,
Jose M. Peña,
Adel Daoud
Abstract:
This work demonstrates the application of a particular branch of causal inference and deep learning models: \emph{causal-Graphical Normalizing Flows (c-GNFs)}. In a recent contribution, scholars showed that normalizing flows carry certain properties, making them particularly suitable for causal and counterfactual analysis. However, c-GNFs have only been tested in a simulated data setting and no co…
▽ More
This work demonstrates the application of a particular branch of causal inference and deep learning models: \emph{causal-Graphical Normalizing Flows (c-GNFs)}. In a recent contribution, scholars showed that normalizing flows carry certain properties, making them particularly suitable for causal and counterfactual analysis. However, c-GNFs have only been tested in a simulated data setting and no contribution to date have evaluated the application of c-GNFs on large-scale real-world data. Focusing on the \emph{AI for social good}, our study provides a counterfactual analysis of the impact of the International Monetary Fund (IMF) program on child poverty using c-GNFs. The analysis relies on a large-scale real-world observational data: 1,941,734 children under the age of 18, cared for by 567,344 families residing in the 67 countries from the Global-South. While the primary objective of the IMF is to support governments in achieving economic stability, our results find that an IMF program reduces child poverty as a positive side-effect by about 1.2$\pm$0.24 degree (`0' equals no poverty and `7' is maximum poverty). Thus, our article shows how c-GNFs further the use of deep learning and causal inference in AI for social good. It shows how learning algorithms can be used for addressing the untapped potential for a significant social impact through counterfactual inference at population level (ACE), sub-population level (CACE), and individual level (ICE). In contrast to most works that model ACE or CACE but not ICE, c-GNFs enable personalization using \emph{`The First Law of Causal Inference'}.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
Personalized Public Policy Analysis in Social Sciences using Causal-Graphical Normalizing Flows
Authors:
Sourabh Balgi,
Jose M. Pena,
Adel Daoud
Abstract:
Structural Equation/Causal Models (SEMs/SCMs) are widely used in epidemiology and social sciences to identify and analyze the average causal effect (ACE) and conditional ACE (CACE). Traditional causal effect estimation methods such as Inverse Probability Weighting (IPW) and more recently Regression-With-Residuals (RWR) are widely used - as they avoid the challenging task of identifying the SCM par…
▽ More
Structural Equation/Causal Models (SEMs/SCMs) are widely used in epidemiology and social sciences to identify and analyze the average causal effect (ACE) and conditional ACE (CACE). Traditional causal effect estimation methods such as Inverse Probability Weighting (IPW) and more recently Regression-With-Residuals (RWR) are widely used - as they avoid the challenging task of identifying the SCM parameters - to estimate ACE and CACE. However, much work remains before traditional estimation methods can be used for counterfactual inference, and for the benefit of Personalized Public Policy Analysis (P$^3$A) in the social sciences. While doctors rely on personalized medicine to tailor treatments to patients in laboratory settings (relatively closed systems), P$^3$A draws inspiration from such tailoring but adapts it for open social systems. In this article, we develop a method for counterfactual inference that we name causal-Graphical Normalizing Flow (c-GNF), facilitating P$^3$A. First, we show how c-GNF captures the underlying SCM without making any assumption about functional forms. Second, we propose a novel dequantization trick to deal with discrete variables, which is a limitation of normalizing flows in general. Third, we demonstrate in experiments that c-GNF performs on-par with IPW and RWR in terms of bias and variance for estimating the ATE, when the true functional forms are known, and better when they are unknown. Fourth and most importantly, we conduct counterfactual inference with c-GNFs, demonstrating promising empirical performance. Because IPW and RWR, like other traditional methods, lack the capability of counterfactual inference, c-GNFs will likely play a major role in tailoring personalized treatment, facilitating P$^3$A, optimizing social interventions - in contrast to the current `one-size-fits-all' approach of existing methods.
△ Less
Submitted 30 April, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Measuring poverty in India with machine learning and remote sensing
Authors:
Adel Daoud,
Felipe Jordan,
Makkunda Sharma,
Fredrik Johansson,
Devdatt Dubhashi,
Sourabh Paul,
Subhashis Banerjee
Abstract:
In this paper, we use deep learning to estimate living conditions in India. We use both census and surveys to train the models. Our procedure achieves comparable results to those found in the literature, but for a wide range of outcomes.
In this paper, we use deep learning to estimate living conditions in India. We use both census and surveys to train the models. Our procedure achieves comparable results to those found in the literature, but for a wide range of outcomes.
△ Less
Submitted 27 October, 2022; v1 submitted 27 December, 2021;
originally announced February 2022.
-
Statistical modeling: the three cultures
Authors:
Adel Daoud,
Devdatt Dubhashi
Abstract:
Two decades ago, Leo Breiman identified two cultures for statistical modeling. The data modeling culture (DMC) refers to practices aiming to conduct statistical inference on one or several quantities of interest. The algorithmic modeling culture (AMC) refers to practices defining a machine-learning (ML) procedure that generates accurate predictions about an event of interest. Breiman argued that s…
▽ More
Two decades ago, Leo Breiman identified two cultures for statistical modeling. The data modeling culture (DMC) refers to practices aiming to conduct statistical inference on one or several quantities of interest. The algorithmic modeling culture (AMC) refers to practices defining a machine-learning (ML) procedure that generates accurate predictions about an event of interest. Breiman argued that statisticians should give more attention to AMC than to DMC, because of the strengths of ML in adapting to data. While twenty years later, DMC has lost some of its dominant role in statistics because of the data-science revolution, we observe that this culture is still the leading practice in the natural and social sciences. DMC is the modus operandi because of the influence of the established scientific method, called the hypothetico-deductive scientific method. Despite the incompatibilities of AMC with this scientific method, among some research groups, AMC and DMC cultures mix intensely. We argue that this mixing has formed a fertile spawning pool for a mutated culture that we called the hybrid modeling culture (HMC) where prediction and inference have fused into new procedures where they reinforce one another. This article identifies key characteristics of HMC, thereby facilitating the scientific endeavor and fueling the evolution of statistical cultures towards better practices. By better, we mean increasingly reliable, valid, and efficient statistical practices in analyzing causal relationships. In combining inference and prediction, the result of HMC is that the distinction between prediction and inference, taken to its limit, melts away. We qualify our melting-away argument by describing three HMC practices, where each practice captures an aspect of the scientific cycle, namely, ML for causal inference, ML for data acquisition, and ML for theory prediction.
△ Less
Submitted 8 December, 2020;
originally announced December 2020.
-
Combining distributive ethics and causal Inference to make trade-offs between austerity and population health
Authors:
Adel Daoud,
Anders Herlitz,
SV Subramanian
Abstract:
The International Monetary Fund (IMF) provides financial assistance to its member-countries in economic turmoil, but requires at the same time that these countries reform their public policies. In several contexts, these reforms are at odds with population health. While researchers have empirically analyzed the consequences of these reforms on health, no analysis exist on identifying fair tradeoff…
▽ More
The International Monetary Fund (IMF) provides financial assistance to its member-countries in economic turmoil, but requires at the same time that these countries reform their public policies. In several contexts, these reforms are at odds with population health. While researchers have empirically analyzed the consequences of these reforms on health, no analysis exist on identifying fair tradeoffs between consequences on population health and economic outcomes. Our article analyzes and identifies the principles governing these tradeoffs. First, this article reviews existing policy-evaluation studies, which show, on balance, that IMF policies frequently cause adverse effects on child health and material standards in the pursuit of macroeconmic improvement. Second, this article discusses four theories in distributive ethics (maximization, egalitarianianism, prioritarianiasm, and sufficientarianism) to identify which is the most compatible with the core mission of the IMF, that is, improved macroeconomics (Articles of Agreement) while at the same time balancing consequences on health. Using a distributive-ethics analyses of IMF polices, we argue that sufficientarianism is the most compatible theory. Third, this article offer a qualitative rearticulation of the Articles of Agreement, and formalize sufficientarian principles in the language of causal inference. We also offer a framework on how to empirically measure, from observational data, the extent that IMF policies trade off fairly between population health and economic outcomes. We conclude with policy recommendations and suggestions for future research.
△ Less
Submitted 10 August, 2020; v1 submitted 30 July, 2020;
originally announced July 2020.
-
EER: Enterprise Expert Ranking using Employee Reputation
Authors:
Saba Mahmood,
Anwar Ghani,
Ali Daud,
Syed Muhammad Saqlain
Abstract:
The emergence of online enterprises spread across continents have given rise to the need for expert identification in this domain. Scenarios that includes the intention of the employer to find tacit expertise and knowledge of an employee that is not documented or self-disclosed has been addressed in this article. The existing reputation based approaches towards expertise ranking in enterprises uti…
▽ More
The emergence of online enterprises spread across continents have given rise to the need for expert identification in this domain. Scenarios that includes the intention of the employer to find tacit expertise and knowledge of an employee that is not documented or self-disclosed has been addressed in this article. The existing reputation based approaches towards expertise ranking in enterprises utilize PageRank, normal distribution, and hidden Markov model for expertise ranking. These models suffer issue of negative referral, collusion, reputation inflation, and dynamism. The authors have however proposed a Bayesian approach utilizing beta probability distribution based reputation model for employee ranking in enterprises. The experimental results reveal improved performance compared to previous techniques in terms of Precision and Mean Average Error (MAE) with almost 7% improvement in precision on average for the three data sets. The proposed technique is able to differentiate categories of interactions in a dynamic context. The results reveal that the technique is independent of the rating pattern and density of data.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
Revenue Maximization Approaches in IaaS Clouds: Research Challenges and Opportunities
Authors:
Afzal Badshah,
Anwar Ghani,
Ali Daud,
Anthony Theodore Chronopoulos,
Ateeqa Jalal
Abstract:
Revenue generation is the main concern of any business, particularly in the cloud, where there is no direct interaction between the provider and the consumer. Cloud computing is an emerging core for today's businesses, however, Its complications (e.g, installation, and migration) with traditional markets are the main challenges. It earns more but needs exemplary performance and marketing skills. I…
▽ More
Revenue generation is the main concern of any business, particularly in the cloud, where there is no direct interaction between the provider and the consumer. Cloud computing is an emerging core for today's businesses, however, Its complications (e.g, installation, and migration) with traditional markets are the main challenges. It earns more but needs exemplary performance and marketing skills. In recent years, cloud computing has become a successful paradigm for providing desktop services. It is expected that more than \$ 331 billion will be invested by 2023, likewise, 51 billion devices are expected to be connected to the cloud. Infrastructure as a Service (IaaS) provides physical resources (e.g, computing, memory, storage, and network) as VM instances. In this article, the main revenue factors are categorized as SLA and penalty management, resource scalability, customer satisfaction and management, resource utilization and provision, cost and price management, and advertising and auction. These parameters are investigated in detail and new dynamics for researchers in the field of the cloud are discovered.
△ Less
Submitted 24 April, 2020;
originally announced April 2020.
-
Issues and challenges in Cloud Storage Architecture: A Survey
Authors:
Anwar Ghani,
Afzal Badshah,
Saeedullah Jan,
Abdulrahman A. Alshdadi,
Ali Daud
Abstract:
From home appliances to industrial enterprises, the Information and Communication Technology (ICT) industry is revolutionizing the world. We are witnessing the emergence of new technologies (e.g, Cloud computing, Fog computing, Internet of Things (IoT), Artificial Intelligence (AI) and Block-chain) which proves the growing use of ICT (e,g. business, education, health, and home appliances), resulti…
▽ More
From home appliances to industrial enterprises, the Information and Communication Technology (ICT) industry is revolutionizing the world. We are witnessing the emergence of new technologies (e.g, Cloud computing, Fog computing, Internet of Things (IoT), Artificial Intelligence (AI) and Block-chain) which proves the growing use of ICT (e,g. business, education, health, and home appliances), resulting in massive data generation. It is expected that more than 175 ZB data will be processed annually by 75 billion devices by 2025. The 5G technology (i.e. mobile communication technology) dramatically increases network speed, enabling users to upload ultra high definition videos in real-time, which will generate a massive stream of big data. Furthermore, smart devices, having artificial intelligence, will act like a human being (e.g, a self-driving vehicle, etc) on the network, will also generate big data. This sudden shift and massive data generation created serious challenges in storing and managing heterogeneous data at such a large scale. This article presents a state-of-the-art review of the issues and challenges involved in storing heterogeneous big data, their countermeasures (i.e, from security and management perspectives), and future opportunities of cloud storage. These challenges are reviewed in detail and new dynamics for researchers in the field of cloud storage are discovered.
△ Less
Submitted 12 July, 2020; v1 submitted 14 April, 2020;
originally announced April 2020.
-
Author Name Disambiguation in Bibliographic Databases: A Survey
Authors:
Muhammad Shoaib,
Ali Daud,
Tehmina Amjad
Abstract:
Entity resolution is a challenging and hot research area in the field of Information Systems since last decade. Author Name Disambiguation (AND) in Bibliographic Databases (BD) like DBLP , Citeseer , and Scopus is a specialized field of entity resolution. Given many citations of underlying authors, the AND task is to find which citations belong to the same author. In this survey, we start with thr…
▽ More
Entity resolution is a challenging and hot research area in the field of Information Systems since last decade. Author Name Disambiguation (AND) in Bibliographic Databases (BD) like DBLP , Citeseer , and Scopus is a specialized field of entity resolution. Given many citations of underlying authors, the AND task is to find which citations belong to the same author. In this survey, we start with three basic AND problems, followed by need for solution and challenges. A generic, five-step framework is provided for handling AND issues. These steps are; (1) Preparation of dataset (2) Selection of publication attributes (3) Selection of similarity metrics (4) Selection of models and (5) Clustering Performance evaluation. Categorization and elaboration of similarity metrics and methods are also provided. Finally, future directions and recommendations are given for this dynamic area of research.
△ Less
Submitted 14 April, 2020;
originally announced April 2020.
-
Semantic Web Environments for Multi-Agent Systems: Enabling agents to use Web of Things via semantic web
Authors:
Alaa Daoud
Abstract:
The Web is ubiquitous, increasingly populated with interconnected data, services, people, and objects. Semantic web technologies (SWT) promote uniformity of data formats, as well as modularization and reuse of specifications (e.g., ontologies), by allowing them to include and refer to information provided by other ontologies. In such a context, multi-agent system (MAS) technologies are the right a…
▽ More
The Web is ubiquitous, increasingly populated with interconnected data, services, people, and objects. Semantic web technologies (SWT) promote uniformity of data formats, as well as modularization and reuse of specifications (e.g., ontologies), by allowing them to include and refer to information provided by other ontologies. In such a context, multi-agent system (MAS) technologies are the right abstraction for develo** decentralized and open Web applications in which agents discover, reason and act on Web resources and cooperate with each other and with people. The aim of the project is to propose an approach to transform "Agent and artifact (A&A) meta-model" into a Web-readable format with ontologies in line with semantic web formats and to reuse already existing ontologies in order to provide uniform access for agents to things.
△ Less
Submitted 20 February, 2020;
originally announced March 2020.
-
Empirical Differential Privacy
Authors:
Paul Burchard,
Anthony Daoud,
Dominic Dotterrer
Abstract:
We show how to achieve differential privacy with no or reduced added noise, based on the empirical noise in the data itself. Unlike previous works on noiseless privacy, the empirical viewpoint avoids making any explicit assumptions about the random process generating the data.
We show how to achieve differential privacy with no or reduced added noise, based on the empirical noise in the data itself. Unlike previous works on noiseless privacy, the empirical viewpoint avoids making any explicit assumptions about the random process generating the data.
△ Less
Submitted 3 January, 2023; v1 submitted 28 October, 2019;
originally announced October 2019.
-
Accuracy Enhancement of Pickett Tunnelling Barrier Memristor Model
Authors:
Ahmad Daoud,
Ahmed Dessouki,
Sherif Abuelenin
Abstract:
Titanium dioxide (TiO2) memristors exhibit complex conduction mechanism. Several models of different complexity have been developed in order to mimic the experimental results for physical behaviors observed in memristor devices. Pickett's tunneling barrier model describes the TiO2 memristors, and utilizes complex derivative of tunnel barrier width. It attains a large error in the ON switching regi…
▽ More
Titanium dioxide (TiO2) memristors exhibit complex conduction mechanism. Several models of different complexity have been developed in order to mimic the experimental results for physical behaviors observed in memristor devices. Pickett's tunneling barrier model describes the TiO2 memristors, and utilizes complex derivative of tunnel barrier width. It attains a large error in the ON switching region. Variety of research consider it as the reference model for the TiO2 memristors. In this paper, we first analyze the theory of operation of the memristor and discuss Pickett's model. Then, we propose a modification to its derivative functions to provide a lower error and closer agreement with physical behavior. This modification is represented by two additional fitting parameters to damp or accelerate the tunnel width derivative. Also, we incorporate a hard limiter term to limit the tunnel width to its physical extremes 1 nm and 2 nm. We run simulations to test the model modifications and we compare the results to the experimental and original Pickett's model results. The modified model more closely resembles the experimental behavior of TiO2 memristors and potentially enables the memristor to be used as a multilevel memory.
△ Less
Submitted 25 February, 2015;
originally announced February 2015.
-
Roman Urdu Opinion Mining System (RUOMiS)
Authors:
Misbah Daud,
Rafiullah Khan,
Mohibullah,
Aitazaz Daud
Abstract:
Convincing a customer is always considered as a challenging task in every business. But when it comes to online business, this task becomes even more difficult. Online retailers try everything possible to gain the trust of the customer. One of the solutions is to provide an area for existing users to leave their comments. This service can effectively develop the trust of the customer however norma…
▽ More
Convincing a customer is always considered as a challenging task in every business. But when it comes to online business, this task becomes even more difficult. Online retailers try everything possible to gain the trust of the customer. One of the solutions is to provide an area for existing users to leave their comments. This service can effectively develop the trust of the customer however normally the customer comments about the product in their native language using Roman script. If there are hundreds of comments this makes difficulty even for the native customers to make a buying decision. This research proposes a system which extracts the comments posted in Roman Urdu, translate them, find their polarity and then gives us the rating of the product. This rating will help the native and non-native customers to make buying decision efficiently from the comments posted in Roman Urdu.
△ Less
Submitted 7 January, 2015;
originally announced January 2015.
-
An Iterated Game of Uncoordinated Sharing of Licensed Spectrum Using Zero-Determinant Strategies
Authors:
Ashraf Al Daoud,
George Kesidis,
Jörg Liebeherr
Abstract:
We consider private commons for secondary sharing of licensed spectrum bands with no access coordination provided by the primary license holder. In such environments, heterogeneity in demand patterns of the secondary users can lead to constant changes in the interference levels, and thus can be a source of volatility to the utilities of the users. In this paper, we consider secondary users to be s…
▽ More
We consider private commons for secondary sharing of licensed spectrum bands with no access coordination provided by the primary license holder. In such environments, heterogeneity in demand patterns of the secondary users can lead to constant changes in the interference levels, and thus can be a source of volatility to the utilities of the users. In this paper, we consider secondary users to be service providers that provide downlink services. We formulate the spectrum sharing problem as a non-cooperative iterated game of power control where service providers change their power levels to fix their long-term average rates at utility-maximizing values. First, we show that in any iterated 2x2 game, the structure of the single-stage game dictates the degree of control that a service provider can exert on the long-term outcome of the game. Then we show that if service providers use binary actions either to access or not to access the channel at any round of the game, then the long-term rate can be fixed regardless of the strategy of the opponent. We identify these rates and show that they can be achieved using mixed Markovian strategies that will be clearly identified in the paper.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.