-
Lightweight Knowledge Representations for Automating Data Analysis
Authors:
Marko Sterbentz,
Cameron Barrie,
Donna Hooshmand,
Shubham Shahi,
Abhratanu Dutta,
Harper Pack,
Andong Li Zhao,
Andrew Paley,
Alexander Einarsson,
Kristian Hammond
Abstract:
The principal goal of data science is to derive meaningful information from data. To do this, data scientists develop a space of analytic possibilities and from it reach their information goals by using their knowledge of the domain, the available data, the operations that can be performed on those data, the algorithms/models that are fed the data, and how all of these facets interweave. In this w…
▽ More
The principal goal of data science is to derive meaningful information from data. To do this, data scientists develop a space of analytic possibilities and from it reach their information goals by using their knowledge of the domain, the available data, the operations that can be performed on those data, the algorithms/models that are fed the data, and how all of these facets interweave. In this work, we take the first steps towards automating a key aspect of the data science pipeline: data analysis. We present an extensible taxonomy of data analytic operations that scopes across domains and data, as well as a method for codifying domain-specific knowledge that links this analytics taxonomy to actual data. We validate the functionality of our analytics taxonomy by implementing a system that leverages it, alongside domain labelings for 8 distinct domains, to automatically generate a space of answerable questions and associated analytic plans. In this way, we produce information spaces over data that enable complex analyses and search over this data and pave the way for fully automated data analysis.
△ Less
Submitted 15 October, 2023;
originally announced November 2023.
-
"Explanation" is Not a Technical Term: The Problem of Ambiguity in XAI
Authors:
Leilani H. Gilpin,
Andrew R. Paley,
Mohammed A. Alam,
Sarah Spurlock,
Kristian J. Hammond
Abstract:
There is broad agreement that Artificial Intelligence (AI) systems, particularly those using Machine Learning (ML), should be able to "explain" their behavior. Unfortunately, there is little agreement as to what constitutes an "explanation." This has caused a disconnect between the explanations that systems produce in service of explainable Artificial Intelligence (XAI) and those explanations that…
▽ More
There is broad agreement that Artificial Intelligence (AI) systems, particularly those using Machine Learning (ML), should be able to "explain" their behavior. Unfortunately, there is little agreement as to what constitutes an "explanation." This has caused a disconnect between the explanations that systems produce in service of explainable Artificial Intelligence (XAI) and those explanations that users and other audiences actually need, which should be defined by the full spectrum of functional roles, audiences, and capabilities for explanation. In this paper, we explore the features of explanations and how to use those features in evaluating their utility. We focus on the requirements for explanations defined by their functional role, the knowledge states of users who are trying to understand them, and the availability of the information needed to generate them. Further, we discuss the risk of XAI enabling trust in systems without establishing their trustworthiness and define a critical next step for the field of XAI to establish metrics to guide and ground the utility of system-generated explanations.
△ Less
Submitted 27 June, 2022;
originally announced July 2022.
-
Requirements for Open Political Information: Transparency Beyond Open Data
Authors:
Andong Luis Li Zhao,
Andrew Paley,
Rachel Adler,
Harper Pack,
Sergio Servantez,
Alexander Einarsson,
Cameron Barrie,
Marko Sterbentz,
Kristian Hammond
Abstract:
A politically informed citizenry is imperative for a welldeveloped democracy. While the US government has pursued policies for open data, these efforts have been insufficient in achieving an open government because only people with technical and domain knowledge can access information in the data. In this work, we conduct user interviews to identify wants and needs among stakeholders. We further u…
▽ More
A politically informed citizenry is imperative for a welldeveloped democracy. While the US government has pursued policies for open data, these efforts have been insufficient in achieving an open government because only people with technical and domain knowledge can access information in the data. In this work, we conduct user interviews to identify wants and needs among stakeholders. We further use this information to sketch out the foundational requirements for a functional political information technical system.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
Optimal guidance and estimation of a 2D diffusion-advection process by a team of mobile sensors
Authors:
Sheng Cheng,
Derek A. Paley
Abstract:
This paper describes an optimization framework to design guidance for a possibly heterogeneous team of multiple mobile sensors to estimate a spatiotemporal process modeled by a 2D diffusion-advection process. Owing to the abstract linear system representation of the process, we apply the Kalman-Bucy filter for estimation, where the sensors provide linear outputs. We propose an optimization problem…
▽ More
This paper describes an optimization framework to design guidance for a possibly heterogeneous team of multiple mobile sensors to estimate a spatiotemporal process modeled by a 2D diffusion-advection process. Owing to the abstract linear system representation of the process, we apply the Kalman-Bucy filter for estimation, where the sensors provide linear outputs. We propose an optimization problem that minimizes the sum of the trace of the covariance operator of the Kalman-Bucy filter and a generic mobility cost of the mobile sensors, subject to the sensors' motion modeled by linear dynamics. We establish the existence of a solution to this problem. Moreover, we prove convergence to the exact optimal solution of the approximate optimal solution. That is, when evaluating these two solutions using the original cost function, the difference becomes arbitrarily small as the approximation gets finer. To compute the approximate solution, we use Pontryagin's minimum principle after approximating the infinite-dimensional terms originating from the diffusion-advection process. The approximate solution is applied in simulation to analyze how a single mobile sensor's performance depends on two important parameters: sensor noise variance and mobility penalty. We also illustrate the application of the framework to multiple sensors, in particular the performance of a heterogeneous team of sensors.
△ Less
Submitted 2 November, 2021; v1 submitted 1 November, 2021;
originally announced November 2021.
-
Optimal control of a 2D diffusion-advection process with a team of mobile actuators under jointly optimal guidance
Authors:
Sheng Cheng,
Derek A. Paley
Abstract:
This paper describes an optimization framework to control a distributed parameter system (DPS) using a team of mobile actuators. The framework simultaneously seeks optimal control of the DPS and optimal guidance of the mobile actuators such that a cost function associated with both the DPS and the mobile actuators is minimized subject to the dynamics of each. The cost incurred from controlling the…
▽ More
This paper describes an optimization framework to control a distributed parameter system (DPS) using a team of mobile actuators. The framework simultaneously seeks optimal control of the DPS and optimal guidance of the mobile actuators such that a cost function associated with both the DPS and the mobile actuators is minimized subject to the dynamics of each. The cost incurred from controlling the DPS is linear-quadratic, which is transformed into an equivalent form as a quadratic term associated with an operator-valued Riccati equation. This equivalent form reduces the problem to seeking for guidance only because the optimal control can be recovered once the optimal guidance is obtained. We establish conditions for the existence of a solution to the proposed problem. Since computing an optimal solution requires approximation, we also establish the conditions for convergence to the exact optimal solution of the approximate optimal solution. That is, when evaluating these two solutions by the original cost function, the difference becomes arbitrarily small as the approximation gets finer. Two numerical examples demonstrate the performance of the optimal control and guidance obtained from the proposed approach.
△ Less
Submitted 18 June, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.