Search | arXiv e-print repository

Lightweight Knowledge Representations for Automating Data Analysis

Authors: Marko Sterbentz, Cameron Barrie, Donna Hooshmand, Shubham Shahi, Abhratanu Dutta, Harper Pack, Andong Li Zhao, Andrew Paley, Alexander Einarsson, Kristian Hammond

Abstract: The principal goal of data science is to derive meaningful information from data. To do this, data scientists develop a space of analytic possibilities and from it reach their information goals by using their knowledge of the domain, the available data, the operations that can be performed on those data, the algorithms/models that are fed the data, and how all of these facets interweave. In this w… ▽ More The principal goal of data science is to derive meaningful information from data. To do this, data scientists develop a space of analytic possibilities and from it reach their information goals by using their knowledge of the domain, the available data, the operations that can be performed on those data, the algorithms/models that are fed the data, and how all of these facets interweave. In this work, we take the first steps towards automating a key aspect of the data science pipeline: data analysis. We present an extensible taxonomy of data analytic operations that scopes across domains and data, as well as a method for codifying domain-specific knowledge that links this analytics taxonomy to actual data. We validate the functionality of our analytics taxonomy by implementing a system that leverages it, alongside domain labelings for 8 distinct domains, to automatically generate a space of answerable questions and associated analytic plans. In this way, we produce information spaces over data that enable complex analyses and search over this data and pave the way for fully automated data analysis. △ Less

Submitted 15 October, 2023; originally announced November 2023.

arXiv:2207.00007 [pdf, other]

"Explanation" is Not a Technical Term: The Problem of Ambiguity in XAI

Authors: Leilani H. Gilpin, Andrew R. Paley, Mohammed A. Alam, Sarah Spurlock, Kristian J. Hammond

Abstract: There is broad agreement that Artificial Intelligence (AI) systems, particularly those using Machine Learning (ML), should be able to "explain" their behavior. Unfortunately, there is little agreement as to what constitutes an "explanation." This has caused a disconnect between the explanations that systems produce in service of explainable Artificial Intelligence (XAI) and those explanations that… ▽ More There is broad agreement that Artificial Intelligence (AI) systems, particularly those using Machine Learning (ML), should be able to "explain" their behavior. Unfortunately, there is little agreement as to what constitutes an "explanation." This has caused a disconnect between the explanations that systems produce in service of explainable Artificial Intelligence (XAI) and those explanations that users and other audiences actually need, which should be defined by the full spectrum of functional roles, audiences, and capabilities for explanation. In this paper, we explore the features of explanations and how to use those features in evaluating their utility. We focus on the requirements for explanations defined by their functional role, the knowledge states of users who are trying to understand them, and the availability of the information needed to generate them. Further, we discuss the risk of XAI enabling trust in systems without establishing their trustworthiness and define a critical next step for the field of XAI to establish metrics to guide and ground the utility of system-generated explanations. △ Less

Submitted 27 June, 2022; originally announced July 2022.

arXiv:2112.03119 [pdf, ps, other]

Requirements for Open Political Information: Transparency Beyond Open Data

Authors: Andong Luis Li Zhao, Andrew Paley, Rachel Adler, Harper Pack, Sergio Servantez, Alexander Einarsson, Cameron Barrie, Marko Sterbentz, Kristian Hammond

Abstract: A politically informed citizenry is imperative for a welldeveloped democracy. While the US government has pursued policies for open data, these efforts have been insufficient in achieving an open government because only people with technical and domain knowledge can access information in the data. In this work, we conduct user interviews to identify wants and needs among stakeholders. We further u… ▽ More A politically informed citizenry is imperative for a welldeveloped democracy. While the US government has pursued policies for open data, these efforts have been insufficient in achieving an open government because only people with technical and domain knowledge can access information in the data. In this work, we conduct user interviews to identify wants and needs among stakeholders. We further use this information to sketch out the foundational requirements for a functional political information technical system. △ Less

Submitted 6 December, 2021; originally announced December 2021.

Comments: Presented at AAAI FSS-21: Artificial Intelligence in Government and Public Sector, Washington, DC, USA

arXiv:2111.01343 [pdf, ps, other]

Optimal guidance and estimation of a 2D diffusion-advection process by a team of mobile sensors

Authors: Sheng Cheng, Derek A. Paley

Abstract: This paper describes an optimization framework to design guidance for a possibly heterogeneous team of multiple mobile sensors to estimate a spatiotemporal process modeled by a 2D diffusion-advection process. Owing to the abstract linear system representation of the process, we apply the Kalman-Bucy filter for estimation, where the sensors provide linear outputs. We propose an optimization problem… ▽ More This paper describes an optimization framework to design guidance for a possibly heterogeneous team of multiple mobile sensors to estimate a spatiotemporal process modeled by a 2D diffusion-advection process. Owing to the abstract linear system representation of the process, we apply the Kalman-Bucy filter for estimation, where the sensors provide linear outputs. We propose an optimization problem that minimizes the sum of the trace of the covariance operator of the Kalman-Bucy filter and a generic mobility cost of the mobile sensors, subject to the sensors' motion modeled by linear dynamics. We establish the existence of a solution to this problem. Moreover, we prove convergence to the exact optimal solution of the approximate optimal solution. That is, when evaluating these two solutions using the original cost function, the difference becomes arbitrarily small as the approximation gets finer. To compute the approximate solution, we use Pontryagin's minimum principle after approximating the infinite-dimensional terms originating from the diffusion-advection process. The approximate solution is applied in simulation to analyze how a single mobile sensor's performance depends on two important parameters: sensor noise variance and mobility penalty. We also illustrate the application of the framework to multiple sensors, in particular the performance of a heterogeneous team of sensors. △ Less

Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

Comments: Proofs for Lemmas 2.5, 2.7, and C.1 are attached in the supplement at the end. arXiv admin note: text overlap with arXiv:2106.08429

arXiv:2106.08429 [pdf, other]

doi 10.1016/j.automatica.2021.109866

Optimal control of a 2D diffusion-advection process with a team of mobile actuators under jointly optimal guidance

Authors: Sheng Cheng, Derek A. Paley

Abstract: This paper describes an optimization framework to control a distributed parameter system (DPS) using a team of mobile actuators. The framework simultaneously seeks optimal control of the DPS and optimal guidance of the mobile actuators such that a cost function associated with both the DPS and the mobile actuators is minimized subject to the dynamics of each. The cost incurred from controlling the… ▽ More This paper describes an optimization framework to control a distributed parameter system (DPS) using a team of mobile actuators. The framework simultaneously seeks optimal control of the DPS and optimal guidance of the mobile actuators such that a cost function associated with both the DPS and the mobile actuators is minimized subject to the dynamics of each. The cost incurred from controlling the DPS is linear-quadratic, which is transformed into an equivalent form as a quadratic term associated with an operator-valued Riccati equation. This equivalent form reduces the problem to seeking for guidance only because the optimal control can be recovered once the optimal guidance is obtained. We establish conditions for the existence of a solution to the proposed problem. Since computing an optimal solution requires approximation, we also establish the conditions for convergence to the exact optimal solution of the approximate optimal solution. That is, when evaluating these two solutions by the original cost function, the difference becomes arbitrarily small as the approximation gets finer. Two numerical examples demonstrate the performance of the optimal control and guidance obtained from the proposed approach. △ Less

Submitted 18 June, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

Comments: Proofs for Lemmas~2.3, 2.5, and D.1 are attached in the supplement at the end

Showing 1–5 of 5 results for author: Paley, A