Search | arXiv e-print repository

Proceedings of The second international workshop on eXplainable AI for the Arts (XAIxArts)

Authors: Nick Bryan-Kinns, Corey Ford, Shuoyang Zheng, Helen Kennedy, Alan Chamberlain, Makayla Lewis, Drew Hemment, Zi** Li, Qiong Wu, Lanxi Xiao, Gus Xia, Jeba Rezwana, Michael Clemens, Gabriel Vigliensoni

Abstract: This second international workshop on explainable AI for the Arts (XAIxArts) brought together a community of researchers in HCI, Interaction Design, AI, explainable AI (XAI), and digital arts to explore the role of XAI for the Arts. Workshop held at the 16th ACM Conference on Creativity and Cognition (C&C 2024), Chicago, USA. This second international workshop on explainable AI for the Arts (XAIxArts) brought together a community of researchers in HCI, Interaction Design, AI, explainable AI (XAI), and digital arts to explore the role of XAI for the Arts. Workshop held at the 16th ACM Conference on Creativity and Cognition (C&C 2024), Chicago, USA. △ Less

Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

arXiv:2405.03083 [pdf, other]

Causal K-Means Clustering

Authors: Kwangho Kim, Jisu Kim, Edward H. Kennedy

Abstract: Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses… ▽ More Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses the widely-used k-means clustering algorithm to uncover the unknown subgroup structure. Our problem differs significantly from the conventional clustering setup since the variables to be clustered are unknown counterfactual functions. We present a plug-in estimator which is simple and readily implementable using off-the-shelf algorithms, and study its rate of convergence. We also develop a new bias-corrected estimator based on nonparametric efficiency theory and double machine learning, and show that this estimator achieves fast root-n rates and asymptotic normality in large nonparametric models. Our proposed methods are especially useful for modern outcome-wide studies with multiple treatment levels. Further, our framework is extensible to clustering with generic pseudo-outcomes, such as partially observed outcomes or otherwise unknown functions. Finally, we explore finite sample properties via simulation, and illustrate the proposed methods in a study of treatment programs for adolescent substance abuse. △ Less

Submitted 29 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

arXiv:2402.00168 [pdf, other]

Continuous Treatment Effects with Surrogate Outcomes

Authors: Zhenghao Zeng, David Arbour, Avi Feller, Raghavendra Addanki, Ryan Rossi, Ritwik Sinha, Edward H. Kennedy

Abstract: In many real-world causal inference applications, the primary outcomes (labels) are often partially missing, especially if they are expensive or difficult to collect. If the missingness depends on covariates (i.e., missingness is not completely at random), analyses based on fully observed samples alone may be biased. Incorporating surrogates, which are fully observed post-treatment variables relat… ▽ More In many real-world causal inference applications, the primary outcomes (labels) are often partially missing, especially if they are expensive or difficult to collect. If the missingness depends on covariates (i.e., missingness is not completely at random), analyses based on fully observed samples alone may be biased. Incorporating surrogates, which are fully observed post-treatment variables related to the primary outcome, can improve estimation in this case. In this paper, we study the role of surrogates in estimating continuous treatment effects and propose a doubly robust method to efficiently incorporate surrogates in the analysis, which uses both labeled and unlabeled data and does not suffer from the above selection bias problem. Importantly, we establish the asymptotic normality of the proposed estimator and show possible improvements on the variance compared with methods that solely use labeled data. Extensive simulations show our methods enjoy appealing empirical performance. △ Less

Submitted 21 May, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

Comments: 30 pages, 7 figures

arXiv:2310.06428

Proceedings of The first international workshop on eXplainable AI for the Arts (XAIxArts)

Authors: Nick Bryan-Kinns, Corey Ford, Alan Chamberlain, Steven David Benford, Helen Kennedy, Zi** Li, Wu Qiong, Gus G. Xia, Jeba Rezwana

Abstract: This first international workshop on explainable AI for the Arts (XAIxArts) brought together a community of researchers in HCI, Interaction Design, AI, explainable AI (XAI), and digital arts to explore the role of XAI for the Arts. Workshop held at the 15th ACM Conference on Creativity and Cognition (C&C 2023). This first international workshop on explainable AI for the Arts (XAIxArts) brought together a community of researchers in HCI, Interaction Design, AI, explainable AI (XAI), and digital arts to explore the role of XAI for the Arts. Workshop held at the 15th ACM Conference on Creativity and Cognition (C&C 2023). △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2301.06199 [pdf, other]

Doubly Robust Counterfactual Classification

Authors: Kwangho Kim, Edward H. Kennedy, José R. Zubizarreta

Abstract: We study counterfactual classification as a new tool for decision-making under hypothetical (contrary to fact) scenarios. We propose a doubly-robust nonparametric estimator for a general counterfactual classifier, where we can incorporate flexible constraints by casting the classification problem as a nonlinear mathematical program involving counterfactuals. We go on to analyze the rates of conver… ▽ More We study counterfactual classification as a new tool for decision-making under hypothetical (contrary to fact) scenarios. We propose a doubly-robust nonparametric estimator for a general counterfactual classifier, where we can incorporate flexible constraints by casting the classification problem as a nonlinear mathematical program involving counterfactuals. We go on to analyze the rates of convergence of the estimator and provide a closed-form expression for its asymptotic distribution. Our analysis shows that the proposed estimator is robust against nuisance model misspecification, and can attain fast $\sqrt{n}$ rates with tractable inference even when using nonparametric machine learning approaches. We study the empirical performance of our methods by simulation and apply them for recidivism risk prediction. △ Less

Submitted 15 January, 2023; originally announced January 2023.

Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2103.01802 [pdf, other]

Median Optimal Treatment Regimes

Authors: Liu Leqi, Edward H. Kennedy

Abstract: Optimal treatment regimes are personalized policies for making a treatment decision based on subject characteristics, with the policy chosen to maximize some value. It is common to aim to maximize the mean outcome in the population, via a regime assigning treatment only to those whose mean outcome is higher under treatment versus control. However, the mean can be an unstable measure of centrality,… ▽ More Optimal treatment regimes are personalized policies for making a treatment decision based on subject characteristics, with the policy chosen to maximize some value. It is common to aim to maximize the mean outcome in the population, via a regime assigning treatment only to those whose mean outcome is higher under treatment versus control. However, the mean can be an unstable measure of centrality, resulting in imprecise statistical procedures, as well as unrobust decisions that can be overly influenced by a small fraction of subjects. In this work, we propose a new median optimal treatment regime that instead treats individuals whose conditional median is higher under treatment. This ensures that optimal decisions for individuals from the same group are not overly influenced either by (i) a small fraction of the group (unlike the mean criterion), or (ii) unrelated subjects from different groups (unlike marginal median/quantile criteria). We introduce a new measure of value, the Average Conditional Median Effect (ACME), which summarizes across-group median treatment outcomes of a policy, and which the median optimal treatment regime maximizes. After develo** key motivating examples that distinguish median optimal treatment regimes from mean and marginal median optimal treatment regimes, we give a nonparametric efficiency bound for estimating the ACME of a policy, and propose a new doubly robust-style estimator that achieves the efficiency bound under weak conditions. To construct the median optimal treatment regime, we introduce a new doubly robust-style estimator for the conditional median treatment effect. Finite-sample properties are explored via numerical simulations and the proposed algorithm is illustrated using data from a randomized clinical trial in patients with HIV. △ Less

Submitted 24 February, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

arXiv:2006.16916 [pdf, other]

Counterfactual Predictions under Runtime Confounding

Authors: Amanda Coston, Edward H. Kennedy, Alexandra Chouldechova

Abstract: Algorithms are commonly used to predict outcomes under a particular decision or intervention, such as predicting whether an offender will succeed on parole if placed under minimal supervision. Generally, to learn such counterfactual prediction models from observational data on historical decisions and corresponding outcomes, one must measure all factors that jointly affect the outcomes and the dec… ▽ More Algorithms are commonly used to predict outcomes under a particular decision or intervention, such as predicting whether an offender will succeed on parole if placed under minimal supervision. Generally, to learn such counterfactual prediction models from observational data on historical decisions and corresponding outcomes, one must measure all factors that jointly affect the outcomes and the decision taken. Motivated by decision support applications, we study the counterfactual prediction task in the setting where all relevant factors are captured in the historical data, but it is either undesirable or impermissible to use some such factors in the prediction model. We refer to this setting as runtime confounding. We propose a doubly-robust procedure for learning counterfactual prediction models in this setting. Our theoretical analysis and experimental results suggest that our method often outperforms competing approaches. We also present a validation procedure for evaluating the performance of counterfactual prediction methods. △ Less

Submitted 15 April, 2021; v1 submitted 30 June, 2020; originally announced June 2020.

Journal ref: Advances in Neural Information Processing Systems Vol 33, 2020. pp. 4150--4162

arXiv:1912.07133 [pdf]

doi 10.1117/1.JEI.27.5.051219

Digital filters with vanishing moments for shape analysis

Authors: Hugh L. Kennedy

Abstract: Shape- and scale-selective digital-filters, with steerable finite/infinite impulse responses (FIR/IIRs) and non-recursive/recursive realizations, that are separable in both spatial dimensions and adequately isotropic, are derived. The filters are conveniently designed in the frequency domain via derivative constraints at dc, which guarantees orthogonality and monomial selectivity in the pixel doma… ▽ More Shape- and scale-selective digital-filters, with steerable finite/infinite impulse responses (FIR/IIRs) and non-recursive/recursive realizations, that are separable in both spatial dimensions and adequately isotropic, are derived. The filters are conveniently designed in the frequency domain via derivative constraints at dc, which guarantees orthogonality and monomial selectivity in the pixel domain (i.e. vanishing moments), unlike more commonly used FIR filters derived from Gaussian functions. A two-stage low-pass/high-pass architecture, for blur/derivative operations, is recommended. Expressions for the coefficients of a low-order IIR blur filter with repeated poles are provided, as a function of scale; discrete Butterworth (IIR), and colored Savitzky-Golay (FIR), blurs are also examined. Parallel software implementations on central processing units (CPUs) and graphics processing units (GPUs), for scale-selective blob-detection in aerial surveillance imagery, are analyzed. It is shown that recursive IIR filters are significantly faster than non-recursive FIR filters when detecting large objects at coarse scales, i.e. using filters with long impulse responses; however, the margin of outperformance decreases as the degree of parallelization increases. △ Less

Submitted 8 April, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

Comments: Fixed some cut-and-paste typos in Table V

Journal ref: SPIE Journal of Electronic Imaging, vol. 27, no. 5, 051219, May 2018

arXiv:1909.00066 [pdf, other]

Counterfactual Risk Assessments, Evaluation, and Fairness

Authors: Amanda Coston, Alan Mishler, Edward H. Kennedy, Alexandra Chouldechova

Abstract: Algorithmic risk assessments are increasingly used to help humans make decisions in high-stakes settings, such as medicine, criminal justice and education. In each of these cases, the purpose of the risk assessment tool is to inform actions, such as medical treatments or release conditions, often with the aim of reducing the likelihood of an adverse event such as hospital readmission or recidivism… ▽ More Algorithmic risk assessments are increasingly used to help humans make decisions in high-stakes settings, such as medicine, criminal justice and education. In each of these cases, the purpose of the risk assessment tool is to inform actions, such as medical treatments or release conditions, often with the aim of reducing the likelihood of an adverse event such as hospital readmission or recidivism. Problematically, most tools are trained and evaluated on historical data in which the outcomes observed depend on the historical decision-making policy. These tools thus reflect risk under the historical policy, rather than under the different decision options that the tool is intended to inform. Even when tools are constructed to predict risk under a specific decision, they are often improperly evaluated as predictors of the target outcome. Focusing on the evaluation task, in this paper we define counterfactual analogues of common predictive performance and algorithmic fairness metrics that we argue are better suited for the decision-making context. We introduce a new method for estimating the proposed metrics using doubly robust estimation. We provide theoretical results that show that only under strong conditions can fairness according to the standard metric and the counterfactual metric simultaneously hold. Consequently, fairness-promoting methods that target parity in a standard fairness metric may --- and as we show empirically, do --- induce greater imbalance in the counterfactual analogue. We provide empirical comparisons on both synthetic data and a real world child welfare dataset to demonstrate how the proposed method improves upon standard practice. △ Less

Submitted 10 January, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

Comments: To appear in ACM FAT* 2020

arXiv:1907.12165 [pdf]

doi 10.1007/s11554-020-01040-4

On the Realization and Analysis of Circular Harmonic Transforms for Feature Detection

Authors: Hugh L Kennedy

Abstract: Circular-harmonic spectra are a compact representation of local image features in two dimensions. It is well known that the computational complexity of such transforms is greatly reduced when polar separability is exploited in steerable filter-banks. Further simplifications are possible when Cartesian separability is incorporated using the radial apodization (i.e. weight, window, or taper) describ… ▽ More Circular-harmonic spectra are a compact representation of local image features in two dimensions. It is well known that the computational complexity of such transforms is greatly reduced when polar separability is exploited in steerable filter-banks. Further simplifications are possible when Cartesian separability is incorporated using the radial apodization (i.e. weight, window, or taper) described here, as a consequence of the Laguerre/Hermite correspondence over polar/Cartesian coordinates. The chosen form also mitigates undesirable discretization artefacts due to angular aliasing. The possible utility of circular-harmonic spectra for the description of simple features is illustrated using real data from an airborne electro-optic sensor. The spectrum is deployed in a test-statistic to detect and characterize corners of arbitrary angle and orientation (i.e. wedges). The test-statistic considers uncertainty due to finite sampling and clutter/noise. △ Less

Submitted 6 November, 2020; v1 submitted 28 July, 2019; originally announced July 2019.

Comments: A new section on parallel software implementation (MATLAB, C++ and CUDA) was added to this draft version. Manuscript was then accepted for publication in Journal of Real-Time Image Processing, special issue on Real-Time Statistical Image and Video Processing for Remote Sensing and Surveillance Applications

arXiv:1806.02935 [pdf, other]

Causal effects based on distributional distances

Authors: Kwangho Kim, Jisu Kim, Edward H. Kennedy

Abstract: In this paper we develop a framework for characterizing causal effects via distributional distances. In particular we define a causal effect in terms of the $L_1$ distance between different counterfactual outcome distributions, rather than the typical mean difference in outcome values. Comparing entire counterfactual outcome distributions can provide more nuanced and valuable measures for explorin… ▽ More In this paper we develop a framework for characterizing causal effects via distributional distances. In particular we define a causal effect in terms of the $L_1$ distance between different counterfactual outcome distributions, rather than the typical mean difference in outcome values. Comparing entire counterfactual outcome distributions can provide more nuanced and valuable measures for exploring causal effects beyond the average treatment effect. First, we propose a novel way to estimate counterfactual outcome densities, which is of independent interest. Then we develop an efficient estimator of our target causal effect. We go on to provide error bounds and asymptotic properties of the proposed estimator, along with bootstrap-based confidence intervals. Finally, we illustrate the methods via simulations and real data. △ Less

Submitted 26 February, 2021; v1 submitted 7 June, 2018; originally announced June 2018.

Comments: 46 pages

arXiv:1501.04228 [pdf]

Improved IIR Low-Pass Smoothers and Differentiators with Tunable Delay

Authors: Hugh L. Kennedy

Abstract: Regression analysis using orthogonal polynomials in the time domain is used to derive closed-form expressions for causal and non-causal filters with an infinite impulse response (IIR) and a maximally-flat magnitude and delay response. The phase response of the resulting low-order smoothers and differentiators, with low-pass characteristics, may be tuned to yield the desired delay in the pass band… ▽ More Regression analysis using orthogonal polynomials in the time domain is used to derive closed-form expressions for causal and non-causal filters with an infinite impulse response (IIR) and a maximally-flat magnitude and delay response. The phase response of the resulting low-order smoothers and differentiators, with low-pass characteristics, may be tuned to yield the desired delay in the pass band or for zero gain at the Nyquist frequency. The filter response is improved when the shape of the exponential weighting function is modified and discrete associated Laguerre polynomials are used in the analysis. As an illustrative example, the derivative filters are used to generate an optical-flow field and to detect moving ground targets, in real video data collected from an airborne platform with an electro-optic sensor. △ Less

Submitted 20 August, 2015; v1 submitted 17 January, 2015; originally announced January 2015.

Comments: To appear in Proc. International Conference on Digital Image Computing: Techniques and Applications (DICTA), Adelaide, 23rd-25th Nov. 2015

arXiv:1410.0582 [pdf]

doi 10.1016/j.sigpro.2015.03.005.

Multidimensional Digital Smoothing Filters for Target Detection

Authors: Hugh L. Kennedy

Abstract: Recursive, causal and non-causal, multidimensional digital filters, with infinite impulse responses and maximally flat magnitude and delay responses in the low-frequency region, are designed to negate correlated clutter and interference in the background and to accumulate power due to dim targets in the foreground of a surveillance sensor. Expressions relating mean impulse-response duration, frequ… ▽ More Recursive, causal and non-causal, multidimensional digital filters, with infinite impulse responses and maximally flat magnitude and delay responses in the low-frequency region, are designed to negate correlated clutter and interference in the background and to accumulate power due to dim targets in the foreground of a surveillance sensor. Expressions relating mean impulse-response duration, frequency selectivity and group delay, to low-order linear-difference-equation coefficients are derived using discrete Laguerre polynomials and discounted least-squares regression, then verified through simulation. △ Less

Submitted 1 April, 2015; v1 submitted 2 October, 2014; originally announced October 2014.

Comments: With galley proof fixes

Journal ref: Signal Processing, Volume 114, September 2015, Pages 251-264

arXiv:1408.3526 [pdf]

doi 10.1109/ICASSP.2015.7178137

Parallel software implementation of recursive multidimensional digital filters for point-target detection in cluttered infrared scenes

Authors: Hugh L. Kennedy

Abstract: A technique for the enhancement of point targets in clutter is described. The local 3-D spectrum at each pixel is estimated recursively. An optical flow-field for the textured background is then generated using the 3-D autocorrelation function and the local velocity estimates are used to apply high-pass velocity-selective spatiotemporal filters, with finite impulse responses (FIRs), to subtract th… ▽ More A technique for the enhancement of point targets in clutter is described. The local 3-D spectrum at each pixel is estimated recursively. An optical flow-field for the textured background is then generated using the 3-D autocorrelation function and the local velocity estimates are used to apply high-pass velocity-selective spatiotemporal filters, with finite impulse responses (FIRs), to subtract the background clutter signal, leaving the foreground target signal, plus noise. Parallel software implementations using a multicore central processing unit (CPU) and a graphical processing unit (GPU) are investigated. △ Less

Submitted 21 August, 2015; v1 submitted 15 August, 2014; originally announced August 2014.

Comments: To appear in Proc. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Added header and DOI

arXiv:1408.2590 [pdf]

doi 10.1117/1.JEI.23.6.063019

Multidimensional Digital Filters for Point-Target Detection in Cluttered Infrared Scenes

Authors: Hugh L. Kennedy

Abstract: A 3-D spatiotemporal prediction-error filter (PEF), is used to enhance foreground/background contrast in (real and simulated) sensor image sequences. Relative velocity is utilized to extract point-targets that would otherwise be indistinguishable on spatial frequency alone. An optical-flow field is generated using local estimates of the 3-D autocorrelation function via the application of the fast… ▽ More A 3-D spatiotemporal prediction-error filter (PEF), is used to enhance foreground/background contrast in (real and simulated) sensor image sequences. Relative velocity is utilized to extract point-targets that would otherwise be indistinguishable on spatial frequency alone. An optical-flow field is generated using local estimates of the 3-D autocorrelation function via the application of the fast Fourier transform (FFT) and inverse FFT. Velocity estimates are then used to tune in a background-whitening PEF that is matched to the motion and texture of the local background. Finite-impulse-response (FIR) filters are designed and implemented in the frequency domain. An analytical expression for the frequency response of velocity-tuned FIR filters, of odd or even dimension, with an arbitrary delay in each dimension, is derived. △ Less

Submitted 16 January, 2015; v1 submitted 11 August, 2014; originally announced August 2014.

Comments: Accepted version

Journal ref: J. Electron. Imaging. 23 (6), 063019 (December 17, 2014)

arXiv:1408.2294 [pdf]

Digital Filter Designs for Recursive Frequency Analysis

Authors: Hugh L. Kennedy

Abstract: Digital filters for recursively computing the discrete Fourier transform (DFT) and estimating the frequency spectrum of sampled signals are examined, with an emphasis on magnitude-response and numerical stability. In this tutorial-style treatment, existing recursive techniques are reviewed, explained and compared within a coherent framework; some fresh insights are provided and new enhancements/mo… ▽ More Digital filters for recursively computing the discrete Fourier transform (DFT) and estimating the frequency spectrum of sampled signals are examined, with an emphasis on magnitude-response and numerical stability. In this tutorial-style treatment, existing recursive techniques are reviewed, explained and compared within a coherent framework; some fresh insights are provided and new enhancements/modifications are proposed. It is shown that the replacement of resonators by (non-recursive) modulators in sliding DFT (SDFT) analyzers with either a finite impulse response (FIR), or an infinite impulse response (IIR), does improve performance somewhat; however stability is not guaranteed, as the cancellation of marginally stable poles by zeros is still involved. The FIR deadbeat observer is shown to be more reliable than the SDFT methods, an IIR variant is presented, and ways of fine-tuning its response are discussed. A novel technique for stabilizing IIR SDFT analyzers with a fading memory, so that all poles are inside the unit circle, is also derived. Slepian and sum-of-cosine windows are adapted to improve the frequency responses for the various FIR and IIR DFT methods. △ Less

Submitted 25 August, 2015; v1 submitted 10 August, 2014; originally announced August 2014.

Comments: To appear in Journal of Circuits, Systems, and Computers (JCSC). Accepted draft version, Aug. 2015. Added summary tables. Expanded Conclusion and Summary Section. Fixed a few errors/typos

Showing 1–16 of 16 results for author: Kennedy, H