Search | arXiv e-print repository

A new set of tools for goodness-of-fit validation

Authors: Gilles R. Ducharme, Teresa Ledwina

Abstract: We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptan… ▽ More We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptance regions related to the model. The knowledge gained from this representation can sometimes suggest an existing $goodness$-$of$-$fit$ test to supplement this visual assessment with a control of the type I error. Otherwise, an adaptive test may be preferable and the second tool is the combination of these components to produce a powerful $χ^2$-type goodness-of-fit test. Because the number of these components can be large, we introduce a new selection rule to decide, in a data driven fashion, on their proper number to take into consideration. In a simulation, our goodness-of-fit tests are seen to be powerwise competitive with the best solutions that have been recommended in the context of a fully specified model as well as when some parameters must be estimated. Practical examples show how to use these tools to derive principled information about where the model departs from the data. △ Less

Submitted 15 May, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

Comments: 35 pages, 10 figures, submitted to the Electronic Journal of Statistic

MSC Class: 62A09 (Primary) 62F03 (Secondary)

arXiv:2003.00520 [pdf, ps, other]

Smooths Tests of Goodness-of-fit for the Newcomb-Benford distribution

Authors: G. R. Ducharme, S. Kaci, C. Vovor-Dassu

Abstract: The Newcomb-Benford probability distribution is becoming very popular in many areas using statistics, notably in fraud detection. In such contexts, it is important to be able to determine if a data set arises from this distribution while controlling the risk of a Type 1 error, i.e. falsely identifying a fraud, and a Type 2 error, i.e. not detecting that a fraud occurred. The statistical tool to do… ▽ More The Newcomb-Benford probability distribution is becoming very popular in many areas using statistics, notably in fraud detection. In such contexts, it is important to be able to determine if a data set arises from this distribution while controlling the risk of a Type 1 error, i.e. falsely identifying a fraud, and a Type 2 error, i.e. not detecting that a fraud occurred. The statistical tool to do this work is a goodness-of-fit test. For the Newcomb-Benford distribution, the most popular such test is Pearson's chi-square test whose power, related to the Type 2 error, is known to be weak. Consequently, other tests have been recently introduced. The goal of the present work is to build new goodness-of-fit tests for this distribution, based on the smooth test principle. These tests are then compared to some of their competitors. It turns out that the proposals of the paper are globally preferable to existing tests and should be seriously considered in fraud detection contexts, among others. △ Less

Submitted 1 March, 2020; originally announced March 2020.

Comments: in French

arXiv:1902.03622 [pdf, ps, other]

A goodness-of-fit test for elliptical distributions with diagnostic capabilities

Authors: Gilles R. Ducharme, Pierre Lafaye de Micheaux

Abstract: This paper develops a smooth test of goodness-of-fit for elliptical distributions. The test is adaptively omnibus, invariant to affine-linear transformations and has a convenient expression that can be broken into components. These components have diagnostic capabilities and can be used to identify specific departures. This helps in correcting the null model when the test rejects. As an example, t… ▽ More This paper develops a smooth test of goodness-of-fit for elliptical distributions. The test is adaptively omnibus, invariant to affine-linear transformations and has a convenient expression that can be broken into components. These components have diagnostic capabilities and can be used to identify specific departures. This helps in correcting the null model when the test rejects. As an example, the results are applied to the multivariate normal distribution for which the R package ECGofTestDx is available. It is shown that the proposed test strategy encompasses and generalizes a number of existing approaches. Some other cases are studied, such as the bivariate Laplace, logistic and Pearson type II distribution. A simulation experiment shows the usefulness of the diagnostic tools. △ Less

Submitted 10 February, 2019; originally announced February 2019.

Comments: 35 p. pre-print

MSC Class: 62F03

Showing 1–3 of 3 results for author: Ducharme, G