-
A new set of tools for goodness-of-fit validation
Authors:
Gilles R. Ducharme,
Teresa Ledwina
Abstract:
We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptan…
▽ More
We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptance regions related to the model. The knowledge gained from this representation can sometimes suggest an existing $goodness$-$of$-$fit$ test to supplement this visual assessment with a control of the type I error. Otherwise, an adaptive test may be preferable and the second tool is the combination of these components to produce a powerful $χ^2$-type goodness-of-fit test. Because the number of these components can be large, we introduce a new selection rule to decide, in a data driven fashion, on their proper number to take into consideration. In a simulation, our goodness-of-fit tests are seen to be powerwise competitive with the best solutions that have been recommended in the context of a fully specified model as well as when some parameters must be estimated. Practical examples show how to use these tools to derive principled information about where the model departs from the data.
△ Less
Submitted 15 May, 2024; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Smooths Tests of Goodness-of-fit for the Newcomb-Benford distribution
Authors:
G. R. Ducharme,
S. Kaci,
C. Vovor-Dassu
Abstract:
The Newcomb-Benford probability distribution is becoming very popular in many areas using statistics, notably in fraud detection. In such contexts, it is important to be able to determine if a data set arises from this distribution while controlling the risk of a Type 1 error, i.e. falsely identifying a fraud, and a Type 2 error, i.e. not detecting that a fraud occurred. The statistical tool to do…
▽ More
The Newcomb-Benford probability distribution is becoming very popular in many areas using statistics, notably in fraud detection. In such contexts, it is important to be able to determine if a data set arises from this distribution while controlling the risk of a Type 1 error, i.e. falsely identifying a fraud, and a Type 2 error, i.e. not detecting that a fraud occurred. The statistical tool to do this work is a goodness-of-fit test. For the Newcomb-Benford distribution, the most popular such test is Pearson's chi-square test whose power, related to the Type 2 error, is known to be weak. Consequently, other tests have been recently introduced. The goal of the present work is to build new goodness-of-fit tests for this distribution, based on the smooth test principle. These tests are then compared to some of their competitors. It turns out that the proposals of the paper are globally preferable to existing tests and should be seriously considered in fraud detection contexts, among others.
△ Less
Submitted 1 March, 2020;
originally announced March 2020.
-
A goodness-of-fit test for elliptical distributions with diagnostic capabilities
Authors:
Gilles R. Ducharme,
Pierre Lafaye de Micheaux
Abstract:
This paper develops a smooth test of goodness-of-fit for elliptical distributions. The test is adaptively omnibus, invariant to affine-linear transformations and has a convenient expression that can be broken into components. These components have diagnostic capabilities and can be used to identify specific departures. This helps in correcting the null model when the test rejects. As an example, t…
▽ More
This paper develops a smooth test of goodness-of-fit for elliptical distributions. The test is adaptively omnibus, invariant to affine-linear transformations and has a convenient expression that can be broken into components. These components have diagnostic capabilities and can be used to identify specific departures. This helps in correcting the null model when the test rejects. As an example, the results are applied to the multivariate normal distribution for which the R package ECGofTestDx is available. It is shown that the proposed test strategy encompasses and generalizes a number of existing approaches. Some other cases are studied, such as the bivariate Laplace, logistic and Pearson type II distribution. A simulation experiment shows the usefulness of the diagnostic tools.
△ Less
Submitted 10 February, 2019;
originally announced February 2019.