-
The Effect of Multiple Imputation of Routine Pathology Variables on Laboratory Diagnosis of Hepatitis C Infection
Authors:
N. Menon,
B. A. Lidbury,
A. M. Richardson
Abstract:
Pathology tests are central to modern healthcare in terms of diagnosis and patient management. Aggregated pathology results provide opportunities for research into fundamental and applied questions in health and medicine, but data analytic challenges appear since test profiles vary between medical practitioners, resulting in missing data. In this study we provide an analytical investigation of the…
▽ More
Pathology tests are central to modern healthcare in terms of diagnosis and patient management. Aggregated pathology results provide opportunities for research into fundamental and applied questions in health and medicine, but data analytic challenges appear since test profiles vary between medical practitioners, resulting in missing data. In this study we provide an analytical investigation of the laboratory diagnosis of Hepatitis C (HCV) infection and focus on how to maximize the predictive value of routine pathology data. We recommend using the Influx - Outflux measures to help construct the imputation model when using multiple imputation.
Data from 14,320 community-patients aged 15 - 100 years were accessed via ACT Pathology (The Canberra Hospital, Australia). Influx and Outflux were calculated to identify which variables were potentially powerful predictors of missing values. Available Case analysis and Multiple Imputation were used to accommodate missing values in the dataset. Logistic regression model and stepwise selection method were used for analysing the imputed datasets. The predictive power of all methods was compared.
The predictive power of the models on multiply imputed data was similar to the power of the models based on complete data. The advantage of multiply imputed data was that it allowed for the inclusion of all the completed variables in the logistic models, thus identifying a broader selection of test results that could lead to the enhanced laboratory prediction of HCV.
Multiple imputation is an important statistical resource allowing all individuals in a study to contribute whatever data they have supplied to the analysis. MI in combination with the values of Influx and Outflux identifies potential predictors of HepC infection. Variables age, gender and alanine aminotransferase have been shown to be strong laboratory predictors of HCV infection.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Textual analysis of clinical notes on pathology request forms to determine sensitivity and specificity of Hepatitis B and C virus infection status
Authors:
Eric H. Kim,
Brett A. Lidbury,
Alice M. Richardson
Abstract:
Background: It is not established whether clinical notes provided on pathology request forms are useful as decision support data when assessing Hepatitis B and C viral infection status. Objective: To determine sensitivity, specificity, and predictive value of clinical notes for identifying infection status of Hepatitis B and C. Methods: The study comprises 179 cases and 166 cases tested for HBsAg…
▽ More
Background: It is not established whether clinical notes provided on pathology request forms are useful as decision support data when assessing Hepatitis B and C viral infection status. Objective: To determine sensitivity, specificity, and predictive value of clinical notes for identifying infection status of Hepatitis B and C. Methods: The study comprises 179 cases and 166 cases tested for HBsAg and anti-HCV serological markers, respectively, and accompanied by a written description (clinical note) provided on pathology request forms by the clinician on duty. The clinical note sensitivity, specificity, positive (PPV) and negative (NPV) predictive values were calculated using serological HBsAg and anti-HCV tests as gold standards. Results: The sensitivity and specificity of clinical notes for Hepatitis B infection status were 90 percent and 56 percent, respectively. The sensitivity and specificity of clinical notes for Hepatitis C infection status were 86 percent and 21 percent, respectively. Conclusions: Clinical note information identifies moderate-to-high sensitivity with regards to Hepatitis B and C viral infection status, however, given low specificity in both groups, the clinical note is not favourable for ruling disease in, possibly due to high rate of false positives.
△ Less
Submitted 27 February, 2022;
originally announced February 2022.
-
An Investigation into Outlier Elimination and Calculation Methods in the Determination of Reference Intervals using Serum Immunoglobulin A as a Model Data Collection
Authors:
Aidan Zellner,
Alice M. Richardson,
Brett A. Lidbury,
Peter Hobson,
Tony Badrick
Abstract:
Background: Reference intervals are essential to interpret diagnostic tests, but their determination has become controversial. Methods: In this paper parametric, non-parametric and robust reference intervals with Tukey and block elimination are calculated from a dataset of over 32,000 serum immunoglobulin A (IgA) measurements. Results: The outlier elimination method was significantly more determin…
▽ More
Background: Reference intervals are essential to interpret diagnostic tests, but their determination has become controversial. Methods: In this paper parametric, non-parametric and robust reference intervals with Tukey and block elimination are calculated from a dataset of over 32,000 serum immunoglobulin A (IgA) measurements. Results: The outlier elimination method was significantly more determinative of the reference intervals than the calculation method. The Tukey elimination procedure consistently eliminated significantly more values than the block method of Dixon and Reed across all age ranges. If Tukey elimination was applied, variation between reference intervals produced by the different calculation methods was minimal. Block elimination rarely eliminated values. The non-parametric reference intervals were more sensitive to outliers, which in the IgA context, led to higher and wider reference intervals for the older age groups. There were only minimal differences between robust and parametric reference intervals. Conclusions: This suggests that Tukey elimination should be preferred over the block D/R method for datasets similar to the one used in this study. These are predominantly new observations, as previous literature has focused on the calculation technique and not discussed outlier elimination. This suggests the robust method is not advantageous over the parametric method and therefore due to its complexity is not particularly useful, contrary to CLSI Guidelines.
△ Less
Submitted 22 July, 2019;
originally announced July 2019.