Search | arXiv e-print repository

Active information, missing data and prevalence estimation

Authors: Ola Hössjer, Daniel Andrés Díaz-Pachón, Chen Zhao, J. Sunil Rao

Abstract: The topic of this paper is prevalence estimation from the perspective of active information. Prevalence among tested individuals has an upward bias under the assumption that individuals' willingness to be tested for the disease increases with the strength of their symptoms. Active information due to testing bias quantifies the degree at which the willingness to be tested correlates with infection… ▽ More The topic of this paper is prevalence estimation from the perspective of active information. Prevalence among tested individuals has an upward bias under the assumption that individuals' willingness to be tested for the disease increases with the strength of their symptoms. Active information due to testing bias quantifies the degree at which the willingness to be tested correlates with infection status. Interpreting incomplete testing as a missing data problem, the missingness mechanism impacts the degree at which the bias of the original prevalence estimate can be removed. The reduction in prevalence, when testing bias is adjusted for, translates into an active information due to bias correction, with opposite sign to active information due to testing bias. Prevalence and active information estimates are asymptotically normal, a behavior also illustrated through simulations. △ Less

Submitted 10 June, 2022; originally announced June 2022.

Comments: 18 pages, 5 tables, 2 figures

MSC Class: 62D10; 94A17; 62B10; 62F12; 62P10; 92B15; 94A17; 94A16; 94A20

arXiv:2202.08928 [pdf, other]

"Back to the future" projections for COVID-19 surges

Authors: J. Sunil Rao, Tianhao Liu, Daniel Andrés Díaz-Pachón

Abstract: We argue that information from countries who had earlier COVID-19 surges can be used to inform another country's current model, then generating what we call back-to-the-future (BTF) projections. We show that these projections can be used to accurately predict future COVID-19 surges prior to an inflection point of the daily infection curve. We show, across 12 different countries from all populated… ▽ More We argue that information from countries who had earlier COVID-19 surges can be used to inform another country's current model, then generating what we call back-to-the-future (BTF) projections. We show that these projections can be used to accurately predict future COVID-19 surges prior to an inflection point of the daily infection curve. We show, across 12 different countries from all populated continents around the world, that our method can often predict future surges in scenarios where the traditional approaches would always predict no future surges. However, as expected, BTF projections cannot accurately predict a surge due to the emergence of a new variant. To generate BTF projections, we make use of a matching scheme for asynchronous time series combined with a response coaching SIR model. △ Less

Submitted 17 February, 2022; originally announced February 2022.

Comments: 21 pages, 7 figures

MSC Class: 92D25 (Primary) 92C60 92B15 62P10 62M10 (Secondary)

arXiv:2007.07426 [pdf, ps, other]

doi 10.1016/j.jtbi.2020.110556

A simple correction for COVID-19 sampling bias

Authors: Daniel Andrés Díaz-Pachón, J Sunil Rao

Abstract: COVID-19 testing has become a standard approach for estimating prevalence which then assist in public health decision making to contain and mitigate the spread of the disease. The sampling designs used are often biased in that they do not reflect the true underlying populations. For instance, individuals with strong symptoms are more likely to be tested than those with no symptoms. This results in… ▽ More COVID-19 testing has become a standard approach for estimating prevalence which then assist in public health decision making to contain and mitigate the spread of the disease. The sampling designs used are often biased in that they do not reflect the true underlying populations. For instance, individuals with strong symptoms are more likely to be tested than those with no symptoms. This results in biased estimates of prevalence (too high). Typical post-sampling corrections are not always possible. Here we present a simple bias correction methodology derived and adapted from a correction for publication bias in meta analysis studies. The methodology is general enough to allow a wide variety of customization making it more useful in practice. Implementation is easily done using already collected information. Via a simulation and two real datasets, we show that the bias corrections can provide dramatic reductions in estimation error. △ Less

Submitted 11 January, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

Comments: 14 pages. Title changed. The whole Section 7 with information from Lombardy, Italy, was added (another real dataset). Some typos were corrected. In spite of several lengthy additions, no substantial changes were done to the paper. The goal of the additions was more to clarify than to correct

MSC Class: 62D99

Journal ref: Journal of Theoretical Biology Journal of Theoretical Biology, Volume 512, 7 March 2021, 110556

arXiv:physics/0207113 [pdf, ps, other]

doi 10.1103/PhysRevE.66.031913

Simplifying the mosaic description of DNA sequences

Authors: Rajeev K. Azad, J. Subba Rao, Wentian Li, Ramakrishna Ramaswamy

Abstract: By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbours, may however share compositional similarity with one or more distant (non--neighbouring) domains. We thus obtain a coarse--grained description of the given DNA string in terms o… ▽ More By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbours, may however share compositional similarity with one or more distant (non--neighbouring) domains. We thus obtain a coarse--grained description of the given DNA string in terms of a smaller set of distinct domain labels. This yields a minimal domain description of a given DNA sequence, significantly reducing its organizational complexity. This procedure gives a new means of evaluating genomic complexity as one examines organisms ranging from bacteria to human. The mosaic organization of DNA sequences could have originated from the insertion of fragments of one genome (the parasite) inside another (the host), and we present numerical experiments that are suggestive of this scenario. △ Less

Submitted 27 July, 2002; originally announced July 2002.

Comments: 16 pages, 1 figure, Accepted for publication in Phys. Rev. E

Showing 1–4 of 4 results for author: Rao, J S