Search | arXiv e-print repository

Estimating Cost Savings from Early Cancer Diagnosis

Authors: Zura Kakushadze, Rakesh Raghubanshi, Willie Yu

Abstract: We estimate treatment cost-savings from early cancer diagnosis. For breast, lung, prostate and colorectal cancers and melanoma, which account for more than 50% of new incidences projected in 2017, we combine published cancer treatment cost estimates by stage with incidence rates by stage at diagnosis. We extrapolate to other cancer sites by using estimated national expenditures and incidence rates… ▽ More We estimate treatment cost-savings from early cancer diagnosis. For breast, lung, prostate and colorectal cancers and melanoma, which account for more than 50% of new incidences projected in 2017, we combine published cancer treatment cost estimates by stage with incidence rates by stage at diagnosis. We extrapolate to other cancer sites by using estimated national expenditures and incidence rates. A rough estimate for the U.S. national annual treatment cost-savings from early cancer diagnosis is in 11 digits. Using this estimate and cost-neutrality, we also estimate a rough upper bound on the cost of a routine early cancer screening test. △ Less

Submitted 2 April, 2019; v1 submitted 30 August, 2017; originally announced September 2017.

Comments: 22 pages; a trivial grammar typo corrected

Journal ref: Data 2(3) (2017) 30

arXiv:1707.08504 [pdf, other]

Mutation Clusters from Cancer Exome

Authors: Zura Kakushadze, Willie Yu

Abstract: We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1,389 published genome samples across 14 cancer… ▽ More We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1,389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics such as novel blood-test methods currently in development. △ Less

Submitted 26 July, 2017; originally announced July 2017.

Comments: 84 pages

Journal ref: Genes 8(8) (2017) 201

arXiv:1703.00703 [pdf, other]

*K-means and Cluster Models for Cancer Signatures

Authors: Zura Kakushadze, Willie Yu

Abstract: We present *K-means clustering algorithm and source code by expanding statistical clustering methods applied in https://ssrn.com/abstract=2802753 to quantitative finance. *K-means is statistically deterministic without specifying initial centers, etc. We apply *K-means to extracting cancer signatures from genome data without using nonnegative matrix factorization (NMF). *K-means' computational cos… ▽ More We present *K-means clustering algorithm and source code by expanding statistical clustering methods applied in https://ssrn.com/abstract=2802753 to quantitative finance. *K-means is statistically deterministic without specifying initial centers, etc. We apply *K-means to extracting cancer signatures from genome data without using nonnegative matrix factorization (NMF). *K-means' computational cost is a fraction of NMF's. Using 1,389 published samples for 14 cancer types, we find that 3 cancers (liver cancer, lung cancer and renal cell carcinoma) stand out and do not have cluster-like structures. Two clusters have especially high within-cluster correlations with 11 other cancers indicating common underlying structures. Our approach opens a novel avenue for studying such structures. *K-means is universal and can be applied in other fields. We discuss some potential applications in quantitative finance. △ Less

Submitted 18 July, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

Comments: 124 pages, 69 figures; a trivial typo corrected; to appear in Biomolecular Detection and Quantification

Journal ref: Biomolecular Detection and Quantification 13 (2017) 7-31

arXiv:1604.08743 [pdf, other]

doi 10.1016/j.physa.2016.06.089

Factor Models for Cancer Signatures

Authors: Zura Kakushadze, Willie Yu

Abstract: We present a novel method for extracting cancer signatures by applying statistical risk models (http://ssrn.com/abstract=2732453) from quantitative finance to cancer genome data. Using 1389 whole genome sequenced samples from 14 cancers, we identify an "overall" mode of somatic mutational noise. We give a prescription for factoring out this noise and source code for fixing the number of signatures… ▽ More We present a novel method for extracting cancer signatures by applying statistical risk models (http://ssrn.com/abstract=2732453) from quantitative finance to cancer genome data. Using 1389 whole genome sequenced samples from 14 cancers, we identify an "overall" mode of somatic mutational noise. We give a prescription for factoring out this noise and source code for fixing the number of signatures. We apply nonnegative matrix factorization (NMF) to genome data aggregated by cancer subtype and filtered using our method. The resultant signatures have substantially lower variability than those from unfiltered data. Also, the computational cost of signature extraction is cut by about a factor of 10. We find 3 novel cancer signatures, including a liver cancer dominant signature (96% contribution) and a renal cell carcinoma signature (70% contribution). Our method accelerates finding new cancer signatures and improves their overall stability. Reciprocally, the methods for extracting cancer signatures could have interesting applications in quantitative finance. △ Less

Submitted 22 January, 2017; v1 submitted 29 April, 2016; originally announced April 2016.

Comments: 70 pages, 21 figures; a few trivial typos corrected

Journal ref: Physica A 462 (2016) 527-559

Showing 1–4 of 4 results for author: Kakushadze, Z