Showing 1–1 of 1 results for author: Taleb, N N
-
Informational Rescaling of PCA Maps with Application to Genetic Distance
Authors:
Nassim Nicholas Taleb,
Pierre Zalloua,
Khaled Elbassioni,
Andreas Henschel,
Daniel E. Platt
Abstract:
We discuss the inadequacy of covariances/correlations and other measures in L2 as relative distance metrics under some conditions. We propose a computationally simple heuristic to transform a map based on standard principal component analysis (PCA) (when the variables are asymptotically Gaussian) into an entropy-based map where distances are based on mutual information (MI). Rescaling Principal Co…
▽ More
We discuss the inadequacy of covariances/correlations and other measures in L2 as relative distance metrics under some conditions. We propose a computationally simple heuristic to transform a map based on standard principal component analysis (PCA) (when the variables are asymptotically Gaussian) into an entropy-based map where distances are based on mutual information (MI). Rescaling Principal Component based distances using MI allows a representation of relative statistical associations when, as in genetics, it is applied on bit measurements between individuals' genomic mutual information.
This entropy rescaled PCA, while preserving order relationships (along a dimension), changes the relative distances to make them linear to information. We show the effect on the entire world population and some subsamples, which leads to significant differences with the results of current research.
△ Less
Submitted 4 March, 2024; v1 submitted 14 March, 2023;
originally announced March 2023.