Correcting sampling biases via importance reweighting for spatial modeling

Prokhorov, Boris; Koldasbayeva, Diana; Zaytsev, Alexey

Computer Science > Machine Learning

arXiv:2309.04824 (cs)

[Submitted on 9 Sep 2023 (v1), last revised 14 Sep 2023 (this version, v2)]

Title:Correcting sampling biases via importance reweighting for spatial modeling

Authors:Boris Prokhorov, Diana Koldasbayeva, Alexey Zaytsev

View PDF

Abstract:In machine learning models, the estimation of errors is often complex due to distribution bias, particularly in spatial data such as those found in environmental studies. We introduce an approach based on the ideas of importance sampling to obtain an unbiased estimate of the target error. By taking into account difference between desirable error and available data, our method reweights errors at each sample point and neutralizes the shift. Importance sampling technique and kernel density estimation were used for reweighteing. We validate the effectiveness of our approach using artificial data that resemble real-world spatial datasets. Our findings demonstrate advantages of the proposed approach for the estimation of the target error, offering a solution to a distribution shift problem. Overall error of predictions dropped from 7% to just 2% and it gets smaller for larger samples.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2309.04824 [cs.LG]
	(or arXiv:2309.04824v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.04824

Submission history

From: Alexey Zaytsev [view email]
[v1] Sat, 9 Sep 2023 15:36:28 UTC (2,909 KB)
[v2] Thu, 14 Sep 2023 06:33:59 UTC (2,909 KB)

Computer Science > Machine Learning

Title:Correcting sampling biases via importance reweighting for spatial modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Correcting sampling biases via importance reweighting for spatial modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators