-
On the Relationship Between Information-Theoretic Privacy Metrics And Probabilistic Information Privacy
Authors:
Chong Xiao Wang,
Wee Peng Tay
Abstract:
Information-theoretic (IT) measures based on $f$-divergences have recently gained interest as a measure of privacy leakage as they allow for trading off privacy against utility using only a single-value characterization. However, their operational interpretations in the privacy context are unclear. In this paper, we relate the notion of probabilistic information privacy (IP) to several IT privacy…
▽ More
Information-theoretic (IT) measures based on $f$-divergences have recently gained interest as a measure of privacy leakage as they allow for trading off privacy against utility using only a single-value characterization. However, their operational interpretations in the privacy context are unclear. In this paper, we relate the notion of probabilistic information privacy (IP) to several IT privacy metrics based on $f$-divergences. We interpret probabilistic IP under both the detection and estimation frameworks and link it to differential privacy, thus allowing a precise operational interpretation of these IT privacy metrics. We show that the $χ^2$-divergence privacy metric is stronger than those based on total variation distance and Kullback-Leibler divergence. Therefore, we further develop a data-driven empirical risk framework based on the $χ^2$-divergence privacy metric and realized using deep neural networks. This framework is agnostic to the adversarial attack model. Empirical experiments demonstrate the efficacy of our approach.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Semi-Nonparametric Estimation of Distribution Divergence in Non-Euclidean Spaces
Authors:
Chong Xiao Wang,
Wee Peng Tay
Abstract:
This paper explores methods for estimating or approximating the total variation distance and the chi-squared divergence of probability measures within topological sample spaces, using independent and identically distributed samples. Our focus is on the practical scenario where the sample space is homeomorphic to subsets of Euclidean space, with the specific homeomorphism remaining unknown. Our pro…
▽ More
This paper explores methods for estimating or approximating the total variation distance and the chi-squared divergence of probability measures within topological sample spaces, using independent and identically distributed samples. Our focus is on the practical scenario where the sample space is homeomorphic to subsets of Euclidean space, with the specific homeomorphism remaining unknown. Our proposed methods rely on the integral probability metric with witness functions in universal reproducing kernel Hilbert spaces (RKHSs). The estimators we develop consist of learnable parametric functions map** the sample space to Euclidean space, paired with universal kernels defined in Euclidean space. This approach effectively overcomes the challenge of constructing universal kernels directly on non-Euclidean spaces. Furthermore, the estimators we devise demonstrate asymptotic consistency, and we provide a detailed statistical analysis, shedding light on their practical implementation.
△ Less
Submitted 18 December, 2023; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Data-driven Regularized Inference Privacy
Authors:
Chong Xiao Wang,
Wee Peng Tay
Abstract:
Data is used widely by service providers as input to inference systems to perform decision making for authorized tasks. The raw data however allows a service provider to infer other sensitive information it has not been authorized for. We propose a data-driven inference privacy preserving framework to sanitize data so as to prevent leakage of sensitive information that is present in the raw data,…
▽ More
Data is used widely by service providers as input to inference systems to perform decision making for authorized tasks. The raw data however allows a service provider to infer other sensitive information it has not been authorized for. We propose a data-driven inference privacy preserving framework to sanitize data so as to prevent leakage of sensitive information that is present in the raw data, while ensuring that the sanitized data is still compatible with the service provider's legacy inference system. We develop an inference privacy framework based on the variational method and include maximum mean discrepancy and domain adaption as techniques to regularize the domain of the sanitized data to ensure its legacy compatibility. However, the variational method leads to weak privacy in cases where the underlying data distribution is hard to approximate. It may also face difficulties when handling continuous private variables. To overcome this, we propose an alternative formulation of the privacy metric using maximal correlation and we present empirical methods to estimate it. Finally, we develop a deep learning model as an example of the proposed inference privacy framework. Numerical experiments verify the feasibility of our approach.
△ Less
Submitted 10 October, 2020;
originally announced October 2020.
-
Arbitrarily Strong Utility-Privacy Tradeoff in Multi-Agent Systems
Authors:
Chong Xiao Wang,
Yang Song,
Wee Peng Tay
Abstract:
Each agent in a network makes a local observation that is linearly related to a set of public and private parameters. The agents send their observations to a fusion center to allow it to estimate the public parameters. To prevent leakage of the private parameters, each agent first sanitizes its local observation using a local privacy mechanism before transmitting it to the fusion center. We invest…
▽ More
Each agent in a network makes a local observation that is linearly related to a set of public and private parameters. The agents send their observations to a fusion center to allow it to estimate the public parameters. To prevent leakage of the private parameters, each agent first sanitizes its local observation using a local privacy mechanism before transmitting it to the fusion center. We investigate the utility-privacy tradeoff in terms of the Cramér-Rao lower bounds for estimating the public and private parameters. We study the class of privacy mechanisms given by linear compression and noise perturbation, and derive necessary and sufficient conditions for achieving arbitrarily strong utility-privacy tradeoff in a multi-agent system for both the cases where prior information is available and unavailable, respectively. We also provide a method to find the maximum estimation privacy achievable without compromising the utility and propose an alternating algorithm to optimize the utility-privacy tradeoff in the case where arbitrarily strong utility-privacy tradeoff is not achievable.
△ Less
Submitted 10 August, 2020; v1 submitted 15 January, 2020;
originally announced January 2020.
-
Compressive Privacy for a Linear Dynamical System
Authors:
Yang Song,
Chong Xiao Wang,
Wee Peng Tay
Abstract:
We consider a linear dynamical system in which the state vector consists of both public and private states. One or more sensors make measurements of the state vector and sends information to a fusion center, which performs the final state estimation. To achieve an optimal tradeoff between the utility of estimating the public states and protection of the private states, the measurements at each tim…
▽ More
We consider a linear dynamical system in which the state vector consists of both public and private states. One or more sensors make measurements of the state vector and sends information to a fusion center, which performs the final state estimation. To achieve an optimal tradeoff between the utility of estimating the public states and protection of the private states, the measurements at each time step are linearly compressed into a lower dimensional space. Under the centralized setting where all measurements are collected by a single sensor, we propose an optimization problem and an algorithm to find the best compression matrix. Under the decentralized setting where measurements are made separately at multiple sensors, each sensor optimizes its own local compression matrix. We propose methods to separate the overall optimization problem into multiple sub-problems that can be solved locally at each sensor. We consider the cases where there is no message exchange between the sensors; and where each sensor takes turns to transmit messages to the other sensors. Simulations and empirical experiments demonstrate the efficiency of our proposed approach in allowing the fusion center to estimate the public states with good accuracy while preventing it from estimating the private states accurately.
△ Less
Submitted 18 July, 2019; v1 submitted 9 December, 2018;
originally announced December 2018.
-
Practical Implementation of Spatial Modulation
Authors:
N. Serafimovski,
A. Younis,
R. Mesleh,
P. Chambers,
M. Di Renzo,
C. X. Wang,
P. M. Grant,
M. A. Beach,
H. Haas
Abstract:
In this work we seek to characterise the performance of spatial modulation (SM) and spatial multiplexing (SMX) with an experimental test bed. Two National Instruments (NI)-PXIe devices are used for the system testing, one for the transmitter and one for the receiver. The digital signal processing that formats the information data in preparation of transmission is described along with the digital s…
▽ More
In this work we seek to characterise the performance of spatial modulation (SM) and spatial multiplexing (SMX) with an experimental test bed. Two National Instruments (NI)-PXIe devices are used for the system testing, one for the transmitter and one for the receiver. The digital signal processing that formats the information data in preparation of transmission is described along with the digital signal processing that recovers the information data. In addition, the hardware limitations of the system are also analysed. The average bit error ratio (ABER) of the system is validated through both theoretical analysis and simulation results for SM and SMX under line of sight (LoS) channel conditions.
△ Less
Submitted 3 June, 2013; v1 submitted 3 May, 2013;
originally announced May 2013.