-
A High-Order Perturbation of Envelopes (HOPE) Method for Vector Electromagnetic Scattering by Periodic Inhomogeneous Media: Analytic Continuation
Authors:
David Nicholls,
Liet Vo
Abstract:
Electromagnetic waves interacting with three--dimensional periodic structures occur in many applications of great scientific and engineering interest. These three dimensional interactions are extremely complicated and subtle, so it is unsurprising that practitioners find their rapid, robust, and accurate numerical simulation to be of paramount interest. Among the wide array of possible numerical a…
▽ More
Electromagnetic waves interacting with three--dimensional periodic structures occur in many applications of great scientific and engineering interest. These three dimensional interactions are extremely complicated and subtle, so it is unsurprising that practitioners find their rapid, robust, and accurate numerical simulation to be of paramount interest. Among the wide array of possible numerical approaches, the High--Order Spectral algorithms are often preferred due to their surpassing fidelity with a moderate number of unknowns, and here we describe an algorithm that fits into this class. In addition, we take a perturbative approach to the problem which views the deviation of the permittivity from a reference value as the deformation and we conduct a regular perturbation theory. This work concludes a line of research on these methods which began with two-dimensional problems governed by the Helmholtz equation and moved to small perturbations in the fully three-dimensional vector Maxwell equations. We now extend these latter results to large (real) perturbations constituting a rigorous analytic continuation.
△ Less
Submitted 14 June, 2024;
originally announced July 2024.
-
Weighted Missing Linear Discriminant Analysis: An Explainable Approach for Classification with Missing Data
Authors:
Tuan L. Vo,
Uyen Dang,
Thu Nguyen
Abstract:
As Artificial Intelligence (AI) models are gradually being adopted in real-life applications, the explainability of the model used is critical, especially in high-stakes areas such as medicine, finance, etc. Among the commonly used models, Linear Discriminant Analysis (LDA) is a widely used classification tool that is also explainable thanks to its ability to model class distributions and maximize…
▽ More
As Artificial Intelligence (AI) models are gradually being adopted in real-life applications, the explainability of the model used is critical, especially in high-stakes areas such as medicine, finance, etc. Among the commonly used models, Linear Discriminant Analysis (LDA) is a widely used classification tool that is also explainable thanks to its ability to model class distributions and maximize class separation through linear feature combinations. Nevertheless, real-world data is frequently incomplete, presenting significant challenges for classification tasks and model explanations. In this paper, we propose a novel approach to LDA under missing data, termed \textbf{\textit{Weighted missing Linear Discriminant Analysis (WLDA)}}, to directly classify observations in data that contains missing values without imputation effectively by estimating the parameters directly on missing data and use a weight matrix for missing values to penalize missing entries during classification. Furthermore, we also analyze the theoretical properties and examine the explainability of the proposed technique in a comprehensive manner. Experimental results demonstrate that WLDA outperforms conventional methods by a significant margin, particularly in scenarios where missing values are present in both training and test sets.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Explainability of Machine Learning Models under Missing Data
Authors:
Tuan L. Vo,
Thu Nguyen,
Hugo L. Hammer,
Michael A. Riegler,
Pal Halvorsen
Abstract:
Missing data is a prevalent issue that can significantly impair model performance and interpretability. This paper briefly summarizes the development of the field of missing data with respect to Explainable Artificial Intelligence and experimentally investigates the effects of various imputation methods on the calculation of Shapley values, a popular technique for interpreting complex machine lear…
▽ More
Missing data is a prevalent issue that can significantly impair model performance and interpretability. This paper briefly summarizes the development of the field of missing data with respect to Explainable Artificial Intelligence and experimentally investigates the effects of various imputation methods on the calculation of Shapley values, a popular technique for interpreting complex machine learning models. We compare different imputation strategies and assess their impact on feature importance and interaction as determined by Shapley values. Moreover, we also theoretically analyze the effects of missing values on Shapley values. Importantly, our findings reveal that the choice of imputation method can introduce biases that could lead to changes in the Shapley values, thereby affecting the interpretability of the model. Moreover, and that a lower test prediction mean square error (MSE) may not imply a lower MSE in Shapley values and vice versa. Also, while Xgboost is a method that could handle missing data directly, using Xgboost directly on missing data can seriously affect interpretability compared to imputing the data before training Xgboost. This study provides a comprehensive evaluation of imputation methods in the context of model interpretation, offering practical guidance for selecting appropriate techniques based on dataset characteristics and analysis objectives. The results underscore the importance of considering imputation effects to ensure robust and reliable insights from machine learning models.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Imputation using training labels and classification via label imputation
Authors:
Thu Nguyen,
Tuan L. Vo,
Pål Halvorsen,
Michael A. Riegler
Abstract:
Missing data is a common problem in practical settings. Various imputation methods have been developed to deal with missing data. However, even though the label is usually available in the training data, the common practice of imputation usually only relies on the input and ignores the label. In this work, we illustrate how stacking the label into the input can significantly improve the imputation…
▽ More
Missing data is a common problem in practical settings. Various imputation methods have been developed to deal with missing data. However, even though the label is usually available in the training data, the common practice of imputation usually only relies on the input and ignores the label. In this work, we illustrate how stacking the label into the input can significantly improve the imputation of the input. In addition, we propose a classification strategy that initializes the predicted test label with missing values and stacks the label with the input for imputation. This allows imputing the label and the input at the same time. Also, the technique is capable of handling data training with missing labels without any prior imputation and is applicable to continuous, categorical, or mixed-type data. Experiments show promising results in terms of accuracy.
△ Less
Submitted 23 April, 2024; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH
Authors:
Nghia T. Nguyen,
Long Luu,
Phuong L. Vo,
Thi Thanh Sang Nguyen,
Cuong T. Do,
Ngoc-thanh Nguyen
Abstract:
Dynamic adaptive streaming over HTTP (DASH) has been widely used in video streaming recently. In DASH, the client downloads video chunks in order from a server. The rate adaptation function at the video client enhances the user's quality-of-experience (QoE) by choosing a suitable quality level for each video chunk to download based on the network condition. Today networks such as content delivery…
▽ More
Dynamic adaptive streaming over HTTP (DASH) has been widely used in video streaming recently. In DASH, the client downloads video chunks in order from a server. The rate adaptation function at the video client enhances the user's quality-of-experience (QoE) by choosing a suitable quality level for each video chunk to download based on the network condition. Today networks such as content delivery networks, edge caching networks, content-centric networks,... usually replicate video contents on multiple cache nodes. We study video streaming from multiple sources in this work. In multi-source streaming, video chunks may arrive out of order due to different conditions of the network paths. Hence, to guarantee a high QoE, the video client needs not only rate adaptation but also chunk scheduling. Reinforcement learning (RL) has emerged as the state-of-the-art control method in various fields in recent years. This paper proposes two algorithms for streaming from multiple sources: RL-based adaptation with greedy scheduling (RLAGS) and RL-based adaptation and scheduling (RLAS). We also build a simulation environment for training and evaluating. The efficiency of the proposed algorithms is proved via extensive simulations with real-trace data.
△ Less
Submitted 25 July, 2023;
originally announced August 2023.
-
A High-Order Perturbation of Envelopes (HOPE) Method for Vector Electromagnetic Scattering by Periodic Inhomogeneous Media
Authors:
David P. Nicholls,
Liet Vo
Abstract:
The scattering of electromagnetic waves by three--dimensional periodic structures is important for many problems of crucial scientific and engineering interest. Due to the complexity and three-dimensional nature of these waves, the fast, accurate, and reliable numerical simulations of these are indispensable for engineers and scientists alike. For this, High Order Spectral methods are frequently e…
▽ More
The scattering of electromagnetic waves by three--dimensional periodic structures is important for many problems of crucial scientific and engineering interest. Due to the complexity and three-dimensional nature of these waves, the fast, accurate, and reliable numerical simulations of these are indispensable for engineers and scientists alike. For this, High Order Spectral methods are frequently employed and here we describe an algorithm in this class. Our approach is perturbative in nature where we view the deviation of the permittivity from a constant value as the deformation and we pursue regular perturbation theory. This work extends our previous contribution regarding the Helmholtz equation to the full vector Maxwell equations, by providing a rigorous analyticity theory, both in deformation size and spatial variable (provided that the permittivity is, itself, analytic).
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP
Authors:
Phuong L. Vo,
Nghia T. Nguyen,
Long Luu,
Canh T. Dinh,
Nguyen H. Tran,
Tuan-Anh Le
Abstract:
In video streaming over HTTP, the bitrate adaptation selects the quality of video chunks depending on the current network condition. Some previous works have applied deep reinforcement learning (DRL) algorithms to determine the chunk's bitrate from the observed states to maximize the quality-of-experience (QoE). However, to build an intelligent model that can predict in various environments, such…
▽ More
In video streaming over HTTP, the bitrate adaptation selects the quality of video chunks depending on the current network condition. Some previous works have applied deep reinforcement learning (DRL) algorithms to determine the chunk's bitrate from the observed states to maximize the quality-of-experience (QoE). However, to build an intelligent model that can predict in various environments, such as 3G, 4G, Wifi, \textit{etc.}, the states observed from these environments must be sent to a server for training centrally. In this work, we integrate federated learning (FL) to DRL-based rate adaptation to train a model appropriate for different environments. The clients in the proposed framework train their model locally and only update the weights to the server. The simulations show that our federated DRL-based rate adaptations, called FDRLABR with different DRL algorithms, such as deep Q-learning, advantage actor-critic, and proximal policy optimization, yield better performance than the traditional bitrate adaptation methods in various environments.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Blockwise Principal Component Analysis for monotone missing data imputation and dimensionality reduction
Authors:
Tu T. Do,
Mai Anh Vu,
Tuan L. Vo,
Hoang Thien Ly,
Thu Nguyen,
Steven A. Hicks,
Michael A. Riegler,
Pål Halvorsen,
Binh T. Nguyen
Abstract:
Monotone missing data is a common problem in data analysis. However, imputation combined with dimensionality reduction can be computationally expensive, especially with the increasing size of datasets. To address this issue, we propose a Blockwise principal component analysis Imputation (BPI) framework for dimensionality reduction and imputation of monotone missing data. The framework conducts Pri…
▽ More
Monotone missing data is a common problem in data analysis. However, imputation combined with dimensionality reduction can be computationally expensive, especially with the increasing size of datasets. To address this issue, we propose a Blockwise principal component analysis Imputation (BPI) framework for dimensionality reduction and imputation of monotone missing data. The framework conducts Principal Component Analysis (PCA) on the observed part of each monotone block of the data and then imputes on merging the obtained principal components using a chosen imputation technique. BPI can work with various imputation techniques and can significantly reduce imputation time compared to conducting dimensionality reduction after imputation. This makes it a practical and efficient approach for large datasets with monotone missing data. Our experiments validate the improvement in speed. In addition, our experiments also show that while applying MICE imputation directly on missing data may not yield convergence, applying BPI with MICE for the data may lead to convergence.
△ Less
Submitted 10 January, 2024; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Higher order time discretization method for a class of semilinear stochastic partial differential equations with multiplicative noise
Authors:
Yukun Li,
Liet Vo,
Guanqian Wang
Abstract:
In this paper, we consider a new approach for semi-discretization in time and spatial discretization of a class of semi-linear stochastic partial differential equations (SPDEs) with multiplicative noise. The drift term of the SPDEs is only assumed to satisfy a one-sided Lipschitz condition and the diffusion term is assumed to be globally Lipschitz continuous. Our new strategy for time discretizati…
▽ More
In this paper, we consider a new approach for semi-discretization in time and spatial discretization of a class of semi-linear stochastic partial differential equations (SPDEs) with multiplicative noise. The drift term of the SPDEs is only assumed to satisfy a one-sided Lipschitz condition and the diffusion term is assumed to be globally Lipschitz continuous. Our new strategy for time discretization is based on the Milstein method from stochastic differential equations. We use the energy method for its error analysis and show a strong convergence order of nearly $1$ for the approximate solution. The proof is based on new Hölder continuity estimates of the SPDE solution and the nonlinear term. For the general polynomial-type drift term, there are difficulties in deriving even the stability of the numerical solutions. We propose an interpolation-based finite element method for spatial discretization to overcome the difficulties. Then we obtain $H^1$ stability, higher moment $H^1$ stability, $L^2$ stability, and higher moment $L^2$ stability results using numerical and stochastic techniques. The nearly optimal convergence orders in time and space are hence obtained by coupling all previous results. Numerical experiments are presented to implement the proposed numerical scheme and to validate the theoretical results.
△ Less
Submitted 7 July, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Higher order time discretization method for the stochastic Stokes equations with multiplicative noise
Authors:
Liet Vo
Abstract:
In this paper, we propose a new approach for the time-discretization of the incompressible stochastic Stokes equations with multiplicative noise. Our new strategy is based on the classical Milstein method from stochastic differential equations. We use the energy method for its error analysis and show a strong convergence order of at most $1$ for both velocity and pressure approximations. The proof…
▽ More
In this paper, we propose a new approach for the time-discretization of the incompressible stochastic Stokes equations with multiplicative noise. Our new strategy is based on the classical Milstein method from stochastic differential equations. We use the energy method for its error analysis and show a strong convergence order of at most $1$ for both velocity and pressure approximations. The proof is based on a new Hölder continuity estimate of the velocity solution. While the errors of the velocity approximation are estimated in the standard $L^2$- and $H^1$-norms, the pressure errors are carefully analyzed in a special norm because of the low regularity of the pressure solution. In addition, a new interpretation of the pressure solution, which is very useful in computation, is also introduced. Numerical experiments are also provided to validate the error estimates and their sharpness.
△ Less
Submitted 6 December, 2022; v1 submitted 4 November, 2022;
originally announced November 2022.
-
High moment and pathwise error estimates for fully discrete mixed finite element approximattions of stochastic Navier-Stokes equations with additive noise
Authors:
Xiaobing Feng,
Liet Vo
Abstract:
This paper is concerned with high moment and pathwise error estimates for fully discrete mixed finite element approximattions of stochastic Navier-Stokes equations with general additive noise. The implicit Euler-Maruyama scheme and standard mixed finite element methods are employed respectively for the time and space discretizations. High moment error estimates for both velocity and a time-avraged…
▽ More
This paper is concerned with high moment and pathwise error estimates for fully discrete mixed finite element approximattions of stochastic Navier-Stokes equations with general additive noise. The implicit Euler-Maruyama scheme and standard mixed finite element methods are employed respectively for the time and space discretizations. High moment error estimates for both velocity and a time-avraged pressure approximations in strong $L^2$ and energy norms are obtained, pathwise error estimates are derived by using the Kolmogorov Theorem. Unlike their derterministic counterparts, the spatial error constants grow in the order of $O(k^{-\frac12})$, where $k$ denotes time step size. Numerical experiments are also provided to validate the error estimates and their sharpness.
△ Less
Submitted 2 October, 2022; v1 submitted 25 September, 2022;
originally announced September 2022.
-
Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-stage Span Labeling
Authors:
Duc-Vu Nguyen,
Linh-Bao Vo,
Ngoc-Linh Tran,
Kiet Van Nguyen,
Ngan Luu-Thuy Nguyen
Abstract:
Chinese word segmentation and part-of-speech tagging are necessary tasks in terms of computational linguistics and application of natural language processing. Many re-searchers still debate the demand for Chinese word segmentation and part-of-speech tagging in the deep learning era. Nevertheless, resolving ambiguities and detecting unknown words are challenging problems in this field. Previous stu…
▽ More
Chinese word segmentation and part-of-speech tagging are necessary tasks in terms of computational linguistics and application of natural language processing. Many re-searchers still debate the demand for Chinese word segmentation and part-of-speech tagging in the deep learning era. Nevertheless, resolving ambiguities and detecting unknown words are challenging problems in this field. Previous studies on joint Chinese word segmentation and part-of-speech tagging mainly follow the character-based tagging model focusing on modeling n-gram features. Unlike previous works, we propose a neural model named SpanSegTag for joint Chinese word segmentation and part-of-speech tagging following the span labeling in which the probability of each n-gram being the word and the part-of-speech tag is the main problem. We use the biaffine operation over the left and right boundary representations of consecutive characters to model the n-grams. Our experiments show that our BERT-based model SpanSegTag achieved competitive performances on the CTB5, CTB6, and UD, or significant improvements on CTB7 and CTB9 benchmark datasets compared with the current state-of-the-art method using BERT or ZEN encoders.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Span Labeling Approach for Vietnamese and Chinese Word Segmentation
Authors:
Duc-Vu Nguyen,
Linh-Bao Vo,
Dang Van Thin,
Ngan Luu-Thuy Nguyen
Abstract:
In this paper, we propose a span labeling approach to model n-gram information for Vietnamese word segmentation, namely SPAN SEG. We compare the span labeling approach with the conditional random field by using encoders with the same architecture. Since Vietnamese and Chinese have similar linguistic phenomena, we evaluated the proposed method on the Vietnamese treebank benchmark dataset and five C…
▽ More
In this paper, we propose a span labeling approach to model n-gram information for Vietnamese word segmentation, namely SPAN SEG. We compare the span labeling approach with the conditional random field by using encoders with the same architecture. Since Vietnamese and Chinese have similar linguistic phenomena, we evaluated the proposed method on the Vietnamese treebank benchmark dataset and five Chinese benchmark datasets. Through our experimental results, the proposed approach SpanSeg achieves higher performance than the sequence tagging approach with the state-of-the-art F-score of 98.31% on the Vietnamese treebank benchmark, when they both apply the contextual pre-trained language model XLM-RoBERTa and the predicted word boundary information. Besides, we do fine-tuning experiments for the span labeling approach on BERT and ZEN pre-trained language model for Chinese with fewer parameters, faster inference time, and competitive or higher F-scores than the previous state-of-the-art approach, word segmentation with word-hood memory networks, on five Chinese benchmarks.
△ Less
Submitted 30 September, 2021;
originally announced October 2021.
-
High moment and pathwise error estimates for fully discrete mixed finite element approximations of the Stochastic Stokes Equations with Multiplicative Noises
Authors:
Liet Vo
Abstract:
This paper is concerned with high moment and pathwise error estimates for both velocity and pressure approximations of the Euler-Maruyama scheme for time discretization and its two fully discrete mixed finite element discretizations. The main idea for deriving the high moment error estimates for the velocity approximation is to use a bootstrap technique starting from the second moment error estima…
▽ More
This paper is concerned with high moment and pathwise error estimates for both velocity and pressure approximations of the Euler-Maruyama scheme for time discretization and its two fully discrete mixed finite element discretizations. The main idea for deriving the high moment error estimates for the velocity approximation is to use a bootstrap technique starting from the second moment error estimate. The pathwise error estimate, which is sub-optimal in the energy norm, is obtained by using Kolmogorov's theorem based on the high moment error estimates. Unlike for the velocity error estimate, the higher moment and pathwise error estimates for the pressure approximation are derived in a time-averaged norm. In addition, the impact of noise types on the rates of convergence for both velocity and pressure approximations is also addressed.
△ Less
Submitted 30 June, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
An efficient iterative method for solving parameter-dependent and random convention-diffusion problems
Authors:
Xiaobing Feng,
Yan Luo,
Liet Vo,
Zhu Wang
Abstract:
This paper develops and analyzes a general iterative framework for solving parameter-dependent and random convection-diffusion problems. It is inspired by the multi-modes method of [7,8] and the ensemble method of [20] and extends those methods into a more general and unified framework. The main idea of the framework is to reformulate the underlying problem into another problem with parameter-inde…
▽ More
This paper develops and analyzes a general iterative framework for solving parameter-dependent and random convection-diffusion problems. It is inspired by the multi-modes method of [7,8] and the ensemble method of [20] and extends those methods into a more general and unified framework. The main idea of the framework is to reformulate the underlying problem into another problem with parameter-independent convection and diffusion coefficients and a parameter-dependent (and solution-dependent) right-hand side, a fixed-point iteration is then employed to compute the solution of the reformulated problem. The main benefit of the proposed approach is that an efficient direct solver and a block Krylov subspace iterative solver can be used at each iteration, allowing to reuse the $LU$ matrix factorization or to do an efficient matrix-matrix multiplication for all parameters, which in turn results in significant computation saving. Convergence and rates of convergence are established for the iterative method both at the variational continuous level and at the finite element discrete level under some structure conditions. Several strategies for establishing reformulations of parameter-dependent and random diffusion and convection-diffusion problems are proposed and their computational complexity is analyzed. Several 1-D and 2-D numerical experiments are also provided to demonstrate the efficiency of the proposed iterative method and to validate the theoretical convergence results.
△ Less
Submitted 20 October, 2021; v1 submitted 25 May, 2021;
originally announced May 2021.
-
Analysis of Chorin-Type Projection Methods for the Stochastic Stokes Equations with General Multiplicative Noises
Authors:
Xiaobing Feng,
Liet Vo
Abstract:
This paper is concerned with numerical analysis of two fully discrete Chorin-type projection methods for the stochastic Stokes equations with general non-solenoidal multiplicative noise. The first scheme is the standard Chorin scheme and the second one is a modified Chorin scheme which is designed by employing the Helmholtz decomposition on the noise function at each time step to produce a project…
▽ More
This paper is concerned with numerical analysis of two fully discrete Chorin-type projection methods for the stochastic Stokes equations with general non-solenoidal multiplicative noise. The first scheme is the standard Chorin scheme and the second one is a modified Chorin scheme which is designed by employing the Helmholtz decomposition on the noise function at each time step to produce a projected divergence-free noise and a "pseudo pressure" after combining the original pressure and the curl-free part of the decomposition. Optimal order rates of the convergence are proved for both velocity and pressure approximations of these two (semi-discrete) Chorin schemes. It is crucial to measure the errors in appropriate norms. The fully discrete finite element methods are formulated by discretizing both semi-discrete Chorin schemes in space by the standard finite element method. Suboptimal order error estimates are derived for both fully discrete methods. It is proved that all spatial error constants contain a growth factor $k^{-1/2}$, where $k$ denotes the time step size, which explains the deteriorating performance of the standard Chorin scheme when $k\to 0$ and the space mesh size is fixed as observed earlier in the numerical tests of [9]. Numerical results are also provided to guage the performance of the proposed numerical methods and to validate the sharpness of the theoretical error estimates.
△ Less
Submitted 1 August, 2021; v1 submitted 28 October, 2020;
originally announced October 2020.
-
Optimally Convergent Mixed Finite Element Methods for the Stochastic Stokes Equations
Authors:
Xiaobing Feng,
Andreas Prohl,
Liet Vo
Abstract:
We propose some new mixed finite element methods for the time dependent stochastic Stokes equations with multiplicative noise, which use the Helmholtz decomposition of the driving multiplicative noise. It is known [16] that the pressure solution has a low regularity, which manifests in sub-optimal convergence rates for well-known inf-sup stable mixed finite element methods in numerical simulations…
▽ More
We propose some new mixed finite element methods for the time dependent stochastic Stokes equations with multiplicative noise, which use the Helmholtz decomposition of the driving multiplicative noise. It is known [16] that the pressure solution has a low regularity, which manifests in sub-optimal convergence rates for well-known inf-sup stable mixed finite element methods in numerical simulations, see [10]. We show that eliminating this gradient part from the noise in the numerical scheme leads to optimally convergent mixed finite element methods, and that this conceptual idea may be used to retool numerical methods that are well-known in the deterministic setting, including pressure stabilization methods, so that their optimal convergence properties can still be maintained in the stochastic setting. Computational experiments are also provided to validate the theoretical results and to illustrate the conceptional usefulness of the proposed numerical approach.
△ Less
Submitted 7 June, 2020; v1 submitted 6 May, 2020;
originally announced May 2020.
-
Privacy with Estimation Guarantees
Authors:
Hao Wang,
Lisa Vo,
Flavio P. Calmon,
Muriel Médard,
Ken R. Duffy,
Mayank Varia
Abstract:
We study the central problem in data privacy: how to share data with an analyst while providing both privacy and utility guarantees to the user that owns the data. In this setting, we present an estimation-theoretic analysis of the privacy-utility trade-off (PUT). Here, an analyst is allowed to reconstruct (in a mean-squared error sense) certain functions of the data (utility), while other private…
▽ More
We study the central problem in data privacy: how to share data with an analyst while providing both privacy and utility guarantees to the user that owns the data. In this setting, we present an estimation-theoretic analysis of the privacy-utility trade-off (PUT). Here, an analyst is allowed to reconstruct (in a mean-squared error sense) certain functions of the data (utility), while other private functions should not be reconstructed with distortion below a certain threshold (privacy). We demonstrate how chi-square information captures the fundamental PUT in this case and provide bounds for the best PUT. We propose a convex program to compute privacy-assuring map**s when the functions to be disclosed and hidden are known a priori and the data distribution is known. We derive lower bounds on the minimum mean-squared error of estimating a target function from the disclosed data and evaluate the robustness of our approach when an empirical distribution is used to compute the privacy-assuring map**s instead of the true data distribution. We illustrate the proposed approach through two numerical experiments.
△ Less
Submitted 20 March, 2020; v1 submitted 1 October, 2017;
originally announced October 2017.
-
The Multi-path Utility Maximization and Multi-path TCP Design
Authors:
Phuong L. Vo,
Anh T. Le,
Choong S. Hong
Abstract:
The network utility maximization problem (NUM) for multi-path is a problem which is non-strictly convex and non-separable. Using Jensen's inequality, we approximate the NUM to a strictly convex and separable problem which can be solved efficiently by the dual decomposition method. After a series of approximations, the result of the approximation problem converges to the globally optimal solution o…
▽ More
The network utility maximization problem (NUM) for multi-path is a problem which is non-strictly convex and non-separable. Using Jensen's inequality, we approximate the NUM to a strictly convex and separable problem which can be solved efficiently by the dual decomposition method. After a series of approximations, the result of the approximation problem converges to the globally optimal solution of the original problem.
Moreover, because of the separable and dual-based natures of the proposed algorithm, we utilize the reverse engineering frameworks of the current TCPs to develop a series of multi-path TCPs which are totally compatible with current TCPs. The multi-path users using our protocols can run simultaneously with the single-path users using the current TCPs. The simulations of our Multi-path Reno on ns-2 show the compatibility and the fairness among multi-path and single-path users.
△ Less
Submitted 4 January, 2013; v1 submitted 19 August, 2011;
originally announced August 2011.
-
The Successive Approximation Approach for NUM Frameworks with Elastic and Inelastic Traffic
Authors:
Phuong L. Vo,
Nguyen H. Tran,
Choong Seon Hong
Abstract:
The concave utility in the Network Utility Maximization (NUM) problem is only suitable for elastic flows. However, the networks with the multiclass traffic, the utility of inelastic traffic is usually represented by the sigmoidal function which is a nonconcave function. Hence, the basic NUM problem becomes a nonconvex optimization problem. Solving the nonconvex NUM distributively is a difficult pr…
▽ More
The concave utility in the Network Utility Maximization (NUM) problem is only suitable for elastic flows. However, the networks with the multiclass traffic, the utility of inelastic traffic is usually represented by the sigmoidal function which is a nonconcave function. Hence, the basic NUM problem becomes a nonconvex optimization problem. Solving the nonconvex NUM distributively is a difficult problem. The current works utilize the standard dual-based algorithm for the convex NUM and find the criteria for the global optimal convergence of the algorithm. It turns out that the link capacity must higher than a certain value to achieve the global optimum.
We propose a new distributed algorithm that converges to the suboptimal solution of the nonconvex NUM for all of link capacity. We approximate the logarithm of the original problem to the convex problem which is solved efficiently by the standard dual-base distributed algorithm. After a sequence of approximations, the solutions converge to the KKT solution of the original problem. In many of our experiments, it also converges to the global optimal solution of the NUM. Moreover, we extend our work to solve the joint rate and power NUM problem with elastic and inelastic traffic in a wireless network. Our techniques can be applied to any log-concave utilities.
△ Less
Submitted 15 April, 2012; v1 submitted 18 August, 2011;
originally announced August 2011.