-
Quickest Change Detection with Controlled Sensing
Authors:
Venugopal V. Veeravalli,
Georgios Fellouris,
George V. Moustakides
Abstract:
In the problem of quickest change detection, a change occurs at some unknown time in the distribution of a sequence of random vectors that are monitored in real time, and the goal is to detect this change as quickly as possible subject to a certain false alarm constraint. In this work we consider this problem in the presence of parametric uncertainty in the post-change regime and controlled sensin…
▽ More
In the problem of quickest change detection, a change occurs at some unknown time in the distribution of a sequence of random vectors that are monitored in real time, and the goal is to detect this change as quickly as possible subject to a certain false alarm constraint. In this work we consider this problem in the presence of parametric uncertainty in the post-change regime and controlled sensing. That is, the post-change distribution contains an unknown parameter, and the distribution of each observation, before and after the change, is affected by a control action. In this context, in addition to a stop** rule that determines the time at which it is declared that the change has occurred, one also needs to determine a sequential control policy, which chooses the control action at each time based on the already collected observations. We formulate this problem mathematically using Lorden's minimax criterion, and assuming that there are finitely many possible actions and post-change parameter values. We then propose a specific procedure for this problem that employs an adaptive CuSum statistic in which (i) the estimate of the parameter is based on a fixed number of the more recent observations, and (ii) each action is selected to maximize the Kullback-Leibler divergence of the next observation based on the current parameter estimate, apart from a small number of exploration times. We show that this procedure, which we call the Windowed Chernoff-CuSum (WCC), is first-order asymptotically optimal under Lorden's minimax criterion, for every possible possible value of the unknown post-change parameter, as the mean time to false alarm goes to infinity. We also provide simulation results to illustrate the performance of the WCC procedure.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
A combinatorial proof for the secretary problem with multiple choices
Authors:
Xujun Liu,
Olgica Milenkovic,
George V. Moustakides
Abstract:
The Secretary problem is a classical sequential decision-making question that can be succinctly described as follows: a set of rank-ordered applicants are interviewed sequentially for a single position. Once an applicant is interviewed, an immediate and irrevocable decision is made if the person is to be offered the job or not and only applicants observed so far can be used in the decision process…
▽ More
The Secretary problem is a classical sequential decision-making question that can be succinctly described as follows: a set of rank-ordered applicants are interviewed sequentially for a single position. Once an applicant is interviewed, an immediate and irrevocable decision is made if the person is to be offered the job or not and only applicants observed so far can be used in the decision process. The problem of interest is to identify the stop** rule that maximizes the probability of hiring the highest-ranked applicant. A multiple-choice version of the Secretary problem, known as the Dowry problem, assumes that one is given a fixed integer budget for the total number of selections allowed to choose the best applicant. It has been solved using tools from dynamic programming and optimal stop** theory. We provide the first combinatorial proof for a related new \emph{query-based model} for which we are allowed to solicit the response of an expert to determine if an applicant is optimal. Since the selection criteria differ from those of the Dowry problem we obtain nonidentical expected stop** times.
Our result indicates that an optimal strategy is the $(a_s, a_{s-1}, \ldots, a_1)$-strategy, i.e., for the $i^{th}$ selection, where $1 \le i \le s$ and $1 \le j = s+1-i \le s$, we reject the first $a_j$ applicants, wait until the decision of the $(i-1)^{th}$ selection (if $i \ge 2$), and then accept the next applicant whose qualification is better than all previously appeared applicants. Furthermore, our optimal strategy is right-hand based, i.e., the optimal strategies for two models with $s_1$ and $s_2$ selections in total ($s_1 < s_2$) share the same sequence $a_1, a_2, \ldots, a_{s_1}$ when it is viewed from the right. When the total number of applicants tends to infinity, our result agrees with the thresholds obtained by Gilbert and Mosteller.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Optimal Stop** Methodology for the Secretary Problem with Random Queries
Authors:
George V. Moustakides,
Xujun Liu,
Olgica Milenkovic
Abstract:
Candidates arrive sequentially for an interview process which results in them being ranked relative to their predecessors. Based on the ranks available at each time, one must develop a decision mechanism that selects or dismisses the current candidate in an effort to maximize the chance to select the best. This classical version of the ``Secretary problem'' has been studied in depth using mostly c…
▽ More
Candidates arrive sequentially for an interview process which results in them being ranked relative to their predecessors. Based on the ranks available at each time, one must develop a decision mechanism that selects or dismisses the current candidate in an effort to maximize the chance to select the best. This classical version of the ``Secretary problem'' has been studied in depth using mostly combinatorial approaches, along with numerous other variants. In this work we consider a particular new version where during reviewing one is allowed to query an external expert to improve the probability of making the correct decision. Unlike existing formulations, we consider experts that are not necessarily infallible and may provide suggestions that can be faulty. For the solution of our problem we adopt a probabilistic methodology and view the querying times as consecutive stop** times which we optimize with the help of optimal stop** theory. For each querying time we must also design a mechanism to decide whether we should terminate the search at the querying time or not. This decision is straightforward under the usual assumption of infallible experts but, when experts are faulty, it has a far more intricate structure.
△ Less
Submitted 2 August, 2023; v1 submitted 15 July, 2021;
originally announced July 2021.
-
Query-Based Selection of Optimal Candidates under the Mallows Model
Authors:
Xujun Liu,
Olgica Milenkovic,
George V. Moustakides
Abstract:
We study the secretary problem in which rank-ordered lists are generated by the Mallows model and the goal is to identify the highest-ranked candidate through a sequential interview process which does not allow rejected candidates to be revisited. The main difference between our formulation and existing models is that, during the selection process, we are given a fixed number of opportunities to q…
▽ More
We study the secretary problem in which rank-ordered lists are generated by the Mallows model and the goal is to identify the highest-ranked candidate through a sequential interview process which does not allow rejected candidates to be revisited. The main difference between our formulation and existing models is that, during the selection process, we are given a fixed number of opportunities to query an infallible expert whether the current candidate is the highest-ranked or not. If the response is positive, the selection process terminates, otherwise, the search continues until a new potentially optimal candidate is identified. Our optimal interview strategy, as well as the expected number of candidates interviewed and the expected number of queries used, can be determined through the evaluation of well-defined recurrence relations. Specifically, if we are allowed to query $s-1$ times and to make a final selection without querying (thus, making $s$ selections in total) then the optimum scheme is characterized by $s$ thresholds that depend on the parameter $θ$ of the Mallows distribution but are independent on the maximum number of queries.
△ Less
Submitted 2 March, 2023; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Image De-Quantization Using Generative Models as Priors
Authors:
Kalliopi Basioti,
George V. Moustakides
Abstract:
Image quantization is used in several applications aiming in reducing the number of available colors in an image and therefore its size. De-quantization is the task of reversing the quantization effect and recovering the original multi-chromatic level image. Existing techniques achieve de-quantization by imposing suitable constraints on the ideal image in order to make the recovery problem feasibl…
▽ More
Image quantization is used in several applications aiming in reducing the number of available colors in an image and therefore its size. De-quantization is the task of reversing the quantization effect and recovering the original multi-chromatic level image. Existing techniques achieve de-quantization by imposing suitable constraints on the ideal image in order to make the recovery problem feasible since it is otherwise ill-posed. Our goal in this work is to develop a de-quantization mechanism through a rigorous mathematical analysis which is based on the classical statistical estimation theory. In this effort we incorporate generative modeling of the ideal image as a suitable prior information. The resulting technique is simple and capable of de-quantizing successfully images that have experienced severe quantization effects. Interestingly, our method can recover images even if the quantization process is not exactly known and contains unknown parameters.
△ Less
Submitted 17 July, 2020; v1 submitted 15 July, 2020;
originally announced July 2020.
-
Image Restoration from Parametric Transformations using Generative Models
Authors:
Kalliopi Basioti,
George V. Moustakides
Abstract:
When images are statistically described by a generative model we can use this information to develop optimum techniques for various image restoration problems as inpainting, super-resolution, image coloring, generative model inversion, etc. With the help of the generative model it is possible to formulate, in a natural way, these restoration problems as Statistical estimation problems. Our approac…
▽ More
When images are statistically described by a generative model we can use this information to develop optimum techniques for various image restoration problems as inpainting, super-resolution, image coloring, generative model inversion, etc. With the help of the generative model it is possible to formulate, in a natural way, these restoration problems as Statistical estimation problems. Our approach, by combining maximum a-posteriori probability with maximum likelihood estimation, is capable of restoring images that are distorted by transformations even when the latter contain unknown parameters. The resulting optimization is completely defined with no parameters requiring tuning. This must be compared with the current state of the art which requires exact knowledge of the transformations and contains regularizer terms with weights that must be properly defined. Finally, we must mention that we extend our method to accommodate mixtures of multiple images where each image is described by its own generative model and we are able of successfully separating each participating image from a single mixture.
△ Less
Submitted 16 June, 2020; v1 submitted 26 May, 2020;
originally announced May 2020.
-
Designing GANs: A Likelihood Ratio Approach
Authors:
Kalliopi Basioti,
George V. Moustakides
Abstract:
We are interested in the design of generative networks. The training of these mathematical structures is mostly performed with the help of adversarial (min-max) optimization problems. We propose a simple methodology for constructing such problems assuring, at the same time, consistency of the corresponding solution. We give characteristic examples developed by our method, some of which can be reco…
▽ More
We are interested in the design of generative networks. The training of these mathematical structures is mostly performed with the help of adversarial (min-max) optimization problems. We propose a simple methodology for constructing such problems assuring, at the same time, consistency of the corresponding solution. We give characteristic examples developed by our method, some of which can be recognized from other applications, and some are introduced here for the first time. We present a new metric, the likelihood ratio, that can be employed online to examine the convergence and stability during the training of different Generative Adversarial Networks (GANs). Finally, we compare various possibilities by applying them to well-known datasets using neural networks of different configurations and sizes.
△ Less
Submitted 15 July, 2021; v1 submitted 3 February, 2020;
originally announced February 2020.
-
Training Neural Networks for Likelihood/Density Ratio Estimation
Authors:
George V. Moustakides,
Kalliopi Basioti
Abstract:
Various problems in Engineering and Statistics require the computation of the likelihood ratio function of two probability densities. In classical approaches the two densities are assumed known or to belong to some known parametric family. In a data-driven version we replace this requirement with the availability of data sampled from the densities of interest. For most well known problems in Detec…
▽ More
Various problems in Engineering and Statistics require the computation of the likelihood ratio function of two probability densities. In classical approaches the two densities are assumed known or to belong to some known parametric family. In a data-driven version we replace this requirement with the availability of data sampled from the densities of interest. For most well known problems in Detection and Hypothesis testing we develop solutions by providing neural network based estimates of the likelihood ratio or its transformations. This task necessitates the definition of proper optimizations which can be used for the training of the network. The main purpose of this work is to offer a simple and unified methodology for defining such optimization problems with guarantees that the solution is indeed the desired function. Our results are extended to cover estimates for likelihood ratios of conditional densities and estimates for statistics encountered in local approaches.
△ Less
Submitted 5 November, 2019; v1 submitted 1 November, 2019;
originally announced November 2019.
-
Optimizing Shallow Networks for Binary Classification
Authors:
Kalliopi Basioti,
George V. Moustakides
Abstract:
Data driven classification that relies on neural networks is based on optimization criteria that involve some form of distance between the output of the network and the desired label. Using the same mathematical analysis, for a multitude of such measures, we can show that their optimum solution matches the ideal likelihood ratio test classifier. In this work we introduce a different family of opti…
▽ More
Data driven classification that relies on neural networks is based on optimization criteria that involve some form of distance between the output of the network and the desired label. Using the same mathematical analysis, for a multitude of such measures, we can show that their optimum solution matches the ideal likelihood ratio test classifier. In this work we introduce a different family of optimization problems which is not covered by the existing approaches and, therefore, opens possibilities for new training algorithms for neural network based classification. We give examples that lead to algorithms that are simple in implementation, exhibit stable convergence characteristics and are antagonistic to the most popular existing techniques.
△ Less
Submitted 23 June, 2019; v1 submitted 24 May, 2019;
originally announced May 2019.
-
Kernel-Based Training of Generative Networks
Authors:
Kalliopi Basioti,
George V. Moustakides,
Emmanouil Z. Psarakis
Abstract:
Generative adversarial networks (GANs) are designed with the help of min-max optimization problems that are solved with stochastic gradient-type algorithms which are known to be non-robust. In this work we revisit a non-adversarial method based on kernels which relies on a pure minimization problem and propose a simple stochastic gradient algorithm for the computation of its solution. Using simpli…
▽ More
Generative adversarial networks (GANs) are designed with the help of min-max optimization problems that are solved with stochastic gradient-type algorithms which are known to be non-robust. In this work we revisit a non-adversarial method based on kernels which relies on a pure minimization problem and propose a simple stochastic gradient algorithm for the computation of its solution. Using simplified tools from Stochastic Approximation theory we demonstrate that batch versions of the algorithm or smoothing of the gradient do not improve convergence. These observations allow for the development of a training algorithm that enjoys reduced computational complexity and increased robustness while exhibiting similar synthesis characteristics as classical GANs.
△ Less
Submitted 23 November, 2018;
originally announced November 2018.
-
Minimax Optimality of Shiryaev-Roberts Procedure for Quickest Drift Change Detection of a Brownian motion
Authors:
Taposh Banerjee,
George V. Moustakides
Abstract:
The problem of detecting a change in the drift of a Brownian motion is considered. The change point is assumed to have a modified exponential prior distribution with unknown parameters. A worst-case analysis with respect to these parameters is adopted leading to a min-max problem formulation. Analytical and numerical justifications are provided towards establishing that the Shiryaev-Roberts proced…
▽ More
The problem of detecting a change in the drift of a Brownian motion is considered. The change point is assumed to have a modified exponential prior distribution with unknown parameters. A worst-case analysis with respect to these parameters is adopted leading to a min-max problem formulation. Analytical and numerical justifications are provided towards establishing that the Shiryaev-Roberts procedure with a specially designed starting point is exactly optimal for the proposed mathematical setup.
△ Less
Submitted 9 October, 2016;
originally announced October 2016.
-
Opportunistic Detection Rules: Finite and Asymptotic Analysis
Authors:
Wenyi Zhang,
George V. Moustakides,
H. Vincent Poor
Abstract:
Opportunistic detection rules (ODRs) are variants of fixed-sample-size detection rules in which the statistician is allowed to make an early decision on the alternative hypothesis opportunistically based on the sequentially observed samples. From a sequential decision perspective, ODRs are also mixtures of one-sided and truncated sequential detection rules. Several results regarding ODRs are estab…
▽ More
Opportunistic detection rules (ODRs) are variants of fixed-sample-size detection rules in which the statistician is allowed to make an early decision on the alternative hypothesis opportunistically based on the sequentially observed samples. From a sequential decision perspective, ODRs are also mixtures of one-sided and truncated sequential detection rules. Several results regarding ODRs are established in this paper. In the finite regime, the maximum sample size is modeled either as a fixed finite number, or a geometric random variable with a fixed finite mean. For both cases, the corresponding Bayesian formulations are investigated. The former case is a slight variation of the well-known finite-length sequential hypothesis testing procedure in the literature, whereas the latter case is new, for which the Bayesian optimal ODR is shown to be a sequence of likelihood ratio threshold tests with two different thresholds: a running threshold, which is determined by solving a stationary state equation, is used when future samples are still available, and a terminal threshold (simply the ratio between the priors scaled by costs) is used when the statistician reaches the final sample and thus has to make a decision immediately. In the asymptotic regime, the tradeoff among the exponents of the (false alarm and miss) error probabilities and the normalized expected stop** time under the alternative hypothesis is completely characterized and proved to be tight, via an information-theoretic argument. Within the tradeoff region, one noteworthy fact is that the performance of the Stein-Chernoff Lemma is attainable by ODRs.
△ Less
Submitted 12 February, 2016;
originally announced February 2016.
-
Detecting Sparse Mixtures: Rate of Decay of Error Probability
Authors:
Jonathan G. Ligo,
George V. Moustakides,
Venugopal V. Veeravalli
Abstract:
We study the rate of decay of the probability of error for distinguishing between a sparse signal with noise, modeled as a sparse mixture, from pure noise. This problem has many applications in signal processing, evolutionary biology, bioinformatics, astrophysics and feature selection for machine learning. We let the mixture probability tend to zero as the number of observations tends to infinity…
▽ More
We study the rate of decay of the probability of error for distinguishing between a sparse signal with noise, modeled as a sparse mixture, from pure noise. This problem has many applications in signal processing, evolutionary biology, bioinformatics, astrophysics and feature selection for machine learning. We let the mixture probability tend to zero as the number of observations tends to infinity and derive oracle rates at which the error probability can be driven to zero for a general class of signal and noise distributions via the likelihood ratio test. In contrast to the problem of detection of non-sparse signals, we see the log-probability of error decays sublinearly rather than linearly and is characterized through the $χ^2$-divergence rather than the Kullback-Leibler divergence for "weak" signals and can be independent of divergence for "strong" signals. Our contribution is the first characterization of the rate of decay of the error probability for this problem for both the false alarm and miss probabilities.
△ Less
Submitted 24 December, 2016; v1 submitted 24 September, 2015;
originally announced September 2015.
-
Sampling-based Roadmap Planners are Probably Near-Optimal after Finite Computation
Authors:
Andrew Dobson,
George V. Moustakides,
Kostas E. Bekris
Abstract:
Sampling-based motion planners have proven to be efficient solutions to a variety of high-dimensional, geometrically complex motion planning problems with applications in several domains. The traditional view of these approaches is that they solve challenges efficiently by giving up formal guarantees and instead attain asymptotic properties in terms of completeness and optimality. Recent work has…
▽ More
Sampling-based motion planners have proven to be efficient solutions to a variety of high-dimensional, geometrically complex motion planning problems with applications in several domains. The traditional view of these approaches is that they solve challenges efficiently by giving up formal guarantees and instead attain asymptotic properties in terms of completeness and optimality. Recent work has argued based on Monte Carlo experiments that these approaches also exhibit desirable probabilistic properties in terms of completeness and optimality after finite computation. The current paper formalizes these guarantees. It proves a formal bound on the probability that solutions returned by asymptotically optimal roadmap-based methods (e.g., PRM*) are within a bound of the optimal path length I* with clearance ε after a finite iteration n. This bound has the form P(|In - I* | {\leq} δI*) {\leq} Psuccess, where δ is an error term for the length a path in the PRM* graph, In. This bound is proven for general dimension Euclidean spaces and evaluated in simulation. A discussion on how this bound can be used in practice, as well as bounds for sparse roadmaps are also provided.
△ Less
Submitted 8 April, 2014;
originally announced April 2014.
-
Sequential and Decentralized Estimation of Linear Regression Parameters in Wireless Sensor Networks
Authors:
Yasin Yilmaz,
George V. Moustakides,
Xiaodong Wang
Abstract:
Sequential estimation of a vector of linear regression coefficients is considered under both centralized and decentralized setups. In sequential estimation, the number of observations used for estimation is determined by the observed samples, hence is random, as opposed to fixed-sample-size estimation. Specifically, after receiving a new sample, if a target accuracy level is reached, we stop and e…
▽ More
Sequential estimation of a vector of linear regression coefficients is considered under both centralized and decentralized setups. In sequential estimation, the number of observations used for estimation is determined by the observed samples, hence is random, as opposed to fixed-sample-size estimation. Specifically, after receiving a new sample, if a target accuracy level is reached, we stop and estimate using the samples collected so far; otherwise we continue to receive another sample. It is known that finding an optimum sequential estimator, which minimizes the average sample number for a given target accuracy level, is an intractable problem with a general stop** rule that depends on the complete observation history. By properly restricting the search space to stop** rules that depend on a specific subset of the complete observation history, we derive the optimum sequential estimator in the centralized case via optimal stop** theory. However, finding the optimum stop** rule in this case requires numerical computations that {\em quadratically} scales with the number of parameters to be estimated. For the decentralized setup with stringent energy constraints, under an alternative problem formulation that is conditional on the observed regressors, we first derive a simple optimum scheme whose computational complexity is {\em constant} with respect to the number of parameters. Then, following this simple optimum scheme we propose a decentralized sequential estimator whose computational complexity and energy consumption scales {\em linearly} with the number of parameters. Specifically, in the proposed decentralized scheme a close-to-optimum average stop** time performance is achieved by infrequently transmitting a single pulse with very short duration.
△ Less
Submitted 17 December, 2014; v1 submitted 24 January, 2013;
originally announced January 2013.
-
Channel-aware Decentralized Detection via Level-triggered Sampling
Authors:
Yasin Yilmaz,
George V. Moustakides,
Xiaodong Wang
Abstract:
We consider decentralized detection through distributed sensors that perform level-triggered sampling and communicate with a fusion center via noisy channels. Each sensor computes its local log-likelihood ratio (LLR), samples it using the level-triggered sampling, and upon sampling transmits a single bit to the FC. Upon receiving a bit from a sensor, the FC updates the global LLR and performs a se…
▽ More
We consider decentralized detection through distributed sensors that perform level-triggered sampling and communicate with a fusion center via noisy channels. Each sensor computes its local log-likelihood ratio (LLR), samples it using the level-triggered sampling, and upon sampling transmits a single bit to the FC. Upon receiving a bit from a sensor, the FC updates the global LLR and performs a sequential probability ratio test (SPRT) step. We derive the fusion rules under various types of channels. We further provide an asymptotic analysis on the average detection delay for the proposed channel-aware scheme, and show that the asymptotic detection delay is characterized by a KL information number. The delay analysis facilitates the choice of appropriate signaling schemes under different channel types for sending the 1-bit information from sensors to the FC.
△ Less
Submitted 9 September, 2012; v1 submitted 26 May, 2012;
originally announced May 2012.
-
Optimal Joint Target Detection and Parameter Estimation By MIMO Radar
Authors:
Ali Tajer,
Guido H. Jajamovich,
Xiaodong Wang,
George V. Moustakides
Abstract:
We consider multiple-input multiple-output (MIMO) radar systems with widely-spaced antennas. Such antenna configuration facilitates capturing the inherent diversity gain due to independent signal dispersion by the target scatterers. We consider a new MIMO radar framework for detecting a target that lies in an unknown location. This is in contrast with conventional MIMO radars which break the spa…
▽ More
We consider multiple-input multiple-output (MIMO) radar systems with widely-spaced antennas. Such antenna configuration facilitates capturing the inherent diversity gain due to independent signal dispersion by the target scatterers. We consider a new MIMO radar framework for detecting a target that lies in an unknown location. This is in contrast with conventional MIMO radars which break the space into small cells and aim at detecting the presence of a target in a specified cell. We treat this problem through offering a novel composite hypothesis testing framework for target detection when (i) one or more parameters of the target are unknown and we are interested in estimating them, and (ii) only a finite number of observations are available. The test offered optimizes a metric which accounts for both detection and estimation accuracies. In this paper as the parameter of interest we focus on the vector of time-delays that the waveforms undergo from being emitted by the transmit antennas until being observed by the receive antennas. The analytical and empirical results establish that for the proposed joint target detection and time-delay estimation framework, MIMO radars exhibit significant gains over phased-array radars for extended targets which consist of multiple independent scatterers. For point targets modeled as single scatterers, however, the detection/estimation accuracies of MIMO and phased-array radars for this specific setup (joint target detection and time-delay estimation) are comparable.
△ Less
Submitted 7 August, 2009;
originally announced August 2009.