-
Pseudonorms and p-adic birational Torelli theorem
Authors:
Chen-Yu Chi
Abstract:
A p-adic analogue of the pseudonorm version of the birational Torelli type theorem is obtained via a comparison theorem of image closures. Among other results obtained, we have a criterion for existence of rational points of canonically polarized surfaces over finite fields.
A p-adic analogue of the pseudonorm version of the birational Torelli type theorem is obtained via a comparison theorem of image closures. Among other results obtained, we have a criterion for existence of rational points of canonically polarized surfaces over finite fields.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
FACT: High-Dimensional Random Forests Inference
Authors:
Chien-Ming Chi,
Yingying Fan,
**chi Lv
Abstract:
Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis t…
▽ More
Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis testing, and suggest a framework of the self-normalized feature-residual correlation test (FACT) for evaluating the significance of a given feature in the random forests model with bias-resistance property, where our null hypothesis concerns whether the feature is conditionally independent of the response given all other features. Such an endeavor on random forests inference is empowered by some recent developments on high-dimensional random forests consistency. Under a fairly general high-dimensional nonparametric model setting with dependent features, we formally establish that FACT can provide theoretically justified feature importance test with controlled type I error and enjoy appealing power property. The theoretical results and finite-sample advantages of the newly suggested method are illustrated with several simulation examples and an economic forecasting application.
△ Less
Submitted 12 November, 2023; v1 submitted 4 July, 2022;
originally announced July 2022.
-
The Turán number for the edge blow-up of trees: the missing case
Authors:
Cheng Chi,
Long-Tu Yuan
Abstract:
The edge blow-up of a graph is the graph obtained from replacing each edge of it by a clique of the same size where the new vertices of the cliques are all different. Wang, Hou, Liu and Ma determined the Turán number of the edge blow-up of trees except one particular case. Answering an problem posed by them, we determined the Turán number of this particular case.
The edge blow-up of a graph is the graph obtained from replacing each edge of it by a clique of the same size where the new vertices of the cliques are all different. Wang, Hou, Liu and Ma determined the Turán number of the edge blow-up of trees except one particular case. Answering an problem posed by them, we determined the Turán number of this particular case.
△ Less
Submitted 10 June, 2022;
originally announced June 2022.
-
A Deep Reinforcement Learning Framework For Column Generation
Authors:
Cheng Chi,
Amine Mohamed Aboussalah,
Elias B. Khalil,
Juyoung Wang,
Zoha Sherkat-Masoumi
Abstract:
Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Wind…
▽ More
Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Windows (VRPTW). In VRPTW, for example, each binary variable represents the decision to include or exclude a \textit{route}, of which there are exponentially many; CG incrementally grows the subset of columns being used, ultimately converging to an optimal solution. We propose RLCG, the first Reinforcement Learning (RL) approach for CG. Unlike typical column selection rules which myopically select a column based on local information at each iteration, we treat CG as a sequential decision-making problem: the column selected in a given iteration affects subsequent column selections. This perspective lends itself to a Deep Reinforcement Learning approach that uses Graph Neural Networks (GNNs) to represent the variable-constraint structure in the LP of interest. We perform an extensive set of experiments using the publicly available BPPLIB benchmark for CSP and Solomon benchmark for VRPTW. RLCG converges faster and reduces the number of CG iterations by 22.4\% for CSP and 40.9\% for VRPTW on average compared to a commonly used greedy policy. Our code is available at https://github.com/chichengmessi/reinforcement-learning-for-column-generation.git.
△ Less
Submitted 12 January, 2023; v1 submitted 2 June, 2022;
originally announced June 2022.
-
High-Dimensional Knockoffs Inference for Time Series Data
Authors:
Chien-Ming Chi,
Yingying Fan,
Ching-Kang Ing,
**chi Lv
Abstract:
The model-X knockoffs framework provides a flexible tool for achieving finite-sample false discovery rate (FDR) control in variable selection in arbitrary dimensions without assuming any dependence structure of the response on covariates. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on…
▽ More
The model-X knockoffs framework provides a flexible tool for achieving finite-sample false discovery rate (FDR) control in variable selection in arbitrary dimensions without assuming any dependence structure of the response on covariates. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on the setting of independent and identically distributed observations. Yet time series data is prevalent in practical applications in various fields such as economics and social sciences. This motivates the study of model-X knockoffs inference for time series data. In this paper, we make some initial attempt to establish the theoretical and methodological foundation for the model-X knockoffs inference for time series data. We suggest the method of time series knockoffs inference (TSKI) by exploiting the ideas of subsampling and e-values to address the difficulty caused by the serial dependence. We also generalize the robust knockoffs inference to the time series setting and relax the assumption of known covariate distribution required by model-X knockoffs, because such an assumption is overly stringent for time series data. We establish sufficient conditions under which TSKI achieves the asymptotic FDR control. Our technical analysis reveals the effects of serial dependence and unknown covariate distribution on the FDR control. We conduct power analysis of TSKI using the Lasso coefficient difference knockoff statistic under linear time series models. The finite-sample performance of TSKI is illustrated with several simulation examples and an economic inflation study.
△ Less
Submitted 19 May, 2023; v1 submitted 18 December, 2021;
originally announced December 2021.
-
Asymptotic Properties of High-Dimensional Random Forests
Authors:
Chien-Ming Chi,
Patrick Vossler,
Yingying Fan,
**chi Lv
Abstract:
As a flexible nonparametric learning tool, the random forests algorithm has been widely applied to various real applications with appealing empirical performance, even in the presence of high-dimensional feature space. Unveiling the underlying mechanisms has led to some important recent theoretical results on the consistency of the random forests algorithm and its variants. However, to our knowled…
▽ More
As a flexible nonparametric learning tool, the random forests algorithm has been widely applied to various real applications with appealing empirical performance, even in the presence of high-dimensional feature space. Unveiling the underlying mechanisms has led to some important recent theoretical results on the consistency of the random forests algorithm and its variants. However, to our knowledge, almost all existing works concerning random forests consistency in high dimensional setting were established for various modified random forests models where the splitting rules are independent of the response; a few exceptions assume simple data generating models with binary features. In light of this, in this paper we derive the consistency rates for the random forests algorithm associated with the sample CART splitting criterion, which is the one used in the original version of the algorithm, in a general high-dimensional nonparametric regression setting through a bias-variance decomposition analysis. Our new theoretical results show that random forests can indeed adapt to high dimensionality and allow for discontinuous regression function. Our bias analysis characterizes explicitly how the random forests bias depends on the sample size, tree height, and column subsampling parameter. Some limitations on our current results are also discussed.
△ Less
Submitted 24 September, 2022; v1 submitted 29 April, 2020;
originally announced April 2020.
-
On the extension of holomorphic sections from reduced unions of strata of divisors
Authors:
Chen-Yu Chi
Abstract:
In this paper we study the problem of extension of holomorphic sections of line bundles/vector bundles from reduced unions of strata of divisors. An extension theorem of Ohsawa--Takegoshi type is proved. As consequences we deduce several qualitative results on extension from snc divisors and generic global generation of vector bundles.
In this paper we study the problem of extension of holomorphic sections of line bundles/vector bundles from reduced unions of strata of divisors. An extension theorem of Ohsawa--Takegoshi type is proved. As consequences we deduce several qualitative results on extension from snc divisors and generic global generation of vector bundles.
△ Less
Submitted 29 August, 2019; v1 submitted 15 May, 2019;
originally announced May 2019.
-
Recovering Trees with Convex Clustering
Authors:
Eric C. Chi,
Stefan Steinerberger
Abstract:
Convex clustering refers, for given $\left\{x_1, \dots, x_n\right\} \subset \mathbb{R}^p$, to the minimization of \begin{eqnarray*} u(γ) & = & \underset{u_1, \dots, u_n }{\arg\min}\;\sum_{i=1}^{n}{\lVert x_i - u_i \rVert^2} + γ\sum_{i,j=1}^{n}{w_{ij} \lVert u_i - u_j\rVert},\\ \end{eqnarray*} where $w_{ij} \geq 0$ is an affinity that quantifies the similarity between $x_i$ and $x_j$. We prove that…
▽ More
Convex clustering refers, for given $\left\{x_1, \dots, x_n\right\} \subset \mathbb{R}^p$, to the minimization of \begin{eqnarray*} u(γ) & = & \underset{u_1, \dots, u_n }{\arg\min}\;\sum_{i=1}^{n}{\lVert x_i - u_i \rVert^2} + γ\sum_{i,j=1}^{n}{w_{ij} \lVert u_i - u_j\rVert},\\ \end{eqnarray*} where $w_{ij} \geq 0$ is an affinity that quantifies the similarity between $x_i$ and $x_j$. We prove that if the affinities $w_{ij}$ reflect a tree structure in the $\left\{x_1, \dots, x_n\right\}$, then the convex clustering solution path reconstructs the tree exactly. The main technical ingredient implies the following combinatorial byproduct: for every set $\left\{x_1, \dots, x_n \right\} \subset \mathbb{R}^p$ of $n \geq 2$ distinct points, there exist at least $n/6$ points with the property that for any of these points $x$ there is a unit vector $v \in \mathbb{R}^p$ such that, when viewed from $x$, `most' points lie in the direction $v$ \begin{eqnarray*} \frac{1}{n-1}\sum_{i=1 \atop x_i \neq x}^{n}{ \left\langle \frac{x_i - x}{\lVert x_i - x \rVert}, v \right\rangle} & \geq & \frac{1}{4}. \end{eqnarray*}
△ Less
Submitted 28 June, 2018; v1 submitted 28 June, 2018;
originally announced June 2018.
-
An MM Algorithm for Split Feasibility Problems
Authors:
Jason Xu,
Eric C. Chi,
Meng Yang,
Kenneth Lange
Abstract:
The classical multi-set split feasibility problem seeks a point in the intersection of finitely many closed convex domain constraints, whose image under a linear map** also lies in the intersection of finitely many closed convex range constraints. Split feasibility generalizes important inverse problems including convex feasibility, linear complementarity, and regression with constraint sets. Wh…
▽ More
The classical multi-set split feasibility problem seeks a point in the intersection of finitely many closed convex domain constraints, whose image under a linear map** also lies in the intersection of finitely many closed convex range constraints. Split feasibility generalizes important inverse problems including convex feasibility, linear complementarity, and regression with constraint sets. When a feasible point does not exist, solution methods that proceed by minimizing a proximity function can be used to obtain optimal approximate solutions to the problem. We present an extension of the proximity function approach that generalizes the linear split feasibility problem to allow for non-linear map**s. Our algorithm is based on the principle of majorization-minimization, is amenable to quasi-Newton acceleration, and comes complete with convergence guarantees under mild assumptions. Furthermore, we show that the Euclidean norm appearing in the proximity function of the non-linear split feasibility problem can be replaced by arbitrary Bregman divergences. We explore several examples illustrating the merits of non-linear formulations over the linear case, with a focus on optimization for intensity-modulated radiation therapy.
△ Less
Submitted 17 January, 2017; v1 submitted 16 December, 2016;
originally announced December 2016.
-
Identifiability of the Simplex Volume Minimization Criterion for Blind Hyperspectral Unmixing: The No Pure-Pixel Case
Authors:
Chia-Hsiang Lin,
Wing-Kin Ma,
Wei-Chiang Li,
Chong-Yung Chi,
ArulMurugan Ambikapathi
Abstract:
In blind hyperspectral unmixing (HU), the pure-pixel assumption is well-known to be powerful in enabling simple and effective blind HU solutions. However, the pure-pixel assumption is not always satisfied in an exact sense, especially for scenarios where pixels are heavily mixed. In the no pure-pixel case, a good blind HU approach to consider is the minimum volume enclosing simplex (MVES). Empiric…
▽ More
In blind hyperspectral unmixing (HU), the pure-pixel assumption is well-known to be powerful in enabling simple and effective blind HU solutions. However, the pure-pixel assumption is not always satisfied in an exact sense, especially for scenarios where pixels are heavily mixed. In the no pure-pixel case, a good blind HU approach to consider is the minimum volume enclosing simplex (MVES). Empirical experience has suggested that MVES algorithms can perform well without pure pixels, although it was not totally clear why this is true from a theoretical viewpoint. This paper aims to address the latter issue. We develop an analysis framework wherein the perfect endmember identifiability of MVES is studied under the noiseless case. We prove that MVES is indeed robust against lack of pure pixels, as long as the pixels do not get too heavily mixed and too asymmetrically spread. The theoretical results are verified by numerical simulations.
△ Less
Submitted 26 February, 2015; v1 submitted 19 June, 2014;
originally announced June 2014.
-
Stable Estimation of a Covariance Matrix Guided by Nuclear Norm Penalties
Authors:
Eric C. Chi,
Kenneth Lange
Abstract:
Estimation of covariance matrices or their inverses plays a central role in many statistical methods. For these methods to work reliably, estimated matrices must not only be invertible but also well-conditioned. In this paper we present an intuitive prior that shrinks the classic sample covariance estimator towards a stable target. We prove that our estimator is consistent and asymptotically effic…
▽ More
Estimation of covariance matrices or their inverses plays a central role in many statistical methods. For these methods to work reliably, estimated matrices must not only be invertible but also well-conditioned. In this paper we present an intuitive prior that shrinks the classic sample covariance estimator towards a stable target. We prove that our estimator is consistent and asymptotically efficient. Thus, it gracefully transitions towards the sample covariance matrix as the number of samples grows relative to the number of covariates. We also demonstrate the utility of our estimator in two standard situations -- discriminant analysis and EM clustering -- when the number of samples is dominated by or comparable to the number of covariates.
△ Less
Submitted 22 November, 2013; v1 submitted 14 May, 2013;
originally announced May 2013.
-
Splitting Methods for Convex Clustering
Authors:
Eric C. Chi,
Kenneth Lange
Abstract:
Clustering is a fundamental problem in many scientific applications. Standard methods such as $k$-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of $k$-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer.…
▽ More
Clustering is a fundamental problem in many scientific applications. Standard methods such as $k$-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of $k$-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer. In this work we present two splitting methods for solving the convex clustering problem. The first is an instance of the alternating direction method of multipliers (ADMM); the second is an instance of the alternating minimization algorithm (AMA). In contrast to previously considered algorithms, our ADMM and AMA formulations provide simple and unified frameworks for solving the convex clustering problem under the previously studied norms and open the door to potentially novel norms. We demonstrate the performance of our algorithm on both simulated and real data examples. While the differences between the two algorithms appear to be minor on the surface, complexity analysis and numerical experiments show AMA to be significantly more efficient.
△ Less
Submitted 18 March, 2014; v1 submitted 1 April, 2013;
originally announced April 2013.
-
On the Toda systems of VHS type
Authors:
Chen-Yu Chi
Abstract:
We consider the Toda systems of VHS type with singular sources and provide a criterion for the existence of solutions with prescribed asymptotic behaviour near singularities. We also prove the uniqueness of solution. Our approach uses Simpson's theory of constructing Higgs-Hermitian-Yang-Mills metrics from stability.
We consider the Toda systems of VHS type with singular sources and provide a criterion for the existence of solutions with prescribed asymptotic behaviour near singularities. We also prove the uniqueness of solution. Our approach uses Simpson's theory of constructing Higgs-Hermitian-Yang-Mills metrics from stability.
△ Less
Submitted 28 March, 2013; v1 submitted 6 February, 2013;
originally announced February 2013.
-
Distance Majorization and Its Applications
Authors:
Eric C. Chi,
Hua Zhou,
Kenneth Lange
Abstract:
The problem of minimizing a continuously differentiable convex function over an intersection of closed convex sets is ubiquitous in applied mathematics. It is particularly interesting when it is easy to project onto each separate set, but nontrivial to project onto their intersection. Algorithms based on Newton's method such as the interior point method are viable for small to medium-scale problem…
▽ More
The problem of minimizing a continuously differentiable convex function over an intersection of closed convex sets is ubiquitous in applied mathematics. It is particularly interesting when it is easy to project onto each separate set, but nontrivial to project onto their intersection. Algorithms based on Newton's method such as the interior point method are viable for small to medium-scale problems. However, modern applications in statistics, engineering, and machine learning are posing problems with potentially tens of thousands of parameters or more. We revisit this convex programming problem and propose an algorithm that scales well with dimensionality. Our proposal is an instance of a sequential unconstrained minimization technique and revolves around three ideas: the majorization-minimization (MM) principle, the classical penalty method for constrained optimization, and quasi-Newton acceleration of fixed-point algorithms. The performance of our distance majorization algorithms is illustrated in several applications.
△ Less
Submitted 11 June, 2013; v1 submitted 16 November, 2012;
originally announced November 2012.
-
Techniques for Solving Sudoku Puzzles
Authors:
Eric C. Chi,
Kenneth Lange
Abstract:
Solving Sudoku puzzles is one of the most popular pastimes in the world. Puzzles range in difficulty from easy to very challenging; the hardest puzzles tend to have the most empty cells. The current paper explains and compares three algorithms for solving Sudoku puzzles. Backtracking, simulated annealing, and alternating projections are generic methods for attacking combinatorial optimization prob…
▽ More
Solving Sudoku puzzles is one of the most popular pastimes in the world. Puzzles range in difficulty from easy to very challenging; the hardest puzzles tend to have the most empty cells. The current paper explains and compares three algorithms for solving Sudoku puzzles. Backtracking, simulated annealing, and alternating projections are generic methods for attacking combinatorial optimization problems. Our results favor backtracking. It infallibly solves a Sudoku puzzle or deduces that a unique solution does not exist. However, backtracking does not scale well in high-dimensional combinatorial optimization. Hence, it is useful to expose students in the mathematical sciences to the other two solution techniques in a concrete setting. Simulated annealing shares a common structure with MCMC (Markov chain Monte Carlo) and enjoys wide applicability. The method of alternating projections solves the feasibility problem in convex programming. Converting a discrete optimization problem into a continuous optimization problem opens up the possibility of handling combinatorial problems of much higher dimensionality.
△ Less
Submitted 16 May, 2013; v1 submitted 10 March, 2012;
originally announced March 2012.
-
A Look at the Generalized Heron Problem through the Lens of Majorization-Minimization
Authors:
Eric C. Chi,
Kenneth Lange
Abstract:
In a recent issue of this journal, Mordukhovich et al.\ pose and solve an interesting non-differentiable generalization of the Heron problem in the framework of modern convex analysis. In the generalized Heron problem one is given $k+1$ closed convex sets in $\Real^d$ equipped with its Euclidean norm and asked to find the point in the last set such that the sum of the distances to the first $k$ se…
▽ More
In a recent issue of this journal, Mordukhovich et al.\ pose and solve an interesting non-differentiable generalization of the Heron problem in the framework of modern convex analysis. In the generalized Heron problem one is given $k+1$ closed convex sets in $\Real^d$ equipped with its Euclidean norm and asked to find the point in the last set such that the sum of the distances to the first $k$ sets is minimal. In later work the authors generalize the Heron problem even further, relax its convexity assumptions, study its theoretical properties, and pursue subgradient algorithms for solving the convex case. Here, we revisit the original problem solely from the numerical perspective. By exploiting the majorization-minimization (MM) principle of computational statistics and rudimentary techniques from differential calculus, we are able to construct a very fast algorithm for solving the Euclidean version of the generalized Heron problem.
△ Less
Submitted 23 May, 2012; v1 submitted 2 March, 2012;
originally announced March 2012.
-
On Tensors, Sparsity, and Nonnegative Factorizations
Authors:
Eric C. Chi,
Tamara G. Kolda
Abstract:
Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive tensor factorization model of such data, along with appropriate algorithms and theory. To do so, we propose that the random variation is best described via a Poisso…
▽ More
Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive tensor factorization model of such data, along with appropriate algorithms and theory. To do so, we propose that the random variation is best described via a Poisson distribution, which better describes the zeros observed in the data as compared to the typical assumption of a Gaussian distribution. Under a Poisson assumption, we fit a model to observed data using the negative log-likelihood score. We present a new algorithm for Poisson tensor factorization called CANDECOMP-PARAFAC Alternating Poisson Regression (CP-APR) that is based on a majorization-minimization approach. It can be shown that CP-APR is a generalization of the Lee-Seung multiplicative updates. We show how to prevent the algorithm from converging to non-KKT points and prove convergence of CP-APR under mild conditions. We also explain how to implement CP-APR for large-scale sparse tensors and present results on several data sets, both real and simulated.
△ Less
Submitted 14 August, 2012; v1 submitted 11 December, 2011;
originally announced December 2011.
-
Extensions of multiply twisted pluri-canonical forms
Authors:
Chen-Yu Chi,
Chin-Lung Wang,
Sz-Sheng Wang
Abstract:
Given a projective variety X, a smooth divisor D, and semipositive line bundles (L_1,h_1),,...,(L_m,h_m), we consider the "multiply twisted pluricanonical bundle" F:=m(K_X+D)+L_1+...+L_m on X and F_D:=mK_D+(L_1+...+L_m)|_D. Let I_j be the multiplier ideal sheaves associated to h_j, j=1,...,m. We show that, under a certain conditions on curvature, H^0(D,F_D\otimes I_1I_2...I_m) lies in the image of…
▽ More
Given a projective variety X, a smooth divisor D, and semipositive line bundles (L_1,h_1),,...,(L_m,h_m), we consider the "multiply twisted pluricanonical bundle" F:=m(K_X+D)+L_1+...+L_m on X and F_D:=mK_D+(L_1+...+L_m)|_D. Let I_j be the multiplier ideal sheaves associated to h_j, j=1,...,m. We show that, under a certain conditions on curvature, H^0(D,F_D\otimes I_1I_2...I_m) lies in the image of the restriction map H^0(X,F)->H^0(D,F_D). The format of our result is inspired both by Paun's simplification of Siu's proof of invariance of plurigenera and an earlier similar result due to Demailly. The main ingredient is a modification of Siu-Paun's induction construction and an extension theorem of Ohsawa-Takegoshi type (O-T). We also include a detail proof of O-T. The key feature is that the ideal sheaf we use is the product of the multiplier ideals associated to the singular metrics h_1,...,h_m, which contains the multiplier ideal sheaf of the product of the metrics h_1\otimes...\otimes h_m.
△ Less
Submitted 23 January, 2011; v1 submitted 11 January, 2011;
originally announced January 2011.
-
Making Tensor Factorizations Robust to Non-Gaussian Noise
Authors:
Eric C. Chi,
Tamara G. Kolda
Abstract:
Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we pr…
▽ More
Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we propose a loss function based on the 1-norm because it can accommodate both Gaussian and grossly non-Gaussian perturbations. We also present an alternating majorization-minimization algorithm for fitting a CP model using our proposed loss function.
△ Less
Submitted 14 October, 2010;
originally announced October 2010.
-
A new geometric approach to problems in birational geometry
Authors:
Chen-Yu Chi,
Shing-Tung Yau
Abstract:
A classical set of birational invariants of a variety are its spaces of pluricanonical forms and some of their canonically defined subspaces. Each of these vector spaces admits a typical metric structure which is also birationally invariant. These vector spaces so metrized will be referred to as the pseudonormed spaces of the original varieties. A fundamental question is the following: given two…
▽ More
A classical set of birational invariants of a variety are its spaces of pluricanonical forms and some of their canonically defined subspaces. Each of these vector spaces admits a typical metric structure which is also birationally invariant. These vector spaces so metrized will be referred to as the pseudonormed spaces of the original varieties. A fundamental question is the following: given two mildly singular projective varieties with some of the first variety's pseudonormed spaces being isometric to the corresponding ones of the second variety's, can one construct a birational map between them which induces these isometries? In this work a positive answer to this question is given for varieties of general type. This can be thought of as a theorem of Torelli type for birational equivalence.
△ Less
Submitted 18 November, 2008;
originally announced November 2008.
-
The Equivalence of Semidefinite Relaxation MIMO Detectors for Higher-Order QAM
Authors:
Wing-Kin Ma,
Chao-Cheng Su,
Joakim Jalden,
Tsung-Hui Chang,
Chong-Yung Chi
Abstract:
In multi-input-multi-output (MIMO) detection, semidefinite relaxation (SDR) has been shown to be an efficient high-performance approach. Developed initially for BPSK and QPSK, SDR has been found to be capable of providing near-optimal performance (for those constellations). This has stimulated a number of recent research endeavors that aim to apply SDR to the high-order QAM cases. These independ…
▽ More
In multi-input-multi-output (MIMO) detection, semidefinite relaxation (SDR) has been shown to be an efficient high-performance approach. Developed initially for BPSK and QPSK, SDR has been found to be capable of providing near-optimal performance (for those constellations). This has stimulated a number of recent research endeavors that aim to apply SDR to the high-order QAM cases. These independently developed SDRs are different in concept and structure, and presently no serious analysis has been given to compare these methods. This paper analyzes the relationship of three such SDR methods, namely the polynomial-inspired SDR (PI-SDR) by Wiesel et al., the bound-constrained SDR (BC-SDR) by Sidiropoulos and Luo, and the virtually-antipodal SDR (VA-SDR) by Mao et al. The result that we have proven is somehow unexpected: the three SDRs are equivalent. Simply speaking, we show that solving any one SDR is equivalent to solving the other SDRs. This paper also discusses some implications arising from the SDR equivalence, and provides simulation results to verify our theoretical findings.
△ Less
Submitted 26 September, 2008;
originally announced September 2008.