Search | arXiv e-print repository

Pseudonorms and p-adic birational Torelli theorem

Abstract: A p-adic analogue of the pseudonorm version of the birational Torelli type theorem is obtained via a comparison theorem of image closures. Among other results obtained, we have a criterion for existence of rational points of canonically polarized surfaces over finite fields. A p-adic analogue of the pseudonorm version of the birational Torelli type theorem is obtained via a comparison theorem of image closures. Among other results obtained, we have a criterion for existence of rational points of canonically polarized surfaces over finite fields. △ Less

Submitted 16 November, 2022; originally announced November 2022.

arXiv:2207.01678 [pdf, other]

FACT: High-Dimensional Random Forests Inference

Authors: Chien-Ming Chi, Yingying Fan, **chi Lv

Abstract: Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis t… ▽ More Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis testing, and suggest a framework of the self-normalized feature-residual correlation test (FACT) for evaluating the significance of a given feature in the random forests model with bias-resistance property, where our null hypothesis concerns whether the feature is conditionally independent of the response given all other features. Such an endeavor on random forests inference is empowered by some recent developments on high-dimensional random forests consistency. Under a fairly general high-dimensional nonparametric model setting with dependent features, we formally establish that FACT can provide theoretically justified feature importance test with controlled type I error and enjoy appealing power property. The theoretical results and finite-sample advantages of the newly suggested method are illustrated with several simulation examples and an economic forecasting application. △ Less

Submitted 12 November, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

Comments: 42 pages, 3 figures

arXiv:2206.05162 [pdf, ps, other]

The Turán number for the edge blow-up of trees: the missing case

Authors: Cheng Chi, Long-Tu Yuan

Abstract: The edge blow-up of a graph is the graph obtained from replacing each edge of it by a clique of the same size where the new vertices of the cliques are all different. Wang, Hou, Liu and Ma determined the Turán number of the edge blow-up of trees except one particular case. Answering an problem posed by them, we determined the Turán number of this particular case. The edge blow-up of a graph is the graph obtained from replacing each edge of it by a clique of the same size where the new vertices of the cliques are all different. Wang, Hou, Liu and Ma determined the Turán number of the edge blow-up of trees except one particular case. Answering an problem posed by them, we determined the Turán number of this particular case. △ Less

Submitted 10 June, 2022; originally announced June 2022.

arXiv:2206.02568 [pdf, other]

A Deep Reinforcement Learning Framework For Column Generation

Authors: Cheng Chi, Amine Mohamed Aboussalah, Elias B. Khalil, Juyoung Wang, Zoha Sherkat-Masoumi

Abstract: Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Wind… ▽ More Column Generation (CG) is an iterative algorithm for solving linear programs (LPs) with an extremely large number of variables (columns). CG is the workhorse for tackling large-scale \textit{integer} linear programs, which rely on CG to solve LP relaxations within a branch and price algorithm. Two canonical applications are the Cutting Stock Problem (CSP) and Vehicle Routing Problem with Time Windows (VRPTW). In VRPTW, for example, each binary variable represents the decision to include or exclude a \textit{route}, of which there are exponentially many; CG incrementally grows the subset of columns being used, ultimately converging to an optimal solution. We propose RLCG, the first Reinforcement Learning (RL) approach for CG. Unlike typical column selection rules which myopically select a column based on local information at each iteration, we treat CG as a sequential decision-making problem: the column selected in a given iteration affects subsequent column selections. This perspective lends itself to a Deep Reinforcement Learning approach that uses Graph Neural Networks (GNNs) to represent the variable-constraint structure in the LP of interest. We perform an extensive set of experiments using the publicly available BPPLIB benchmark for CSP and Solomon benchmark for VRPTW. RLCG converges faster and reduces the number of CG iterations by 22.4\% for CSP and 40.9\% for VRPTW on average compared to a commonly used greedy policy. Our code is available at https://github.com/chichengmessi/reinforcement-learning-for-column-generation.git. △ Less

Submitted 12 January, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

Journal ref: Advances in Neural Information Processing Systems (NeurIPS), 2022

arXiv:2112.09851 [pdf, other]

High-Dimensional Knockoffs Inference for Time Series Data

Authors: Chien-Ming Chi, Yingying Fan, Ching-Kang Ing, **chi Lv

Abstract: The model-X knockoffs framework provides a flexible tool for achieving finite-sample false discovery rate (FDR) control in variable selection in arbitrary dimensions without assuming any dependence structure of the response on covariates. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on… ▽ More The model-X knockoffs framework provides a flexible tool for achieving finite-sample false discovery rate (FDR) control in variable selection in arbitrary dimensions without assuming any dependence structure of the response on covariates. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on the setting of independent and identically distributed observations. Yet time series data is prevalent in practical applications in various fields such as economics and social sciences. This motivates the study of model-X knockoffs inference for time series data. In this paper, we make some initial attempt to establish the theoretical and methodological foundation for the model-X knockoffs inference for time series data. We suggest the method of time series knockoffs inference (TSKI) by exploiting the ideas of subsampling and e-values to address the difficulty caused by the serial dependence. We also generalize the robust knockoffs inference to the time series setting and relax the assumption of known covariate distribution required by model-X knockoffs, because such an assumption is overly stringent for time series data. We establish sufficient conditions under which TSKI achieves the asymptotic FDR control. Our technical analysis reveals the effects of serial dependence and unknown covariate distribution on the FDR control. We conduct power analysis of TSKI using the Lasso coefficient difference knockoff statistic under linear time series models. The finite-sample performance of TSKI is illustrated with several simulation examples and an economic inflation study. △ Less

Submitted 19 May, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

Comments: 65 pages, 4 figures

MSC Class: 62P20 ACM Class: A.0

arXiv:2004.13953 [pdf, other]

Asymptotic Properties of High-Dimensional Random Forests

Authors: Chien-Ming Chi, Patrick Vossler, Yingying Fan, **chi Lv

Abstract: As a flexible nonparametric learning tool, the random forests algorithm has been widely applied to various real applications with appealing empirical performance, even in the presence of high-dimensional feature space. Unveiling the underlying mechanisms has led to some important recent theoretical results on the consistency of the random forests algorithm and its variants. However, to our knowled… ▽ More As a flexible nonparametric learning tool, the random forests algorithm has been widely applied to various real applications with appealing empirical performance, even in the presence of high-dimensional feature space. Unveiling the underlying mechanisms has led to some important recent theoretical results on the consistency of the random forests algorithm and its variants. However, to our knowledge, almost all existing works concerning random forests consistency in high dimensional setting were established for various modified random forests models where the splitting rules are independent of the response; a few exceptions assume simple data generating models with binary features. In light of this, in this paper we derive the consistency rates for the random forests algorithm associated with the sample CART splitting criterion, which is the one used in the original version of the algorithm, in a general high-dimensional nonparametric regression setting through a bias-variance decomposition analysis. Our new theoretical results show that random forests can indeed adapt to high dimensionality and allow for discontinuous regression function. Our bias analysis characterizes explicitly how the random forests bias depends on the sample size, tree height, and column subsampling parameter. Some limitations on our current results are also discussed. △ Less

Submitted 24 September, 2022; v1 submitted 29 April, 2020; originally announced April 2020.

Comments: 64 pages, 5 figures, to appear in The Annals of Statistics

arXiv:1905.06481 [pdf, ps, other]

On the extension of holomorphic sections from reduced unions of strata of divisors

Authors: Chen-Yu Chi

Abstract: In this paper we study the problem of extension of holomorphic sections of line bundles/vector bundles from reduced unions of strata of divisors. An extension theorem of Ohsawa--Takegoshi type is proved. As consequences we deduce several qualitative results on extension from snc divisors and generic global generation of vector bundles. In this paper we study the problem of extension of holomorphic sections of line bundles/vector bundles from reduced unions of strata of divisors. An extension theorem of Ohsawa--Takegoshi type is proved. As consequences we deduce several qualitative results on extension from snc divisors and generic global generation of vector bundles. △ Less

Submitted 29 August, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

Comments: 28 pages. This version consists of (a revision of) the part about adjoint bundles of the original version; the rest parts about pluricanonical bundles are removed and will appear in another article

arXiv:1806.11096 [pdf, other]

Recovering Trees with Convex Clustering

Authors: Eric C. Chi, Stefan Steinerberger

Abstract: Convex clustering refers, for given $\left\{x_1, \dots, x_n\right\} \subset \mathbb{R}^p$, to the minimization of \begin{eqnarray*} u(γ) & = & \underset{u_1, \dots, u_n }{\arg\min}\;\sum_{i=1}^{n}{\lVert x_i - u_i \rVert^2} + γ\sum_{i,j=1}^{n}{w_{ij} \lVert u_i - u_j\rVert},\\ \end{eqnarray*} where $w_{ij} \geq 0$ is an affinity that quantifies the similarity between $x_i$ and $x_j$. We prove that… ▽ More Convex clustering refers, for given $\left\{x_1, \dots, x_n\right\} \subset \mathbb{R}^p$, to the minimization of \begin{eqnarray*} u(γ) & = & \underset{u_1, \dots, u_n }{\arg\min}\;\sum_{i=1}^{n}{\lVert x_i - u_i \rVert^2} + γ\sum_{i,j=1}^{n}{w_{ij} \lVert u_i - u_j\rVert},\\ \end{eqnarray*} where $w_{ij} \geq 0$ is an affinity that quantifies the similarity between $x_i$ and $x_j$. We prove that if the affinities $w_{ij}$ reflect a tree structure in the $\left\{x_1, \dots, x_n\right\}$, then the convex clustering solution path reconstructs the tree exactly. The main technical ingredient implies the following combinatorial byproduct: for every set $\left\{x_1, \dots, x_n \right\} \subset \mathbb{R}^p$ of $n \geq 2$ distinct points, there exist at least $n/6$ points with the property that for any of these points $x$ there is a unit vector $v \in \mathbb{R}^p$ such that, when viewed from $x$, `most' points lie in the direction $v$ \begin{eqnarray*} \frac{1}{n-1}\sum_{i=1 \atop x_i \neq x}^{n}{ \left\langle \frac{x_i - x}{\lVert x_i - x \rVert}, v \right\rangle} & \geq & \frac{1}{4}. \end{eqnarray*} △ Less

Submitted 28 June, 2018; v1 submitted 28 June, 2018; originally announced June 2018.

Comments: 26 pages, 7 figures

arXiv:1612.05614 [pdf, other]

An MM Algorithm for Split Feasibility Problems

Authors: Jason Xu, Eric C. Chi, Meng Yang, Kenneth Lange

Abstract: The classical multi-set split feasibility problem seeks a point in the intersection of finitely many closed convex domain constraints, whose image under a linear map** also lies in the intersection of finitely many closed convex range constraints. Split feasibility generalizes important inverse problems including convex feasibility, linear complementarity, and regression with constraint sets. Wh… ▽ More The classical multi-set split feasibility problem seeks a point in the intersection of finitely many closed convex domain constraints, whose image under a linear map** also lies in the intersection of finitely many closed convex range constraints. Split feasibility generalizes important inverse problems including convex feasibility, linear complementarity, and regression with constraint sets. When a feasible point does not exist, solution methods that proceed by minimizing a proximity function can be used to obtain optimal approximate solutions to the problem. We present an extension of the proximity function approach that generalizes the linear split feasibility problem to allow for non-linear map**s. Our algorithm is based on the principle of majorization-minimization, is amenable to quasi-Newton acceleration, and comes complete with convergence guarantees under mild assumptions. Furthermore, we show that the Euclidean norm appearing in the proximity function of the non-linear split feasibility problem can be replaced by arbitrary Bregman divergences. We explore several examples illustrating the merits of non-linear formulations over the linear case, with a focus on optimization for intensity-modulated radiation therapy. △ Less

Submitted 17 January, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

Comments: 31 pages, 5 figures, 1 table

arXiv:1406.5273 [pdf, ps, other]

doi 10.1109/TGRS.2015.2424719

Identifiability of the Simplex Volume Minimization Criterion for Blind Hyperspectral Unmixing: The No Pure-Pixel Case

Authors: Chia-Hsiang Lin, Wing-Kin Ma, Wei-Chiang Li, Chong-Yung Chi, ArulMurugan Ambikapathi

Abstract: In blind hyperspectral unmixing (HU), the pure-pixel assumption is well-known to be powerful in enabling simple and effective blind HU solutions. However, the pure-pixel assumption is not always satisfied in an exact sense, especially for scenarios where pixels are heavily mixed. In the no pure-pixel case, a good blind HU approach to consider is the minimum volume enclosing simplex (MVES). Empiric… ▽ More In blind hyperspectral unmixing (HU), the pure-pixel assumption is well-known to be powerful in enabling simple and effective blind HU solutions. However, the pure-pixel assumption is not always satisfied in an exact sense, especially for scenarios where pixels are heavily mixed. In the no pure-pixel case, a good blind HU approach to consider is the minimum volume enclosing simplex (MVES). Empirical experience has suggested that MVES algorithms can perform well without pure pixels, although it was not totally clear why this is true from a theoretical viewpoint. This paper aims to address the latter issue. We develop an analysis framework wherein the perfect endmember identifiability of MVES is studied under the noiseless case. We prove that MVES is indeed robust against lack of pure pixels, as long as the pixels do not get too heavily mixed and too asymmetrically spread. The theoretical results are verified by numerical simulations. △ Less

Submitted 26 February, 2015; v1 submitted 19 June, 2014; originally announced June 2014.

arXiv:1305.3312 [pdf, other]

doi 10.1016/j.csda.2014.06.018

Stable Estimation of a Covariance Matrix Guided by Nuclear Norm Penalties

Authors: Eric C. Chi, Kenneth Lange

Abstract: Estimation of covariance matrices or their inverses plays a central role in many statistical methods. For these methods to work reliably, estimated matrices must not only be invertible but also well-conditioned. In this paper we present an intuitive prior that shrinks the classic sample covariance estimator towards a stable target. We prove that our estimator is consistent and asymptotically effic… ▽ More Estimation of covariance matrices or their inverses plays a central role in many statistical methods. For these methods to work reliably, estimated matrices must not only be invertible but also well-conditioned. In this paper we present an intuitive prior that shrinks the classic sample covariance estimator towards a stable target. We prove that our estimator is consistent and asymptotically efficient. Thus, it gracefully transitions towards the sample covariance matrix as the number of samples grows relative to the number of covariates. We also demonstrate the utility of our estimator in two standard situations -- discriminant analysis and EM clustering -- when the number of samples is dominated by or comparable to the number of covariates. △ Less

Submitted 22 November, 2013; v1 submitted 14 May, 2013; originally announced May 2013.

Comments: 25 pages, 3 figures

Journal ref: Computational Statistics & Data Analysis 80:117-128, 2014

arXiv:1304.0499 [pdf, other]

doi 10.1080/10618600.2014.948181

Splitting Methods for Convex Clustering

Authors: Eric C. Chi, Kenneth Lange

Abstract: Clustering is a fundamental problem in many scientific applications. Standard methods such as $k$-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of $k$-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer.… ▽ More Clustering is a fundamental problem in many scientific applications. Standard methods such as $k$-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of $k$-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer. In this work we present two splitting methods for solving the convex clustering problem. The first is an instance of the alternating direction method of multipliers (ADMM); the second is an instance of the alternating minimization algorithm (AMA). In contrast to previously considered algorithms, our ADMM and AMA formulations provide simple and unified frameworks for solving the convex clustering problem under the previously studied norms and open the door to potentially novel norms. We demonstrate the performance of our algorithm on both simulated and real data examples. While the differences between the two algorithms appear to be minor on the surface, complexity analysis and numerical experiments show AMA to be significantly more efficient. △ Less

Submitted 18 March, 2014; v1 submitted 1 April, 2013; originally announced April 2013.

Comments: 37 pages, 6 figures

MSC Class: 62H30; 90C25; 90C90

Journal ref: Journal of Computational and Graphical Statistics, 24(4):994-1013, 2015

arXiv:1302.1287 [pdf, ps, other]

On the Toda systems of VHS type

Authors: Chen-Yu Chi

Abstract: We consider the Toda systems of VHS type with singular sources and provide a criterion for the existence of solutions with prescribed asymptotic behaviour near singularities. We also prove the uniqueness of solution. Our approach uses Simpson's theory of constructing Higgs-Hermitian-Yang-Mills metrics from stability. We consider the Toda systems of VHS type with singular sources and provide a criterion for the existence of solutions with prescribed asymptotic behaviour near singularities. We also prove the uniqueness of solution. Our approach uses Simpson's theory of constructing Higgs-Hermitian-Yang-Mills metrics from stability. △ Less

Submitted 28 March, 2013; v1 submitted 6 February, 2013; originally announced February 2013.

MSC Class: 14H60 (Primary); 35Q35; 53c55 (Secondary)

arXiv:1211.3907 [pdf, other]

doi 10.1007/s10107-013-0697-1

Distance Majorization and Its Applications

Authors: Eric C. Chi, Hua Zhou, Kenneth Lange

Abstract: The problem of minimizing a continuously differentiable convex function over an intersection of closed convex sets is ubiquitous in applied mathematics. It is particularly interesting when it is easy to project onto each separate set, but nontrivial to project onto their intersection. Algorithms based on Newton's method such as the interior point method are viable for small to medium-scale problem… ▽ More The problem of minimizing a continuously differentiable convex function over an intersection of closed convex sets is ubiquitous in applied mathematics. It is particularly interesting when it is easy to project onto each separate set, but nontrivial to project onto their intersection. Algorithms based on Newton's method such as the interior point method are viable for small to medium-scale problems. However, modern applications in statistics, engineering, and machine learning are posing problems with potentially tens of thousands of parameters or more. We revisit this convex programming problem and propose an algorithm that scales well with dimensionality. Our proposal is an instance of a sequential unconstrained minimization technique and revolves around three ideas: the majorization-minimization (MM) principle, the classical penalty method for constrained optimization, and quasi-Newton acceleration of fixed-point algorithms. The performance of our distance majorization algorithms is illustrated in several applications. △ Less

Submitted 11 June, 2013; v1 submitted 16 November, 2012; originally announced November 2012.

Comments: 29 pages, 6 figures

MSC Class: 65K05; 90C25; 90C30; 62J02

Journal ref: Mathematical Programming Series A, 146:409-436, 2014

arXiv:1203.2295 [pdf, other]

Techniques for Solving Sudoku Puzzles

Authors: Eric C. Chi, Kenneth Lange

Abstract: Solving Sudoku puzzles is one of the most popular pastimes in the world. Puzzles range in difficulty from easy to very challenging; the hardest puzzles tend to have the most empty cells. The current paper explains and compares three algorithms for solving Sudoku puzzles. Backtracking, simulated annealing, and alternating projections are generic methods for attacking combinatorial optimization prob… ▽ More Solving Sudoku puzzles is one of the most popular pastimes in the world. Puzzles range in difficulty from easy to very challenging; the hardest puzzles tend to have the most empty cells. The current paper explains and compares three algorithms for solving Sudoku puzzles. Backtracking, simulated annealing, and alternating projections are generic methods for attacking combinatorial optimization problems. Our results favor backtracking. It infallibly solves a Sudoku puzzle or deduces that a unique solution does not exist. However, backtracking does not scale well in high-dimensional combinatorial optimization. Hence, it is useful to expose students in the mathematical sciences to the other two solution techniques in a concrete setting. Simulated annealing shares a common structure with MCMC (Markov chain Monte Carlo) and enjoys wide applicability. The method of alternating projections solves the feasibility problem in convex programming. Converting a discrete optimization problem into a continuous optimization problem opens up the possibility of handling combinatorial problems of much higher dimensionality. △ Less

Submitted 16 May, 2013; v1 submitted 10 March, 2012; originally announced March 2012.

Comments: 11 pages, 5 figures

arXiv:1203.0578 [pdf, other]

doi 10.4169/amer.math.monthly.121.02.095

A Look at the Generalized Heron Problem through the Lens of Majorization-Minimization

Authors: Eric C. Chi, Kenneth Lange

Abstract: In a recent issue of this journal, Mordukhovich et al.\ pose and solve an interesting non-differentiable generalization of the Heron problem in the framework of modern convex analysis. In the generalized Heron problem one is given $k+1$ closed convex sets in $\Real^d$ equipped with its Euclidean norm and asked to find the point in the last set such that the sum of the distances to the first $k$ se… ▽ More In a recent issue of this journal, Mordukhovich et al.\ pose and solve an interesting non-differentiable generalization of the Heron problem in the framework of modern convex analysis. In the generalized Heron problem one is given $k+1$ closed convex sets in $\Real^d$ equipped with its Euclidean norm and asked to find the point in the last set such that the sum of the distances to the first $k$ sets is minimal. In later work the authors generalize the Heron problem even further, relax its convexity assumptions, study its theoretical properties, and pursue subgradient algorithms for solving the convex case. Here, we revisit the original problem solely from the numerical perspective. By exploiting the majorization-minimization (MM) principle of computational statistics and rudimentary techniques from differential calculus, we are able to construct a very fast algorithm for solving the Euclidean version of the generalized Heron problem. △ Less

Submitted 23 May, 2012; v1 submitted 2 March, 2012; originally announced March 2012.

Comments: 21 pages, 3 figures

Journal ref: The American Mathematical Monthly, 121(2):95-108, 2014

arXiv:1112.2414 [pdf, other]

doi 10.1137/110859063

On Tensors, Sparsity, and Nonnegative Factorizations

Authors: Eric C. Chi, Tamara G. Kolda

Abstract: Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive tensor factorization model of such data, along with appropriate algorithms and theory. To do so, we propose that the random variation is best described via a Poisso… ▽ More Tensors have found application in a variety of fields, ranging from chemometrics to signal processing and beyond. In this paper, we consider the problem of multilinear modeling of sparse count data. Our goal is to develop a descriptive tensor factorization model of such data, along with appropriate algorithms and theory. To do so, we propose that the random variation is best described via a Poisson distribution, which better describes the zeros observed in the data as compared to the typical assumption of a Gaussian distribution. Under a Poisson assumption, we fit a model to observed data using the negative log-likelihood score. We present a new algorithm for Poisson tensor factorization called CANDECOMP-PARAFAC Alternating Poisson Regression (CP-APR) that is based on a majorization-minimization approach. It can be shown that CP-APR is a generalization of the Lee-Seung multiplicative updates. We show how to prevent the algorithm from converging to non-KKT points and prove convergence of CP-APR under mild conditions. We also explain how to implement CP-APR for large-scale sparse tensors and present results on several data sets, both real and simulated. △ Less

Submitted 14 August, 2012; v1 submitted 11 December, 2011; originally announced December 2011.

Journal ref: SIAM Journal on Matrix Analysis and Applications 33(4):1272-1299, 2012

arXiv:1101.2077 [pdf, ps, other]

Extensions of multiply twisted pluri-canonical forms

Authors: Chen-Yu Chi, Chin-Lung Wang, Sz-Sheng Wang

Abstract: Given a projective variety X, a smooth divisor D, and semipositive line bundles (L_1,h_1),,...,(L_m,h_m), we consider the "multiply twisted pluricanonical bundle" F:=m(K_X+D)+L_1+...+L_m on X and F_D:=mK_D+(L_1+...+L_m)|_D. Let I_j be the multiplier ideal sheaves associated to h_j, j=1,...,m. We show that, under a certain conditions on curvature, H^0(D,F_D\otimes I_1I_2...I_m) lies in the image of… ▽ More Given a projective variety X, a smooth divisor D, and semipositive line bundles (L_1,h_1),,...,(L_m,h_m), we consider the "multiply twisted pluricanonical bundle" F:=m(K_X+D)+L_1+...+L_m on X and F_D:=mK_D+(L_1+...+L_m)|_D. Let I_j be the multiplier ideal sheaves associated to h_j, j=1,...,m. We show that, under a certain conditions on curvature, H^0(D,F_D\otimes I_1I_2...I_m) lies in the image of the restriction map H^0(X,F)->H^0(D,F_D). The format of our result is inspired both by Paun's simplification of Siu's proof of invariance of plurigenera and an earlier similar result due to Demailly. The main ingredient is a modification of Siu-Paun's induction construction and an extension theorem of Ohsawa-Takegoshi type (O-T). We also include a detail proof of O-T. The key feature is that the ideal sheaf we use is the product of the multiplier ideals associated to the singular metrics h_1,...,h_m, which contains the multiplier ideal sheaf of the product of the metrics h_1\otimes...\otimes h_m. △ Less

Submitted 23 January, 2011; v1 submitted 11 January, 2011; originally announced January 2011.

Comments: 26 pages

Journal ref: Pure Appl. Math. Quarterly 7 (2011), no.4, 1129-1164, special issue dedicated to Eckart Viehweg

arXiv:1010.3043 [pdf, other]

Making Tensor Factorizations Robust to Non-Gaussian Noise

Authors: Eric C. Chi, Tamara G. Kolda

Abstract: Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we pr… ▽ More Tensors are multi-way arrays, and the Candecomp/Parafac (CP) tensor factorization has found application in many different domains. The CP model is typically fit using a least squares objective function, which is a maximum likelihood estimate under the assumption of i.i.d. Gaussian noise. We demonstrate that this loss function can actually be highly sensitive to non-Gaussian noise. Therefore, we propose a loss function based on the 1-norm because it can accommodate both Gaussian and grossly non-Gaussian perturbations. We also present an alternating majorization-minimization algorithm for fitting a CP model using our proposed loss function. △ Less

Submitted 14 October, 2010; originally announced October 2010.

Comments: Contributed presentation at the NIPS Workshop on Tensors, Kernels, and Machine Learning, Whistler, BC, Canada, December 10, 2010

arXiv:0811.2965 [pdf, ps, other]

doi 10.1073/pnas.0809030105

A new geometric approach to problems in birational geometry

Authors: Chen-Yu Chi, Shing-Tung Yau

Abstract: A classical set of birational invariants of a variety are its spaces of pluricanonical forms and some of their canonically defined subspaces. Each of these vector spaces admits a typical metric structure which is also birationally invariant. These vector spaces so metrized will be referred to as the pseudonormed spaces of the original varieties. A fundamental question is the following: given two… ▽ More A classical set of birational invariants of a variety are its spaces of pluricanonical forms and some of their canonically defined subspaces. Each of these vector spaces admits a typical metric structure which is also birationally invariant. These vector spaces so metrized will be referred to as the pseudonormed spaces of the original varieties. A fundamental question is the following: given two mildly singular projective varieties with some of the first variety's pseudonormed spaces being isometric to the corresponding ones of the second variety's, can one construct a birational map between them which induces these isometries? In this work a positive answer to this question is given for varieties of general type. This can be thought of as a theorem of Torelli type for birational equivalence. △ Less

Submitted 18 November, 2008; originally announced November 2008.

Comments: 13 pages, to appear in PNAS

arXiv:0809.4529 [pdf, ps, other]

doi 10.1109/JSTSP.2009.2035798

The Equivalence of Semidefinite Relaxation MIMO Detectors for Higher-Order QAM

Authors: Wing-Kin Ma, Chao-Cheng Su, Joakim Jalden, Tsung-Hui Chang, Chong-Yung Chi

Abstract: In multi-input-multi-output (MIMO) detection, semidefinite relaxation (SDR) has been shown to be an efficient high-performance approach. Developed initially for BPSK and QPSK, SDR has been found to be capable of providing near-optimal performance (for those constellations). This has stimulated a number of recent research endeavors that aim to apply SDR to the high-order QAM cases. These independ… ▽ More In multi-input-multi-output (MIMO) detection, semidefinite relaxation (SDR) has been shown to be an efficient high-performance approach. Developed initially for BPSK and QPSK, SDR has been found to be capable of providing near-optimal performance (for those constellations). This has stimulated a number of recent research endeavors that aim to apply SDR to the high-order QAM cases. These independently developed SDRs are different in concept and structure, and presently no serious analysis has been given to compare these methods. This paper analyzes the relationship of three such SDR methods, namely the polynomial-inspired SDR (PI-SDR) by Wiesel et al., the bound-constrained SDR (BC-SDR) by Sidiropoulos and Luo, and the virtually-antipodal SDR (VA-SDR) by Mao et al. The result that we have proven is somehow unexpected: the three SDRs are equivalent. Simply speaking, we show that solving any one SDR is equivalent to solving the other SDRs. This paper also discusses some implications arising from the SDR equivalence, and provides simulation results to verify our theoretical findings. △ Less

Submitted 26 September, 2008; originally announced September 2008.

Comments: Submitted to IEEE Journal of Selected Topics in Signal Processing, Aug 2008

Showing 1–21 of 21 results for author: Chi, C