Skip to main content

Showing 1–28 of 28 results for author: Qu, Q

Searching in archive math. Search in all archives.
.
  1. arXiv:2403.16341  [pdf, other

    math.NA

    NonlinearSolve.jl: High-Performance and Robust Solvers for Systems of Nonlinear Equations in Julia

    Authors: Avik Pal, Flemming Holtorf, Axel Larsson, Torkel Loman, Utkarsh, Frank Schäefer, Qingyu Qu, Alan Edelman, Chris Rackauckas

    Abstract: Efficiently solving nonlinear equations underpins numerous scientific and engineering disciplines, yet scaling these solutions for complex system models remains a challenge. This paper presents NonlinearSolve.jl - a suite of high-performance open-source nonlinear equation solvers implemented natively in the Julia programming language. NonlinearSolve.jl distinguishes itself by offering a unified AP… ▽ More

    Submitted 28 March, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  2. arXiv:2311.15210  [pdf, other

    cs.LG math.ST

    Topology combined machine learning for consonant recognition

    Authors: **yao Feng, Siheng Yi, Qingrui Qu, Zhiwang Yu, Yifei Zhu

    Abstract: In artificial-intelligence-aided signal processing, existing deep learning models often exhibit a black-box structure, and their validity and comprehensibility remain elusive. The integration of topological methods, despite its relatively nascent application, serves a dual purpose of making models more interpretable as well as extracting structural information from time-dependent data for smarter… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  3. arXiv:2311.02960  [pdf, other

    cs.LG cs.CV math.OC

    Understanding Deep Representation Learning via Layerwise Feature Compression and Discrimination

    Authors: Peng Wang, Xiao Li, Can Yaras, Zhihui Zhu, Laura Balzano, Wei Hu, Qing Qu

    Abstract: Over the past decade, deep learning has proven to be a highly effective tool for learning meaningful features from raw data. However, it remains an open question how deep networks perform hierarchical feature learning across layers. In this work, we attempt to unveil this mystery by investigating the structures of intermediate features. Motivated by our empirical findings that linear layers mimic… ▽ More

    Submitted 9 January, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 61 pages, 14 figures

  4. arXiv:2210.02192  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    Are All Losses Created Equal: A Neural Collapse Perspective

    Authors: **xin Zhou, Chong You, Xiao Li, Kangning Liu, Sheng Liu, Qing Qu, Zhihui Zhu

    Abstract: While cross entropy (CE) is the most commonly used loss to train deep neural networks for classification tasks, many alternative losses have been developed to obtain better empirical performance. Among them, which one is the best to use is still a mystery, because there seem to be multiple factors affecting the answer, such as properties of the dataset, the choice of network architecture, and so o… ▽ More

    Submitted 8 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 32 page, 10 figures, NeurIPS 2022

  5. arXiv:2203.01238  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features

    Authors: **xin Zhou, Xiao Li, Tianyu Ding, Chong You, Qing Qu, Zhihui Zhu

    Abstract: When training deep neural networks for classification tasks, an intriguing empirical phenomenon has been widely observed in the last-layer classifiers and features, where (i) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equiangular Tight Frame (ETF) up to scaling, and (ii) cross-example within-class variability of last-layer activations collapses to zero… ▽ More

    Submitted 12 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  6. arXiv:2202.01896  [pdf, other

    math.OC cs.AI cs.LG

    Yordle: An Efficient Imitation Learning for Branch and Bound

    Authors: Qingyu Qu, Xijun Li, Yunfan Zhou

    Abstract: Combinatorial optimization problems have aroused extensive research interests due to its huge application potential. In practice, there are highly redundant patterns and characteristics during solving the combinatorial optimization problem, which can be captured by machine learning models. Thus, the 2021 NeurIPS Machine Learning for Combinatorial Optimization (ML4CO) competition is proposed with t… ▽ More

    Submitted 19 February, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2201.06213

  7. arXiv:2201.06216  [pdf, other

    math.OC cs.AI cs.LG

    Learning to Reformulate for Linear Programming

    Authors: Xijun Li, Qingyu Qu, Fangzhou Zhu, Jia Zeng, Mingxuan Yuan, Kun Mao, Jie Wang

    Abstract: It has been verified that the linear programming (LP) is able to formulate many real-life optimization problems, which can obtain the optimum by resorting to corresponding solvers such as OptVerse, Gurobi and CPLEX. In the past decades, a serial of traditional operation research algorithms have been proposed to obtain the optimum of a given LP in a fewer solving time. Recently, there is a trend of… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  8. arXiv:2201.06213  [pdf, other

    cs.LG cs.AI math.OC

    An Improved Reinforcement Learning Algorithm for Learning to Branch

    Authors: Qingyu Qu, Xijun Li, Yunfan Zhou, Jia Zeng, Mingxuan Yuan, Jie Wang, **hu Lv, Kexin Liu, Kun Mao

    Abstract: Most combinatorial optimization problems can be formulated as mixed integer linear programming (MILP), in which branch-and-bound (B\&B) is a general and widely used method. Recently, learning to branch has become a hot research topic in the intersection of machine learning and combinatorial optimization. In this paper, we propose a novel reinforcement learning-based B\&B algorithm. Similar to offl… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  9. arXiv:2109.11154  [pdf, other

    math.OC cs.LG stat.ML

    Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

    Authors: Lijun Ding, Liwei Jiang, Yudong Chen, Qing Qu, Zhihui Zhu

    Abstract: We study the robust recovery of a low-rank matrix from sparsely and grossly corrupted Gaussian measurements, with no prior knowledge on the intrinsic rank. We consider the robust matrix factorization approach. We employ a robust $\ell_1$ loss function and deal with the challenge of the unknown rank by using an overspecified factored representation of the matrix variable. We then solve the associat… ▽ More

    Submitted 26 October, 2021; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: 75 pages, 3 figures

  10. arXiv:2105.02375  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    A Geometric Analysis of Neural Collapse with Unconstrained Features

    Authors: Zhihui Zhu, Tianyu Ding, **xin Zhou, Xiao Li, Chong You, Jeremias Sulam, Qing Qu

    Abstract: We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As recently reported by Papyan et al., this phenomenon implies that ($i$) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 42 pages, 8 figures, 1 table; the first two authors contributed to this work equally

  11. arXiv:2007.06753  [pdf, other

    cs.LG cs.CV cs.IT math.OC stat.ML

    From Symmetry to Geometry: Tractable Nonconvex Problems

    Authors: Yuqian Zhang, Qing Qu, John Wright

    Abstract: As science and engineering have become increasingly data-driven, the role of optimization has expanded to touch almost every stage of the data analysis pipeline, from signal and data acquisition to modeling and prediction. The optimization problems encountered in practice are often nonconvex. While challenges vary from problem to problem, one common source of nonconvexity is nonlinearity in the da… ▽ More

    Submitted 8 July, 2022; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: review paper, 38 pages, 10 figures, revision: correction of typos, adding more discussion on recent advances on deep learning

  12. arXiv:2006.10557  [pdf, ps, other

    math.DG

    The navigation problems and the curvature properties on conic Kropina manifolds

    Authors: Xinyue Cheng, Qiuhong Qu, Suiyun Xu

    Abstract: In this paper, we study navigation problems on conic Kropina manifolds. Let $F(x, y)$ be a conic Kropina metric on an $n$-dimensional manifold $M$ and $V$ be a conformal vector field on $(M, F)$ with $F(x, - V_{x})\leq 1$. Let $\widetilde{F}= \widetilde{F} (x,y)$ be the solution of the navigation problem with navigation data $(F, V)$. We prove that $\widetilde{F}$ must be either a Randers metric o… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    MSC Class: 53B40; 53C60

  13. arXiv:2006.08857  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Robust Recovery via Implicit Bias of Discrepant Learning Rates for Double Over-parameterization

    Authors: Chong You, Zhihui Zhu, Qing Qu, Yi Ma

    Abstract: Recent advances have shown that implicit bias of gradient descent on over-parameterized models enables the recovery of low-rank matrices from linear measurements, even with no prior knowledge on the intrinsic rank. In contrast, for robust low-rank matrix recovery from grossly corrupted measurements, over-parameterization leads to overfitting without prior knowledge on both the intrinsic rank and s… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  14. arXiv:2001.06970  [pdf, other

    cs.LG cs.IT eess.IV math.OC stat.ML

    Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications

    Authors: Qing Qu, Zhihui Zhu, Xiao Li, Manolis C. Tsakiris, John Wright, René Vidal

    Abstract: The problem of finding the sparsest vector (direction) in a low dimensional subspace can be considered as a homogeneous variant of the sparse recovery problem, which finds applications in robust subspace recovery, dictionary learning, sparse blind deconvolution, and many other problems in signal processing and machine learning. However, in contrast to the classical sparse recovery problem, the mos… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: QQ and ZZ contributed equally to the work. Invited review paper for IEEE Signal Processing Magazine Special Issue on non-convex optimization for signal processing and machine learning. This article contains 26 pages with 11 figures

  15. arXiv:1912.02427  [pdf, other

    cs.LG cs.IT eess.SP math.OC stat.ML

    Analysis of the Optimization Landscapes for Overcomplete Representation Learning

    Authors: Qing Qu, Yuexiang Zhai, Xiao Li, Yuqian Zhang, Zhihui Zhu

    Abstract: We study nonconvex optimization landscapes for learning overcomplete representations, including learning (i) sparsely used overcomplete dictionaries and (ii) convolutional dictionaries, where these unsupervised learning problems find many applications in high-dimensional data analysis. Despite the empirical success of simple nonconvex algorithms, theoretical justifications of why these methods wor… ▽ More

    Submitted 10 December, 2019; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 68 pages, 5 figures

  16. arXiv:1911.05047  [pdf, other

    math.OC cs.IT cs.LG

    Weakly Convex Optimization over Stiefel Manifold Using Riemannian Subgradient-Type Methods

    Authors: Xiao Li, Shixiang Chen, Zengde Deng, Qing Qu, Zhihui Zhu, Anthony Man Cho So

    Abstract: We consider a class of nonsmooth optimization problems over the Stiefel manifold, in which the objective function is weakly convex in the ambient Euclidean space. Such problems are ubiquitous in engineering applications but still largely unexplored. We present a family of Riemannian subgradient-type methods -- namely Riemannain subgradient, incremental subgradient, and stochastic subgradient metho… ▽ More

    Submitted 24 March, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: 30 pages. Accepted to SIAM Journal on Optimization

    MSC Class: 68Q25; 65K10; 90C90; 90C26; 90C06

  17. arXiv:1908.10959  [pdf, other

    eess.SP cs.LG eess.IV math.OC stat.ML

    Short-and-Sparse Deconvolution -- A Geometric Approach

    Authors: Yenson Lau, Qing Qu, Han-Wen Kuo, Pengcheng Zhou, Yuqian Zhang, John Wright

    Abstract: Short-and-sparse deconvolution (SaSD) is the problem of extracting localized, recurring motifs in signals with spatial or temporal structure. Variants of this problem arise in applications such as image deblurring, microscopy, neural spike sorting, and more. The problem is challenging in both theory and practice, as natural optimization formulations are nonconvex. Moreover, practical deconvolution… ▽ More

    Submitted 1 October, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: *YL and QQ contributed equally to this work; 30 figures, 45 pages; This version: added an experiment comparing with other methods, corrected typos and added references

  18. arXiv:1908.10776  [pdf, ps, other

    eess.SP cs.LG eess.IV math.OC stat.ML

    A Nonconvex Approach for Exact and Efficient Multichannel Sparse Blind Deconvolution

    Authors: Qing Qu, Xiao Li, Zhihui Zhu

    Abstract: We study the multi-channel sparse blind deconvolution (MCS-BD) problem, whose task is to simultaneously recover a kernel $\mathbf a$ and multiple sparse inputs $\{\mathbf x_i\}_{i=1}^p$ from their circulant convolution $\mathbf y_i = \mathbf a \circledast \mathbf x_i $ ($i=1,\cdots,p$). We formulate the task as a nonconvex optimization problem over the sphere. Under mild statistical assumptions of… ▽ More

    Submitted 29 February, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: 62 pages, 6 figures; short version accepted as a spotlight paper at NeurIPS'19 (https://papers.nips.cc/paper/8656-a-nonconvex-approach-for-exact-and-efficient-multichannel-sparse-blind-deconvolution) ; A long journal version is under revision at SIIMS

  19. arXiv:1712.00716  [pdf, other

    stat.CO cs.IT math.NA math.OC stat.ML

    Convolutional Phase Retrieval via Gradient Descent

    Authors: Qing Qu, Yuqian Zhang, Yonina C. Eldar, John Wright

    Abstract: We study the convolutional phase retrieval problem, of recovering an unknown signal $\mathbf x \in \mathbb C^n $ from $m$ measurements consisting of the magnitude of its cyclic convolution with a given kernel $\mathbf a \in \mathbb C^m $. This model is motivated by applications such as channel estimation, optics, and underwater acoustic communication, where the signal of interest is acted on by a… ▽ More

    Submitted 5 October, 2019; v1 submitted 3 December, 2017; originally announced December 2017.

    Comments: 64 pages , 9 figures, appeared in NeurIPS 2017. Accepted at IEEE Transactions on Information Theory. This is the final (minor) update: fixed typos and grammar issues

  20. arXiv:1602.06664  [pdf, other

    cs.IT math.OC stat.ML

    A Geometric Analysis of Phase Retrieval

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: Can we recover a complex signal from its Fourier magnitudes? More generally, given a set of $m$ measurements, $y_k = |\mathbf a_k^* \mathbf x|$ for $k = 1, \dots, m$, is it possible to recover $\mathbf x \in \mathbb{C}^n$ (i.e., length-$n$ complex vector)? This **generalized phase retrieval** (GPR) problem is a fundamental task in various disciplines, and has been the subject of much recent invest… ▽ More

    Submitted 1 January, 2017; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: 61 pages, 5 figures. A short version can be found here http://sunju.org/docs/PR_G4_16.pdf . Revised according to reviewers' feedback

    Journal ref: Foundations of Computational Mathematics, 18(5):1131--1198, 2018

  21. arXiv:1511.04777  [pdf, other

    cs.IT cs.CV math.OC stat.ML

    Complete Dictionary Recovery over the Sphere II: Recovery by Riemannian Trust-region Method

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and fin… ▽ More

    Submitted 1 September, 2016; v1 submitted 15 November, 2015; originally announced November 2015.

    Comments: The second of two papers based on the report arXiv:1504.06785. Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

    Journal ref: IEEE Trans. Information Theory, 63(2): 885 - 914 (2017)

  22. arXiv:1511.03607  [pdf, other

    cs.IT cs.CV math.OC stat.ML

    Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and fin… ▽ More

    Submitted 1 September, 2016; v1 submitted 11 November, 2015; originally announced November 2015.

    Comments: Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

    Journal ref: IEEE Trans. Information Theory, 63(2): 853 - 884 (2017)

  23. arXiv:1510.06096  [pdf, other

    math.OC cs.IT stat.ML

    When Are Nonconvex Problems Not Scary?

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: In this note, we focus on smooth nonconvex optimization problems that obey: (1) all local minimizers are also global; and (2) around any saddle point or local maximizer, the objective has a negative directional curvature. Concrete applications such as dictionary learning, generalized phase retrieval, and orthogonal tensor decomposition are known to induce such structures. We describe a second-orde… ▽ More

    Submitted 22 April, 2016; v1 submitted 20 October, 2015; originally announced October 2015.

    Comments: 6 pages, 3 figures. New examples on phase synchronization and community detection added; emphasis on all local minimizers being global added; exposition is polished. This is a concise expository article that avoids much technical rigor. We will make a separate submission with full technical details in future

  24. arXiv:1505.03627  [pdf, ps, other

    math.DG

    Killing Vector Fields on Multiply Warped Products with a Semi-symmetric Metric Connection

    Authors: Quan Qu

    Abstract: In this paper, we define a semi-symmetric metric Killing vector field, then study semi-symmetric metric Killing vector fields on warped and multiply warped products with a semi-symmetric metric connection. We also study Killing and 2-Killing vector fields on multiply warped products.

    Submitted 14 May, 2015; originally announced May 2015.

    Comments: 13 pages

    MSC Class: 53B05; 53C21; 53C25; 53C50; 53C80

  25. arXiv:1505.03319  [pdf, ps, other

    math.DG

    Quasi-Einstein and Generalized Quasi-Einstein Warped Products with an Affine Connection

    Authors: Quan Qu

    Abstract: In this paper, we study the quasi-Einstein and generalized quasi-Einstein warped products with a semi-symmetric non-metric connection. We give the expressions of the Ricci tensors and scalar curvatures for the bases and fibres. In some cases we give some obstructions to the existence of the quasi-Einstein and generalized quasi-Einstein warped products with a semi-symmetric non-metric connection.

    Submitted 13 May, 2015; originally announced May 2015.

    Comments: 16 pages

    MSC Class: 53B05; 53C25

  26. arXiv:1504.06785  [pdf, other

    cs.IT cs.CV cs.LG math.OC stat.ML

    Complete Dictionary Recovery over the Sphere

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb R^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to the theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals, and… ▽ More

    Submitted 17 November, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: 104 pages, 5 figures. Due to length constraint of publication, this long paper are subsequently divided into two papers (arXiv:1511.03607 and arXiv:1511.04777). Further updates will be made only to the two papers

    MSC Class: 68P30; 58C05; 94A12; 94A08; 68T05; 90C26; 90C48; 90C55

  27. arXiv:1412.4659  [pdf, other

    cs.IT cs.CV cs.LG math.OC stat.ML

    Finding a sparse vector in a subspace: Linear sparsity using alternating directions

    Authors: Qing Qu, Ju Sun, John Wright

    Abstract: Is it possible to find the sparsest vector (direction) in a generic subspace $\mathcal{S} \subseteq \mathbb{R}^p$ with $\mathrm{dim}(\mathcal{S})= n < p$? This problem can be considered a homogeneous variant of the sparse recovery problem, and finds connections to sparse dictionary learning, sparse PCA, and many other problems in signal processing and machine learning. In this paper, we focus on a… ▽ More

    Submitted 19 July, 2016; v1 submitted 15 December, 2014; originally announced December 2014.

    Comments: Accepted by IEEE Trans. Information Theory. The paper has been revised by the reviewers' comments. The proofs have been streamlined

    Journal ref: IEEE Transaction on Information Theory, 62(10):5855 - 5880, 2016

  28. arXiv:1410.0170  [pdf, ps, other

    math.DG

    Multiply Warped Products with a Quarter-symmetric Connection

    Authors: Quan Qu, Yong Wang

    Abstract: In this paper, we study the Einstein warped products and multiply warped products with a quarter-symmetric connection. We also study warped products and multiply warped products with a quarter-symmetric connection with constant scalar curvature. Then apply our results to generalized Robertson-Walker spacetimes with a quarter-symmetric connection and generalized Kasner space-times with a quarter-sy… ▽ More

    Submitted 1 October, 2014; originally announced October 2014.

    Comments: 41 pages. arXiv admin note: text overlap with arXiv:1207.5092