Skip to main content

Showing 1–50 of 466 results for author: Zhang, T

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.19976  [pdf, other

    cs.LG math.OC

    ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

    Authors: Rui Pan, Jipeng Zhang, Xingyuan Pan, Renjie Pi, Xiaoyu Wang, Tong Zhang

    Abstract: Bilevel optimization has shown its utility across various machine learning settings, yet most algorithms in practice require second-order information, making it challenging to scale them up. Only recently, a paradigm of first-order algorithms emerged, capable of effectively addressing bilevel optimization problems. Nevertheless, the practical efficiency of this paradigm remains unverified, particu… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.15244  [pdf, other

    cs.LG math.OC

    Large Batch Analysis for Adagrad Under Anisotropic Smoothness

    Authors: Yuxing Liu, Rui Pan, Tong Zhang

    Abstract: Adaptive gradient algorithms have been widely adopted in training large-scale deep neural networks, especially large foundation models. Despite their huge success in practice, their theoretical advantages over stochastic gradient descent (SGD) have not been fully understood, especially in the large batch-size setting commonly used in practice. This is because the only theoretical result that can d… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.04558  [pdf, other

    cs.LG math.OC

    On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization

    Authors: Motahareh Sohrabi, Juan Ramirez, Tianyue H. Zhang, Simon Lacoste-Julien, Jose Gallego-Posada

    Abstract: Constrained optimization offers a powerful framework to prescribe desired behaviors in neural network models. Typically, constrained problems are solved via their min-max Lagrangian formulations, which exhibit unstable oscillatory dynamics when optimized using gradient descent-ascent. The adoption of constrained optimization techniques in the machine learning community is currently limited by the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Published at ICML 2024. Code available at https://github.com/motahareh-sohrabi/nuPI

  4. arXiv:2405.19003  [pdf, other

    math.NA

    A structure-preserving scheme for computing effective diffusivity and anomalous diffusion phenomena of random flows

    Authors: Tan Zhang, Zhongjian Wang, Jack Xin, Zhiwen Zhang

    Abstract: This paper aims to investigate the diffusion behavior of particles moving in stochastic flows under a structure-preserving scheme. We compute the effective diffusivity for normal diffusive random flows and establish the power law between spatial and temporal variables for cases with anomalous diffusion phenomena. From a Lagrangian approach, we separate the corresponding stochastic differential equ… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 39pages, 10 figures, planning to submit for Journal of Scientific Computing or Numerische Mathematik

    MSC Class: 37M25; 60J60; 60H35; 65P10; 65M75; 76M50

  5. arXiv:2405.17764  [pdf, other

    cs.CL cs.AI math.ST

    On the Sequence Evaluation based on Stochastic Processes

    Authors: Tianhao Zhang, Zhexiao Lin, Zhecheng Sheng, Chen Jiang, Dongyeop Kang

    Abstract: Modeling and analyzing long sequences of text is an essential task for Natural Language Processing. Success in capturing long text dynamics using neural language models will facilitate many downstream tasks such as coherence evaluation, text generation, machine translation and so on. This paper presents a novel approach to model sequences through a stochastic process. We introduce a likelihood-bas… ▽ More

    Submitted 15 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.00414  [pdf, ps, other

    math.PR

    Ergodicity for 2D Navier-Stokes equations with a degenerate pure jump noise

    Authors: Xuhui Peng, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we establish the ergodicity for stochastic 2D Navier-Stokes equations driven by a highly degenerate pure jump Lévy noise. The noise could appear in as few as four directions. This gives an affirmative anwser to a longstanding problem. The case of Gaussian noise was treated in Hairer and Mattingly [\emph{Ann. of Math.}, 164(3):993--1032, 2006]. To obtain the uniqueness of invariant m… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  7. arXiv:2404.12849  [pdf, ps, other

    math.FA math.OA

    An improvement and generalization of Rotfel'd type inequalities for sectorial matrices

    Authors: Nan Fanghong, Teng Zhang

    Abstract: Byusing equivalence conditions for sectorial matrices obtained by Alakhrass and Sababheh in 2020, we improve a Rotfel'd type inequality for sectorial matrices derived by P. Zhang in 2015 and generalize a result derived by Y. Mao et al. in 2024.

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 10pages

    MSC Class: 15A45; 15A60

  8. arXiv:2404.10656  [pdf, ps, other

    math.CO

    The foundation of generalized parallel connections, 2-sums, and segment-cosegment exchanges of matroids

    Authors: Matthew Baker, Oliver Lorscheid, Zach Walsh, Tianyi Zhang

    Abstract: We show that, under suitable hypotheses, the foundation of a generalized parallel connection of matroids is the relative tensor product of the foundations. Using this result, we show that the foundation of a 2-sum of matroids is the absolute tensor product of the foundations, and that the foundation of a matroid is invariant under segment-cosegment exchange.

    Submitted 16 April, 2024; originally announced April 2024.

    MSC Class: 05B35

  9. arXiv:2403.18658  [pdf, ps, other

    math.ST stat.ML

    Theoretical Guarantees for the Subspace-Constrained Tyler's Estimator

    Authors: Gilad Lerman, Feng Yu, Teng Zhang

    Abstract: This work analyzes the subspace-constrained Tyler's estimator (STE) designed for recovering a low-dimensional subspace within a dataset that may be highly corrupted with outliers. It assumes a weak inlier-outlier model and allows the fraction of inliers to be smaller than a fraction that leads to computational hardness of the robust subspace recovery problem. It shows that in this setting, if the… ▽ More

    Submitted 12 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  10. arXiv:2403.17919  [pdf, other

    cs.LG cs.AI cs.CL math.OC

    LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

    Authors: Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang

    Abstract: The machine learning community has witnessed impressive advancements since large language models (LLMs) first appeared. Yet, their massive memory consumption has become a significant roadblock to large-scale training. For instance, a 7B model typically requires at least 60 GB of GPU memory with full parameter training, which presents challenges for researchers without access to high-resource envir… ▽ More

    Submitted 25 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  11. arXiv:2403.14969  [pdf, ps, other

    math.DS

    Dynamics of a memory-based diffusion model with spatial heterogeneity and nonlinear boundary condition

    Authors: Quanli Ji, Ranchao Wu, Tonghua Zhang

    Abstract: In this work, we study the dynamics of a spatially heterogeneous single population model with the memory effect and nonlinear boundary condition. By virtue of the implicit function theorem and Lyapunov-Schmidt reduction, spatially nonconstant positive steady state solutions appear from two trivial solutions, respectively. By using bifurcation analysis, the Hopf bifurcation associated with one spat… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  12. arXiv:2403.06183  [pdf, other

    cs.LG math.OC math.ST stat.ML

    An Improved Analysis of Langevin Algorithms with Prior Diffusion for Non-Log-Concave Sampling

    Authors: Xunpeng Huang, Hanze Dong, Difan Zou, Tong Zhang

    Abstract: Understanding the dimension dependency of computational complexity in high-dimensional sampling problem is a fundamental problem, both from a practical and theoretical perspective. Compared with samplers with unbiased stationary distribution, e.g., Metropolis-adjusted Langevin algorithm (MALA), biased samplers, e.g., Underdamped Langevin Dynamics (ULD), perform better in low-accuracy cases just be… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 32 pages

  13. arXiv:2403.05679  [pdf, other

    stat.ME math.ST stat.AP

    Debiased Projected Two-Sample Comparisonscfor Single-Cell Expression Data

    Authors: Tianyu Zhang, **g Lei, Kathryn Roeder

    Abstract: We study several variants of the high-dimensional mean inference problem motivated by modern single-cell genomics data. By taking advantage of low-dimensional and localized signal structures commonly seen in such data, our proposed methods not only have the usual frequentist validity but also provide useful information on the potential locations of the signal if the null hypothesis is rejected. Ou… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  14. arXiv:2403.02704  [pdf, ps, other

    math.OC

    Projected Gradient Descent Algorithm for Low-Rank Matrix Estimation

    Authors: Teng Zhang, Xing Fan

    Abstract: Most existing methodologies of estimating low-rank matrices rely on Burer-Monteiro factorization, but these approaches can suffer from slow convergence, especially when dealing with solutions characterized by a large condition number, defined by the ratio of the largest to the $r$-th singular values, where $r$ is the search rank. While methods such as Scaled Gradient Descent have been proposed to… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  15. arXiv:2403.01388  [pdf, ps, other

    math.PR

    Wong-Zakai approximations and support theorems for SDEs under Lyapunov conditions

    Authors: Qi Li, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we establish the Stroock-Varadhan type support theorems for stochastic differential equations (SDEs) under Lyapunov conditions, which significantly improve the existing results in the literature where the coefficients of the SDEs are required to be globally Lipschitz and of linear growth. Our conditions are very mild to include many important models, e.g. Threshold Ornstein-Ulenbeck… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  16. arXiv:2403.01125  [pdf, ps, other

    math.PR

    Large Deviation Principle of Stochastic Evolution Equations with reflection

    Authors: Zdzisław Brzeźniak, Qi Li, Tusheng Zhang

    Abstract: In this paper, we establish a large deviation principle for stochastic evolution equations with reflection in an infinite dimensional ball. Weak convergence approach plays an important role.

    Submitted 2 March, 2024; originally announced March 2024.

  17. arXiv:2402.17468  [pdf, ps, other

    math.AG

    Noether inequality for irregular threefolds of general type

    Authors: Yong Hu, Tong Zhang

    Abstract: Let $X$ be a smooth irregular $3$-fold of general type over $\mathbb{C}$. We prove that the optimal Noether inequality $$ \mathrm{vol}(X) \ge \frac{4}{3}p_g(X) $$ holds if $p_g(X) \ge 16$ or if $X$ has a Gorenstein minimal model. Moreover, when $X$ attains the equality and $p_g(X) \ge 16$, its canonical model can be explicitly described.

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Comments are very welcome

  18. arXiv:2402.17421  [pdf, ps, other

    math.CO

    Toughness and Aα-spectral radius in graphs

    Authors: Sizhong Zhou, Yuli Zhang, Tao Zhang, Hongxia Liu

    Abstract: Let $α\in[0,1)$, and let $G$ be a connected graph of order $n$ with $n\geq f(α)$, where $f(α)=6$ for $α\in[0,\frac{2}{3}]$ and $f(α)=\frac{4}{1-α}$ for $α\in(\frac{2}{3},1)$. A graph $G$ is said to be $t$-tough if $|S|\geq tc(G-S)$ for each subset $S$ of $V(G)$ with $c(G-S)\geq2$, where $c(G-S)$ is the number of connected components in $G-S$. The $A_α$-spectral radius of $G$ is denoted by… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages

    MSC Class: 05C50

  19. arXiv:2402.16522  [pdf, other

    math.PR

    Uniform large deviations and metastability of random dynamical systems

    Authors: Jifa Jiang, Jian Wang, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we first provide a criterion on uniform large deviation principles (ULDP) of stochastic differential equations under Lyapunov conditions on the coefficients, which can be applied to stochastic systems with coefficients of polynomial growth and possible degenerate driving noises. In the second part, using the ULDP criterion we preclude the concentration of limiting measures of invari… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    MSC Class: 60B10; 60F10; 60H10; 37A50; 37C70

  20. arXiv:2402.03644  [pdf, ps, other

    math.CO

    Signed Mahonian Polynomials on Derangements in Classical Weyl Groups

    Authors: Kathy Q. Ji, Dax T. X. Zhang

    Abstract: The polynomial of the major index ${\rm maj}_W (σ)$ over the subset $T$ of the Coxeter group $W$ is called the Mahonian polynomial over $T$, where ${\rm maj}_W (σ)$ is a Mahonian statistic of an element $σ\in T$, whereas the polynomial of the major index ${\rm maj}_W (σ)$ with the sign $(-1)^{\ell_W(σ)}$ over the subset $T$ is referred to as the signed Mahonian polynomial over $T$, where… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 32 pages

  21. arXiv:2401.15571  [pdf, ps, other

    math.CO cs.IT

    New results on sparse representations in unions of orthonormal bases

    Authors: Tao Zhang, Gennian Ge

    Abstract: The problem of sparse representation has significant applications in signal processing. The spark of a dictionary plays a crucial role in the study of sparse representation. Donoho and Elad initially explored the spark, and they provided a general lower bound. When the dictionary is a union of several orthonormal bases, Gribonval and Nielsen presented an improved lower bound for spark. In this pap… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  22. arXiv:2401.14632  [pdf, ps, other

    math.CO

    Newton polytopes of dual $k$-Schur polynomials

    Authors: Bo Wang, Candice X. T. Zhang, Zhong-Xue Zhang

    Abstract: Rado's theorem about permutahedra and dominance order on partitions reveals that each Schur polynomial is M-convex, or equivalently, it has a saturated Newton polytope and this polytope is a generalized permutahedron as well. In this paper we show that the support of each dual $k$-Schur polynomial indexed by a $k$-bounded partition coincides with that of the Schur polynomial indexed by the same pa… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 20 pages, 8 figures

  23. arXiv:2401.07038  [pdf, other

    math.ST econ.EM

    A simple stochastic nonlinear AR model with application to bubble

    Authors: Xuanling Yang, Dong Li, Ting Zhang

    Abstract: Economic and financial time series can feature locally explosive behavior when a bubble is formed. The economic or financial bubble, especially its dynamics, is an intriguing topic that has been attracting longstanding attention. To illustrate the dynamics of the local explosion itself, the paper presents a novel, simple, yet useful time series model, called the stochastic nonlinear autoregressive… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 41 pages, 6 figures

  24. arXiv:2401.06325  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo

    Authors: Xunpeng Huang, Difan Zou, Hanze Dong, Yian Ma, Tong Zhang

    Abstract: To sample from a general target distribution $p_*\propto e^{-f_*}$ beyond the isoperimetric condition, Huang et al. (2023) proposed to perform sampling through reverse diffusion, giving rise to Diffusion-based Monte Carlo (DMC). Specifically, DMC follows the reverse SDE of a diffusion process that transforms the target distribution to the standard Gaussian, utilizing a non-parametric score estimat… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 54 pages

  25. arXiv:2401.05456  [pdf, ps, other

    math.FA math.OA

    Proof of Audenaert-Kittaneh's Conjecture

    Authors: Teng Zhang

    Abstract: By using Hadamard's 3-lines theorem for a certain analytic function defined in terms of the trace, we show that Audenaert-Kittaneh's Conjecture related to $p$-Schatten class.

    Submitted 26 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 8 pages

    MSC Class: 47A30; 46C15; 47B10; 47B15

  26. arXiv:2401.01220  [pdf, other

    math.NA

    Solving multiscale dynamical systems by deep learning

    Authors: Zhi-Qin John Xu, Junjie Yao, Yuxiao Yi, Liangkai Hang, Weinan E, Yaoyu Zhang, Tianhan Zhang

    Abstract: Multiscale dynamical systems, modeled by high-dimensional stiff ordinary differential equations (ODEs) with wide-ranging characteristic timescales, arise across diverse fields of science and engineering, but their numerical solvers often encounter severe efficiency bottlenecks. This paper introduces a novel DeePODE method, which consists of a global multiscale sampling method and a fitting by deep… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 7 pages, 6 figures

  27. arXiv:2312.14567  [pdf, other

    cs.LG math.OC

    Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

    Authors: Rui Pan, Yuxing Liu, Xiaoyu Wang, Tong Zhang

    Abstract: Heavy-ball momentum with decaying learning rates is widely used with SGD for optimizing deep learning models. In contrast to its empirical popularity, the understanding of its theoretical property is still quite limited, especially under the standard anisotropic gradient noise condition for quadratic regression problems. Although it is widely conjectured that heavy-ball momentum method can provide… ▽ More

    Submitted 17 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Published at ICLR 2024

  28. arXiv:2312.13080  [pdf, other

    math-ph hep-th math.RT

    Langlands Dualities through Bethe/Gauge Correspondence for 3d Gauge Theories

    Authors: Xiang-Mao Ding, Ting Zhang

    Abstract: For non-simple laced Lie algebras, the $\text{B}_{N}$ and $\text{C}_{N}$ are Langlands dual to each other in mathematical. In this article, we give another Bethe/Gauge correspondence between 3d (or 2d) classical Lie group supersymmetry gauge theory with closed and open $\text{XXZ}$ (or $\text{XXX}$) spin chain. Here, the representations of the $\text{ADE}$ Lie algebras are self-dual, and while for… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 25 pages, one figure

  29. arXiv:2312.07023  [pdf, ps, other

    math.OC

    On the proximal point algorithms for solving the monotone inclusion problem

    Authors: Tao Zhang, Shiru Li, Yong Xia

    Abstract: We consider finding a zero point of the maximally monotone operator $T$. First, instead of using the proximal point algorithm (PPA) for this purpose, we employ PPA to solve its Yosida regularization $T_λ$. Then, based on an $O(a_{k+1})$ ($a_{k+1}\geq \varepsilon>0$) resolvent index of $T$, it turns out that we can establish a convergence rate of $O (1/{\sqrt{\sum_{i=0}^{k}a_{i+1}^2}})$ for both th… ▽ More

    Submitted 22 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    MSC Class: 47H09; 47H10; 90C25; 90C30

  30. arXiv:2311.13794  [pdf, ps, other

    math.OC cs.IT

    Error estimation for the non-convex cosparse optimization problem

    Authors: Zisheng Liu, Ting Zhang

    Abstract: When the signal does not have a sparse structure but has sparsity under a certain transformation domain, Nam et al. \cite{NS} introduced the cosparse analysis model, which provides a dual perspective on the sparse representation model. This paper mainly discusses the error estimation of non-convex $\ell_p(0<p<1)$ relaxation cosparse optimization model with noise condition. Compared with the existi… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    MSC Class: 90C25

  31. arXiv:2311.12902  [pdf, other

    cs.LG math.DS math.NA

    Local Convolution Enhanced Global Fourier Neural Operator For Multiscale Dynamic Spaces Prediction

    Authors: Xuanle Zhao, Yue Sun, Tielin Zhang, Bo Xu

    Abstract: Neural operators extend the capabilities of traditional neural networks by allowing them to handle map**s between function spaces for the purpose of solving partial differential equations (PDEs). One of the most notable methods is the Fourier Neural Operator (FNO), which is inspired by Green's function method and approximate operator kernel directly in the frequency domain. In this work, we focu… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures

  32. arXiv:2311.02696  [pdf, ps, other

    math.FA math.CA

    Commutators for certain fractional type operators on weighted spaces and Orlicz-Morrey spaces

    Authors: Huoxiong Wu Tong Zhang

    Abstract: In this paper, we focus on a class of fractional type integral operators that can be served as extensions of Riesz potential with kernels $$K(x,y)=\frac{Ω_1(x-A_1 y)}{|x-A_1 y |^{\frac{n}{q_1}}} \cdots \frac{Ω_m(x-A_m y)}{|x-A_m y |^{\frac{n}{q_m}}},$$ where $α\in [0,n), m\geqslant1, \sum_{i=1}^m\frac{n}{q_i}=n-α$, $\{A_i\}^m_{i=1}$ are invertible matrixes, $Ω_i$ is homogeneous of degree 0 on… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  33. arXiv:2311.02490  [pdf, other

    math.NA math.OC stat.ML

    Improved Convergence Rates of Windowed Anderson Acceleration for Symmetric Fixed-Point Iterations

    Authors: Casey Garner, Gilad Lerman, Teng Zhang

    Abstract: This paper studies the commonly utilized windowed Anderson acceleration (AA) algorithm for fixed-point methods, $x^{(k+1)}=q(x^{(k)})$. It provides the first proof that when the operator $q$ is linear and symmetric the windowed AA, which uses a sliding window of prior iterates, improves the root-linear convergence factor over the fixed-point iterations. When $q$ is nonlinear, yet has a symmetric J… ▽ More

    Submitted 8 March, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 32 pages, 14 figures

    MSC Class: 65F10; 65H10; 68W40

  34. arXiv:2310.19952  [pdf, ps, other

    math.CO

    Foundations of matroids -- Part 2: Further theory, examples, and computational methods

    Authors: Matthew Baker, Oliver Lorscheid, Tianyi Zhang

    Abstract: In this sequel to "Foundations of matroids - Part 1", we establish several presentations of the foundation of a matroid in terms of small building blocks. For example, we show that the foundation of a matroid M is the colimit of the foundations of all embedded minors of M isomorphic to one of the matroids $U^2_4$, $U^2_5$, $U^3_5$, $C_5$, $C_5^\ast$, $U^2_4\oplus U^1_2$, $F_7$, $F_7^\ast$, and we… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 69 pages

  35. arXiv:2310.13614  [pdf, ps, other

    math.RA math.RT

    Lie triple 2-algebras

    Authors: Tao Zhang, Zhang-Ju Liu

    Abstract: We invent a new cohomology theory for Lie triple algebras. Using this cohomology, we introduce the notions of 2-term $L_\infty$-triple algebras and Lie triple 2-algebras. We prove that the category of 2-term $L_\infty$-triple algebras is equivalent to the category of Lie triple 2-algebras. Crossed modules of Lie triple algebras are studied in detail.

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 26 pages. arXiv admin note: text overlap with arXiv:1601.01196 by other authors

    MSC Class: 17A99; 17B56; 18N25; 18G45

  36. arXiv:2310.13334  [pdf, ps, other

    math.OC

    Convergence analysis on the alternating direction method of multipliers for the cosparse optimization problem

    Authors: Zisheng Liu, Ting Zhang

    Abstract: From a dual perspective of the sparse representation model, Nam et al. proposed the cosparse analysis model. In this paper, we aim to investigate the convergence of the alternating direction method of multipliers (ADMM) for the cosparse optimization problem. First, we examine the variational inequality representation of the cosparse optimization problem by introducing auxiliary variables. Second,… ▽ More

    Submitted 22 November, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 15 page

    MSC Class: 90C25

  37. arXiv:2310.12140  [pdf, other

    math.ST stat.ME stat.ML

    Online Estimation with Rolling Validation: Adaptive Nonparametric Estimation with Streaming Data

    Authors: Tianyu Zhang, **g Lei

    Abstract: Online nonparametric estimators are gaining popularity due to their efficient computation and competitive generalization abilities. An important example includes variants of stochastic gradient descent. These algorithms often take one sample point at a time and instantly update the parameter estimate of interest. In this work we consider model selection and hyperparameter tuning for such online al… ▽ More

    Submitted 4 April, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  38. arXiv:2310.08930  [pdf, ps, other

    math.NA math.CA

    When $D$-companion matrix meets incomplete polynomials

    Authors: Teng Zhang

    Abstract: In this paper, we provide a simple proof of a generalization of the Gauss-Lucas theorem. By using methods of D-companion matrix, we get the majorization relationship between the zeros of convex combinations of incomplete polynomials and an origin polynomial. Moreover, we prove that the set of all zeros of all convex combinations of incomplete polynomials coincides with the closed convex hull of ze… ▽ More

    Submitted 7 January, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 13 pages

    MSC Class: 15A42; 15A18

  39. arXiv:2310.07719  [pdf, ps, other

    math.RA math.RT

    Deformations and extensions of homotopy associative algebras

    Authors: Tao Zhang

    Abstract: The representation and the cohomology theory of associative 2-algebras are developed. We study the deformations and abelian extensions of associative 2-algebras in details.

    Submitted 28 December, 2023; v1 submitted 28 July, 2023; originally announced October 2023.

    Comments: 29 pages, 0 figures, continuation of arXiv:1306.6225. arXiv admin note: text overlap with arXiv:2304.13071

    MSC Class: 16E99; 18N40

  40. arXiv:2310.03247  [pdf, ps, other

    math.NA math.AP

    Robust globally divergence-free Weak Galerkin finite element method for incompressible Magnetohydrodynamics flow

    Authors: Min Zhang, Tong Zhang, Abstract: This paper develops a weak Galerkin (WG) finite element method of arbitrary order for the steady incompressible Magnetohydrodynamics equations. The WG scheme uses piecewise polynomials of degrees $k(k\geq 1),k,k-1$, and $k-1$ respectively for the approximations of the velocity, the magnetic field, the pressure, and the magnetic pseudo-pressure in the interior of elements, and uses piecewise polyno… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  41. arXiv:2309.10315  [pdf, ps, other

    math.RA math-ph

    On (co-)morphisms of $n$-Lie-Rinehart algebras with applications to Nambu-Poisson manifolds

    Authors: Yanhui Bi, Zhixiong Chen, Tao Zhang

    Abstract: In this paper, we give a unified description of morphisms and comorphisms of $n$-Lie-Rinehart algebras. We show that these morphisms and comorphisms can be regarded as two subalgebras of the $ψ$-sum of $n$-Lie-Rinehart algebras. We also provide similar descriptions for morphisms and comorphisms of $n$-Lie algebroids. It is proved that the category of vector bundles with Nambu-Poisson structures of… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 23

  42. arXiv:2309.05903  [pdf, ps, other

    math.CO

    Interlacing property of a family of generating polynomials over Dyck paths

    Authors: Bo Wang, Candice X. T. Zhang

    Abstract: In the study of a tantalizing symmetry on Catalan objects, Bóna et al. introduced a family of polynomials $\{W_{n,k}(x)\}_{n\geq k\geq 0}$ defined by \begin{align*} W_{n,k}(x)=\sum_{m=0}^{k}w_{n,k,m}x^{m}, \end{align*} where $w_{n,k,m}$ counts the number of Dyck paths of semilength $n$ with $k$ occurrences of $UD$ and $m$ occurrences of $UUD$. They proposed two conjectures on the interlacing prope… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  43. arXiv:2309.04885  [pdf, other

    cs.LG math.SG

    Symplectic Structure-Aware Hamiltonian (Graph) Embeddings

    Authors: Jiaxu Liu, ** Yi, Tianle Zhang, Xiaowei Huang

    Abstract: In traditional Graph Neural Networks (GNNs), the assumption of a fixed embedding manifold often limits their adaptability to diverse graph geometries. Recently, Hamiltonian system-inspired GNNs have been proposed to address the dynamic nature of such embeddings by incorporating physical laws into node feature updates. We present Symplectic Structure-Aware Hamiltonian GNN (SAH-GNN), a novel approac… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 5 pages main content with 5 pages appendix

  44. arXiv:2309.01251  [pdf, ps, other

    math.PR

    Reflection of Stochastic Evolution Equations in Infinite Dimensional Domains

    Authors: Zdzisław Brzeźniak, Tusheng Zhang

    Abstract: In this paper, we establish the existence and the uniqueness of solutions of stochastic evolution equations (SEEs) with reflection in an infinite dimensional ball. Our framework is sufficiently general to include e.g. the stochastic Navier-Stokes equations.

    Submitted 3 September, 2023; originally announced September 2023.

    MSC Class: Primary 60H15 Secondary 60J60; 35R60

  45. arXiv:2309.00432  [pdf, ps, other

    math.NT math.AG math.RA

    Drinfeld Modular Curves Subordinate to Conjugacy Classes of Nilpotent Upper-Triangular Matrices

    Authors: Zhuo Chen, Chuangqiang Hu, Tao Zhang, Xiaopeng Zheng

    Abstract: We introduce normalized Drinfeld modular curves that parameterize rank $m$ Drinfeld modules compatible with a $T$-torsion structure arising from a given conjugacy class of nilpotent upper-triangular $n\times n$ matrices with rank $\geqslant n-m$ over a finite field $\mathbb{F}_q$. This creates a deep link connecting the classification of nilpotent upper-triangular matrices and the decomposition of… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 43 pages

    MSC Class: 11G09; 11R58; 14H05; 11G20

  46. arXiv:2308.14479  [pdf, other

    math.NA

    A convergent interacting particle method for computing KPP front speeds in random flows

    Authors: Tan Zhang, Zhongjian Wang, Jack Xin, Zhiwen Zhang

    Abstract: We aim to efficiently compute spreading speeds of reaction-diffusion-advection (RDA) fronts in divergence free random flows under the Kolmogorov-Petrovsky-Piskunov (KPP) nonlinearity. We study a stochastic interacting particle method (IPM) for the reduced principal eigenvalue (Lyapunov exponent) problem of an associated linear advection-diffusion operator with spatially random coefficients. The Fo… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 36 pages, 17 figures, planning to submit for SIAM/ASA Journal on Uncertainty Quantification

    MSC Class: 35K57; 47D08; 65C35; 65L20; 65N25

  47. arXiv:2308.13776  [pdf, ps, other

    math.OC

    A VMiPG method for composite optimization with nonsmooth term having no closed-form proximal map**

    Authors: Taiwei Zhang, Shaohua Pan, Ruyu Liu

    Abstract: This paper concerns the minimization of the sum of a twice continuously differentiable function $f$ and a nonsmooth convex function $g$ without closed-form proximal map**. For this class of nonconvex and nonsmooth problems, we propose a line-search based variable metric inexact proximal gradient (VMiPG) method with uniformly bounded positive definite variable metric linear operators. This method… ▽ More

    Submitted 6 April, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

  48. arXiv:2308.06856  [pdf, ps, other

    math.AP

    On the large time asymptotics of bi-laplacian Schrödinger equation with general data

    Authors: Avy Soffer, Jiayan Wu, Xiaoxu Wu, Ting Zhang

    Abstract: We study the bi-laplacian Schrödinger equation with a general interaction term, which can be either linear or nonlinear, and is time-dependent. We prove that the global solutions for this equation are asymptotically given by a free wave and a weakly localized part. The proof relies on constructing the Free Channel Wave Operator in a new way, based on the method developed from recent studies \cite{… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 27 pages. Comments welcome

    MSC Class: 35Q55

  49. arXiv:2308.05650  [pdf, other

    math.NA

    Asymptotic-preserving neural networks for multiscale Vlasov-Poisson-Fokker-Planck system in the high-field regime

    Authors: Shi **, Zheng Ma, Tian-ai Zhang

    Abstract: The Vlasov-Poisson-Fokker-Planck (VPFP) system is a fundamental model in plasma physics that describes the Brownian motion of a large ensemble of particles within a surrounding bath. Under the high-field scaling, both collision and field are dominant. This paper introduces two Asymptotic-Preserving Neural Network (APNN) methods within a physics-informed neural network (PINN) framework for solving… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  50. arXiv:2308.04023  [pdf, ps, other

    math.DS math.GR

    Patterson-Sullivan measures for relatively Anosov groups

    Authors: Richard Canary, Andrew Zimmer, Tengren Zhang

    Abstract: We establish existence, uniqueness and ergodicity results for Patterson-Sullivan measures for relatively Anosov groups. As applications we obtain an entropy gap theorem and a strict concavity result for entropies associated to linear functionals.

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 40 pages

    MSC Class: 37A17; 22F10; 22E40; 53C30