Search | arXiv e-print repository

Attainability and criticality for multipolar Rellich inequality

Authors: Yongyang **, Shoufeng Shen, Li Tang

Abstract: In this paper we obtain optimal multipolar Rellich inequality for biharmonic Schrodinger operator with positive multi-singular potentials. Moreover, we prove the attainability of the best constant and the criticality of the biharmonic Schrodinger operator. In this paper we obtain optimal multipolar Rellich inequality for biharmonic Schrodinger operator with positive multi-singular potentials. Moreover, we prove the attainability of the best constant and the criticality of the biharmonic Schrodinger operator. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.12174 [pdf, other]

Expected Bipartite Matching Distance in A $D$-dimensional $L^p$ Space: Approximate Closed-form Formulas and Applications to Mobility Services

Authors: Shiyu Shen, Yuhui Zhai, Yanfeng Ouyang

Abstract: Although many well-known algorithms can solve the bipartite matching problem instance efficiently, it remains an open question how one could estimate the expected optimal matching distance for arbitrary numbers of randomly distributed vertices in a $D$-dimensional $L^p$ space (referred to as a random bipartite matching problem, or RBMP). This paper proposes an analytical model with closed-form for… ▽ More Although many well-known algorithms can solve the bipartite matching problem instance efficiently, it remains an open question how one could estimate the expected optimal matching distance for arbitrary numbers of randomly distributed vertices in a $D$-dimensional $L^p$ space (referred to as a random bipartite matching problem, or RBMP). This paper proposes an analytical model with closed-form formulas (without statistical curve-fitting) that estimate both the probability distribution and expectation of the optimal matching distance of RBMP. Simpler asymptotic approximations of the formulas are also developed for some special cases. A series of Monte-Carlo simulation experiments are conducted to verify the accuracy of the proposed formulas under varying conditions. These proposed distance estimates could be key for strategic performance evaluation and resource planning in a wide variety of application contexts. To illustrate their usefulness, we focus on mobility service systems where matches must be made between customers and service vehicles that are randomly distributed over time and space. We show how the proposed distance formulas provide a theoretical foundation for the empirically assumed Cobb-Douglas matching function for taxi systems, and reveal conditions under which the matching function can be suitable. Our formulas can also be easily incorporated into optimization models to select taxi operation strategies (e.g., whether newly arriving customers shall be instantly matched or pooled into a batch for matching). Agent-based simulations are conducted to verify the predicted performance of the demand pooling strategy for two types of e-hailing taxi systems. The results not only demonstrate the accuracy of the proposed model estimates under various service conditions, but also offer valuable managerial insights for service operators to optimize their strategies. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2405.14583 [pdf, ps, other]

Anosov vector fields and Fried sections

Authors: Jean-Michel Bismut, Shu Shen

Abstract: The purpose of this paper is to prove that if $Y$ is a compact manifold, if $Z$ is an Anosov vector field on $Y$, and if $F$ is a flat vector bundle, then there is a corresponding canonical nonzero section $τ_ν\left(i_{Z}\right)$ of the determinant line $ν=\mathrm{det} H\left(Y,F\right)$. In families, this section is $C^{1}$. When $F$ is flat on the total space of the corresponding fibration, our… ▽ More The purpose of this paper is to prove that if $Y$ is a compact manifold, if $Z$ is an Anosov vector field on $Y$, and if $F$ is a flat vector bundle, then there is a corresponding canonical nonzero section $τ_ν\left(i_{Z}\right)$ of the determinant line $ν=\mathrm{det} H\left(Y,F\right)$. In families, this section is $C^{1}$. When $F$ is flat on the total space of the corresponding fibration, our section is flat with respect to the Gauss-Manin connection on $ν$. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2403.13226 [pdf, ps, other]

Non-preservation of $α$-concavity for the porous medium equation in higher dimensions

Authors: Xi Sisi Shen, Pranay Talla

Abstract: In this short note, we prove that $α$-concavity of the pressure is not preserved for the porous medium equation in dimensions $n=3$ and higher for any $α\in [0,1]\backslash \{\frac{1}{2}\}$. Together with the result of Chau-Weinkove for $n=2$, this fully resolves an open problem posed by Vásquez on whether pressure concavity is preserved in general for the porous medium equation. In this short note, we prove that $α$-concavity of the pressure is not preserved for the porous medium equation in dimensions $n=3$ and higher for any $α\in [0,1]\backslash \{\frac{1}{2}\}$. Together with the result of Chau-Weinkove for $n=2$, this fully resolves an open problem posed by Vásquez on whether pressure concavity is preserved in general for the porous medium equation. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 8 pages

arXiv:2401.10120 [pdf, other]

Binary Quantum Control Optimization with Uncertain Hamiltonians

Authors: Xinyu Fei, Lucas T. Brady, Jeffrey Larson, Sven Leyffer, Siqian Shen

Abstract: Optimizing the controls of quantum systems plays a crucial role in advancing quantum technologies. The time-varying noises in quantum systems and the widespread use of inhomogeneous quantum ensembles raise the need for high-quality quantum controls under uncertainties. In this paper, we consider a stochastic discrete optimization formulation of a binary optimal quantum control problem involving Ha… ▽ More Optimizing the controls of quantum systems plays a crucial role in advancing quantum technologies. The time-varying noises in quantum systems and the widespread use of inhomogeneous quantum ensembles raise the need for high-quality quantum controls under uncertainties. In this paper, we consider a stochastic discrete optimization formulation of a binary optimal quantum control problem involving Hamiltonians with predictable uncertainties. We propose a sample-based reformulation that optimizes both risk-neutral and risk-averse measurements of control policies, and solve these with two gradient-based algorithms using sum-up-rounding approaches. Furthermore, we discuss the differentiability of the objective function and prove upper bounds of the gaps between the optimal solutions to binary control problems and their continuous relaxations. We conduct numerical studies on various sized problem instances based of two applications of quantum pulse optimization; we evaluate different strategies to mitigate the impact of uncertainties in quantum systems. We demonstrate that the controls of our stochastic optimization model achieve significantly higher quality and robustness compared to the controls of a deterministic model. △ Less

Submitted 19 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

arXiv:2311.02383 [pdf, ps, other]

Crank equidistribution and $(k,j)$-overlined partitions

Authors: Adithya Chakravarthy, Joshua Males, Shuyang Shen

Abstract: In a paper published in 2023, Wagner introduced and studied Jacobi forms with complex multiplication, and gave several applications. One such application was in constructing a new doubly-infinite family of partition-theoretic objects, called $(k,j)$-coloured overpartitions and labelled by $\overline{p}_{k,j}$, and using the Jacobi forms to construct crank functions which explain the Ramanujan-type… ▽ More In a paper published in 2023, Wagner introduced and studied Jacobi forms with complex multiplication, and gave several applications. One such application was in constructing a new doubly-infinite family of partition-theoretic objects, called $(k,j)$-coloured overpartitions and labelled by $\overline{p}_{k,j}$, and using the Jacobi forms to construct crank functions which explain the Ramanujan-type congruences satisfied by $\overline{p}_{k,j}$. In this note, we give an asymptotic formula for the number of $(k,j)$-coloured overpartitions and prove that any crank constructed by Wagner is asymptotically equidistributed on arithmetic progressions, following several recent papers in the literature. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: 11 pages, 1 table. Comments welcome!

arXiv:2311.02008 [pdf, ps, other]

Sharp Global Well-posedness and Scattering of the Boltzmann Equation

Authors: Xuwen Chen, Shunlin Shen, Zhifei Zhang

Abstract: We consider the 3D Boltzmann equation for the Maxwellian particle and soft potential with an angular cutoff. We prove sharp global well-posedness with initial data small in the scaling-critical space. The solution also remains in $L^{1}$ if the initial datum is in $L^{1}$, even at such low regularity. The key to existence, uniqueness and regularity criteria is the new bilinear spacetime estimates… ▽ More We consider the 3D Boltzmann equation for the Maxwellian particle and soft potential with an angular cutoff. We prove sharp global well-posedness with initial data small in the scaling-critical space. The solution also remains in $L^{1}$ if the initial datum is in $L^{1}$, even at such low regularity. The key to existence, uniqueness and regularity criteria is the new bilinear spacetime estimates for the gain term, the proof of which is based on novel techniques from nonlinear dispersive PDEs including the atomic $U$-$V$ spaces, multi-linear frequency analysis, dispersive estimates, etc. To our knowledge, this is the first 3D sharp global result for the Boltzmann equation. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.05042 [pdf, ps, other]

Well/Ill-posedness of the Boltzmann Equation with Soft Potential

Authors: Xuwen Chen, Shunlin Shen, Zhifei Zhang

Abstract: We consider the Boltzmann equation with the soft potential and angular cutoff. Inspired by the methods from dispersive PDEs, we establish its sharp local well-posedness and ill-posedness in $H^{s}$ Sobolev space. We find the well/ill-posedness separation at regularity $s=\frac{d-1}{2}$, strictly $\frac{1}{2}$-derivative higher than the scaling-invariant index $s=\frac{d-2}{2}$, the usually expecte… ▽ More We consider the Boltzmann equation with the soft potential and angular cutoff. Inspired by the methods from dispersive PDEs, we establish its sharp local well-posedness and ill-posedness in $H^{s}$ Sobolev space. We find the well/ill-posedness separation at regularity $s=\frac{d-1}{2}$, strictly $\frac{1}{2}$-derivative higher than the scaling-invariant index $s=\frac{d-2}{2}$, the usually expected separation point. △ Less

Submitted 8 October, 2023; originally announced October 2023.

arXiv:2308.03132 [pdf, other]

Switching Time Optimization for Binary Quantum Optimal Control

Authors: Xinyu Fei, Lucas T. Brady, Jeffrey Larson, Sven Leyffer, Siqian Shen

Abstract: Quantum optimal control is a technique for controlling the evolution of a quantum system and has been applied to a wide range of problems in quantum physics. We study a binary quantum control optimization problem, where control decisions are binary-valued and the problem is solved in diverse quantum algorithms. In this paper, we utilize classical optimization and computing techniques to develop an… ▽ More Quantum optimal control is a technique for controlling the evolution of a quantum system and has been applied to a wide range of problems in quantum physics. We study a binary quantum control optimization problem, where control decisions are binary-valued and the problem is solved in diverse quantum algorithms. In this paper, we utilize classical optimization and computing techniques to develop an algorithmic framework that sequentially optimizes the number of control switches and the duration of each control interval on a continuous time horizon. Specifically, we first solve the continuous relaxation of the binary control problem based on time discretization and then use a heuristic to obtain a controller sequence with a penalty on the number of switches. Then, we formulate a switching time optimization model and apply sequential least-squares programming with accelerated time-evolution simulation to solve the model. We demonstrate that our computational framework can obtain binary controls with high-quality performance and also reduce computational time via solving a family of quantum control instances in various quantum physics applications. △ Less

Submitted 6 August, 2023; originally announced August 2023.

arXiv:2308.02687 [pdf, other]

A Multi-objective Mixed-integer Programming Approach for Supply Chain Disruption Response with Lead-Time Awareness

Authors: Juan-Alberto Estrada-Garcia, Mingjie Bi, Dawn M. Tilbury, Kira Barton, Siqian Shen

Abstract: Supply chain (SC) risk management is influenced by both spatial and temporal attributes of different entities (suppliers, retailers, and customers). Each entity has given capacity and lead time for processing and transporting products to downstream entities. Under disruptive events, lead time and capacities may vary, which affects the overall SC performance. There have been many studies on SC disr… ▽ More Supply chain (SC) risk management is influenced by both spatial and temporal attributes of different entities (suppliers, retailers, and customers). Each entity has given capacity and lead time for processing and transporting products to downstream entities. Under disruptive events, lead time and capacities may vary, which affects the overall SC performance. There have been many studies on SC disruption mitigation, but often without considering lead time and the magnitude of lateness. In this paper, we formulate a mixed-integer programming (MIP) model to optimize SC operations via a routing and scheduling approach, to model the delivery time of products at different entities as they flow throughout the SC network. We minimize a weighted sum of multiple objectives involving costs related to transportation, shortage, and delivery lateness. We also develop a discrete-event simulation framework to evaluate the performance of solutions to the MIP model under lead time uncertainty. Via extensive numerical studies, we show how the attributes of SC entities affect the performance, so that we can improve SC design and operations under various uncertainties. △ Less

Submitted 4 August, 2023; originally announced August 2023.

arXiv:2307.03665 [pdf, ps, other]

doi 10.1112/blms.12976

The continuity equation for Hermitian metrics: Calabi estimates, Chern scalar curvature and Oeljeklaus-Toma manifolds

Authors: Shuang Liang, Xi Sisi Shen, Kevin Smith

Abstract: We prove local Calabi and higher order estimates for solutions to the continuity equation introduced by La Nave-Tian and extended to Hermitian metrics by Sherman-Weinkove. We apply the estimates to show that on a compact complex manifold the Chern scalar curvature of a solution must blow up at a finite-time singularity. Additionally, starting from certain classes of initial data on Oeljeklaus-Toma… ▽ More We prove local Calabi and higher order estimates for solutions to the continuity equation introduced by La Nave-Tian and extended to Hermitian metrics by Sherman-Weinkove. We apply the estimates to show that on a compact complex manifold the Chern scalar curvature of a solution must blow up at a finite-time singularity. Additionally, starting from certain classes of initial data on Oeljeklaus-Toma manifolds we prove Gromov-Hausdorff and smooth convergence of the metric to a particular non-negative $(1,1)$-form as $t\to\infty$. △ Less

Submitted 7 July, 2023; originally announced July 2023.

Comments: 20 pages

arXiv:2304.03447 [pdf, ps, other]

On the mean-field and semiclassical limit from quantum $N$-body dynamics

Authors: Xuwen Chen, Shunlin Shen, Zhifei Zhang

Abstract: We study the mean-field and semiclassical limit of the quantum many-body dynamics with a repulsive $δ$-type potential $N^{3β}V(N^βx)$ and a Coulomb potential, which leads to a macroscopic fluid equation, the Euler-Poisson equation with pressure. We prove quantitative strong convergence of the quantum mass and momentum densities up to the first blow up time of the limiting equation. The main ingred… ▽ More We study the mean-field and semiclassical limit of the quantum many-body dynamics with a repulsive $δ$-type potential $N^{3β}V(N^βx)$ and a Coulomb potential, which leads to a macroscopic fluid equation, the Euler-Poisson equation with pressure. We prove quantitative strong convergence of the quantum mass and momentum densities up to the first blow up time of the limiting equation. The main ingredient is a functional inequality on the $δ$-type potential for the almost optimal case $β\in(0,1)$, for which we give an analysis of the singular correlation structure between particles. △ Less

Submitted 6 April, 2023; originally announced April 2023.

arXiv:2302.02817 [pdf, ps, other]

doi 10.1016/j.aim.2024.109616

Mirror symmetry for parabolic Higgs bundles via $p$-adic integration

Authors: Shiyu Shen

Abstract: Applying the technique of $p$-adic integration, we prove the topological mirror symmetry conjecture of Hausel-Thaddeus for the moduli spaces of (strongly) parabolic Higgs bundles for the structure groups $\text{SL}_n$ and $\text{PGL}_n$, building on previous work of Groechenig-Wyss-Ziegler on the non-parabolic case. We also prove the $E$-polynomial of the smooth moduli space of parabolic… ▽ More Applying the technique of $p$-adic integration, we prove the topological mirror symmetry conjecture of Hausel-Thaddeus for the moduli spaces of (strongly) parabolic Higgs bundles for the structure groups $\text{SL}_n$ and $\text{PGL}_n$, building on previous work of Groechenig-Wyss-Ziegler on the non-parabolic case. We also prove the $E$-polynomial of the smooth moduli space of parabolic $\text{GL}_n$-Higgs bundles is independent of the degree of the underlying vector bundles. △ Less

Submitted 15 April, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

Comments: 33 pages. Refereed version

arXiv:2301.05981 [pdf, other]

doi 10.1109/CDC51059.2022.9992450

Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures

Authors: Xian Yu, Siqian Shen

Abstract: Traditional reinforcement learning (RL) aims to maximize the expected total reward, while the risk of uncertain outcomes needs to be controlled to ensure reliable performance in a risk-averse setting. In this paper, we consider the problem of maximizing dynamic risk of a sequence of rewards in infinite-horizon Markov Decision Processes (MDPs). We adapt the Expected Conditional Risk Measures (ECRMs… ▽ More Traditional reinforcement learning (RL) aims to maximize the expected total reward, while the risk of uncertain outcomes needs to be controlled to ensure reliable performance in a risk-averse setting. In this paper, we consider the problem of maximizing dynamic risk of a sequence of rewards in infinite-horizon Markov Decision Processes (MDPs). We adapt the Expected Conditional Risk Measures (ECRMs) to the infinite-horizon risk-averse MDP and prove its time consistency. Using a convex combination of expectation and conditional value-at-risk (CVaR) as a special one-step conditional risk measure, we reformulate the risk-averse MDP as a risk-neutral counterpart with augmented action space and manipulation on the immediate rewards. We further prove that the related Bellman operator is a contraction map**, which guarantees the convergence of any value-based RL algorithms. Accordingly, we develop a risk-averse deep Q-learning framework, and our numerical studies based on two simple MDPs show that the risk-averse setting can reduce the variance and enhance robustness of the results. △ Less

Submitted 14 January, 2023; originally announced January 2023.

Journal ref: 2022 IEEE 61st Conference on Decision and Control (CDC), Cancun, Mexico, 2022, pp. 2307-2312

arXiv:2212.12088 [pdf, other]

Frequency-Secured Unit Commitment: Tight Approximation using Bernstein Polynomials

Authors: Bo Zhou, Ruiwei Jiang, Siqian Shen

Abstract: As we replace conventional synchronous generators with renewable energy, the frequency security of power systems is at higher risk. This calls for a more careful consideration of unit commitment (UC) and primary frequency response (PFR) reserves. This paper studies frequency-secured UC under significant wind power uncertainty. We coordinate the thermal units and wind farms to provide frequency sup… ▽ More As we replace conventional synchronous generators with renewable energy, the frequency security of power systems is at higher risk. This calls for a more careful consideration of unit commitment (UC) and primary frequency response (PFR) reserves. This paper studies frequency-secured UC under significant wind power uncertainty. We coordinate the thermal units and wind farms to provide frequency support, wherein we optimize the variable inverter droop factors of the wind farms for higher economy. In addition, we adopt distributionally robust chance constraints (DRCCs) to handle the wind power uncertainty. To depict the frequency dynamics, we incorporate a differential-algebraic equation (DAE) with the dead band into the UC model. Notably, we apply Bernstein polynomials to derive tight inner approximation of the DAE and obtain mixed-integer linear constraints, which can be computed in off-the-shelf solvers. Case studies demonstrate the tightness and effectiveness of the proposed method in guaranteeing frequency security. △ Less

Submitted 22 December, 2022; originally announced December 2022.

arXiv:2212.11775 [pdf, other]

An efficient peridynamics-based statistical multiscale method for fracture in composite structure with randomly distributed particles

Authors: Zihao Yang, Shaoqi Zheng, Shangkun Shen, Fei Han

Abstract: The fracture simulation of random particle reinforced composite structures remains a challenge. Current techniques either assumed a homogeneous model, ignoring the microstructure characteristics of composite structures, or considered a micro-mechanical model, involving intractable computational costs. This paper proposes a peridynamics-based statistical multiscale (PSM) framework to simulate the m… ▽ More The fracture simulation of random particle reinforced composite structures remains a challenge. Current techniques either assumed a homogeneous model, ignoring the microstructure characteristics of composite structures, or considered a micro-mechanical model, involving intractable computational costs. This paper proposes a peridynamics-based statistical multiscale (PSM) framework to simulate the macroscopic structure fracture with high efficiency. The heterogeneities of composites, including the shape, spatial distribution and volume fraction of particles, are characterized within the representative volume elements (RVEs), and their impact on structure failure are extracted as two types of peridynamic parameters, namely, statistical critical stretch and equivalent micromodulus. At the microscale level, a bond-based peridynamic (BPD) model with energy-based micromodulus correction technique is introduced to simulate the fracture in RVEs, and then the computational model of statistical critical stretch is established through micromechanical analysis. Moreover, based on the statistical homogenization approach, the computational model of effective elastic tensor is also established. Then, the equivalent micromodulus can be derived from the effective elastic tensor, according to the energy density equivalence between classical continuum mechanics (CCM) and BPD models. At the macroscale level, a macroscale BPD model with the statistical critical stretch and the equivalent micromodulus is constructed to simulate the fracture in the macroscopic homogenized structures. The algorithm framework of the PSM method is also described. Two- and three-dimensional numerical examples illustrate the validity, accuracy and efficiency of the proposed method. △ Less

Submitted 15 November, 2022; originally announced December 2022.

arXiv:2212.10695 [pdf, ps, other]

Complex K-theory of moduli spaces of Higgs bundles

Authors: Michael Groechenig, Shiyu Shen

Abstract: We establish an isomorphism of complex $K$-theory of the moduli space $\check{\mathcal{M}}$ of $``SL_n"$-Higgs bundles of degree $d$ and rank $n$ (in the sense of Hausel--Thaddeus) and twisted complex $K$-theory of the orbifold $\hat{\mathcal{M}}$ of $PGL_n$-Higgs bundles of degree $e$, where $(n,d)=(n,e)=1$. Along the way we prove the vanishing of torsion for $H^*(\check{\mathcal{M}})$ and certai… ▽ More We establish an isomorphism of complex $K$-theory of the moduli space $\check{\mathcal{M}}$ of $``SL_n"$-Higgs bundles of degree $d$ and rank $n$ (in the sense of Hausel--Thaddeus) and twisted complex $K$-theory of the orbifold $\hat{\mathcal{M}}$ of $PGL_n$-Higgs bundles of degree $e$, where $(n,d)=(n,e)=1$. Along the way we prove the vanishing of torsion for $H^*(\check{\mathcal{M}})$ and certain twisted complex $K$-theory groups of $\hat{\mathcal{M}}$. We also extend Arinkin's autoduality of compactified Jacobian to a derived equivalence between $SL_n$ and $PGL_n$-Hitchin systems over the elliptic locus. In the appendix we develop a formalism of $G$-sheaves of spectra, generalising equivariant homotopy theory to a relative setting. △ Less

Submitted 20 December, 2022; originally announced December 2022.

Comments: 49 pages, one appendix, comments welcome

arXiv:2212.00867 [pdf, other]

Pre-averaging fractional processes contaminated by noise, with an application to turbulence

Authors: David Chen, Yu Cheng, Carsten Chong, Pierre Gentine, Wangdong Jia, Bryce Monier, Shiyang Shen

Abstract: In this article, we consider the problem of estimating fractional processes based on noisy high-frequency data. Generalizing the idea of pre-averaging to a fractional setting, we exhibit a sequence of consistent estimators for the unknown parameters of interest by proving a law of large numbers for associated variation functionals. In contrast to the semimartingale setting, the optimal window size… ▽ More In this article, we consider the problem of estimating fractional processes based on noisy high-frequency data. Generalizing the idea of pre-averaging to a fractional setting, we exhibit a sequence of consistent estimators for the unknown parameters of interest by proving a law of large numbers for associated variation functionals. In contrast to the semimartingale setting, the optimal window size for pre-averaging depends on the unknown roughness parameter of the underlying process. We evaluate the performance of our estimators in a simulation study and use them to empirically verify Kolmogorov's 2/3-law in turbulence data contaminated by instrument noise. △ Less

Submitted 1 December, 2022; originally announced December 2022.

MSC Class: 60F25; 60G22; 62M09; 76M35; 62G05; 76F55

arXiv:2209.05473 [pdf, ps, other]

The continuity equation on Hopf and Inoue surfaces

Authors: Xi Sisi Shen, Kevin Smith

Abstract: We study the continuity equation of La Nave-Tian, extended to the Hermitian setting by Sherman-Weinkove, on Hopf and Inoue surfaces. We prove a priori estimates for solutions in both cases, and Gromov-Hausdorff convergence of Inoue surfaces to a circle. We study the continuity equation of La Nave-Tian, extended to the Hermitian setting by Sherman-Weinkove, on Hopf and Inoue surfaces. We prove a priori estimates for solutions in both cases, and Gromov-Hausdorff convergence of Inoue surfaces to a circle. △ Less

Submitted 12 September, 2022; originally announced September 2022.

Comments: 15 pages

arXiv:2208.00026 [pdf, ps, other]

Canonical almost-Kähler metrics dual to general plane-fronted wave Lorentzian metrics

Authors: Mehdi Lejmi, Xi Sisi Shen

Abstract: In the compact setting, Aazami and Ream \cite{Aazami:2022th} proved that Riemannian metrics dual to a class of Lorentzian metrics, called (compact) general plane-fronted waves, are almost-Kähler. In this note, we explain how to construct extremal and second-Chern-Einstein non-Kähler almost-Kähler metrics dual to those general plane-fronted waves. In the compact setting, Aazami and Ream \cite{Aazami:2022th} proved that Riemannian metrics dual to a class of Lorentzian metrics, called (compact) general plane-fronted waves, are almost-Kähler. In this note, we explain how to construct extremal and second-Chern-Einstein non-Kähler almost-Kähler metrics dual to those general plane-fronted waves. △ Less

Submitted 29 July, 2022; originally announced August 2022.

Comments: 12 pages

MSC Class: 53C55

arXiv:2202.09062 [pdf, ps, other]

A cylindrical coordinates approach concerning internal waves for the Antarctic Circumpolar Current

Authors: Lili Fan, Shuge Shen

Abstract: In this paper, we devise a new exact and partially explicit solution to the governing equations of geophysical fluid dynamics for an inviscid and incompressible azimuth flow with a discontinuous density distribution and subjected to forcing terms in terms of cylindrical coordinates. The obtained solution represents a steady, purely azimuthal, stratified flow with an associated free surface and an… ▽ More In this paper, we devise a new exact and partially explicit solution to the governing equations of geophysical fluid dynamics for an inviscid and incompressible azimuth flow with a discontinuous density distribution and subjected to forcing terms in terms of cylindrical coordinates. The obtained solution represents a steady, purely azimuthal, stratified flow with an associated free surface and an interface that is suitable for describing the Antarctic Circumpolar Current. Resorting to a functional analysis, we demonstrate that the relationship between the imposed pressure at the free surface and the resulting surface deformation is well-defined and show that the continuity of the pressure along the interface generates an equation that describes implicitly the shape of the interface. Moreover, a particular example is considered to show that the interface can be determined explicitly. Finally, we derive an infinite regularity about the interface and obtain the expected monotonicity properties between the surface pressure and its distortion. △ Less

Submitted 18 February, 2022; originally announced February 2022.

arXiv:2112.14897 [pdf, ps, other]

The derivation of the compressible Euler equation from quantum many-body dynamics

Authors: Xuwen Chen, Shunlin Shen, Jiahao Wu, Zhifei Zhang

Abstract: We study the three dimensional many-particle quantum dynamics in mean-field setting. We forge together the hierarchy method and the modulated energy method. We prove rigorously that the compressible Euler equation is the limit as the particle number tends to infinity and the Planck's constant tends to zero. We establish strong and quantitative microscopic to macroscopic convergence of mass and mom… ▽ More We study the three dimensional many-particle quantum dynamics in mean-field setting. We forge together the hierarchy method and the modulated energy method. We prove rigorously that the compressible Euler equation is the limit as the particle number tends to infinity and the Planck's constant tends to zero. We establish strong and quantitative microscopic to macroscopic convergence of mass and momentum densities up to the 1st blow up time of the limiting Euler equation. We justify that the macroscopic pressure emerges from the space-time averages of microscopic interactions, which are in fact, Strichartz-type bounds. We have hence found a physical meaning for Strichartz type bounds which were first raised by Klainerman and Machedon in this context. △ Less

Submitted 29 December, 2021; originally announced December 2021.

arXiv:2107.05785 [pdf, ps, other]

Optimal Hardy inequalities associated with multipolar Schrödinger operators

Authors: Yongyang **, Li Tang, Can Ye, Shoufeng Shen

Abstract: We proved some optimal Hardy inequalities in RNwhich is closely related to multipolar Schrödinger operators with mean-value type potentials, these sharp inequalities imply some multipolar type Heisenberg inequalities. We also obtained someimproved multipolar Hardy inequalities on bounded domains, moreover, we got the range of the best Hardy constant for a specific Hardy inequality. We proved some optimal Hardy inequalities in RNwhich is closely related to multipolar Schrödinger operators with mean-value type potentials, these sharp inequalities imply some multipolar type Heisenberg inequalities. We also obtained someimproved multipolar Hardy inequalities on bounded domains, moreover, we got the range of the best Hardy constant for a specific Hardy inequality. △ Less

Submitted 12 July, 2021; originally announced July 2021.

MSC Class: 26D10; 42B37

arXiv:2107.01958 [pdf, ps, other]

Bismut hypoelliptic Laplacians for manifolds with boundaries

Authors: Francis Nier, Shu Shen

Abstract: Boundary conditions for Bismut's hypoelliptic Laplacian which naturally correspond to Dirichlet and Neumann boundary conditions for Hodge Laplacians are considered. Those are related with specific boundary conditions for the differential and its various adjoints. Once the closed realizations of those operators are well understood, the commutation of the differential with the resolvent of the hypoe… ▽ More Boundary conditions for Bismut's hypoelliptic Laplacian which naturally correspond to Dirichlet and Neumann boundary conditions for Hodge Laplacians are considered. Those are related with specific boundary conditions for the differential and its various adjoints. Once the closed realizations of those operators are well understood, the commutation of the differential with the resolvent of the hypoelliptic Laplacian is checked with other properties like the PT-symmetry, which are important for the spectral analysis. △ Less

Submitted 9 September, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

arXiv:2105.11005 [pdf, other]

On the Value of Multistage Risk-Averse Stochastic Facility Location With or Without Prioritization

Authors: Xian Yu, Siqian Shen

Abstract: We consider a multiperiod stochastic capacitated facility location problem under uncertain demand and budget in each period. Using a scenario tree representation of the uncertainties, we formulate a multistage stochastic integer program to dynamically locate facilities in each period and compare it with a two-stage approach that determines the facility locations up front. In the multistage model,… ▽ More We consider a multiperiod stochastic capacitated facility location problem under uncertain demand and budget in each period. Using a scenario tree representation of the uncertainties, we formulate a multistage stochastic integer program to dynamically locate facilities in each period and compare it with a two-stage approach that determines the facility locations up front. In the multistage model, in each stage, a decision maker optimizes facility locations and recourse flows from open facilities to demand sites, to minimize certain risk measures of the cost associated with current facility location and shipment decisions. When the budget is also uncertain, a popular modeling framework is to prioritize the candidate sites. In the two-stage model, the priority list is decided in advance and fixed through all periods, while in the multistage model, the priority list can change adaptively. In each period, the decision maker follows the priority list to open facilities according to the realized budget, and optimizes recourse flows given the realized demand. Using expected conditional risk measures (ECRMs), we derive tight lower bounds for the gaps between the optimal objective values of risk-averse multistage models and their two-stage counterparts in both settings with and without prioritization. Moreover, we propose two approximation algorithms to efficiently solve risk-averse two-stage and multistage models without prioritization, which are asymptotically optimal under an expanding market assumption. We also design a set of super-valid inequalities for risk-averse two-stage and multistage stochastic programs with prioritization to reduce the computational time. We conduct numerical studies using both randomly generated and real-world instances with diverse sizes, to demonstrate the tightness of the analytical bounds and efficacy of the approximation algorithms and prioritization cuts. △ Less

Submitted 16 July, 2022; v1 submitted 23 May, 2021; originally announced May 2021.

arXiv:2104.06592 [pdf, ps, other]

doi 10.1007/s40818-022-00130-9

The unconditional uniqueness for the energy-supercritical NLS

Authors: Xuwen Chen, Shunlin Shen, Zhifei Zhang

Abstract: We consider the cubic and quintic nonlinear Schrödinger equations (NLS) under the $\mathbb{R}^{d}$ and $\mathbb{T}^{d}$ energy-supercritical setting. Via a newly developed unified scheme, we prove the unconditional uniqueness for solutions to NLS at critical regularity for all dimensions. Thus, together with [18,19], the unconditional uniqueness problems for $H^{1}$-critical and $H^{1}$-supercriti… ▽ More We consider the cubic and quintic nonlinear Schrödinger equations (NLS) under the $\mathbb{R}^{d}$ and $\mathbb{T}^{d}$ energy-supercritical setting. Via a newly developed unified scheme, we prove the unconditional uniqueness for solutions to NLS at critical regularity for all dimensions. Thus, together with [18,19], the unconditional uniqueness problems for $H^{1}$-critical and $H^{1}$-supercritical cubic and quintic NLS are completely and uniformly resolved at critical regularity for these domains. One application of our theorem is to prove that defocusing blowup solutions of the type in [54] is the only possible $C([0,T);\dot{H}^{s_{c}})$ solution if exist in these domains. △ Less

Submitted 27 June, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

Comments: Revised per the referee reports

Journal ref: Annals of PDE 8 (2022), no. 2, Paper No. 14, 1-82

arXiv:2103.04266 [pdf, other]

Resource Distribution Under Spatiotemporal Uncertainty of Disease Spread: Stochastic versus Robust Approaches

Authors: Beste Basciftci, Xian Yu, Siqian Shen

Abstract: We consider the problem of optimizing locations of distribution centers (DCs) and plans for distributing resources such as test kits and vaccines, under spatiotemporal uncertainties of disease spread and demand for the resources. We aim to balance the operational cost (including costs of deploying facilities, ship**, and storage) and quality of service (reflected by demand coverage), while ensur… ▽ More We consider the problem of optimizing locations of distribution centers (DCs) and plans for distributing resources such as test kits and vaccines, under spatiotemporal uncertainties of disease spread and demand for the resources. We aim to balance the operational cost (including costs of deploying facilities, ship**, and storage) and quality of service (reflected by demand coverage), while ensuring equity and fairness of resource distribution across multiple populations. We compare a sample-based stochastic programming (SP) approach with a distributionally robust optimization (DRO) approach using a moment-based ambiguity set. Numerical studies are conducted on instances of distributing COVID-19 vaccines in the United States and test kits, to compare SP and DRO models with a deterministic formulation using estimated demand and with the current resource distribution plans implemented in the US. We demonstrate the results over distinct phases of the pandemic to estimate the cost and speed of resource distribution depending on scale and coverage, and show the ``demand-driven'' properties of the SP and DRO solutions. Our results further indicate that if the worst-case unmet demand is prioritized, then the DRO approach is preferred despite of its higher overall cost. Nevertheless, the SP approach can provide an intermediate plan under budgetary restrictions without significant compromises in demand coverage. △ Less

Submitted 16 July, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

arXiv:2103.04259 [pdf, other]

Sequential Competitive Facility Location: Exact and Approximate Algorithms

Authors: Mingyao Qi, Ruiwei Jiang, Siqian Shen

Abstract: We study a competitive facility location problem (CFLP), where two firms sequentially open new facilities within their budgets, in order to maximize their market shares of demand that follows a probabilistic choice model. This process is a Stackelberg game and admits a bilevel mixed-integer nonlinear program (MINLP) formulation. We derive an equivalent, single-level MINLP reformulation and exploit… ▽ More We study a competitive facility location problem (CFLP), where two firms sequentially open new facilities within their budgets, in order to maximize their market shares of demand that follows a probabilistic choice model. This process is a Stackelberg game and admits a bilevel mixed-integer nonlinear program (MINLP) formulation. We derive an equivalent, single-level MINLP reformulation and exploit the problem structures to derive two valid inequalities, based on submodularity and concave overestimation, respectively. We use the two valid inequalities in a branch-and-cut algorithm to find globally optimal solutions. Then, we propose an approximation algorithm to find good-quality solutions with a constant approximation guarantee. We develop several extensions by considering general facility-opening costs, outside competitors, as well as diverse facility-planning decisions, and discuss solution approaches for each extension. We conduct numerical studies to demonstrate that the exact algorithm significantly accelerates the computation of CFLP on large-sized instances that have not been solved optimally or even heuristically by existing methods, and the approximation algorithm can quickly find high-quality solutions. We derive managerial insights based on sensitivity analysis of different settings that affect customers' probabilistic choices and the ensuing demand. △ Less

Submitted 17 July, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

arXiv:2102.08129 [pdf, other]

doi 10.1007/978-3-031-27234-9

Coherent sheaves, superconnections, and RRG

Authors: Jean-Michel Bismut, Shu Shen, Zhaoting Wei

Abstract: Given a compact complex manifold, the purpose of this paper is to construct the Chern character for coherent sheaves with values in Bott-Chern cohomology, and to prove a corresponding Riemann-Roch-Grothendieck formula. Our paper is based on a fundamental construction of Block. Given a compact complex manifold, the purpose of this paper is to construct the Chern character for coherent sheaves with values in Bott-Chern cohomology, and to prove a corresponding Riemann-Roch-Grothendieck formula. Our paper is based on a fundamental construction of Block. △ Less

Submitted 1 May, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

Comments: 161 pages, 1 figure. In version 2, references to earlier work have been added

MSC Class: 18G80; 19L10; 35H10

Journal ref: Progress in Mathematics 347, Birkhauser/Springer, Cham, 2023

arXiv:2011.09683 [pdf, ps, other]

A Chern-Calabi flow on Hermitian manifolds

Authors: Xi Sisi Shen

Abstract: We study an analogue of the Calabi flow in the non-Kähler setting for compact Hermitian manifolds with vanishing first Bott-Chern class. We prove a priori estimates for the evolving metric along the flow given a uniform bound on the Chern scalar curvature. If the Chern scalar curvature remains uniformly bounded for all time, we show that the flow converges smoothly to the unique Chern-Ricci-flat m… ▽ More We study an analogue of the Calabi flow in the non-Kähler setting for compact Hermitian manifolds with vanishing first Bott-Chern class. We prove a priori estimates for the evolving metric along the flow given a uniform bound on the Chern scalar curvature. If the Chern scalar curvature remains uniformly bounded for all time, we show that the flow converges smoothly to the unique Chern-Ricci-flat metric in the $\partial\bar{\partial}$-class of the initial metric. △ Less

Submitted 2 February, 2022; v1 submitted 19 November, 2020; originally announced November 2020.

Comments: 15 pages. Final version to appear in Journal of Geometric Analysis

arXiv:2010.15063 [pdf, other]

Combinatorial-Probabilistic Trade-Off: Community Properties Test in the Stochastic Block Models

Authors: Shuting Shen, Junwei Lu

Abstract: In this paper, we propose an inferential framework testing the general community combinatorial properties of the stochastic block model. Instead of estimating the community assignments, we aim to test the hypothesis on whether a certain community property is satisfied. For instance, we propose to test whether a given set of nodes belong to the same community or whether different network communitie… ▽ More In this paper, we propose an inferential framework testing the general community combinatorial properties of the stochastic block model. Instead of estimating the community assignments, we aim to test the hypothesis on whether a certain community property is satisfied. For instance, we propose to test whether a given set of nodes belong to the same community or whether different network communities have the same size. We propose a general inference framework that can be applied to all symmetric community properties. To ease the challenges caused by the combinatorial nature of communities properties, we develop a novel shadowing bootstrap testing method. By utilizing the symmetry, our method can find a shadowing representative of the true assignment and the number of assignments to be tested in the alternative can be largely reduced. In theory, we introduce a combinatorial distance between two community classes and show a combinatorial-probabilistic trade-off phenomenon in the community properties test. Our test is honest as long as the product of combinatorial distance between two communities and the probabilistic distance between two assignment probabilities is sufficiently large. On the other hand, we shows that such trade-off also exists in the information-theoretic lower bound of the community property test. We also implement numerical experiments on both the synthetic data and the protein interaction application to show the validity of our method. △ Less

Submitted 29 October, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

MSC Class: 05C80

arXiv:2010.10630 [pdf, other]

An Optimization-and-Simulation Framework for Redesigning University Campus Bus System with Social Distancing

Authors: Gongyu Chen, Xinyu Fei, Huiwen Jia, Xian Yu, Siqian Shen

Abstract: The outbreak of coronavirus disease 2019 (COVID-19) has led to significant challenges for schools, workplaces and communities to return to operations during the pandemic, requiring policymakers to balance individuals' safety and operational efficiency. In this paper, we present our work using mixed-integer programming and simulation for redesigning routes and bus schedules for University of Michig… ▽ More The outbreak of coronavirus disease 2019 (COVID-19) has led to significant challenges for schools, workplaces and communities to return to operations during the pandemic, requiring policymakers to balance individuals' safety and operational efficiency. In this paper, we present our work using mixed-integer programming and simulation for redesigning routes and bus schedules for University of Michigan (UM)'s campus bus system during the COVID-19 pandemic. We propose a hub-and-spoke design and utilize real data of student activities to identify hub locations and bus stops to be used in the new routes. Using the same total number of buses to operate, each new bus route has 50\% or fewer seats being used and takes maximumly 15 minutes, to reduce disease transmission through expiratory aerosol. We sample a variety of scenarios that cover variations of peak demand, social-distancing requirements, and break-down buses, to demonstrate the system resiliency of the new routes and schedules via simulation. The new bus routes are implemented and used by all UM campuses during the academic year 2020-2021, to ensure social distancing and short travel time. Our approach can be generalized to redesign public transit systems with social distancing requirement during the pandemic to reduce passengers' infection risk. △ Less

Submitted 29 September, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

arXiv:2009.04897 [pdf, ps, other]

doi 10.1007/s00220-021-04113-y

Analytic torsion, dynamical zeta function, and the Fried conjecture for admissible twists

Authors: Shu Shen

Abstract: We show an equality between the analytic torsion and the absolute value at the zero point of the Ruelle dynamical zeta function on a closed odd dimensional locally symmetric space twisted by an acyclic flat vector bundle obtained by the restriction of a representation of the underlying Lie group. This generalises author's previous result for unitarily flat vector bundles, and the results of Bröcke… ▽ More We show an equality between the analytic torsion and the absolute value at the zero point of the Ruelle dynamical zeta function on a closed odd dimensional locally symmetric space twisted by an acyclic flat vector bundle obtained by the restriction of a representation of the underlying Lie group. This generalises author's previous result for unitarily flat vector bundles, and the results of Bröcker, Müller, and Wotzke on closed hyperbolic manifolds. △ Less

Submitted 9 September, 2020; originally announced September 2020.

Comments: 40 pages. arXiv admin note: text overlap with arXiv:2009.03427

arXiv:2009.03427 [pdf, other]

Complex valued analytic torsion and dynamical zeta function on locally symmetric spaces

Authors: Shu Shen

Abstract: We show that the Ruelle dynamical zeta function on a closed odd dimensional locally symmetric space twisted by an arbitrary flat vector bundle has a meromorphic extension to the whole complex plane and that its leading term in the Laurent series at the zero point is related to the regularised determinant of the flat Laplacian of Cappell-Miller. When the flat vector bundle is close to an acyclic an… ▽ More We show that the Ruelle dynamical zeta function on a closed odd dimensional locally symmetric space twisted by an arbitrary flat vector bundle has a meromorphic extension to the whole complex plane and that its leading term in the Laurent series at the zero point is related to the regularised determinant of the flat Laplacian of Cappell-Miller. When the flat vector bundle is close to an acyclic and unitary one, we show that the dynamical zeta function is regular at the zero point and that its value is equal to the complex valued analytic torsion of Cappell-Miller. This generalises author's previous results for unitarily flat vector bundles as well as Müller and Spilioti's results on hyperbolic manifolds. △ Less

Submitted 7 September, 2020; originally announced September 2020.

Comments: 51 pages, 1 figure

arXiv:2007.08043 [pdf, ps, other]

doi 10.1088/1361-6544/ac21a5

Dynamical Zeta Functions in the Nonorientable Case

Authors: Yonah Borns-Weil, Shu Shen

Abstract: We use a simple argument to extend the microlocal proofs of meromorphicity of dynamical zeta functions to the nonorientable case. In the special case of geodesic flow on a connected non-orientable negatively curved closed surface, we compute the order of vanishing of the zeta function at the zero point to be the first Betti number of the surface. We use a simple argument to extend the microlocal proofs of meromorphicity of dynamical zeta functions to the nonorientable case. In the special case of geodesic flow on a connected non-orientable negatively curved closed surface, we compute the order of vanishing of the zeta function at the zero point to be the first Betti number of the surface. △ Less

Submitted 1 September, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

Comments: 15 pages

arXiv:2006.06377 [pdf, ps, other]

STL-SGD: Speeding Up Local SGD with Stagewise Communication Period

Authors: Shuheng Shen, Yifei Cheng, **gchang Liu, Linli Xu

Abstract: Distributed parallel stochastic gradient descent algorithms are workhorses for large scale machine learning tasks. Among them, local stochastic gradient descent (Local SGD) has attracted significant attention due to its low communication complexity. Previous studies prove that the communication complexity of Local SGD with a fixed or an adaptive communication period is in the order of… ▽ More Distributed parallel stochastic gradient descent algorithms are workhorses for large scale machine learning tasks. Among them, local stochastic gradient descent (Local SGD) has attracted significant attention due to its low communication complexity. Previous studies prove that the communication complexity of Local SGD with a fixed or an adaptive communication period is in the order of $O (N^{\frac{3}{2}} T^{\frac{1}{2}})$ and $O (N^{\frac{3}{4}} T^{\frac{3}{4}})$ when the data distributions on clients are identical (IID) or otherwise (Non-IID), where $N$ is the number of clients and $T$ is the number of iterations. In this paper, to accelerate the convergence by reducing the communication complexity, we propose \textit{ST}agewise \textit{L}ocal \textit{SGD} (STL-SGD), which increases the communication period gradually along with decreasing learning rate. We prove that STL-SGD can keep the same convergence rate and linear speedup as mini-batch SGD. In addition, as the benefit of increasing the communication period, when the objective is strongly convex or satisfies the Polyak-Łojasiewicz condition, the communication complexity of STL-SGD is $O (N \log{T})$ and $O (N^{\frac{1}{2}} T^{\frac{1}{2}})$ for the IID case and the Non-IID case respectively, achieving significant improvements over Local SGD. Experiments on both convex and non-convex problems demonstrate the superior performance of STL-SGD. △ Less

Submitted 15 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

Comments: Accepted by AAAI2021

arXiv:2006.00719 [pdf, other]

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning

Authors: Zhewei Yao, Amir Gholami, Sheng Shen, Mustafa Mustafa, Kurt Keutzer, Michael W. Mahoney

Abstract: We introduce ADAHESSIAN, a second order stochastic optimization algorithm which dynamically incorporates the curvature of the loss function via ADAptive estimates of the HESSIAN. Second order algorithms are among the most powerful optimization algorithms with superior convergence properties as compared to first order methods such as SGD and Adam. The main disadvantage of traditional second order m… ▽ More We introduce ADAHESSIAN, a second order stochastic optimization algorithm which dynamically incorporates the curvature of the loss function via ADAptive estimates of the HESSIAN. Second order algorithms are among the most powerful optimization algorithms with superior convergence properties as compared to first order methods such as SGD and Adam. The main disadvantage of traditional second order methods is their heavier per iteration computation and poor accuracy as compared to first order methods. To address these, we incorporate several novel approaches in ADAHESSIAN, including: (i) a fast Hutchinson based method to approximate the curvature matrix with low computational overhead; (ii) a root-mean-square exponential moving average to smooth out variations of the Hessian diagonal across different iterations; and (iii) a block diagonal averaging to reduce the variance of Hessian diagonal elements. We show that ADAHESSIAN achieves new state-of-the-art results by a large margin as compared to other adaptive optimization methods, including variants of Adam. In particular, we perform extensive tests on CV, NLP, and recommendation system tasks and find that ADAHESSIAN: (i) achieves 1.80%/1.45% higher accuracy on ResNets20/32 on Cifar10, and 5.55% higher accuracy on ImageNet as compared to Adam; (ii) outperforms AdamW for transformers by 0.13/0.33 BLEU score on IWSLT14/WMT14 and 2.7/1.0 PPL on PTB/Wikitext-103; (iii) outperforms AdamW for SqueezeBert by 0.41 points on GLUE; and (iv) achieves 0.032% better score than Adagrad for DLRM on the Criteo Ad Kaggle dataset. Importantly, we show that the cost per iteration of ADAHESSIAN is comparable to first order methods, and that it exhibits robustness towards its hyperparameters. △ Less

Submitted 28 April, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

Journal ref: AAAI 2021

arXiv:2005.03287 [pdf, ps, other]

On the unique solution of the generalized absolute value equation

Authors: Shi-Liang Wu, Shu-Qian Shen

Abstract: In this paper, some useful necessary and sufficient conditions for the unique solution of the generalized absolute value equation (GAVE) $Ax-B|x|=b$ with $A, B\in \mathbb{R}^{n\times n}$ from the optimization field are first presented, which cover the fundamental theorem for the unique solution of the linear system $Ax=b$ with $A\in \mathbb{R}^{n\times n}$. Not only that, some new sufficient condi… ▽ More In this paper, some useful necessary and sufficient conditions for the unique solution of the generalized absolute value equation (GAVE) $Ax-B|x|=b$ with $A, B\in \mathbb{R}^{n\times n}$ from the optimization field are first presented, which cover the fundamental theorem for the unique solution of the linear system $Ax=b$ with $A\in \mathbb{R}^{n\times n}$. Not only that, some new sufficient conditions for the unique solution of the GAVE are obtained, which are weaker than the previous published works. △ Less

Submitted 7 May, 2020; originally announced May 2020.

Comments: 8 pages, submitted

arXiv:2004.00042 [pdf, ps, other]

The Kähler-Ricci flow, holomorphic vector fields and Fano bundles

Authors: Xi Sisi Shen

Abstract: We study the behavior of the Kähler-Ricci flow on compact manifolds develo** finite-time singularities, in particular, when the flow contracts exceptional divisors or collapses the Fano fibers of a holomorphic fiber bundle. We present a technique using holomorphic vector fields to prove estimates related to the work of Song-Weinkove and Fu-Zhang. We study the behavior of the Kähler-Ricci flow on compact manifolds develo** finite-time singularities, in particular, when the flow contracts exceptional divisors or collapses the Fano fibers of a holomorphic fiber bundle. We present a technique using holomorphic vector fields to prove estimates related to the work of Song-Weinkove and Fu-Zhang. △ Less

Submitted 31 March, 2020; originally announced April 2020.

Comments: 17 pages

arXiv:2003.09693 [pdf, ps, other]

doi 10.1016/j.jfa.2021.108934

The rigorous derivation of the $\mathbb{T}^{2}$ focusing cubic NLS from 3D

Authors: Shunlin Shen

Abstract: We derive rigorously the 2D periodic focusing cubic NLS as the mean-field limit of the 3D focusing quantum many-body dynamics describing a dilute Bose gas with periodic boundary condition in the $x$-direction and a well of infinite-depth in the $z$-direction. Physical experiments for these systems are scarce. We find that, to fulfill the empirical requirement for observing NLS dynamics in experime… ▽ More We derive rigorously the 2D periodic focusing cubic NLS as the mean-field limit of the 3D focusing quantum many-body dynamics describing a dilute Bose gas with periodic boundary condition in the $x$-direction and a well of infinite-depth in the $z$-direction. Physical experiments for these systems are scarce. We find that, to fulfill the empirical requirement for observing NLS dynamics in experiments, namely, the kinetic energy dominates the potential energy, it is necessary to impose an extra restriction on the system parameters. This restriction gives rises to an unusual coupling constant. △ Less

Submitted 25 June, 2020; v1 submitted 21 March, 2020; originally announced March 2020.

Comments: v2, improved result

Journal ref: Journal of Functional Analysis, 280 (2021), no. 8, 108934, 72pp

arXiv:2002.12518 [pdf, ps, other]

Multistage Distributionally Robust Mixed-Integer Programming with Decision-Dependent Moment-Based Ambiguity Sets

Authors: Xian Yu, Siqian Shen

Abstract: We study multistage distributionally robust mixed-integer programs under endogenous uncertainty, where the probability distribution of stage-wise uncertainty depends on the decisions made in previous stages. We first consider two ambiguity sets defined by decision-dependent bounds on the first and second moments of uncertain parameters and by mean and covariance matrix that exactly match decision-… ▽ More We study multistage distributionally robust mixed-integer programs under endogenous uncertainty, where the probability distribution of stage-wise uncertainty depends on the decisions made in previous stages. We first consider two ambiguity sets defined by decision-dependent bounds on the first and second moments of uncertain parameters and by mean and covariance matrix that exactly match decision-dependent empirical ones, respectively. For both sets, we show that the subproblem in each stage can be recast as a mixed-integer linear program (MILP). Moreover, we extend the general moment-based ambiguity set in (Delage and Ye, 2010) to the multistage decision-dependent setting, and derive mixed-integer semidefinite programming (MISDP) reformulations of stage-wise subproblems. We develop methods for attaining lower and upper bounds of the optimal objective value of the multistage MISDPs, and approximate them using a series of MILPs. We deploy the Stochastic Dual Dynamic integer Programming (SDDiP) method for solving the problem under the three ambiguity sets with risk-neutral or risk-averse objective functions, and conduct numerical studies on multistage facility-location instances having diverse sizes under different parameter and uncertainty settings. Our results show that the SDDiP quickly finds optimal solutions for moderate-sized instances under the first two ambiguity sets, and also finds good approximate bounds for the multistage MISDPs derived under the third ambiguity set. We also demonstrate the efficacy of incorporating decision-dependent distributional ambiguity in multistage decision-making processes. △ Less

Submitted 24 September, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

MSC Class: 90C11; 90C15; 90C22

arXiv:2001.00857 [pdf, ps, other]

Sharp Hardy-Rellich Type Inequalities Associated with Dunkl Operators

Authors: Li Tang, Haiting Chen, Shoufeng Shen, Yongyang **

Abstract: In this paper, we obtained the Dunkl analogy of classical Lp Hardy inequality for $p > N + 2γ$ with sharp constant $\left(\frac{p-N-2γ}{p}\right)^{p}$, where $2γ$ is the degree of weight function associated with Dunkl operators, and $L^p$ Hardy inequalities with distant function in some G-invariant domains. Moreover we proved two sharp Hardy-Rellich type inequalities for Dunkl operators. In this paper, we obtained the Dunkl analogy of classical Lp Hardy inequality for $p > N + 2γ$ with sharp constant $\left(\frac{p-N-2γ}{p}\right)^{p}$, where $2γ$ is the degree of weight function associated with Dunkl operators, and $L^p$ Hardy inequalities with distant function in some G-invariant domains. Moreover we proved two sharp Hardy-Rellich type inequalities for Dunkl operators. △ Less

Submitted 15 January, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

arXiv:1912.12844 [pdf, other]

Variance Reduced Local SGD with Lower Communication Complexity

Authors: Xianfeng Liang, Shuheng Shen, **gchang Liu, Zhen Pan, Enhong Chen, Yifei Cheng

Abstract: To accelerate the training of machine learning models, distributed stochastic gradient descent (SGD) and its variants have been widely adopted, which apply multiple workers in parallel to speed up training. Among them, Local SGD has gained much attention due to its lower communication cost. Nevertheless, when the data distribution on workers is non-identical, Local SGD requires… ▽ More To accelerate the training of machine learning models, distributed stochastic gradient descent (SGD) and its variants have been widely adopted, which apply multiple workers in parallel to speed up training. Among them, Local SGD has gained much attention due to its lower communication cost. Nevertheless, when the data distribution on workers is non-identical, Local SGD requires $O(T^{\frac{3}{4}} N^{\frac{3}{4}})$ communications to maintain its \emph{linear iteration speedup} property, where $T$ is the total number of iterations and $N$ is the number of workers. In this paper, we propose Variance Reduced Local SGD (VRL-SGD) to further reduce the communication complexity. Benefiting from eliminating the dependency on the gradient variance among workers, we theoretically prove that VRL-SGD achieves a \emph{linear iteration speedup} with a lower communication complexity $O(T^{\frac{1}{2}} N^{\frac{3}{2}})$ even if workers access non-identical datasets. We conduct experiments on three machine learning tasks, and the experimental results demonstrate that VRL-SGD performs impressively better than Local SGD when the data among workers are quite diverse. △ Less

Submitted 30 December, 2019; originally announced December 2019.

Comments: 25 pages, 6 figures. The paper presents a novel variance reduction algorithm for Local SGD

arXiv:1912.05577 [pdf, ps, other]

Distributionally Robust Facility Location Problem under Decision-dependent Stochastic Demand

Authors: Beste Basciftci, Shabbir Ahmed, Siqian Shen

Abstract: Facility location decisions significantly impact customer behavior and consequently the resulting demand in a wide range of businesses. Furthermore, sequentially realized uncertain demand enforces strategically determining locations under partial information. To address these issues, we study a facility location problem where the distribution of customer demand is dependent on location decisions.… ▽ More Facility location decisions significantly impact customer behavior and consequently the resulting demand in a wide range of businesses. Furthermore, sequentially realized uncertain demand enforces strategically determining locations under partial information. To address these issues, we study a facility location problem where the distribution of customer demand is dependent on location decisions. We represent moment information of stochastic demand as a piecewise linear function of facility-location decisions. Then, we propose a decision-dependent distributionally robust optimization model, and develop its exact mixed-integer linear programming reformulation. We further derive valid inequalities to strengthen the formulation. We conduct an extensive computational study, in which we compare our model with the existing (decision-independent) stochastic and robust models. Our results demonstrate superior performance of the proposed approach with remarkable improvement in profit and quality of service by extensively testing problem characteristics, in addition to computational speed-ups due to the formulation enhancements. These results draw attention to the need of considering the impact of location decisions on customer demand within this strategic-level planning problem. △ Less

Submitted 7 January, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

arXiv:1910.11731 [pdf, ps, other]

doi 10.1112/S0010437X22007412

Geometric orbital integrals and the center of the envelo** algebra

Authors: Jean-Michel Bismut, Shu Shen

Abstract: The purpose of this paper is to extend the explicit geometric evaluation of semisimple orbital integrals for smooth kernels for the Casimir operator obtained by the first author to the case of kernels for arbitrary elements in the center of the envelo** algebra. The purpose of this paper is to extend the explicit geometric evaluation of semisimple orbital integrals for smooth kernels for the Casimir operator obtained by the first author to the case of kernels for arbitrary elements in the center of the envelo** algebra. △ Less

Submitted 16 February, 2021; v1 submitted 25 October, 2019; originally announced October 2019.

Journal ref: Compositio Mathematica, 158(6), 1189-1253, 2022

arXiv:1909.13445 [pdf, ps, other]

Estimates for metrics of constant Chern scalar curvature

Authors: Xi Sisi Shen

Abstract: We prove a priori estimates for constant Chern scalar curvature metrics on a compact complex manifold conditional on an upper bound on the entropy, extending a recent result by Chen-Cheng in the Kähler setting. We prove a priori estimates for constant Chern scalar curvature metrics on a compact complex manifold conditional on an upper bound on the entropy, extending a recent result by Chen-Cheng in the Kähler setting. △ Less

Submitted 3 July, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

Comments: 19 pages; minor revisions; to appear in Math. Res. Lett

arXiv:1906.12043 [pdf, other]

Faster Distributed Deep Net Training: Computation and Communication Decoupled Stochastic Gradient Descent

Authors: Shuheng Shen, Linli Xu, **gchang Liu, Xianfeng Liang, Yifei Cheng

Abstract: With the increase in the amount of data and the expansion of model scale, distributed parallel training becomes an important and successful technique to address the optimization challenges. Nevertheless, although distributed stochastic gradient descent (SGD) algorithms can achieve a linear iteration speedup, they are limited significantly in practice by the communication cost, making it difficult… ▽ More With the increase in the amount of data and the expansion of model scale, distributed parallel training becomes an important and successful technique to address the optimization challenges. Nevertheless, although distributed stochastic gradient descent (SGD) algorithms can achieve a linear iteration speedup, they are limited significantly in practice by the communication cost, making it difficult to achieve a linear time speedup. In this paper, we propose a computation and communication decoupled stochastic gradient descent (CoCoD-SGD) algorithm to run computation and communication in parallel to reduce the communication cost. We prove that CoCoD-SGD has a linear iteration speedup with respect to the total computation capability of the hardware resources. In addition, it has a lower communication complexity and better time speedup comparing with traditional distributed SGD algorithms. Experiments on deep neural network training demonstrate the significant improvements of CoCoD-SGD: when training ResNet18 and VGG16 with 16 Geforce GTX 1080Ti GPUs, CoCoD-SGD is up to 2-3$\times$ faster than traditional synchronous SGD. △ Less

Submitted 20 September, 2019; v1 submitted 28 June, 2019; originally announced June 2019.

Comments: IJCAI2019, 20 pages, 21 figures

arXiv:1906.05994 [pdf, other]

Benders Cut Classification via Support Vector Machines for Solving Two-stage Stochastic Programs

Authors: Huiwen Jia, Siqian Shen

Abstract: We consider Benders decomposition for solving two-stage stochastic programs with complete recourse based on finite samples of the uncertain parameters. We define the Benders cuts binding at the final optimal solution or the ones significantly improving bounds over iterations as valuable cuts. We propose a learning-enhanced Benders decomposition (LearnBD) algorithm, which adds a cut classification… ▽ More We consider Benders decomposition for solving two-stage stochastic programs with complete recourse based on finite samples of the uncertain parameters. We define the Benders cuts binding at the final optimal solution or the ones significantly improving bounds over iterations as valuable cuts. We propose a learning-enhanced Benders decomposition (LearnBD) algorithm, which adds a cut classification step in each iteration to selectively generate cuts that are more likely to be valuable cuts. The LearnBD algorithm includes two phases: (i) sampling cuts and collecting information from training problems and (ii) solving testing problems with a support vector machine (SVM) cut classifier. We run the LearnBD algorithm on instances of capacitated facility location and multi-commodity network design under uncertain demand. Our results show that SVM cut classifier works effectively for identifying valuable cuts, and the LearnBD algorithm reduces the total solving time of all instances for different problems with various sizes and complexities. △ Less

Submitted 14 October, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

arXiv:1906.05988 [pdf, other]

Distributionally Robust Partially Observable Markov Decision Process with Moment-based Ambiguity

Authors: Hideaki Nakao, Ruiwei Jiang, Siqian Shen

Abstract: We consider a distributionally robust Partially Observable Markov Decision Process (DR-POMDP), where the distribution of the transition-observation probabilities is unknown at the beginning of each decision period, but their realizations can be inferred using side information at the end of each period after an action being taken. We build an ambiguity set of the joint distribution using bounded mo… ▽ More We consider a distributionally robust Partially Observable Markov Decision Process (DR-POMDP), where the distribution of the transition-observation probabilities is unknown at the beginning of each decision period, but their realizations can be inferred using side information at the end of each period after an action being taken. We build an ambiguity set of the joint distribution using bounded moments via conic constraints and seek an optimal policy to maximize the worst-case (minimum) reward for any distribution in the set. We show that the value function of DR-POMDP is piecewise linear convex with respect to the belief state and propose a heuristic search value iteration method for obtaining lower and upper bounds of the value function. We conduct numerical studies and demonstrate the computational performance of our approach via testing instances of a dynamic epidemic control problem. Our results show that DR-POMDP can produce more robust policies under misspecified distributions of transition-observation probabilities as compared to POMDP, but has less costly solutions than robust POMDP. The DR-POMDP policies are also insensitive to varying parameter in the ambiguity set and to noise added to the true transition-observation probability values obtained at the end of each decision period. △ Less

Submitted 7 December, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

MSC Class: 90C39; 90C40; 90C47; 93E20

arXiv:1811.06396 [pdf, other]

Asynchronous Stochastic Composition Optimization with Variance Reduction

Authors: Shuheng Shen, Linli Xu, **gchang Liu, Junliang Guo, Qing Ling

Abstract: Composition optimization has drawn a lot of attention in a wide variety of machine learning domains from risk management to reinforcement learning. Existing methods solving the composition optimization problem often work in a sequential and single-machine manner, which limits their applications in large-scale problems. To address this issue, this paper proposes two asynchronous parallel variance r… ▽ More Composition optimization has drawn a lot of attention in a wide variety of machine learning domains from risk management to reinforcement learning. Existing methods solving the composition optimization problem often work in a sequential and single-machine manner, which limits their applications in large-scale problems. To address this issue, this paper proposes two asynchronous parallel variance reduced stochastic compositional gradient (AsyVRSC) algorithms that are suitable to handle large-scale data sets. The two algorithms are AsyVRSC-Shared for the shared-memory architecture and AsyVRSC-Distributed for the master-worker architecture. The embedded variance reduction techniques enable the algorithms to achieve linear convergence rates. Furthermore, AsyVRSC-Shared and AsyVRSC-Distributed enjoy provable linear speedup, when the time delays are bounded by the data dimensionality or the sparsity ratio of the partial gradients, respectively. Extensive experiments are conducted to verify the effectiveness of the proposed algorithms. △ Less

Submitted 15 November, 2018; originally announced November 2018.

Comments: 30 pages, 19 figures

Showing 1–50 of 69 results for author: Shen, S