Search | arXiv e-print repository

arXiv:2406.19605 [pdf, other]

A Customized Augmented Lagrangian Method for Block-Structured Integer Programming

Authors: Rui Wang, Chuwen Zhang, Shanwen Pu, Jianjun Gao, Zaiwen Wen

Abstract: Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between… ▽ More Integer programming with block structures has received considerable attention recently and is widely used in many practical applications such as train timetabling and vehicle routing problems. It is known to be NP-hard due to the presence of integer variables. We define a novel augmented Lagrangian function by directly penalizing the inequality constraints and establish the strong duality between the primal problem and the augmented Lagrangian dual problem. Then, a customized augmented Lagrangian method is proposed to address the block-structures. In particular, the minimization of the augmented Lagrangian function is decomposed into multiple subproblems by decoupling the linking constraints and these subproblems can be efficiently solved using the block coordinate descent method. We also establish the convergence property of the proposed method. To make the algorithm more practical, we further introduce several refinement techniques to identify high-quality feasible solutions. Numerical experiments on a few interesting scenarios show that our proposed algorithm often achieves a satisfactory solution and is quite effective. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.11410 [pdf, other]

HARE: HumAn pRiors, a key to small language model Efficiency

Authors: Lingyun Zhang, Bin **, Gaojian Ge, Lunhui Liu, Xuewen Shen, Mingyong Wu, Houqian Zhang, Yongneng Jiang, Shiqi Chen, Shi Pu

Abstract: Human priors play a crucial role in efficiently utilizing data in deep learning. However, with the development of large language models (LLMs), there is an increasing emphasis on scaling both model size and data volume, which often diminishes the importance of human priors in data construction. Influenced by these trends, existing Small Language Models (SLMs) mainly rely on web-scraped large-scale… ▽ More Human priors play a crucial role in efficiently utilizing data in deep learning. However, with the development of large language models (LLMs), there is an increasing emphasis on scaling both model size and data volume, which often diminishes the importance of human priors in data construction. Influenced by these trends, existing Small Language Models (SLMs) mainly rely on web-scraped large-scale training data, neglecting the proper incorporation of human priors. This oversight limits the training efficiency of language models in resource-constrained settings. In this paper, we propose a principle to leverage human priors for data construction. This principle emphasizes achieving high-performance SLMs by training on a concise dataset that accommodates both semantic diversity and data quality consistency, while avoiding benchmark data leakage. Following this principle, we train an SLM named HARE-1.1B. Extensive experiments on large-scale benchmark datasets demonstrate that HARE-1.1B performs favorably against state-of-the-art SLMs, validating the effectiveness of the proposed principle. Additionally, this provides new insights into efficient language model training in resource-constrained environments from the view of human priors. △ Less

Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

arXiv:2405.16491 [pdf, other]

Nuclear deformation effects in photoproduction of $ρ$ mesons in ultraperipheral isobaric collisions

Authors: Shuo Lin, **-Yu Hu, Hao-Jie Xu, Shi Pu, Qun Wang

Abstract: We have investigated the $ρ^{0}$ meson photoproduction in ultraperipheral isobaric collisions between $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at $\sqrt{s_{NN}}=200$ GeV, employing the dipole model with the equivalent photon approximation. By implementing the Woods-Saxon distribution to represent the nuclear mass density, which is derived from… ▽ More We have investigated the $ρ^{0}$ meson photoproduction in ultraperipheral isobaric collisions between $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at $\sqrt{s_{NN}}=200$ GeV, employing the dipole model with the equivalent photon approximation. By implementing the Woods-Saxon distribution to represent the nuclear mass density, which is derived from density functional theory with an inclusion of nuclear deformation effects, we have calculated the transverse momentum $q_{T}$ spectra in isobaric collisions. We observe the characteristic dip behavior in these spectra, indicative of diffraction phenomena in high-energy physics. We notice that the deformation effects cause a nearly linear increase with $q_{T}^{2}$ for $q_{T}^{2}\lesssim0.015$ $\textrm{GeV}^{2}$, aligning with experimental observations. We offer a simple explanation for the observed behavior in these spectra by introducing the effective width of the nuclei in the thickness function. We also extend our discussion on the $ρ^{0}$ meson photoproduction with the targets $^{63}\textrm{Cu}$,$^{197}\textrm{Au}$, and $^{238}\textrm{U}$. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 9 pages, 5 figures

arXiv:2405.03105 [pdf, ps, other]

Thermodynamic stability in relativistic viscous and spin hydrodynamics

Authors: Xiang Ren, Chen Yang, Dong-Lin Wang, Shi Pu

Abstract: We have applied thermodynamic stability analysis to derive the stability and causality conditions for conventional relativistic viscous hydrodynamics and spin hydrodynamics. We obtain the thermodynamic stability conditions for second-order relativistic hydrodynamics with shear and bulk viscous tensors, finding them identical to those derived from linear mode analysis. We then derive the thermodyna… ▽ More We have applied thermodynamic stability analysis to derive the stability and causality conditions for conventional relativistic viscous hydrodynamics and spin hydrodynamics. We obtain the thermodynamic stability conditions for second-order relativistic hydrodynamics with shear and bulk viscous tensors, finding them identical to those derived from linear mode analysis. We then derive the thermodynamic stability conditions for minimal causal extended second-order spin hydrodynamics in canonical form, both with and without viscous tensors. Without viscous tensors, the constraints from thermodynamic stability exactly match those from linear mode analysis. In the presence of viscous tensors, the thermodynamic stability imposes more stringent constraints than those obtained from linear mode analysis. Our results suggest that conditions derived from thermodynamic stability analysis can guarantee both causality and stability in linear mode analysis. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: 30 pages

arXiv:2404.05454 [pdf, other]

B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data

Authors: Runze You, Shi Pu

Abstract: This paper considers the distributed learning problem where a group of agents cooperatively minimizes the summation of their local cost functions based on peer-to-peer communication. Particularly, we propose a highly efficient algorithm, termed ``B-ary Tree Push-Pull'' (BTPP), that employs two B-ary spanning trees for distributing the information related to the parameters and stochastic gradients… ▽ More This paper considers the distributed learning problem where a group of agents cooperatively minimizes the summation of their local cost functions based on peer-to-peer communication. Particularly, we propose a highly efficient algorithm, termed ``B-ary Tree Push-Pull'' (BTPP), that employs two B-ary spanning trees for distributing the information related to the parameters and stochastic gradients across the network. The simple method is efficient in communication since each agent interacts with at most $(B+1)$ neighbors per iteration. More importantly, BTPP achieves linear speedup for smooth nonconvex objective functions with only $\tilde{O}(n)$ transient iterations, significantly outperforming the state-of-the-art results to the best of our knowledge. △ Less

Submitted 6 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

arXiv:2403.05172 [pdf, other]

Learning Expressive And Generalizable Motion Features For Face Forgery Detection

Authors: **gyi Zhang, Peng Zhang, **g**g Wang, Di Xie, Shiliang Pu

Abstract: Previous face forgery detection methods mainly focus on appearance features, which may be easily attacked by sophisticated manipulation. Considering the majority of current face manipulation methods generate fake faces based on a single frame, which do not take frame consistency and coordination into consideration, artifacts on frame sequences are more effective for face forgery detection. However… ▽ More Previous face forgery detection methods mainly focus on appearance features, which may be easily attacked by sophisticated manipulation. Considering the majority of current face manipulation methods generate fake faces based on a single frame, which do not take frame consistency and coordination into consideration, artifacts on frame sequences are more effective for face forgery detection. However, current sequence-based face forgery detection methods use general video classification networks directly, which discard the special and discriminative motion information for face manipulation detection. To this end, we propose an effective sequence-based forgery detection framework based on an existing video classification method. To make the motion features more expressive for manipulation detection, we propose an alternative motion consistency block instead of the original motion features module. To make the learned features more generalizable, we propose an auxiliary anomaly detection block. With these two specially designed improvements, we make a general video classification network achieve promising results on three popular face forgery datasets. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: Accepted to ICASSP 2023

arXiv:2403.05117 [pdf, other]

Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning

Authors: Hang Du, Xuejun Yan, **g**g Wang, Di Xie, Shiliang Pu

Abstract: Recently, arbitrary-scale point cloud upsampling mechanism became increasingly popular due to its efficiency and convenience for practical applications. To achieve this, most previous approaches formulate it as a problem of surface approximation and employ point-based networks to learn surface representations. However, learning surfaces from sparse point clouds is more challenging, and thus they o… ▽ More Recently, arbitrary-scale point cloud upsampling mechanism became increasingly popular due to its efficiency and convenience for practical applications. To achieve this, most previous approaches formulate it as a problem of surface approximation and employ point-based networks to learn surface representations. However, learning surfaces from sparse point clouds is more challenging, and thus they often suffer from the low-fidelity geometry approximation. To address it, we propose an arbitrary-scale Point cloud Upsampling framework using Voxel-based Network (\textbf{PU-VoxelNet}). Thanks to the completeness and regularity inherited from the voxel representation, voxel-based networks are capable of providing predefined grid space to approximate 3D surface, and an arbitrary number of points can be reconstructed according to the predicted density distribution within each grid cell. However, we investigate the inaccurate grid sampling caused by imprecise density predictions. To address this issue, a density-guided grid resampling method is developed to generate high-fidelity points while effectively avoiding sampling outliers. Further, to improve the fine-grained details, we present an auxiliary training supervision to enforce the latent geometric consistency among local surface patches. Extensive experiments indicate the proposed approach outperforms the state-of-the-art approaches not only in terms of fixed upsampling rates but also for arbitrary-scale upsampling. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: Accepted to AAAI 2024. The source code is available at https://github.com/hikvision-research/3DVision

arXiv:2403.02806 [pdf, other]

Atacama Large Aperture Submillimeter Telescope (AtLAST) Science: Surveying the distant Universe

Authors: Eelco van Kampen, Tom Bakx, Carlos De Breuck, Chian-Chou Chen, Helmut Dannerbauer, Benjamin Magnelli, Francisco Miguel Montenegro-Montes, Teppei Okumura, Sy-Yun Pu, Matus Rybak, Amelie Saintonge, Claudia Cicone, Evanthia Hatziminaoglou, Juliette Hilhorst, Pamela Klaassen, Minju Lee, Christopher C. Lovell, Andreas Lundgren, Luca Di Mascolo, Tony Mroczkowski, Laura Sommovigo, Mark Booth, Martin A. Cordiner, Rob Ivison, Doug Johnstone , et al. (5 additional authors not shown)

Abstract: During the most active period of star formation in galaxies, which occurs in the redshift range 1<z<3, strong bursts of star formation result in significant quantities of dust, which obscures new stars being formed as their UV/optical light is absorbed and then re-emitted in the infrared, which redshifts into the mm/sub-mm bands for these early times. To get a complete picture of the high-z galaxy… ▽ More During the most active period of star formation in galaxies, which occurs in the redshift range 1<z<3, strong bursts of star formation result in significant quantities of dust, which obscures new stars being formed as their UV/optical light is absorbed and then re-emitted in the infrared, which redshifts into the mm/sub-mm bands for these early times. To get a complete picture of the high-z galaxy population, we need to survey a large patch of the sky in the sub-mm with sufficient angular resolution to resolve all galaxies, but we also need the depth to fully sample their cosmic evolution, and therefore obtain their redshifts using direct mm spectroscopy with a very wide frequency coverage. This requires a large single-dish sub-mm telescope with fast map** speeds at high sensitivity and angular resolution, a large bandwidth with good spectral resolution and multiplex spectroscopic capabilities. The proposed 50-m Atacama Large Aperture Submillimeter Telescope (AtLAST) will deliver these specifications. We discuss how AtLAST allows us to study the whole population of high-z galaxies, including the dusty star-forming ones which can only be detected and studied in the sub-mm, and obtain a wealth of information for each of these up to z~7: gas content, cooling budget, star formation rate, dust mass, and dust temperature. We present worked examples of surveys that AtLAST can perform, both deep and wide, and also focused on galaxies in proto-clusters. In addition we show how such surveys with AtLAST can measure the growth rate and the Hubble constant with high accuracy, and demonstrate the power of the line-intensity map** method in the mm/sub-mm wavebands to constrain the cosmic expansion history at high redshifts, as good examples of what can uniquely be done by AtLAST in this research field. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 17 pages, 10 figures, submitted to Open Research Europe as part of the AtLAST collection

arXiv:2403.00258 [pdf, ps, other]

"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

Authors: Lingyu Gu, Yongqi Du, Yuan Zhang, Di Xie, Shiliang Pu, Robert C. Qiu, Zhenyu Liao

Abstract: Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to address this key limitation, efforts have been devoted to the compression (e.g., sparsification and/or quantization) of these large-scale machine learning models, s… ▽ More Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to address this key limitation, efforts have been devoted to the compression (e.g., sparsification and/or quantization) of these large-scale machine learning models, so that they can be deployed on low-power IoT devices. In this paper, building upon recent advances in neural tangent kernel (NTK) and random matrix theory (RMT), we provide a novel compression approach to wide and fully-connected \emph{deep} neural nets. Specifically, we demonstrate that in the high-dimensional regime where the number of data points $n$ and their dimension $p$ are both large, and under a Gaussian mixture model for the data, there exists \emph{asymptotic spectral equivalence} between the NTK matrices for a large family of DNN models. This theoretical result enables "lossless" compression of a given DNN to be performed, in the sense that the compressed network yields asymptotically the same NTK as the original (dense and unquantized) network, with its weights and activations taking values \emph{only} in $\{ 0, \pm 1 \}$ up to a scaling. Experiments on both synthetic and real-world data are conducted to support the advantages of the proposed compression scheme, with code available at \url{https://github.com/Model-Compression/Lossless_Compression}. △ Less

Submitted 29 February, 2024; originally announced March 2024.

Comments: 32 pages, 4 figures, and 2 tables. Fixing typos in Theorems 1 and 2 from NeurIPS 2022 proceeding (https://proceedings.neurips.cc/paper_files/paper/2022/hash/185087ea328b4f03ea8fd0c8aa96f747-Abstract-Conference.html)

arXiv:2402.18627 [pdf, other]

Topologically protected emergent Fermi surface in an Abrikosov vortex lattice

Authors: Songyang Pu, Jay D. Sau, Rui-Xing Zhang

Abstract: We show that a three-dimensional (3D) fully gapped type-II superconductor can feature emergent in-gap Fermi surfaces of Caroli-de Gennes Matricon (CdGM) quasiparticles in the presence of an Abrikosov vortex lattice. In particular, these CdGM Fermi surfaces manifest in the emergent 3D band structure enabled by the intervortex tunneling physics, and their stability is guaranteed by a $\mathbb{Z}_2$… ▽ More We show that a three-dimensional (3D) fully gapped type-II superconductor can feature emergent in-gap Fermi surfaces of Caroli-de Gennes Matricon (CdGM) quasiparticles in the presence of an Abrikosov vortex lattice. In particular, these CdGM Fermi surfaces manifest in the emergent 3D band structure enabled by the intervortex tunneling physics, and their stability is guaranteed by a $\mathbb{Z}_2$ topological index. By develo** an effective analytical theory, we find that each vortex line carrying a 1D nodal dispersion is a sufficient condition for the vortex lattice to form CdGM Fermi surfaces. Following this prediction, in-gap CdGM Fermi surfaces are numerically confirmed in a microscopic vortex-lattice simulation of a superconducting Dirac semimetal with an $s$-wave spin-singlet pairing, which is directly applicable to a large class of type-II superconductors such as LiFeAs. Remarkably, the CdGM Fermi surfaces persist even when the normal state is deformed to a doped insulator of trivial band topology. Our work establishes the vortex lattice as a new experimentally feasible control knob for emergent topological phenomena in conventional superconductors. △ Less

Submitted 17 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: 6 + 9 pages, 3 + 6 figures

arXiv:2402.17294 [pdf, ps, other]

Advancing Continuous Distribution Generation: An Exponentiated Odds Ratio Generator Approach

Authors: Xinyu Chen, Yuanqi Xie, Achraf Cohen, Shusen Pu

Abstract: This paper presents a new methodology for generating continuous statistical distributions, integrating the exponentiated odds ratio within the framework of survival analysis. This new method enhances the flexibility and adaptability of distribution models to effectively address the complexities inherent in contemporary datasets. The core of this advancement is illustrated by introducing a particul… ▽ More This paper presents a new methodology for generating continuous statistical distributions, integrating the exponentiated odds ratio within the framework of survival analysis. This new method enhances the flexibility and adaptability of distribution models to effectively address the complexities inherent in contemporary datasets. The core of this advancement is illustrated by introducing a particular subfamily, the "Type-2 Gumbel Weibull-G Family of Distributions." We provide a comprehensive analysis of the mathematical properties of these distributions, encompassing statistical properties such as density functions, moments, hazard rate and quantile functions, Rényi entropy, order statistics, and the concept of stochastic ordering. To establish the robustness of our approach, we apply five distinct methods for parameter estimation. The practical applicability of the Type-2 Gumbel Weibull-G distributions is further supported through the analysis of three real-world datasets. These empirical applications illustrate the exceptional statistical precision of our distributions compared to existing models, thereby reinforcing their significant value in both theoretical and practical statistical applications. △ Less

Submitted 27 February, 2024; originally announced February 2024.

MSC Class: 62E99; 60E05

arXiv:2402.09714 [pdf, other]

An Accelerated Distributed Stochastic Gradient Method with Momentum

Authors: Kun Huang, Shi Pu, Angelia Nedić

Abstract: In this paper, we introduce an accelerated distributed stochastic gradient method with momentum for solving the distributed optimization problem, where a group of $n$ agents collaboratively minimize the average of the local objective functions over a connected network. The method, termed ``Distributed Stochastic Momentum Tracking (DSMT)'', is a single-loop algorithm that utilizes the momentum trac… ▽ More In this paper, we introduce an accelerated distributed stochastic gradient method with momentum for solving the distributed optimization problem, where a group of $n$ agents collaboratively minimize the average of the local objective functions over a connected network. The method, termed ``Distributed Stochastic Momentum Tracking (DSMT)'', is a single-loop algorithm that utilizes the momentum tracking technique as well as the Loopless Chebyshev Acceleration (LCA) method. We show that DSMT can asymptotically achieve comparable convergence rates as centralized stochastic gradient descent (SGD) method under a general variance condition regarding the stochastic gradients. Moreover, the number of iterations (transient times) required for DSMT to achieve such rates behaves as $\mathcal{O}(n^{5/3}/(1-λ))$ for minimizing general smooth objective functions, and $\mathcal{O}(\sqrt{n/(1-λ)})$ under the Polyak-Łojasiewicz (PL) condition. Here, the term $1-λ$ denotes the spectral gap of the mixing matrix related to the underlying network topology. Notably, the obtained results do not rely on multiple inter-node communications or stochastic gradient accumulation per iteration, and the transient times are the shortest under the setting to the best of our knowledge. △ Less

Submitted 18 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: 32 pages, 5 figures

arXiv:2402.04540 [pdf, other]

Spin polarization in relativistic heavy-ion collisions

Authors: Francesco Becattini, Matteo Buzzegoli, Takafumi Niida, Shi Pu, Ai-Hong Tang, Qun Wang

Abstract: Polarization has opened a new physics chapter in relativistic heavy-ion collisions. Since the first prediction and experimental observation of global spin polarization, a lot of progress has been made in understanding its features, both at experimental and theoretical level. In this paper, we give an overview on the recent advances in this field. The covered topics include a review of measurements… ▽ More Polarization has opened a new physics chapter in relativistic heavy-ion collisions. Since the first prediction and experimental observation of global spin polarization, a lot of progress has been made in understanding its features, both at experimental and theoretical level. In this paper, we give an overview on the recent advances in this field. The covered topics include a review of measurements of global and local spin polarization of hyperons and the global spin alignment of vector mesons. We account for the basic theoretical framework to describe spin polarization in a relativistic fluid such as the Quark Gluon Plasma, including statistical quantum field theory and local thermodynamic equilibrium, spin hydrodynamics, relativistic kinetic theory with spin and coalescence models. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: RevTeX 4, 41 pages, 12 figures, review article as a book chapter for QGP6

arXiv:2402.03672 [pdf, other]

The spin alignment of rho mesons in a pion gas

Authors: Yi-Liang Yin, Wen-Bo Dong, **-Yi Pang, Shi Pu, Qun Wang

Abstract: We study the spin alignment of neutral rho mesons in a pion gas using spin kinetic or Boltzmann equations. The $ρππ$ coupling is given by the chiral effective theory. The collision terms at the leading and next-to-leading order in spin Boltzmann equations are derived. The evolution of the spin density matrix of the neutral rho meson is simulated with different initial conditions. The numerical res… ▽ More We study the spin alignment of neutral rho mesons in a pion gas using spin kinetic or Boltzmann equations. The $ρππ$ coupling is given by the chiral effective theory. The collision terms at the leading and next-to-leading order in spin Boltzmann equations are derived. The evolution of the spin density matrix of the neutral rho meson is simulated with different initial conditions. The numerical results show that the interaction of pions and neutral rho mesons creates very small spin alignment in the central rapidity region if there is no rho meson in the system at the initial time. Such a small spin alignment in the central rapidity region will decay rapidly toward zero in later time. If there are rho mesons with a sizable spin alignment at the initial time the spin alignment will also decrease rapidly. We also considered the effect on $ρ_{00}$ from the elliptic flow of pions in the blast wave model. With vanishing spin alignment at the initial time, the deviation of $ρ_{00}$ from 1/3 is positive but very small. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: RevTex 4, 17 pages, 12 figures

arXiv:2401.17352 [pdf, other]

doi 10.1103/PhysRevLett.132.236503

Microscopic Model for Fractional Quantum Hall Nematics

Authors: Songyang Pu, Ajit C. Balram, Joseph Taylor, Eduardo Fradkin, Zlatko Papić

Abstract: Geometric fluctuations of the density mode in a fractional quantum Hall (FQH) state can give rise to a nematic FQH phase, a topological state with a spontaneously broken rotational symmetry. While experiments on FQH states in the second Landau level have reported signatures of putative FQH nematics in anisotropic transport, a realistic model for this state has been lacking. We show that the standa… ▽ More Geometric fluctuations of the density mode in a fractional quantum Hall (FQH) state can give rise to a nematic FQH phase, a topological state with a spontaneously broken rotational symmetry. While experiments on FQH states in the second Landau level have reported signatures of putative FQH nematics in anisotropic transport, a realistic model for this state has been lacking. We show that the standard model of particles in the lowest Landau level interacting via the Coulomb potential realizes the FQH nematic transition, which is reached by a progressive reduction of the strength of the shortest-range Haldane pseudopotential. Using exact diagonalization and variational wave functions, we demonstrate that the FQH nematic transition occurs when the system's neutral gap closes in the long-wavelength limit while the charge gap remains open. We confirm the symmetry-breaking nature of the transition by demonstrating the existence of a "circular moat" potential in the manifold of states with broken rotational symmetry, while its geometric character is revealed through the strong fluctuations of the nematic susceptibility and Hall viscosity. △ Less

Submitted 9 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: Four figures in main text with supplementary information

Journal ref: Phys. Rev. Lett. 132, 236503 (2024)

arXiv:2401.09703 [pdf, other]

Fast Updating Truncated SVD for Representation Learning with Sparse Matrices

Authors: Haoran Deng, Yang Yang, Jiahe Li, Cheng Chen, Weihao Jiang, Shiliang Pu

Abstract: Updating a truncated Singular Value Decomposition (SVD) is crucial in representation learning, especially when dealing with large-scale data matrices that continuously evolve in practical scenarios. Aligning SVD-based models with fast-paced updates becomes increasingly important. Existing methods for updating truncated SVDs employ Rayleigh-Ritz projection procedures, where projection matrices are… ▽ More Updating a truncated Singular Value Decomposition (SVD) is crucial in representation learning, especially when dealing with large-scale data matrices that continuously evolve in practical scenarios. Aligning SVD-based models with fast-paced updates becomes increasingly important. Existing methods for updating truncated SVDs employ Rayleigh-Ritz projection procedures, where projection matrices are augmented based on original singular vectors. However, these methods suffer from inefficiency due to the densification of the update matrix and the application of the projection to all singular vectors. To address these limitations, we introduce a novel method for dynamically approximating the truncated SVD of a sparse and temporally evolving matrix. Our approach leverages sparsity in the orthogonalization process of augmented matrices and utilizes an extended decomposition to independently store projections in the column space of singular vectors. Numerical experiments demonstrate a remarkable efficiency improvement of an order of magnitude compared to previous methods. Remarkably, this improvement is achieved while maintaining a comparable precision to existing approaches. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2312.09979 [pdf, other]

LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

Authors: Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuan**g Huang

Abstract: Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks. Increasing instruction data substantially is a direct solution to align the model with a broader range of downstream tasks or notably improve its performance on a specific task. However, we find that large-scale increase… ▽ More Supervised fine-tuning (SFT) is a crucial step for large language models (LLMs), enabling them to align with human instructions and enhance their capabilities in downstream tasks. Increasing instruction data substantially is a direct solution to align the model with a broader range of downstream tasks or notably improve its performance on a specific task. However, we find that large-scale increases in instruction data can damage the world knowledge previously stored in LLMs. To address this challenge, we propose LoRAMoE, a novelty framework that introduces several low-rank adapters (LoRA) and integrates them by using a router network, like a plugin version of Mixture of Experts (MoE). It freezes the backbone model and forces a portion of LoRAs to focus on leveraging world knowledge to solve downstream tasks, to alleviate world knowledge-edge forgetting. Experimental results show that, as the instruction data increases, LoRAMoE can significantly improve the ability to process downstream tasks, while maintaining the world knowledge stored in the LLM. △ Less

Submitted 8 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

Comments: 14 pages, 7 figures

arXiv:2312.09068 [pdf, other]

Global and local polarization of $Λ$ hyperons across RHIC-BES energies

Authors: Xiang-Yu Wu, Cong Yi, Guang-You Qin, Shi Pu

Abstract: We report our recent study on the global and local polarization of $Λ$ hyperons in Au+Au collisions at RHIC-BES energies within the (3+1)-dimensional CLVisc hydrodynamics framework. We present our numerical results for the global polarization as the function of collision energies and the local polarization along the beam direction as functions of azimuthal angle in $20-50$% centrality at… ▽ More We report our recent study on the global and local polarization of $Λ$ hyperons in Au+Au collisions at RHIC-BES energies within the (3+1)-dimensional CLVisc hydrodynamics framework. We present our numerical results for the global polarization as the function of collision energies and the local polarization along the beam direction as functions of azimuthal angle in $20-50$% centrality at $\sqrt{s_{NN}}$=7.7 GeV Au+Au collision energy. We have discussed the effects of initial conditions, Spin Hall effect and baryon diffusion. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: 4 pages, 3 figures. Contribution to the proceedings of Quark Matter 2023 (Houston, TX, 3-9 Sep. 2023)

arXiv:2312.06779 [pdf, other]

Fingerprints of Composite Fermion Lambda Levels in Scanning Tunneling Microscopy

Authors: Songyang Pu, Ajit C. Balram, Yuwen Hu, Yen-Chen Tsui, Minhao He, Nicolas Regnault, Michael P. Zaletel, Ali Yazdani, Zlatko Papić

Abstract: Composite fermion (CF) is a topological quasiparticle that emerges from a non-perturbative attachment of vortices to electrons in strongly correlated two-dimensional materials. Similar to non-interacting fermions that form Landau levels in a magnetic field, CFs can fill analogous ``Lambda'' levels, giving rise to the fractional quantum Hall (FQH) effect of electrons. Here, we show that Lambda leve… ▽ More Composite fermion (CF) is a topological quasiparticle that emerges from a non-perturbative attachment of vortices to electrons in strongly correlated two-dimensional materials. Similar to non-interacting fermions that form Landau levels in a magnetic field, CFs can fill analogous ``Lambda'' levels, giving rise to the fractional quantum Hall (FQH) effect of electrons. Here, we show that Lambda levels can be directly visualized through the characteristic peak structure in the signal obtained via spectroscopy with the scanning tunneling microscopy (STM) on a FQH state. Complementary to transport, which probes low-energy properties of CFs, we show that \emph{high-energy} features in STM spectra can be interpreted in terms of Lambda levels. We numerically demonstrate that STM spectra can be accurately modeled using Jain's CF theory. Our results show that STM provides a powerful tool for revealing the anatomy of FQH states and identifying physics beyond the non-interacting CF paradigm. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: Six figures including supplementary materials

arXiv:2311.15197 [pdf, other]

Spin polarization and spin alignment from quantum kinetic theory with self-energy corrections

Authors: Shuo Fang, Shi Pu, Di-Lun Yang

Abstract: We derive the quantum kinetic theory for massive fermions with collision terms and self-energy corrections based on quantum field theory. We adopt an effective power counting scheme with $\hbar$ expansion to obtain the leading-order perturbative solutions of the vector and axial Wigner functions and the corresponding kinetic equations. We observe that both the onshell relation and the structure of… ▽ More We derive the quantum kinetic theory for massive fermions with collision terms and self-energy corrections based on quantum field theory. We adopt an effective power counting scheme with $\hbar$ expansion to obtain the leading-order perturbative solutions of the vector and axial Wigner functions and the corresponding kinetic equations. We observe that both the onshell relation and the structure of Wigner functions, along with the kinetic equations, are modified due to the presence of self-energies and their space-time gradients. We further apply our formalism to investigate the spin polarization phenomena in relativistic heavy ion collisions and derive the modification to the spin polarization spectrum of massive quarks. We find that the gradient of vector self-energy plays a similar role to the background electromagnetic fields, which induces a more dominant contribution than the collisional effects by a naive power counting in the gradient expansion and weak coupling. Our findings could further modify the spin polarization of strange quarks and spin alignment of $φ$ mesons beyond local thermal equilibrium. △ Less

Submitted 27 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

Comments: 52 pages, 1 table

Journal ref: Phys.Rev.D 109 (2024) 3, 034034

arXiv:2310.08298 [pdf, other]

MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition

Authors: Shuhui Wu, Yongliang Shen, Zeqi Tan, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu

Abstract: Distantly supervised named entity recognition (DS-NER) aims to locate entity mentions and classify their types with only knowledge bases or gazetteers and unlabeled corpus. However, distant annotations are noisy and degrade the performance of NER models. In this paper, we propose a noise-robust prototype network named MProto for the DS-NER task. Different from previous prototype-based NER methods,… ▽ More Distantly supervised named entity recognition (DS-NER) aims to locate entity mentions and classify their types with only knowledge bases or gazetteers and unlabeled corpus. However, distant annotations are noisy and degrade the performance of NER models. In this paper, we propose a noise-robust prototype network named MProto for the DS-NER task. Different from previous prototype-based NER methods, MProto represents each entity type with multiple prototypes to characterize the intra-class variance among entity representations. To optimize the classifier, each token should be assigned an appropriate ground-truth prototype and we consider such token-prototype assignment as an optimal transport (OT) problem. Furthermore, to mitigate the noise from incomplete labeling, we propose a novel denoised optimal transport (DOT) algorithm. Specifically, we utilize the assignment result between Other class tokens and all prototypes to distinguish unlabeled entity tokens from true negatives. Experiments on several DS-NER benchmarks demonstrate that our MProto achieves state-of-the-art performance. The source code is now available on Github. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP-2023, camera ready version

arXiv:2309.11708 [pdf, other]

Stability and causality criteria in linear mode analysis: stability means causality

Authors: Dong-Lin Wang, Shi Pu

Abstract: Causality and stability are fundamental requirements for the differential equations describing predictable relativistic many-body systems. In this work, we investigate the stability and causality criteria in linear mode analysis. We discuss the updated stability criterion in 3+1 dimensional systems and introduce the improved sufficient criterion for causality. Our findings clearly demonstrate that… ▽ More Causality and stability are fundamental requirements for the differential equations describing predictable relativistic many-body systems. In this work, we investigate the stability and causality criteria in linear mode analysis. We discuss the updated stability criterion in 3+1 dimensional systems and introduce the improved sufficient criterion for causality. Our findings clearly demonstrate that stability implies causality in linear mode analysis. Furthermore, based on the theorems present in this work, we conclude that if updated stability criterion and improved causality criterion are fulfilled in one inertial frame of reference (IFR), they hold for all IFR. △ Less

Submitted 20 February, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 6+8 pages, 1 figure; references added, typos corrected

Journal ref: Phys. Rev. D 109, L031504 (2024)

arXiv:2309.04527 [pdf, other]

doi 10.1103/PhysRevResearch.6.013105

Deformed Fredkin model for the $ν{=}5/2$ Moore-Read state on thin cylinders

Authors: Cristian Voinea, Songyang Pu, Ammar Kirmani, Pouyan Ghaemi, Armin Rahmani, Zlatko Papić

Abstract: We propose a frustration-free model for the Moore-Read quantum Hall state on sufficiently thin cylinders with circumferences $\lesssim 7$ magnetic lengths. While the Moore-Read Hamiltonian involves complicated long-range interactions between triplets of electrons in a Landau level, our effective model is a simpler one-dimensional chain of qubits with deformed Fredkin gates. We show that the ground… ▽ More We propose a frustration-free model for the Moore-Read quantum Hall state on sufficiently thin cylinders with circumferences $\lesssim 7$ magnetic lengths. While the Moore-Read Hamiltonian involves complicated long-range interactions between triplets of electrons in a Landau level, our effective model is a simpler one-dimensional chain of qubits with deformed Fredkin gates. We show that the ground state of the Fredkin model has high overlap with the Moore-Read wave function and accurately reproduces the latter's entanglement properties. Moreover, we demonstrate that the model captures the dynamical response of the Moore-Read state to a geometric quench, induced by suddenly changing the anisotropy of the system. We elucidate the underlying mechanism of the quench dynamics and show that it coincides with the linearized bimetric field theory. The minimal model introduced here can be directly implemented as a first step towards quantum simulation of the Moore-Read state, as we demonstrate by deriving an efficient circuit approximation to the ground state and implementing it on IBM quantum processor. △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: 18 pages, 15 figures

Journal ref: Phys. Rev. Research 6, 013105 (2024)

arXiv:2308.14038 [pdf, other]

Momentum dependence of $φ$ meson's spin alignment

Authors: Xin-Li Sheng, Shi Pu, Qun Wang

Abstract: We study the rapidity and azimuthal angle dependences of the global spin alignment $ρ_{00}$ for $φ$ mesons with respect to the reaction plane in Au+Au collisions at RHIC by the relativistic coalescence model in the spin transport theory. The global spin alignment of $φ$ mesons arises from local fluctuations of strong force fields whose values are extracted from the STAR's data. The calculated resu… ▽ More We study the rapidity and azimuthal angle dependences of the global spin alignment $ρ_{00}$ for $φ$ mesons with respect to the reaction plane in Au+Au collisions at RHIC by the relativistic coalescence model in the spin transport theory. The global spin alignment of $φ$ mesons arises from local fluctuations of strong force fields whose values are extracted from the STAR's data. The calculated results show that $ρ_{00}<1/3$ at the rapidity $Y=0$, and then it increases with rapidity and becomes $ρ_{00}>1/3$ at $Y=1$. Such a rapidity dependence is dominated by the relative motion of the $φ$ meson in the bulk matter. We also give prediction for the azimuthal angle dependence of $ρ_{00}$ at different rapidities. △ Less

Submitted 27 August, 2023; originally announced August 2023.

Comments: RevTex 4, 5 pages, 4 figures

arXiv:2308.08171 [pdf, other]

doi 10.1609/aaai.v38i8.28646

Learning to Pivot as a Smart Expert

Authors: Tianhao Liu, Shanwen Pu, Dongdong Ge, Yinyu Ye

Abstract: Linear programming has been practically solved mainly by simplex and interior point methods. Compared with the weakly polynomial complexity obtained by the interior point methods, the existence of strongly polynomial bounds for the length of the pivot path generated by the simplex methods remains a mystery. In this paper, we propose two novel pivot experts that leverage both global and local infor… ▽ More Linear programming has been practically solved mainly by simplex and interior point methods. Compared with the weakly polynomial complexity obtained by the interior point methods, the existence of strongly polynomial bounds for the length of the pivot path generated by the simplex methods remains a mystery. In this paper, we propose two novel pivot experts that leverage both global and local information of the linear programming instances for the primal simplex method and show their excellent performance numerically. The experts can be regarded as a benchmark to evaluate the performance of classical pivot rules, although they are hard to directly implement. To tackle this challenge, we employ a graph convolutional neural network model, trained via imitation learning, to mimic the behavior of the pivot expert. Our pivot rule, learned empirically, displays a significant advantage over conventional methods in various linear programming problems, as demonstrated through a series of rigorous experiments. △ Less

Submitted 31 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

arXiv:2306.13880 [pdf, other]

Causality and stability analysis for the minimal causal spin hydrodynamics

Authors: Xin-Qing Xie, Dong-Lin Wang, Chen Yang, Shi Pu

Abstract: We perform the linear analysis of causality and stability for a minimal extended spin hydrodynamics up to second order of the gradient expansion. The first order spin hydrodynamics, with a rank-3 spin tensor being antisymmetric for only the last two indices, are proved to be acausal and unstable. We then consider the minimal causal spin hydrodynamics up to second order of the gradient expansion. W… ▽ More We perform the linear analysis of causality and stability for a minimal extended spin hydrodynamics up to second order of the gradient expansion. The first order spin hydrodynamics, with a rank-3 spin tensor being antisymmetric for only the last two indices, are proved to be acausal and unstable. We then consider the minimal causal spin hydrodynamics up to second order of the gradient expansion. We derive the necessary causality and stability conditions for this minimal causal spin hydrodynamics. Interestingly, the satisfaction of the stability conditions relies on the equations of state for the spin density and chemical potentials. Moreover, different with the conventional relativistic dissipative hydrodynamics, the stability of the theory seems to be broken at the finite wave-vector when the stability conditions are fulfilled at small and large wave-vector limits. It implies that the behavior in small and large wave-vector limits may be insufficient to determine the stability conditions for spin hydrodynamics in linear mode analysis. △ Less

Submitted 21 November, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

Comments: 45 pages, 2 figures, typos corrected, published version

Journal ref: Phys. Rev. D 108, 094031 (2023)

arXiv:2306.12037 [pdf, other]

Distributed Random Reshuffling Methods with Improved Convergence

Authors: Kun Huang, Linli Zhou, Shi Pu

Abstract: This paper proposes two distributed random reshuffling methods, namely Gradient Tracking with Random Reshuffling (GT-RR) and Exact Diffusion with Random Reshuffling (ED-RR), to solve the distributed optimization problem over a connected network, where a set of agents aim to minimize the average of their local cost functions. Both algorithms invoke random reshuffling (RR) update for each agent, inh… ▽ More This paper proposes two distributed random reshuffling methods, namely Gradient Tracking with Random Reshuffling (GT-RR) and Exact Diffusion with Random Reshuffling (ED-RR), to solve the distributed optimization problem over a connected network, where a set of agents aim to minimize the average of their local cost functions. Both algorithms invoke random reshuffling (RR) update for each agent, inherit favorable characteristics of RR for minimizing smooth nonconvex objective functions, and improve the performance of previous distributed random reshuffling methods both theoretically and empirically. Specifically, both GT-RR and ED-RR achieve the convergence rate of $O(1/[(1-λ)^{1/3}m^{1/3}T^{2/3}])$ in driving the (minimum) expected squared norm of the gradient to zero, where $T$ denotes the number of epochs, $m$ is the sample size for each agent, and $1-λ$ represents the spectral gap of the mixing matrix. When the objective functions further satisfy the Polyak-Łojasiewicz (PL) condition, we show GT-RR and ED-RR both achieve $O(1/[(1-λ)mT^2])$ convergence rate in terms of the averaged expected differences between the agents' function values and the global minimum value. Notably, both results are comparable to the convergence rates of centralized RR methods (up to constant factors depending on the network topology) and outperform those of previous distributed random reshuffling algorithms. Moreover, we support the theoretical findings with a set of numerical experiments. △ Less

Submitted 16 April, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: 16 pages, 8 figures

arXiv:2306.08967 [pdf, other]

doi 10.1145/3580305.3599250

Accelerating Dynamic Network Embedding with Billions of Parameter Updates to Milliseconds

Authors: Haoran Deng, Yang Yang, Jiahe Li, Haoyang Cai, Shiliang Pu, Weihao Jiang

Abstract: Network embedding, a graph representation learning method illustrating network topology by map** nodes into lower-dimension vectors, is challenging to accommodate the ever-changing dynamic graphs in practice. Existing research is mainly based on node-by-node embedding modifications, which falls into the dilemma of efficient calculation and accuracy. Observing that the embedding dimensions are us… ▽ More Network embedding, a graph representation learning method illustrating network topology by map** nodes into lower-dimension vectors, is challenging to accommodate the ever-changing dynamic graphs in practice. Existing research is mainly based on node-by-node embedding modifications, which falls into the dilemma of efficient calculation and accuracy. Observing that the embedding dimensions are usually much smaller than the number of nodes, we break this dilemma with a novel dynamic network embedding paradigm that rotates and scales the axes of embedding space instead of a node-by-node update. Specifically, we propose the Dynamic Adjacency Matrix Factorization (DAMF) algorithm, which achieves an efficient and accurate dynamic network embedding by rotating and scaling the coordinate system where the network embedding resides with no more than the number of edge modifications changes of node embeddings. Moreover, a dynamic Personalized PageRank is applied to the obtained network embeddings to enhance node embeddings and capture higher-order neighbor information dynamically. Experiments of node classification, link prediction, and graph reconstruction on different-sized dynamic graphs suggest that DAMF advances dynamic network embedding. Further, we unprecedentedly expand dynamic network embedding experiments to billion-edge graphs, where DAMF updates billion-level parameters in less than 10ms. △ Less

Submitted 15 June, 2023; originally announced June 2023.

arXiv:2305.12800 [pdf, other]

doi 10.1109/ICASSP49357.2023.10095424

Single Domain Dynamic Generalization for Iris Presentation Attack Detection

Authors: Yachun Li, **g**g Wang, Yuhui Chen, Di Xie, Shiliang Pu

Abstract: Iris presentation attack detection (PAD) has achieved great success under intra-domain settings but easily degrades on unseen domains. Conventional domain generalization methods mitigate the gap by learning domain-invariant features. However, they ignore the discriminative information in the domain-specific features. Moreover, we usually face a more realistic scenario with only one single domain a… ▽ More Iris presentation attack detection (PAD) has achieved great success under intra-domain settings but easily degrades on unseen domains. Conventional domain generalization methods mitigate the gap by learning domain-invariant features. However, they ignore the discriminative information in the domain-specific features. Moreover, we usually face a more realistic scenario with only one single domain available for training. To tackle the above issues, we propose a Single Domain Dynamic Generalization (SDDG) framework, which simultaneously exploits domain-invariant and domain-specific features on a per-sample basis and learns to generalize to various unseen domains with numerous natural images. Specifically, a dynamic block is designed to adaptively adjust the network with a dynamic adaptor. And an information maximization loss is further combined to increase diversity. The whole network is integrated into the meta-learning paradigm. We generate amplitude perturbed images and cover diverse domains with natural images. Therefore, the network can learn to generalize to the perturbed domains in the meta-test phase. Extensive experiments show the proposed method is effective and outperforms the state-of-the-art on LivDet-Iris 2017 dataset. △ Less

Submitted 22 May, 2023; originally announced May 2023.

Comments: ICASSP 2023 Camera Ready

arXiv:2305.11004 [pdf, other]

Insert or Attach: Taxonomy Completion via Box Embedding

Authors: Wei Xue, Yongliang Shen, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu

Abstract: Taxonomy completion, enriching existing taxonomies by inserting new concepts as parents or attaching them as children, has gained significant interest. Previous approaches embed concepts as vectors in Euclidean space, which makes it difficult to model asymmetric relations in taxonomy. In addition, they introduce pseudo-leaves to convert attachment cases into insertion cases, leading to an incorrec… ▽ More Taxonomy completion, enriching existing taxonomies by inserting new concepts as parents or attaching them as children, has gained significant interest. Previous approaches embed concepts as vectors in Euclidean space, which makes it difficult to model asymmetric relations in taxonomy. In addition, they introduce pseudo-leaves to convert attachment cases into insertion cases, leading to an incorrect bias in network learning dominated by numerous pseudo-leaves. Addressing these, our framework, TaxBox, leverages box containment and center closeness to design two specialized geometric scorers within the box embedding space. These scorers are tailored for insertion and attachment operations and can effectively capture intrinsic relationships between concepts by optimizing on a granular box constraint loss. We employ a dynamic ranking loss mechanism to balance the scores from these scorers, allowing adaptive adjustments of insertion and attachment scores. Experiments on four real-world datasets show that TaxBox significantly outperforms previous methods, yielding substantial improvements over prior methods in real-world datasets, with average performance boosts of 6.7%, 34.9%, and 51.4% in MRR, Hit@1, and Prec@1, respectively. △ Less

Submitted 18 June, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

arXiv:2305.04175 [pdf, other]

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

Authors: Shengfang Zhai, Yinpeng Dong, Qingni Shen, Shi Pu, Yuejian Fang, Hang Su

Abstract: With the help of conditioning mechanisms, the state-of-the-art diffusion models have achieved tremendous success in guided image generation, particularly in text-to-image synthesis. To gain a better understanding of the training process and potential risks of text-to-image synthesis, we perform a systematic investigation of backdoor attack on text-to-image diffusion models and propose BadT2I, a ge… ▽ More With the help of conditioning mechanisms, the state-of-the-art diffusion models have achieved tremendous success in guided image generation, particularly in text-to-image synthesis. To gain a better understanding of the training process and potential risks of text-to-image synthesis, we perform a systematic investigation of backdoor attack on text-to-image diffusion models and propose BadT2I, a general multimodal backdoor attack framework that tampers with image synthesis in diverse semantic levels. Specifically, we perform backdoor attacks on three levels of the vision semantics: Pixel-Backdoor, Object-Backdoor and Style-Backdoor. By utilizing a regularization loss, our methods efficiently inject backdoors into a large-scale text-to-image diffusion model while preserving its utility with benign inputs. We conduct empirical experiments on Stable Diffusion, the widely-used text-to-image diffusion model, demonstrating that the large-scale diffusion model can be easily backdoored within a few fine-tuning steps. We conduct additional experiments to explore the impact of different types of textual triggers, as well as the backdoor persistence during further training, providing insights for the development of backdoor defense methods. Besides, our investigation may contribute to the copyright protection of text-to-image models in the future. △ Less

Submitted 22 October, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

Comments: Carmera-ready version. To appear in ACM MM 2023. Code will be released at: https://github.com/sf-zhai/BadT2I

arXiv:2304.08777 [pdf, other]

Probing vortical structures in heavy-ion collisions at RHIC-BES energies through helicity polarization

Authors: Cong Yi, Xiang-Yu Wu, Di-Lun Yang, Jian-Hua Gao, Shi Pu, Guang-You Qin

Abstract: We investigate the hydrodynamic helicity polarization of $Λ$ hyperons, defined as the projection of the spin polarization vector along the directions of particle momenta, at RHIC-BES energies by utilizing the relativistic (3+1)D CLVisc hydrodynamics framework with SMASH initial conditions. As opposed to local spin polarization at high energy collisions, our hydrodynamic simulations demonstrate tha… ▽ More We investigate the hydrodynamic helicity polarization of $Λ$ hyperons, defined as the projection of the spin polarization vector along the directions of particle momenta, at RHIC-BES energies by utilizing the relativistic (3+1)D CLVisc hydrodynamics framework with SMASH initial conditions. As opposed to local spin polarization at high energy collisions, our hydrodynamic simulations demonstrate that the helicity polarization induced by the kinetic vorticity dominates over other contributions at intermediate and low collision energies. Our findings provide an opportunity to probe the fine structure of local kinetic vorticity as a function of azimuthal angle at intermediate and low collision energies by map** our predictions to the future measurements in experiments. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: 8 pages, 3 figures

arXiv:2304.02950 [pdf, other]

Multi-view Adversarial Discriminator: Mine the Non-causal Factors for Object Detection in Unseen Domains

Authors: Mingjun Xu, Lingyun Qin, Weijie Chen, Shiliang Pu, Lei Zhang

Abstract: Domain shift degrades the performance of object detection models in practical applications. To alleviate the influence of domain shift, plenty of previous work try to decouple and learn the domain-invariant (common) features from source domains via domain adversarial learning (DAL). However, inspired by causal mechanisms, we find that previous methods ignore the implicit insignificant non-causal f… ▽ More Domain shift degrades the performance of object detection models in practical applications. To alleviate the influence of domain shift, plenty of previous work try to decouple and learn the domain-invariant (common) features from source domains via domain adversarial learning (DAL). However, inspired by causal mechanisms, we find that previous methods ignore the implicit insignificant non-causal factors hidden in the common features. This is mainly due to the single-view nature of DAL. In this work, we present an idea to remove non-causal factors from common features by multi-view adversarial training on source domains, because we observe that such insignificant non-causal factors may still be significant in other latent spaces (views) due to the multi-mode structure of data. To summarize, we propose a Multi-view Adversarial Discriminator (MAD) based domain generalization model, consisting of a Spurious Correlations Generator (SCG) that increases the diversity of source domain by random augmentation and a Multi-View Domain Classifier (MVDC) that maps features to multiple latent spaces, such that the non-causal factors are removed and the domain-invariant features are purified. Extensive experiments on six benchmarks show our MAD obtains state-of-the-art performance. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: CVPR 2023 (Highlight, top 2.5%). Pytorch vs. MindSpore Code at "https://github.com/K2OKOH/MAD"

arXiv:2304.00519 [pdf, other]

doi 10.1103/PhysRevD.107.116006

Electric and magnetic conductivities in magnetized fermion systems

Authors: Hao-Hao Peng, Xin-Li Sheng, Shi Pu, Qun Wang

Abstract: In Wigner function approach with relaxation time approximation, we calculate electric and magnetic conductivities of a fermion system in the strong magnetic field. The linear response has been calculated to the perturbation of electromagnetic fields on the background constant magnetic field. The Wigner function is separated into an equilibrium part in the background magnetic field and an off-equil… ▽ More In Wigner function approach with relaxation time approximation, we calculate electric and magnetic conductivities of a fermion system in the strong magnetic field. The linear response has been calculated to the perturbation of electromagnetic fields on the background constant magnetic field. The Wigner function is separated into an equilibrium part in the background magnetic field and an off-equilibrium part induced by perturbative fields. The analytical expression for the equilibrium part and the corresponding equilibrium conditions are given. For the off-equilibrium part, we obtain the kinetic equation at the leading order in $\hbar$ from the master equation of the Wigner function. When perturbative fields only depend on the proper time, the off-equilibrium part can be analytically solved from which the vector and axial vector currents are obtained. We obtain the longitudinal and transverse Ohm conductivities as well as Hall conductivity as the linear response of the vector current to the perturbative electric field. The behaviors of these conductivities as functions of the evolving time, relaxation time, particle mass, and strength of the background magnetic field are investigated both analytically and numerically. △ Less

Submitted 2 April, 2023; originally announced April 2023.

Comments: 25 pages, 6 figures

arXiv:2303.17167 [pdf, other]

Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation

Authors: Hang Du, Xuejun Yan, **g**g Wang, Di Xie, Shiliang Pu

Abstract: Most existing approaches for point cloud normal estimation aim to locally fit a geometric surface and calculate the normal from the fitted surface. Recently, learning-based methods have adopted a routine of predicting point-wise weights to solve the weighted least-squares surface fitting problem. Despite achieving remarkable progress, these methods overlook the approximation error of the fitting p… ▽ More Most existing approaches for point cloud normal estimation aim to locally fit a geometric surface and calculate the normal from the fitted surface. Recently, learning-based methods have adopted a routine of predicting point-wise weights to solve the weighted least-squares surface fitting problem. Despite achieving remarkable progress, these methods overlook the approximation error of the fitting problem, resulting in a less accurate fitted surface. In this paper, we first carry out in-depth analysis of the approximation error in the surface fitting problem. Then, in order to bridge the gap between estimated and precise surface normals, we present two basic design principles: 1) applies the $Z$-direction Transform to rotate local patches for a better surface fitting with a lower approximation error; 2) models the error of the normal estimation as a learnable term. We implement these two principles using deep neural networks, and integrate them with the state-of-the-art (SOTA) normal estimation methods in a plug-and-play manner. Extensive experiments verify our approaches bring benefits to point cloud normal estimation and push the frontier of state-of-the-art performance on both synthetic and real-world datasets. △ Less

Submitted 30 March, 2023; originally announced March 2023.

Comments: The first two authors contributed equally to this work. The source code are available at https://github.com/hikvision-research/3DVision. Accepted to CVPR 2023

arXiv:2303.07091 [pdf, other]

A Linearly Convergent Robust Compressed Push-Pull Method for Decentralized Optimization

Authors: Yiwei Liao, Zhuorui Li, Shi Pu

Abstract: In the modern paradigm of multi-agent networks, communication has become one of the main bottlenecks for decentralized optimization, where a large number of agents are involved in minimizing the average of the local cost functions. In this paper, we propose a robust compressed push-pull algorithm (RCPP) that combines gradient tracking with communication compression. In particular, RCPP is compatib… ▽ More In the modern paradigm of multi-agent networks, communication has become one of the main bottlenecks for decentralized optimization, where a large number of agents are involved in minimizing the average of the local cost functions. In this paper, we propose a robust compressed push-pull algorithm (RCPP) that combines gradient tracking with communication compression. In particular, RCPP is compatible with a much more general class of compression operators that allow both relative and absolute compression errors. We show that RCPP achieves linear convergence rate for smooth objective functions satisfying the Polyak-Łojasiewicz condition over general directed networks. Numerical examples verify the theoretical findings and demonstrate the efficiency, flexibility, and robustness of the proposed algorithm. △ Less

Submitted 13 March, 2023; originally announced March 2023.

arXiv:2303.06555 [pdf, other]

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

Authors: Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu

Abstract: This paper proposes a unified diffusion framework (dubbed UniDiffuser) to fit all distributions relevant to a set of multi-modal data in one model. Our key insight is -- learning diffusion models for marginal, conditional, and joint distributions can be unified as predicting the noise in the perturbed data, where the perturbation levels (i.e. timesteps) can be different for different modalities. I… ▽ More This paper proposes a unified diffusion framework (dubbed UniDiffuser) to fit all distributions relevant to a set of multi-modal data in one model. Our key insight is -- learning diffusion models for marginal, conditional, and joint distributions can be unified as predicting the noise in the perturbed data, where the perturbation levels (i.e. timesteps) can be different for different modalities. Inspired by the unified view, UniDiffuser learns all distributions simultaneously with a minimal modification to the original diffusion model -- perturbs data in all modalities instead of a single modality, inputs individual timesteps in different modalities, and predicts the noise of all modalities instead of a single modality. UniDiffuser is parameterized by a transformer for diffusion models to handle input types of different modalities. Implemented on large-scale paired image-text data, UniDiffuser is able to perform image, text, text-to-image, image-to-text, and image-text pair generation by setting proper timesteps without additional overhead. In particular, UniDiffuser is able to produce perceptually realistic samples in all tasks and its quantitative results (e.g., the FID and CLIP score) are not only superior to existing general-purpose models but also comparable to the bespoken models (e.g., Stable Diffusion and DALL-E 2) in representative tasks (e.g., text-to-image generation). △ Less

Submitted 30 May, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

Comments: Accepted to ICML2023

arXiv:2302.00912 [pdf]

Advances and Challenges in Multimodal Remote Sensing Image Registration

Authors: Bai Zhu, Liang Zhou, Simiao Pu, Jianwei Fan, Yuanxin Ye

Abstract: Over the past few decades, with the rapid development of global aerospace and aerial remote sensing technology, the types of sensors have evolved from the traditional monomodal sensors (e.g., optical sensors) to the new generation of multimodal sensors [e.g., multispectral, hyperspectral, light detection and ranging (LiDAR) and synthetic aperture radar (SAR) sensors]. These advanced devices can dy… ▽ More Over the past few decades, with the rapid development of global aerospace and aerial remote sensing technology, the types of sensors have evolved from the traditional monomodal sensors (e.g., optical sensors) to the new generation of multimodal sensors [e.g., multispectral, hyperspectral, light detection and ranging (LiDAR) and synthetic aperture radar (SAR) sensors]. These advanced devices can dynamically provide various and abundant multimodal remote sensing images with different spatial, temporal, and spectral resolutions according to different application requirements. Since then, it is of great scientific significance to carry out the research of multimodal remote sensing image registration, which is a crucial step for integrating the complementary information among multimodal data and making comprehensive observations and analysis of the Earths surface. In this work, we will present our own contributions to the field of multimodal image registration, summarize the advantages and limitations of existing multimodal image registration methods, and then discuss the remaining challenges and make a forward-looking prospect for the future development of the field. △ Less

Submitted 7 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

Comments: 10 pages, 4 figures

arXiv:2301.12677 [pdf, other]

Distributed Stochastic Optimization under a General Variance Condition

Authors: Kun Huang, Xiao Li, Shi Pu

Abstract: Distributed stochastic optimization has drawn great attention recently due to its effectiveness in solving large-scale machine learning problems. Though numerous algorithms have been proposed and successfully applied to general practical problems, their theoretical guarantees mainly rely on certain boundedness conditions on the stochastic gradients, varying from uniform boundedness to the relaxed… ▽ More Distributed stochastic optimization has drawn great attention recently due to its effectiveness in solving large-scale machine learning problems. Though numerous algorithms have been proposed and successfully applied to general practical problems, their theoretical guarantees mainly rely on certain boundedness conditions on the stochastic gradients, varying from uniform boundedness to the relaxed growth condition. In addition, how to characterize the data heterogeneity among the agents and its impacts on the algorithmic performance remains challenging. In light of such motivations, we revisit the classical Federated Averaging (FedAvg) algorithm (McMahan et al., 2017) as well as the more recent SCAFFOLD method (Karimireddy et al., 2020) for solving the distributed stochastic optimization problem and establish the convergence results under only a mild variance condition on the stochastic gradients for smooth nonconvex objective functions. Almost sure convergence to a stationary point is also established under the condition. Moreover, we discuss a more informative measurement for data heterogeneity as well as its implications. △ Less

Submitted 13 December, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: 16 pages, 2 figure

arXiv:2301.05872 [pdf, other]

CEDAS: A Compressed Decentralized Stochastic Gradient Method with Improved Convergence

Authors: Kun Huang, Shi Pu

Abstract: In this paper, we consider solving the distributed optimization problem over a multi-agent network under the communication restricted setting. We study a compressed decentralized stochastic gradient method, termed ``compressed exact diffusion with adaptive stepsizes (CEDAS)", and show the method asymptotically achieves comparable convergence rate as centralized { stochastic gradient descent (SGD)}… ▽ More In this paper, we consider solving the distributed optimization problem over a multi-agent network under the communication restricted setting. We study a compressed decentralized stochastic gradient method, termed ``compressed exact diffusion with adaptive stepsizes (CEDAS)", and show the method asymptotically achieves comparable convergence rate as centralized { stochastic gradient descent (SGD)} for both smooth strongly convex objective functions and smooth nonconvex objective functions under unbiased compression operators. In particular, to our knowledge, CEDAS enjoys so far the shortest transient time (with respect to the graph specifics) for achieving the convergence rate of centralized SGD, which behaves as $\mathcal{O}(n{C^3}/(1-λ_2)^{2})$ under smooth strongly convex objective functions, and $\mathcal{O}(n^3{C^6}/(1-λ_2)^4)$ under smooth nonconvex objective functions, where $(1-λ_2)$ denotes the spectral gap of the mixing matrix, and $C>0$ is the compression-related parameter. Numerical experiments further demonstrate the effectiveness of the proposed algorithm. △ Less

Submitted 25 February, 2024; v1 submitted 14 January, 2023; originally announced January 2023.

Comments: 16 pages, 8 figures

arXiv:2301.04796 [pdf, other]

1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection Track

Authors: Wei Zhao, Binbin Chen, Weijie Chen, Shicai Yang, Di Xie, Shiliang Pu, Yueting Zhuang

Abstract: OOD-CV challenge is an out-of-distribution generalization task. To solve this problem in object detection track, we propose a simple yet effective Generalize-then-Adapt (G&A) framework, which is composed of a two-stage domain generalization part and a one-stage domain adaptation part. The domain generalization part is implemented by a Supervised Model Pretraining stage using source data for model… ▽ More OOD-CV challenge is an out-of-distribution generalization task. To solve this problem in object detection track, we propose a simple yet effective Generalize-then-Adapt (G&A) framework, which is composed of a two-stage domain generalization part and a one-stage domain adaptation part. The domain generalization part is implemented by a Supervised Model Pretraining stage using source data for model warm-up and a Weakly Semi-Supervised Model Pretraining stage using both source data with box-level label and auxiliary data (ImageNet-1K) with image-level label for performance boosting. The domain adaptation part is implemented as a Source-Free Domain Adaptation paradigm, which only uses the pre-trained model and the unlabeled target data to further optimize in a self-supervised training manner. The proposed G&A framework help us achieve the first place on the object detection leaderboard of the OOD-CV challenge. Code will be released in https://github.com/hikvision-research/OOD-CV. △ Less

Submitted 11 January, 2023; originally announced January 2023.

Comments: Tech Report

arXiv:2301.04795 [pdf, other]

1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track

Authors: Yilu Guo, Xingyue Shi, Weijie Chen, Shicai Yang, Di Xie, Shiliang Pu, Yueting Zhuang

Abstract: OOD-CV challenge is an out-of-distribution generalization task. In this challenge, our core solution can be summarized as that Noisy Label Learning Is A Strong Test-Time Domain Adaptation Optimizer. Briefly speaking, our main pipeline can be divided into two stages, a pre-training stage for domain generalization and a test-time training stage for domain adaptation. We only exploit labeled source d… ▽ More OOD-CV challenge is an out-of-distribution generalization task. In this challenge, our core solution can be summarized as that Noisy Label Learning Is A Strong Test-Time Domain Adaptation Optimizer. Briefly speaking, our main pipeline can be divided into two stages, a pre-training stage for domain generalization and a test-time training stage for domain adaptation. We only exploit labeled source data in the pre-training stage and only exploit unlabeled target data in the test-time training stage. In the pre-training stage, we propose a simple yet effective Mask-Level Copy-Paste data augmentation strategy to enhance out-of-distribution generalization ability so as to resist shape, pose, context, texture, occlusion, and weather domain shifts in this challenge. In the test-time training stage, we use the pre-trained model to assign noisy label for the unlabeled target data, and propose a Label-Periodically-Updated DivideMix method for noisy label learning. After integrating Test-Time Augmentation and Model Ensemble strategies, our solution ranks the first place on the Image Classification Leaderboard of the OOD-CV Challenge. Code will be released in https://github.com/hikvision-research/OOD-CV. △ Less

Submitted 11 January, 2023; originally announced January 2023.

Comments: Tech Report

arXiv:2301.04169 [pdf, other]

doi 10.1103/PhysRevLett.130.176501

Signatures of Supersymmetry in the $ν{=}5/2$ Fractional Quantum Hall Effect

Authors: Songyang Pu, Ajit C. Balram, Mikael Fremling, Andrey Gromov, Zlatko Papić

Abstract: The Moore-Read state, one of the leading candidates for describing the fractional quantum Hall effect at filling factor $ν{=}5/2$, is a paradigmatic $p$-wave superconductor with non-Abelian topological order. Among its many exotic properties, the state hosts two collective modes: a bosonic density wave and a neutral fermion mode that arises from an unpaired electron in the condensate. It has recen… ▽ More The Moore-Read state, one of the leading candidates for describing the fractional quantum Hall effect at filling factor $ν{=}5/2$, is a paradigmatic $p$-wave superconductor with non-Abelian topological order. Among its many exotic properties, the state hosts two collective modes: a bosonic density wave and a neutral fermion mode that arises from an unpaired electron in the condensate. It has recently been proposed that the descriptions of the two modes can be unified by postulating supersymmetry (SUSY) that relates them in the long-wavelength limit. Here we extend the SUSY description to construct wave functions of the two modes on closed surfaces, such as the sphere and torus, and we test the resulting states in large-scale numerical simulations. We demonstrate the equivalence in the long-wavelength limit between SUSY wave functions and previous descriptions of collective modes based on the Girvin-MacDonald-Platzman ansatz, Jack polynomials, and bipartite composite fermions. Leveraging the first-quantized form of the SUSY wave functions, we study their energies using the Monte Carlo method and show that realistic $ν{=}5/2$ systems are close to the putative SUSY point, where the two collective modes become degenerate in energy. △ Less

Submitted 7 May, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: Main text 6 pages, 4 figures with attached supplementary information

Journal ref: Phys. Rev. Lett. 130, 176501 (2023)

arXiv:2301.00240 [pdf, other]

doi 10.1103/PhysRevB.107.195126

Torus geometry eigenfunctions of an interacting multi-Landau level Hamiltonian

Authors: Abhishek Anand, Songyang Pu, G J Sreejith

Abstract: A short-ranged, rotationally symmetric multi-Landau-level model Hamiltonian for strongly interacting electrons in a magnetic field was proposed [A. Anand et al, Phys. Rev. Lett. 126, 136601 (2021)] with the key feature that it allows exact many-body eigenfunctions on the disk not just for quasiholes but for all charged and neutral excitations of the entire Jain sequence filling fractions. We exten… ▽ More A short-ranged, rotationally symmetric multi-Landau-level model Hamiltonian for strongly interacting electrons in a magnetic field was proposed [A. Anand et al, Phys. Rev. Lett. 126, 136601 (2021)] with the key feature that it allows exact many-body eigenfunctions on the disk not just for quasiholes but for all charged and neutral excitations of the entire Jain sequence filling fractions. We extend this to geometries without full rotational symmetry, namely the torus and cylinder geometries, and present their spectra. Exact diagonalization of the interaction on the torus produces the low-energy spectra at filling fraction $ν=n/(2pn+1)$ that is identical, up to a topological $(2pn+1)$-fold multiplicity, to that of the integer quantum Hall spectra at $ν=n$, for the incompressible state as well as all excitations. While the ansatz eigenfunctions in the disk geometry cannot be generalized to closed geometries such as torus or sphere, we show how to extend them to cylinder geometry. Meanwhile, we show eigenfunctions for charged excitations at filling fractions between $\frac{1}{3}$ and $\frac{2}{5}$ can be written on the torus and the spherical geometries. △ Less

Submitted 27 April, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

arXiv:2212.14710 [pdf, other]

NeRF-Gaze: A Head-Eye Redirection Parametric Model for Gaze Estimation

Authors: Pengwei Yin, Jiawu Dai, **g**g Wang, Di Xie, Shiliang Pu

Abstract: Gaze estimation is the fundamental basis for many visual tasks. Yet, the high cost of acquiring gaze datasets with 3D annotations hinders the optimization and application of gaze estimation models. In this work, we propose a novel Head-Eye redirection parametric model based on Neural Radiance Field, which allows dense gaze data generation with view consistency and accurate gaze direction. Moreover… ▽ More Gaze estimation is the fundamental basis for many visual tasks. Yet, the high cost of acquiring gaze datasets with 3D annotations hinders the optimization and application of gaze estimation models. In this work, we propose a novel Head-Eye redirection parametric model based on Neural Radiance Field, which allows dense gaze data generation with view consistency and accurate gaze direction. Moreover, our head-eye redirection parametric model can decouple the face and eyes for separate neural rendering, so it can achieve the purpose of separately controlling the attributes of the face, identity, illumination, and eye gaze direction. Thus diverse 3D-aware gaze datasets could be obtained by manipulating the latent code belonging to different face attributions in an unsupervised manner. Extensive experiments on several benchmarks demonstrate the effectiveness of our method in domain generalization and domain adaptation for gaze estimation tasks. △ Less

Submitted 30 December, 2022; originally announced December 2022.

Comments: 10 pages, 8 figures, submitted to CVPR 2023

arXiv:2212.12661 [pdf, other]

Transmission Congestion Management with Generalized Generation Shift Distribution Factors

Authors: Shutong Pu, Guangchun Ruan, Xinfei Yan, Haiwang Zhong

Abstract: A major concern in modern power systems is that the popularity and fluctuating characteristics of renewable energy may cause more and more transmission congestion events. Traditional congestion management modeling involves AC or DC power flow equations, while the former equation always accompanies great amount of computation, and the latter cannot consider voltage amplitude and reactive power. The… ▽ More A major concern in modern power systems is that the popularity and fluctuating characteristics of renewable energy may cause more and more transmission congestion events. Traditional congestion management modeling involves AC or DC power flow equations, while the former equation always accompanies great amount of computation, and the latter cannot consider voltage amplitude and reactive power. Therefore, this paper proposes a congestion management approach incorporating a specially-designed generalized generator shift distribution factor (GSDF) to derive a computationally-efficient and accurate management strategies. This congestion management strategy involves multiple balancing generators for generation shift operation. The proposed model is superior in a low computational complexity (linear equation) and versatile modeling representation with full consideration of voltage amplitude and reactive power. △ Less

Submitted 24 December, 2022; originally announced December 2022.

Comments: 5 pages, 4 figures. Accepted by conference: ICPES 2022

arXiv:2212.01706 [pdf, other]

doi 10.1103/PhysRevLett.130.126201

Fractional quantum Hall effect with unconventional pairing in monolayer graphene

Authors: Anirban Sharma, Songyang Pu, Ajit C. Balram, Jainendra K. Jain

Abstract: Motivated by the observation of even denominator fractional quantum Hall effect in the $n=3$ Landau level of monolayer graphene [Y. Kim $\textit{et al.}$, Nature Physics $\textbf{15}$, 154 (2019)], we consider a Bardeen-Cooper-Schrieffer variational state for composite fermions and find that the composite-fermion Fermi sea in this Landau level is unstable to an $f$-wave pairing. Analogous calculat… ▽ More Motivated by the observation of even denominator fractional quantum Hall effect in the $n=3$ Landau level of monolayer graphene [Y. Kim $\textit{et al.}$, Nature Physics $\textbf{15}$, 154 (2019)], we consider a Bardeen-Cooper-Schrieffer variational state for composite fermions and find that the composite-fermion Fermi sea in this Landau level is unstable to an $f$-wave pairing. Analogous calculation suggests the possibility of a $p$-wave pairing of composite fermions at half filling in the $n=2$ graphene Landau level, whereas no pairing instability is found at half filling in the $n=0$ and $1$ graphene Landau levels. The relevance of these results to experiments is discussed. △ Less

Submitted 3 December, 2022; originally announced December 2022.

Comments: 13 pages, 7 figures

arXiv:2211.11286 [pdf, other]

doi 10.1103/PhysRevD.107.096010

Anomalous magnetohydrodynamics with temperature-dependent electric conductivity and application to the global polarization

Authors: Hao-Hao Peng, Sihao Wu, Ren-jie Wang, Duan She, Shi Pu

Abstract: We have derived the solutions of the relativistic anomalous magnetohydrodynamics with longitudinal Bjorken boost invariance and transverse electromagnetic fields in the presence of temperature or energy density dependent electric conductivity. We consider the equations of states in a high temperature limit or in a high chiral chemical potential limit. We obtain both perturbative analytic solutions… ▽ More We have derived the solutions of the relativistic anomalous magnetohydrodynamics with longitudinal Bjorken boost invariance and transverse electromagnetic fields in the presence of temperature or energy density dependent electric conductivity. We consider the equations of states in a high temperature limit or in a high chiral chemical potential limit. We obtain both perturbative analytic solutions up to the order of \hbar and numerical solutions in our configurations of initial electromagnetic fields and Bjorken flow velocity. Our results show that the temperature or energy density dependent electric conductivity plays an important role to the decaying of the energy density and electromagnetic fields. We also implement our results to the splitting of global polarization for Λand \barΛ hyperons induced by the magnetic fields. Our results for the splitting of global polarization disagree with the experimental data in low energy collisions, which implies that the contribution from gradient of chemical potential may dominate in the low energy collisions. △ Less

Submitted 21 November, 2022; originally announced November 2022.

arXiv:2211.08998 [pdf, other]

Data-pooling Reinforcement Learning for Personalized Healthcare Intervention

Authors: Xinyun Chen, Pengyi Shi, Shanwen Pu

Abstract: Motivated by the emerging needs of personalized preventative intervention in many healthcare applications, we consider a multi-stage, dynamic decision-making problem in the online setting with unknown model parameters. To deal with the pervasive issue of small sample size in personalized planning, we develop a novel data-pooling reinforcement learning (RL) algorithm based on a general perturbed va… ▽ More Motivated by the emerging needs of personalized preventative intervention in many healthcare applications, we consider a multi-stage, dynamic decision-making problem in the online setting with unknown model parameters. To deal with the pervasive issue of small sample size in personalized planning, we develop a novel data-pooling reinforcement learning (RL) algorithm based on a general perturbed value iteration framework. Our algorithm adaptively pools historical data, with three main innovations: (i) the weight of pooling ties directly to the performance of decision (measured by regret) as opposed to estimation accuracy in conventional methods; (ii) no parametric assumptions are needed between historical and current data; and (iii) requiring data-sharing only via aggregate statistics, as opposed to patient-level data. Our data-pooling algorithm framework applies to a variety of popular RL algorithms, and we establish a theoretical performance guarantee showing that our pooling version achieves a regret bound strictly smaller than that of the no-pooling counterpart. We substantiate the theoretical development with empirically better performance of our algorithm via a case study in the context of post-discharge intervention to prevent unplanned readmissions, generating practical insights for healthcare management. In particular, our algorithm alleviates privacy concerns about sharing health data, which (i) opens the door for individual organizations to levering public datasets or published studies to better manage their own patients; and (ii) provides the basis for public policy makers to encourage organizations to share aggregate data to improve population health outcomes for the broader community. △ Less

Submitted 16 November, 2022; originally announced November 2022.

arXiv:2210.05106 [pdf, other]

doi 10.1103/PhysRevD.107.054004

Photoproduction of $e^{+}e^{-}$ in peripheral isobar collisions

Authors: Shuo Lin, Ren-Jie Wang, JIan-Fei Wang, Hao-Jie Xu, Shi Pu, Qun Wang

Abstract: We investigate the photoproduction of di-electrons in peripheral collisions of $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at 200 GeV. With the charge and mass density distributions given by the calculation of the density functional theory, we calculate the spectra of transverse momentum, invariant mass and azimuthal angle for di-electrons at 40-… ▽ More We investigate the photoproduction of di-electrons in peripheral collisions of $_{44}^{96}\textrm{Ru}+_{44}^{96}\textrm{Ru}$ and $_{40}^{96}\textrm{Zr}+_{40}^{96}\textrm{Zr}$ at 200 GeV. With the charge and mass density distributions given by the calculation of the density functional theory, we calculate the spectra of transverse momentum, invariant mass and azimuthal angle for di-electrons at 40-80\% centrality. The ratios of these spectra in Ru+Ru collisions over to Zr+Zr collisions are shown to be smaller than $(44/40)^{4}$ (the ratio of $Z^{4}$ for Ru and Zr) at low transverse momentum. The deviation arises from the different mass and charge density distributions in Ru and Zr. So the photoproduction of di-leptons in isobar collisions may provide a new way to probe the nuclear structure. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 17 pages, 6 figures

Showing 1–50 of 240 results for author: Pu, S