-
Stable vortex solitons sustained by a localized gain in the cubic medium
Authors:
Chunyan Li,
Yaroslav V. Kartashov
Abstract:
We propose a simple dissipative system with purely cubic defocusing nonlinearity and nonuniform linear gain that can support stable localized dissipative vortex solitons with high topological charges without the utilization of competing nonlinearities and nonlinear gain or losses. Localization of such solitons is achieved due to an intriguing mechanism when defocusing nonlinearity stimulates energ…
▽ More
We propose a simple dissipative system with purely cubic defocusing nonlinearity and nonuniform linear gain that can support stable localized dissipative vortex solitons with high topological charges without the utilization of competing nonlinearities and nonlinear gain or losses. Localization of such solitons is achieved due to an intriguing mechanism when defocusing nonlinearity stimulates energy flow from the ring-like region with linear gain to the periphery of the medium where energy is absorbed due to linear background losses. Vortex solitons bifurcate from linear gain-guided vortical modes with eigenvalues depending on topological charges that become purely real only at specific gain amplitudes. Increasing gain amplitude leads to transverse expansion of vortex solitons, but simultaneously it usually also leads to stability enhancement. Increasing background losses allows creation of stable vortex solitons with high topological charges that are usually prone to instabilities in conservative and dissipative systems. Propagation of the perturbed unstable vortex solitons in this system reveals unusual dynamical regimes, when instead of decay or breakup, the initial state transforms into stable vortex soliton with lower or sometimes even with higher topological charge. Our results suggest an efficient mechanism for the formation of nonlinear excited vortex-carrying states with suppressed destructive azimuthal modulational instabilities in a simple setting relevant to a wide class of systems, including polaritonic systems, structured microcavities, and lasers.
△ Less
Submitted 28 April, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Invariant sample measures and sample statistical solutions for nonautonomous stochastic lattice Cahn-Hilliard equation with nonlinear noise
Authors:
**tao Wang,
Dongdong Zhu,
Chunqiu Li
Abstract:
We consider a stochastic lattice Cahn-Hilliard equation with nonautonomous nonlinear noise. First, we prove the existence of pullback random attractors in $\ell^2$ for the generated nonautonomous random dynamical system. Then, we construct the time-dependent invariant sample Borel probability measures based on the pullback random attractor. Moreover, we develop a general stochastic Liouville type…
▽ More
We consider a stochastic lattice Cahn-Hilliard equation with nonautonomous nonlinear noise. First, we prove the existence of pullback random attractors in $\ell^2$ for the generated nonautonomous random dynamical system. Then, we construct the time-dependent invariant sample Borel probability measures based on the pullback random attractor. Moreover, we develop a general stochastic Liouville type equation for nonautonomous random dynamical systems and show that the invariant sample measures obtained satisfy the stochastic Liouville type equation. At last, we define a new kind of statistical solution -- sample statistical solution corresponding to the invariant sample measures and show that each family of invariant sample measures is a sample statistical solution.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Towards Fast Setup and High Throughput of GPU Serverless Computing
Authors:
Han Zhao,
Weihao Cui,
Quan Chen,
Shulai Zhang,
Zijun Li,
**gwen Leng,
Chao Li,
Deze Zeng,
Minyi Guo
Abstract:
Integrating GPUs into serverless computing platforms is crucial for improving efficiency. However, existing solutions for GPU-enabled serverless computing platforms face two significant problems due to coarse-grained GPU management: long setup time and low function throughput.
To address these issues, we propose SAGE, a GPU serverless framework with fast setup and high throughput. First, based o…
▽ More
Integrating GPUs into serverless computing platforms is crucial for improving efficiency. However, existing solutions for GPU-enabled serverless computing platforms face two significant problems due to coarse-grained GPU management: long setup time and low function throughput.
To address these issues, we propose SAGE, a GPU serverless framework with fast setup and high throughput. First, based on the data knowability of GPU function ahead of actual execution, SAGE first devises the parallelized function setup mechanism, which parallelizes the data preparation and context creation. In this way, SAGE achieves fast setup of GPU function invocations.Second, SAGE further proposes the sharing-based memory management mechanism, which shares the read-only memory and context memory across multiple invocations of the same function. The memory sharing mechanism avoids repeated data preparation and then unnecessary data-loading contention. As a consequence, the function throughput could be improved. Our experimental results show that SAGE reduces function duration by 11.3X and improves function density by 1.22X compared to the state-of-the-art serverless platform.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
Authors:
Yuying Ge,
Sijie Zhao,
**guo Zhu,
Yixiao Ge,
Kun Yi,
Lin Song,
Chen Li,
Xiaohan Ding,
Ying Shan
Abstract:
The rapid evolution of multimodal foundation model has demonstrated significant progresses in vision-language understanding and generation, e.g., our previous work SEED-LLaMA. However, there remains a gap between its capability and the real-world applicability, primarily due to the model's limited capacity to effectively respond to various user instructions and interact with diverse visual data. I…
▽ More
The rapid evolution of multimodal foundation model has demonstrated significant progresses in vision-language understanding and generation, e.g., our previous work SEED-LLaMA. However, there remains a gap between its capability and the real-world applicability, primarily due to the model's limited capacity to effectively respond to various user instructions and interact with diverse visual data. In this work, we focus on bridging this gap through integrating two enhanced features: (1) comprehending images of arbitrary sizes and ratios, and (2) enabling multi-granularity image generation. We present a unified and versatile foundation model, namely, SEED-X, which is able to model multi-granularity visual semantics for comprehension and generation tasks. Besides the competitive results on public benchmarks, SEED-X demonstrates its effectiveness in handling real-world applications across various domains after instruction tuning. We hope that our work will inspire future research into what can be achieved by versatile multimodal foundation models in real-world applications. The models, codes, and datasets will be released in https://github.com/AILab-CVC/SEED-X.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Graphic Design with Large Multimodal Model
Authors:
Yutao Cheng,
Zhao Zhang,
Maoke Yang,
Hui Nie,
Chunyuan Li,
Xinglong Wu,
Jie Shao
Abstract:
In the field of graphic design, automating the integration of design elements into a cohesive multi-layered artwork not only boosts productivity but also paves the way for the democratization of graphic design. One existing practice is Graphic Layout Generation (GLG), which aims to layout sequential design elements. It has been constrained by the necessity for a predefined correct sequence of laye…
▽ More
In the field of graphic design, automating the integration of design elements into a cohesive multi-layered artwork not only boosts productivity but also paves the way for the democratization of graphic design. One existing practice is Graphic Layout Generation (GLG), which aims to layout sequential design elements. It has been constrained by the necessity for a predefined correct sequence of layers, thus limiting creative potential and increasing user workload. In this paper, we present Hierarchical Layout Generation (HLG) as a more flexible and pragmatic setup, which creates graphic composition from unordered sets of design elements. To tackle the HLG task, we introduce Graphist, the first layout generation model based on large multimodal models. Graphist efficiently reframes the HLG as a sequence generation problem, utilizing RGB-A images as input, outputs a JSON draft protocol, indicating the coordinates, size, and order of each element. We develop new evaluation metrics for HLG. Graphist outperforms prior arts and establishes a strong baseline for this field. Project homepage: https://github.com/graphic-design-ai/graphist
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
A General Continuous-Time Formulation of Stochastic ADMM and Its Variants
Authors:
Chris Junchi Li
Abstract:
Stochastic versions of the alternating direction method of multiplier (ADMM) and its variants play a key role in many modern large-scale machine learning problems. In this work, we introduce a unified algorithmic framework called generalized stochastic ADMM and investigate their continuous-time analysis. The generalized framework widely includes many stochastic ADMM variants such as standard, line…
▽ More
Stochastic versions of the alternating direction method of multiplier (ADMM) and its variants play a key role in many modern large-scale machine learning problems. In this work, we introduce a unified algorithmic framework called generalized stochastic ADMM and investigate their continuous-time analysis. The generalized framework widely includes many stochastic ADMM variants such as standard, linearized and gradient-based ADMM. Our continuous-time analysis provides us with new insights into stochastic ADMM and variants, and we rigorously prove that under some proper scaling, the trajectory of stochastic ADMM weakly converges to the solution of a stochastic differential equation with small noise. Our analysis also provides a theoretical explanation of why the relaxation parameter should be chosen between 0 and 2.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
New point of view about optical activity in helically-coiled fiber
Authors:
Chun-Fang Li,
Zhi-Juan Hu
Abstract:
The optical activity in a helically-coiled optical fiber is reexamined. It is proven that not only is there no circular birefringence in the fiber but the polarization relative to the laboratory reference frame is not rotated along the fiber. The reason for this is that in contrast with the polarization vector, the Jones vector does not give a complete description of the polarization. As a mathema…
▽ More
The optical activity in a helically-coiled optical fiber is reexamined. It is proven that not only is there no circular birefringence in the fiber but the polarization relative to the laboratory reference frame is not rotated along the fiber. The reason for this is that in contrast with the polarization vector, the Jones vector does not give a complete description of the polarization. As a mathematical entity in some local reference frame that depends on the instantaneous propagation direction, it can only describe the state of polarization relative to that reference frame. With the new implication of the Jones vector, the results of the experiment reported by Papp and Harms in 1977 are explained satisfactorily. In particular, it is shown that the state of polarization relative to the Tang frame remains unchanged along the fiber. The optical activity appears only relative to the Serret-Frenet frame.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit
Authors:
Boning Zhang,
Chengxi Li,
Kai Fan
Abstract:
Large language models (LLMs) have been explored in a variety of reasoning tasks including solving of mathematical problems. Each math dataset typically includes its own specially designed evaluation script, which, while suitable for its intended use, lacks generalizability across different datasets. Consequently, updates and adaptations to these evaluation tools tend to occur without being systema…
▽ More
Large language models (LLMs) have been explored in a variety of reasoning tasks including solving of mathematical problems. Each math dataset typically includes its own specially designed evaluation script, which, while suitable for its intended use, lacks generalizability across different datasets. Consequently, updates and adaptations to these evaluation tools tend to occur without being systematically reported, leading to inconsistencies and obstacles to fair comparison across studies. To bridge this gap, we introduce a comprehensive mathematical evaluation toolkit that not only utilizes a python computer algebra system (CAS) for its numerical accuracy, but also integrates an optional LLM, known for its considerable natural language processing capabilities. To validate the effectiveness of our toolkit, we manually annotated two distinct datasets. Our experiments demonstrate that the toolkit yields more robust evaluation results compared to prior works, even without an LLM. Furthermore, when an LLM is incorporated, there is a notable enhancement. The code for our method will be made available at \url{https://github.com/MARIO-Math-Reasoning/math_evaluation}.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Reconstructing Intrinsic Stellar Noise with Stellar Atmospheric Parameters and Chromospheric Activity
Authors:
**ghua Zhang,
Maosheng Xiang,
Jie Yu,
Jian Ge,
Ji-Wei Xie,
Hui Zhang,
Yaguang Li,
You Wu,
Chun-Qian Li,
Shaolan Bi,
Hong-Liang Yan,
Jian-Rong Shi
Abstract:
Accurately characterizing intrinsic stellar photometric noise induced by stellar astrophysics, such as stellar activity, granulation, and oscillations, is of crucial importance for detecting transiting exoplanets. In this study, we investigate the relation between the intrinsic stellar photometric noise, as quantified by the Kepler rrmsCDPP measurement, and the level of stellar chromospheric activ…
▽ More
Accurately characterizing intrinsic stellar photometric noise induced by stellar astrophysics, such as stellar activity, granulation, and oscillations, is of crucial importance for detecting transiting exoplanets. In this study, we investigate the relation between the intrinsic stellar photometric noise, as quantified by the Kepler rrmsCDPP measurement, and the level of stellar chromospheric activity, as indicated by the S-index of Ca II HK lines derived from the LAMOST spectra. Our results reveal a clear positive correlation between S-index and rrmsCDPP, and the correlation becomes more significant at higher activity levels and on longer timescales. We have therefore built an empirical relation between rrmsCDPP and S-index as well as Teff, logg, [Fe/H], and apparent magnitude with the XGBoost regression algorithm, using the LAMOST-Kepler common star sample as the training set. This method achieves a precision of ~20 ppm for inferring the intrinsic noise from the S-index and other stellar labels on a 6-hour integration duration. We have applied this empirical relation to the full LAMOST DR7 spectra database, and obtained the intrinsic noise predictions for 1,358,275 stars. The resultant catalog is publicly available and expected to be valuable for optimizing target selection for future exoplanet-hunting space missions, such as the Earth 2.0 mission.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be…
▽ More
Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be $0.38\pm0.20_\text{stat.}\pm0.01_\text{syst.}$ ($R< 0.83$ at 90\% confidence level). In addition, we measure the ratio of the average cross section of $e^+e^-\toωX(3872)$ to $e^+e^-\toωχ_{c1}(ωχ_{c2})$ to be $σ_{ωX(3872)}/σ_{ωχ_{c1}}~(σ_{ωX(3872)}/σ_{ωχ_{c2}})=5.2\pm1.0_\text{stat.}\pm1.9_\text{syst.}~ (5.5\pm1.1_\text{stat.}\pm2.4_\text{syst.})$. Finally, we search for the process of $e^+e^-\toγX(3872)$, and no obvious signal is observed. The upper limit on the ratio of the average cross section of $e^+e^-\toγX(3872)$ to $e^+e^-\toωX(3872)$ is set as $σ_{γX(3872)}/σ_{ωX(3872)}<0.23$ at 90\% confidence level.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
DesTest: A Decentralised Testing Architecture for Improving Data Accuracy of Blockchain Oracle
Authors:
Xueying Zeng,
Youquan Xian,
Chunpei Li,
Zhengdong Hu,
Peng Liu
Abstract:
Blockchain technology ensures secure and trustworthy data flow between multiple participants on the chain, but interoperability of on-chain and off-chain data has always been a difficult problem that needs to be solved. To solve the problem that blockchain systems cannot access off-chain data, oracle is introduced. however, existing research mainly focuses on the consistency and integrity of data,…
▽ More
Blockchain technology ensures secure and trustworthy data flow between multiple participants on the chain, but interoperability of on-chain and off-chain data has always been a difficult problem that needs to be solved. To solve the problem that blockchain systems cannot access off-chain data, oracle is introduced. however, existing research mainly focuses on the consistency and integrity of data, but ignores the problem that oracle nodes may be externally attacked or provide false data for selfish motives, resulting in the unresolved problem of data accuracy. In this paper, we introduce a new decentralized testing architecture (DesTest) that aims to improve data accuracy. A blockchain oracle random secret testing mechanism is first proposed to enhance the monitoring and verification of nodes by introducing a dynamic anonymized question-verification committee. Based on this, a comprehensive evaluation incentive mechanism is designed to incentivize honest work performance by evaluating nodes based on their reputation scores. The simulation results show that we successfully reduced the discrete entropy value of the acquired data and the real value of the data by 61.4%.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Uncovering Obscured Phonon Dynamics from Powder Inelastic Neutron Scattering using Machine Learning
Authors:
Yaokun Su,
Chen Li
Abstract:
The study of phonon dynamics is pivotal for understanding material properties, yet it faces challenges due to the irreversible information loss inherent in powder inelastic neutron scattering spectra and the limitations of traditional analysis methods. In this study, we present a machine learning framework designed to reveal obscured phonon dynamics from powder spectra. Using a variational autoenc…
▽ More
The study of phonon dynamics is pivotal for understanding material properties, yet it faces challenges due to the irreversible information loss inherent in powder inelastic neutron scattering spectra and the limitations of traditional analysis methods. In this study, we present a machine learning framework designed to reveal obscured phonon dynamics from powder spectra. Using a variational autoencoder, we obtain a disentangled latent representation of spectra and successfully extract force constants for reconstructing phonon dispersions. Notably, our model demonstrates effective applicability to experimental data even when trained exclusively on physics-based simulations. The fine-tuning with experimental spectra further mitigates issues arising from domain shift. Analysis of latent space underscores the model's versatility and generalizability, affirming its suitability for complex system applications. Furthermore, our framework's two-stage design is promising for develo** a universal pre-trained feature extractor. This approach has the potential to revolutionize neutron measurements of phonon dynamics, offering researchers a potent tool to decipher intricate spectra and gain valuable insights into the intrinsic physics of materials.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
3D characterization of kinematic fields and poroelastic swelling near the tip of a propagating crack in a hydrogel
Authors:
Chenzhuo Li,
Danila Zubko,
Damien Delespaul,
John M. Kolinski
Abstract:
In fracture mechanics, polyacrylamide hydrogels have been widely used as a model material for experiments, benefited from its optical transparency, fracture brittleness, and low Rayleigh wave velocity. To describe the brittle fracture in the hydrogels, linear elastic fracture mechanics comes as the first choice. However, in soft materials such as hydrogels, the crack opening can be extremely large…
▽ More
In fracture mechanics, polyacrylamide hydrogels have been widely used as a model material for experiments, benefited from its optical transparency, fracture brittleness, and low Rayleigh wave velocity. To describe the brittle fracture in the hydrogels, linear elastic fracture mechanics comes as the first choice. However, in soft materials such as hydrogels, the crack opening can be extremely large, leading to substantial geometric nonlinearity and material nonlinearity at the crack tip. Furthermore, poroelasticity may also modify the local mechanical state within the polymer network. Direct characterization of the kinematic fields and poroelastic effect at the crack tip is lacking. Here, based on a hybrid method of digital image correlation and particle tracking technique, we retrieved high-resolution 3D particle trajectories near the tip of a slowly propagating crack and measured the near-tip 3D kinematic fields, including the displacement fields, rotation fields, stretch fields, strain fields, and swelling fields. Results confirmed the complex multi-axial stretching near the crack tip and the substantial geometric nonlinearity, particularly on the two wakes of the crack where rotation exceeds $30^{\circ}$. Comparison between the measured and predicted displacement and strain fields, derived from linear elastic fracture mechanics, highlights a disagreement in the direct vicinity of the crack tip, particularly for displacement component $u_x$ and through-thickness strain component $\varepsilon_{zz}$. Significant swelling, due to the poroelastic solvent migration, is also observed, with a strong correlation to the local stretch. Our experimental method, without any assumption of the material properties, can be readily extended to study 3D crack tips in a huge varieties of materials, and our results can shed light on the fundamental fracture mechanics.
△ Less
Submitted 3 May, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Multi-feature Reconstruction Network using Crossed-mask Restoration for Unsupervised Anomaly Detection
Authors:
Junpu Wang,
Guili Xu,
Chunlei Li,
Guangshuai Gao,
Yuehua Cheng
Abstract:
Unsupervised anomaly detection using only normal samples is of great significance for quality inspection in industrial manufacturing. Although existing reconstruction-based methods have achieved promising results, they still face two problems: poor distinguishable information in image reconstruction and well abnormal regeneration caused by model over-generalization ability. To overcome the above i…
▽ More
Unsupervised anomaly detection using only normal samples is of great significance for quality inspection in industrial manufacturing. Although existing reconstruction-based methods have achieved promising results, they still face two problems: poor distinguishable information in image reconstruction and well abnormal regeneration caused by model over-generalization ability. To overcome the above issues, we convert the image reconstruction into a combination of parallel feature restorations and propose a multi-feature reconstruction network, MFRNet, using crossed-mask restoration in this paper. Specifically, a multi-scale feature aggregator is first developed to generate more discriminative hierarchical representations of the input images from a pre-trained model. Subsequently, a crossed-mask generator is adopted to randomly cover the extracted feature map, followed by a restoration network based on the transformer structure for high-quality repair of the missing regions. Finally, a hybrid loss is equipped to guide model training and anomaly estimation, which gives consideration to both the pixel and structural similarity. Extensive experiments show that our method is highly competitive with or significantly outperforms other state-of-the-arts on four public available datasets and one self-made dataset.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
FilterPrompt: Guiding Image Transfer in Diffusion Models
Authors:
Xi Wang,
Yichen Peng,
Heng Fang,
Haoran Xie,
Xi Yang,
Chuntao Li
Abstract:
In controllable generation tasks, flexibly manipulating the generated images to attain a desired appearance or structure based on a single input image cue remains a critical and longstanding challenge. Achieving this requires the effective decoupling of key attributes within the input image data, aiming to get representations accurately. Previous research has predominantly concentrated on disentan…
▽ More
In controllable generation tasks, flexibly manipulating the generated images to attain a desired appearance or structure based on a single input image cue remains a critical and longstanding challenge. Achieving this requires the effective decoupling of key attributes within the input image data, aiming to get representations accurately. Previous research has predominantly concentrated on disentangling image attributes within feature space. However, the complex distribution present in real-world data often makes the application of such decoupling algorithms to other datasets challenging. Moreover, the granularity of control over feature encoding frequently fails to meet specific task requirements. Upon scrutinizing the characteristics of various generative models, we have observed that the input sensitivity and dynamic evolution properties of the diffusion model can be effectively fused with the explicit decomposition operation in pixel space. This integration enables the image processing operations performed in pixel space for a specific feature distribution of the input image, and can achieve the desired control effect in the generated results. Therefore, we propose FilterPrompt, an approach to enhance the model control effect. It can be universally applied to any diffusion model, allowing users to adjust the representation of specific image features in accordance with task requirements, thereby facilitating more precise and controllable generation outcomes. In particular, our designed experiments demonstrate that the FilterPrompt optimizes feature correlation, mitigates content conflicts during the generation process, and enhances the model's control capability.
△ Less
Submitted 12 May, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Insights from the Gaussian Processes Method for the FRB-associated X-ray Burst of SGR 1935+2154
Authors:
Rui**g Tang,
Dahai Yan,
Haiyun Zhang,
Qingchang Zhao,
Lian Tao,
Chengkui Li,
Mingyu Ge,
Xiaobo Li,
Qianqing Yin,
Ce Cai
Abstract:
Gaussian processes method is employed to analyze the light curves of bursts detected by Insight-HXMT, NICER, and GECAM from SGR 1935+2154 between 2020 to 2022. It is found that a stochastically driven damped simple harmonic oscillator (SHO) is necessary to capture the characteristics of the X-ray bursts. Variability timescale of the X-ray bursts, corresponding to the broken frequencies in the SHO…
▽ More
Gaussian processes method is employed to analyze the light curves of bursts detected by Insight-HXMT, NICER, and GECAM from SGR 1935+2154 between 2020 to 2022. It is found that a stochastically driven damped simple harmonic oscillator (SHO) is necessary to capture the characteristics of the X-ray bursts. Variability timescale of the X-ray bursts, corresponding to the broken frequencies in the SHO power spectral densities (PSDs), are extracted. In particular, a high broken frequency of 35 Hz where the index of the SHO PSD changes from -4 to -2 is constrained by the HXMT-HE burst associated with FRB 200428. It is suggested that the corresponding timescale of 0.03 s could be the retarding timescale of the system driven by some energy release, and the production of the HE photon should be quasi-simultaneous with the response. The other special event is a NICER burst with a retarding timescale of 1/39 Hz (0.02 s). In the normal X-ray bursts, no retarding timescale is constrained; a long relax/equilibrium timescale (corresponding to a broken frequency of 1-10 Hz where the index of the SHO PSD changing from -4/-2 to 0 in the SHO PSD) is obtained. The results indicate that the FRB-associated HXMT-HE X-ray burst could be produced immediately when the system is responding to the energy disturbance, far before the equilibrium state.
△ Less
Submitted 19 June, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics
Authors:
Xiaofei Wang,
Xingxu Huang,
Stephen J. Price,
Chao Li
Abstract:
The recent advancement of spatial transcriptomics (ST) allows to characterize spatial gene expression within tissue for discovery research. However, current ST platforms suffer from low resolution, hindering in-depth understanding of spatial gene expression. Super-resolution approaches promise to enhance ST maps by integrating histology images with gene expressions of profiled tissue spots. Howeve…
▽ More
The recent advancement of spatial transcriptomics (ST) allows to characterize spatial gene expression within tissue for discovery research. However, current ST platforms suffer from low resolution, hindering in-depth understanding of spatial gene expression. Super-resolution approaches promise to enhance ST maps by integrating histology images with gene expressions of profiled tissue spots. However, current super-resolution methods are limited by restoration uncertainty and mode collapse. Although diffusion models have shown promise in capturing complex interactions between multi-modal conditions, it remains a challenge to integrate histology images and gene expression for super-resolved ST maps. This paper proposes a cross-modal conditional diffusion model for super-resolving ST maps with the guidance of histology images. Specifically, we design a multi-modal disentangling network with cross-modal adaptive modulation to utilize complementary information from histology images and spatial gene expression. Moreover, we propose a dynamic cross-attention modelling strategy to extract hierarchical cell-to-tissue information from histology images. Lastly, we propose a co-expression-based gene-correlation graph network to model the co-expression relationship of multiple genes. Experiments show that our method outperforms other state-of-the-art methods in ST super-resolution on three public datasets.
△ Less
Submitted 27 May, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Authors:
Yixiang Zhuang,
Bao** Cheng,
Yao Cheng,
Yuntao **,
Renshuai Liu,
Chengyang Li,
Xuan Cheng,
**g Liao,
Juncong Lin
Abstract:
Speech-driven facial animation methods usually contain two main classes, 3D and 2D talking face, both of which attract considerable research attention in recent years. However, to the best of our knowledge, the research on 3D talking face does not go deeper as 2D talking face, in the aspect of lip-synchronization (lip-sync) and speech perception. To mind the gap between the two sub-fields, we prop…
▽ More
Speech-driven facial animation methods usually contain two main classes, 3D and 2D talking face, both of which attract considerable research attention in recent years. However, to the best of our knowledge, the research on 3D talking face does not go deeper as 2D talking face, in the aspect of lip-synchronization (lip-sync) and speech perception. To mind the gap between the two sub-fields, we propose a learning framework named Learn2Talk, which can construct a better 3D talking face network by exploiting two expertise points from the field of 2D talking face. Firstly, inspired by the audio-video sync network, a 3D sync-lip expert model is devised for the pursuit of lip-sync between audio and 3D facial motion. Secondly, a teacher model selected from 2D talking face methods is used to guide the training of the audio-to-3D motions regression network to yield more 3D vertex accuracy. Extensive experiments show the advantages of the proposed framework in terms of lip-sync, vertex accuracy and speech perception, compared with state-of-the-arts. Finally, we show two applications of the proposed framework: audio-visual speech recognition and speech-driven 3D Gaussian Splatting based avatar animation.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Determination of the CKM angle $φ_{3}$ from a combination of Belle and Belle II results
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Aihara,
N. Akopov,
A. Aloisio,
S. Al Said,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
S. Bahinipati,
P. Bambade,
Sw. Banerjee,
S. Bansal,
M. Barrett,
J. Baudot,
A. Baur,
A. Beaubien
, et al. (377 additional authors not shown)
Abstract:
We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihoo…
▽ More
We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihood fit to obtain $φ_{3} = (78.6^{+7.2}_{-7.3})^{\circ}$. We also briefly discuss the interpretation of this result.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
On the Path to High-temperature Josephson Multi-junction Devices
Authors:
Xu Wang,
Fucong Chen,
Zefeng Lin,
Changhong Yuan,
Shibing Tian,
Chunguang Li,
Victor Kornev,
Nikolay Kolotinskiy
Abstract:
We report our progress in the high-temperature superconductor (HTS) Josephson junction fabrication process founded on using a focused helium ion beam damaging technique and discuss the expected device performance attainable with the HTS multi-junction device technology. Both the achievable high value of characteristic voltage $V_c=I_cR_N$ of Josephson junctions and the ability to design a large nu…
▽ More
We report our progress in the high-temperature superconductor (HTS) Josephson junction fabrication process founded on using a focused helium ion beam damaging technique and discuss the expected device performance attainable with the HTS multi-junction device technology. Both the achievable high value of characteristic voltage $V_c=I_cR_N$ of Josephson junctions and the ability to design a large number of arbitrary located Josephson junctions allow narrowing the existing gap in design abilities for LTS and HTS circuits even with using a single YBCO film layer. A one-layer topology of active electrically small antenna is suggested and its voltage response characteristics are considered.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
On the roles of stellar rotation and binarity in NGC 2423's main-sequence turnoff region
Authors:
Yutian Bu,
Chenyu He,
Li Wang,
Jiamao Lin,
Chengyuan Li
Abstract:
Research has shown that many young and intermediate-age clusters (younger than $\sim$2 Gyr) have extended main sequences and main-sequence turnoffs (eMSTOs), which cannot be adequately described by a single isochrone. The reason for the extended main sequences is now known, with the most probable cause being the fast rotation of stars. However, a significant fraction of slowly rotating stars form…
▽ More
Research has shown that many young and intermediate-age clusters (younger than $\sim$2 Gyr) have extended main sequences and main-sequence turnoffs (eMSTOs), which cannot be adequately described by a single isochrone. The reason for the extended main sequences is now known, with the most probable cause being the fast rotation of stars. However, a significant fraction of slowly rotating stars form a younger stellar population than their fast-rotating counterparts, leading to speculation that they have undergone thorough rotational mixing processes internally. One speculation is that a considerable number of slowly rotating stars reside in close binary systems, where tidal forces from companion stars are the cause of their rotational deceleration. In this work, we report a relatively old open star cluster in the Milky Way, NGC 2423 ($\sim$1 Gyrs old), which exhibits an apparent eMSTO. As anticipated, many characteristics of NGC 2423 indicate that its eMSTO is driven by stellar rotations. Our calculations indicate that if slowly rotating stars commonly have a close companion star, they should exhibit significant differences in radial velocities observationally, and binary systems that can be tidally locked within the age of NGC 2423 should have a mass ratio close to 1. However, none of these predictions align with our observations. Interestingly, among the only two equal-mass binary systems in the observed region for which spectroscopic data could be obtained, we discovered that one of them is a tidally locked binary system. This further suggests the validity of our numerical simulation results.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Relationship Discovery for Drug Recommendation
Authors:
Xiang Li,
Shunpan Liang,
Yu Lei,
Chen Li,
Yulei Hou,
Tengfei Ma
Abstract:
Medication recommendation systems are designed to deliver personalized drug suggestions that are closely aligned with individual patient needs. Previous studies have primarily concentrated on develo** medication embeddings, achieving significant progress. Nonetheless, these approaches often fall short in accurately reflecting individual patient profiles, mainly due to challenges in distinguishin…
▽ More
Medication recommendation systems are designed to deliver personalized drug suggestions that are closely aligned with individual patient needs. Previous studies have primarily concentrated on develo** medication embeddings, achieving significant progress. Nonetheless, these approaches often fall short in accurately reflecting individual patient profiles, mainly due to challenges in distinguishing between various patient conditions and the inability to establish precise correlations between specific conditions and appropriate medications. In response to these issues, we introduce DisMed, a model that focuses on patient conditions to enhance personalization. DisMed employs causal inference to discern clear, quantifiable causal links. It then examines patient conditions in depth, recognizing and adapting to the evolving nuances of these conditions, and map** them directly to corresponding medications. Additionally, DisMed leverages data from multiple patient visits to propose combinations of medications. Comprehensive testing on real-world datasets demonstrates that DisMed not only improves the customization of patient profiles but also surpasses leading models in both precision and safety.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Knowledge-Aware Multi-Intent Contrastive Learning for Multi-Behavior Recommendation
Authors:
Shunpan Liang,
Junjie Zhao,
Chen Li,
Yu Lei
Abstract:
Multi-behavioral recommendation optimizes user experiences by providing users with more accurate choices based on their diverse behaviors, such as view, add to cart, and purchase. Current studies on multi-behavioral recommendation mainly explore the connections and differences between multi-behaviors from an implicit perspective. Specifically, they directly model those relations using black-box ne…
▽ More
Multi-behavioral recommendation optimizes user experiences by providing users with more accurate choices based on their diverse behaviors, such as view, add to cart, and purchase. Current studies on multi-behavioral recommendation mainly explore the connections and differences between multi-behaviors from an implicit perspective. Specifically, they directly model those relations using black-box neural networks. In fact, users' interactions with items under different behaviors are driven by distinct intents. For instance, when users view products, they tend to pay greater attention to information such as ratings and brands. However, when it comes to the purchasing phase, users become more price-conscious. To tackle this challenge and data sparsity problem in the multi-behavioral recommendation, we propose a novel model: Knowledge-Aware Multi-Intent Contrastive Learning (KAMCL) model. This model uses relationships in the knowledge graph to construct intents, aiming to mine the connections between users' multi-behaviors from the perspective of intents to achieve more accurate recommendations. KAMCL is equipped with two contrastive learning schemes to alleviate the data scarcity problem and further enhance user representations. Extensive experiments on three real datasets demonstrate the superiority of our model.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Authors:
Tianyi Liang,
Jiangqi Liu,
Sicheng Song,
Shiqi Jiang,
Yifei Huang,
Changbo Wang,
Chenhui Li
Abstract:
Recent advancements in Text-to-image (T2I) generation have witnessed a shift from adapting text to fixed backgrounds to creating images around text. Traditional approaches are often limited to generate layouts within static images for effective text placement. Our proposed approach, TextCenGen, introduces a dynamic adaptation of the blank region for text-friendly image generation, emphasizing text…
▽ More
Recent advancements in Text-to-image (T2I) generation have witnessed a shift from adapting text to fixed backgrounds to creating images around text. Traditional approaches are often limited to generate layouts within static images for effective text placement. Our proposed approach, TextCenGen, introduces a dynamic adaptation of the blank region for text-friendly image generation, emphasizing text-centric design and visual harmony generation. Our method employs force-directed attention guidance in T2I models to generate images that strategically reserve whitespace for pre-defined text areas, even for text or icons at the golden ratio. Observing how cross-attention maps affect object placement, we detect and repel conflicting objects using a force-directed graph approach, combined with a Spatial Excluding Cross-Attention Constraint for smooth attention in whitespace areas. As a novel task in graphic design, experiments indicate that TextCenGen outperforms existing methods with more harmonious compositions. Furthermore, our method significantly enhances T2I model outcomes on our specially collected prompt datasets, catering to varied text positions. These results demonstrate the efficacy of TextCenGen in creating more harmonious and integrated text-image compositions.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
A class of maximum-based iteration methods for the generalized absolute value equation
Authors:
Shiliang Wu,
Deren Han,
Cuixia Li
Abstract:
In this paper, by using $|x|=2\max\{0,x\}-x$, a class of maximum-based iteration methods is established to solve the generalized absolute value equation $Ax-B|x|=b$. Some convergence conditions of the proposed method are presented. By some numerical experiments, the effectiveness and feasibility of the proposed method are confirmed.
In this paper, by using $|x|=2\max\{0,x\}-x$, a class of maximum-based iteration methods is established to solve the generalized absolute value equation $Ax-B|x|=b$. Some convergence conditions of the proposed method are presented. By some numerical experiments, the effectiveness and feasibility of the proposed method are confirmed.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
LLMTune: Accelerate Database Knob Tuning with Large Language Models
Authors:
Xinmei Huang,
Haoyang Li,
**g Zhang,
Xinxin Zhao,
Zhiming Yao,
Yiyan Li,
Zhuohao Yu,
Tieying Zhang,
Hong Chen,
Cui** Li
Abstract:
Database knob tuning is a critical challenge in the database community, aiming to optimize knob values to enhance database performance for specific workloads. DBMS often feature hundreds of tunable knobs, posing a significant challenge for DBAs to recommend optimal configurations. Consequently, many machine learning-based tuning methods have been developed to automate this process. Despite the int…
▽ More
Database knob tuning is a critical challenge in the database community, aiming to optimize knob values to enhance database performance for specific workloads. DBMS often feature hundreds of tunable knobs, posing a significant challenge for DBAs to recommend optimal configurations. Consequently, many machine learning-based tuning methods have been developed to automate this process. Despite the introduction of various optimizers, practical applications have unveiled a new problem: they typically require numerous workload runs to achieve satisfactory performance, a process that is both time-consuming and resource-intensive. This inefficiency largely stems from the optimal configuration often being substantially different from the default setting, necessitating multiple iterations during tuning. Recognizing this, we argue that an effective starting point could significantly reduce redundant exploration in less efficient areas, thereby potentially speeding up the tuning process for the optimizers. Based on this assumption, we introduce LLMTune, a large language model-based configuration generator designed to produce an initial, high-quality configuration for new workloads. These generated configurations can then serve as starting points for various base optimizers, accelerating their tuning processes. To obtain training data for LLMTune's supervised fine-tuning, we have devised a new automatic data generation framework capable of efficiently creating a large number of <workload, configuration> pairs. We have conducted thorough experiments to evaluate LLMTune's effectiveness with different workloads, such as TPC-H and JOB. In comparison to leading methods, LLMTune demonstrates a quicker ability to identify superior configurations. For instance, with the challenging TPC-H workload, our LLMTune achieves a significant 15.6x speed-up ratio in finding the best-performing configurations.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
Authors:
Buzhen Huang,
Chen Li,
Chongyang Xu,
Liang Pan,
Yangang Wang,
Gim Hee Lee
Abstract:
Existing multi-person human reconstruction approaches mainly focus on recovering accurate poses or avoiding penetration, but overlook the modeling of close interactions. In this work, we tackle the task of reconstructing closely interactive humans from a monocular video. The main challenge of this task comes from insufficient visual information caused by depth ambiguity and severe inter-person occ…
▽ More
Existing multi-person human reconstruction approaches mainly focus on recovering accurate poses or avoiding penetration, but overlook the modeling of close interactions. In this work, we tackle the task of reconstructing closely interactive humans from a monocular video. The main challenge of this task comes from insufficient visual information caused by depth ambiguity and severe inter-person occlusion. In view of this, we propose to leverage knowledge from proxemic behavior and physics to compensate the lack of visual information. This is based on the observation that human interaction has specific patterns following the social proxemics. Specifically, we first design a latent representation based on Vector Quantised-Variational AutoEncoder (VQ-VAE) to model human interaction. A proxemics and physics guided diffusion model is then introduced to denoise the initial distribution. We design the diffusion model as dual branch with each branch representing one individual such that the interaction can be modeled via cross attention. With the learned priors of VQ-VAE and physical constraint as the additional information, our proposed approach is capable of estimating accurate poses that are also proxemics and physics plausible. Experimental results on Hi4D, 3DPW, and CHI3D demonstrate that our method outperforms existing approaches. The code is available at \url{https://github.com/boycehbz/HumanInteraction}.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation
Authors:
Ye Bai,
Chenxing Li,
Hao Li,
Yuanyuan Zhao,
Xiaorui Wang
Abstract:
In short video and live broadcasts, speech, singing voice, and background music often overlap and obscure each other. This complexity creates difficulties in structuring and recognizing the audio content, which may impair subsequent ASR and music understanding applications. This paper proposes a multi-task audio source separation (MTASS) based ASR model called JRSV, which Jointly Recognizes Speech…
▽ More
In short video and live broadcasts, speech, singing voice, and background music often overlap and obscure each other. This complexity creates difficulties in structuring and recognizing the audio content, which may impair subsequent ASR and music understanding applications. This paper proposes a multi-task audio source separation (MTASS) based ASR model called JRSV, which Jointly Recognizes Speech and singing Voices. Specifically, the MTASS module separates the mixed audio into distinct speech and singing voice tracks while removing background music. The CTC/attention hybrid recognition module recognizes both tracks. Online distillation is proposed to improve the robustness of recognition further. To evaluate the proposed methods, a benchmark dataset is constructed and released. Experimental results demonstrate that JRSV can significantly improve recognition accuracy on each track of the mixed audio.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
The maximum number of cliques in graphs with given fractional matching number and minimum degree
Authors:
Chengli Li,
Yurui Tang
Abstract:
Recently, Ma, Qian and Shi determined the maximum size of an $n$-vertex graph with given fractional matching number $s$ and maximum degree at most $d$. Motivated by this result, we determine the maximum number of $\ell$-cliques in a graph with given fractional matching number and minimum degree, which generalizes Shi and Ma's result about the maximum size of a graph with given fractional matching…
▽ More
Recently, Ma, Qian and Shi determined the maximum size of an $n$-vertex graph with given fractional matching number $s$ and maximum degree at most $d$. Motivated by this result, we determine the maximum number of $\ell$-cliques in a graph with given fractional matching number and minimum degree, which generalizes Shi and Ma's result about the maximum size of a graph with given fractional matching number and minimum degree at least one. We also determine the maximum number of complete bipartite graphs in a graph with prescribed fractional matching number and minimum degree.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty
Authors:
Changbin Li,
Kangshuo Li,
Yuzhe Ou,
Lance M. Kaplan,
Audun Jøsang,
**-Hee Cho,
Dong Hyun Jeong,
Feng Chen
Abstract:
Deep neural networks (DNNs) have been shown to perform well on exclusive, multi-class classification tasks. However, when different classes have similar visual features, it becomes challenging for human annotators to differentiate them. This scenario necessitates the use of composite class labels. In this paper, we propose a novel framework called Hyper-Evidential Neural Network (HENN) that explic…
▽ More
Deep neural networks (DNNs) have been shown to perform well on exclusive, multi-class classification tasks. However, when different classes have similar visual features, it becomes challenging for human annotators to differentiate them. This scenario necessitates the use of composite class labels. In this paper, we propose a novel framework called Hyper-Evidential Neural Network (HENN) that explicitly models predictive uncertainty due to composite class labels in training data in the context of the belief theory called Subjective Logic (SL). By placing a grouped Dirichlet distribution on the class probabilities, we treat predictions of a neural network as parameters of hyper-subjective opinions and learn the network that collects both single and composite evidence leading to these hyper-opinions by a deterministic DNN from data. We introduce a new uncertainty type called vagueness originally designed for hyper-opinions in SL to quantify composite classification uncertainty for DNNs. Our results demonstrate that HENN outperforms its state-of-the-art counterparts based on four image datasets. The code and datasets are available at: https://github.com/Hugo101/HyperEvidentialNN.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Authors:
Sicheng Xu,
Guojun Chen,
Yu-Xiao Guo,
Jiaolong Yang,
Chong Li,
Zhenyu Zang,
Yizhong Zhang,
Xin Tong,
Baining Guo
Abstract:
We introduce VASA, a framework for generating lifelike talking faces with appealing visual affective skills (VAS) given a single static image and a speech audio clip. Our premiere model, VASA-1, is capable of not only producing lip movements that are exquisitely synchronized with the audio, but also capturing a large spectrum of facial nuances and natural head motions that contribute to the percep…
▽ More
We introduce VASA, a framework for generating lifelike talking faces with appealing visual affective skills (VAS) given a single static image and a speech audio clip. Our premiere model, VASA-1, is capable of not only producing lip movements that are exquisitely synchronized with the audio, but also capturing a large spectrum of facial nuances and natural head motions that contribute to the perception of authenticity and liveliness. The core innovations include a holistic facial dynamics and head movement generation model that works in a face latent space, and the development of such an expressive and disentangled face latent space using videos. Through extensive experiments including evaluation on a set of new metrics, we show that our method significantly outperforms previous methods along various dimensions comprehensively. Our method not only delivers high video quality with realistic facial and head dynamics but also supports the online generation of 512x512 videos at up to 40 FPS with negligible starting latency. It paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior
Authors:
Yiqian Wu,
Hao Xu,
Xiangjun Tang,
Xien Chen,
Siyu Tang,
Zhebin Zhang,
Chen Li,
Xiaogang **
Abstract:
Existing neural rendering-based text-to-3D-portrait generation methods typically make use of human geometry prior and diffusion models to obtain guidance. However, relying solely on geometry information introduces issues such as the Janus problem, over-saturation, and over-smoothing. We present Portrait3D, a novel neural rendering-based framework with a novel joint geometry-appearance prior to ach…
▽ More
Existing neural rendering-based text-to-3D-portrait generation methods typically make use of human geometry prior and diffusion models to obtain guidance. However, relying solely on geometry information introduces issues such as the Janus problem, over-saturation, and over-smoothing. We present Portrait3D, a novel neural rendering-based framework with a novel joint geometry-appearance prior to achieve text-to-3D-portrait generation that overcomes the aforementioned issues. To accomplish this, we train a 3D portrait generator, 3DPortraitGAN-Pyramid, as a robust prior. This generator is capable of producing 360° canonical 3D portraits, serving as a starting point for the subsequent diffusion-based generation process. To mitigate the "grid-like" artifact caused by the high-frequency information in the feature-map-based 3D representation commonly used by most 3D-aware GANs, we integrate a novel pyramid tri-grid 3D representation into 3DPortraitGAN-Pyramid. To generate 3D portraits from text, we first project a randomly generated image aligned with the given prompt into the pre-trained 3DPortraitGAN-Pyramid's latent space. The resulting latent code is then used to synthesize a pyramid tri-grid. Beginning with the obtained pyramid tri-grid, we use score distillation sampling to distill the diffusion model's knowledge into the pyramid tri-grid. Following that, we utilize the diffusion model to refine the rendered images of the 3D portrait and then use these refined images as training data to further optimize the pyramid tri-grid, effectively eliminating issues with unrealistic color and unnatural artifacts. Our experimental results show that Portrait3D can produce realistic, high-quality, and canonical 3D portraits that align with the prompt.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Future Language Modeling from Temporal Document History
Authors:
Changmao Li,
Jeffrey Flanigan
Abstract:
Predicting the future is of great interest across many aspects of human activity. Businesses are interested in future trends, traders are interested in future stock prices, and companies are highly interested in future technological breakthroughs. While there are many automated systems for predicting future numerical data, such as weather, stock prices, and demand for products, there is relatively…
▽ More
Predicting the future is of great interest across many aspects of human activity. Businesses are interested in future trends, traders are interested in future stock prices, and companies are highly interested in future technological breakthroughs. While there are many automated systems for predicting future numerical data, such as weather, stock prices, and demand for products, there is relatively little work in automatically predicting textual data. Humans are interested in textual data predictions because it is a natural format for our consumption, and experts routinely make predictions in a textual format (Christensen et al., 2004; Tetlock & Gardner, 2015; Frick, 2015). However, there has been relatively little formalization of this general problem in the machine learning or natural language processing communities. To address this gap, we introduce the task of future language modeling: probabilistic modeling of texts in the future based on a temporal history of texts. To our knowledge, our work is the first work to formalize the task of predicting the future in this way. We show that it is indeed possible to build future language models that improve upon strong non-temporal language model baselines, opening the door to working on this important, and widely applicable problem.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Calibration of the Cryogenic Measurement System of a Resonant Haloscope Cavity
Authors:
Dong He,
Jie Fan,
Xin Gao,
Yu Gao,
Nick Houston,
Zhongqing Ji,
Yirong **,
Chuang Li,
**mian Li,
Tianjun Li,
Shi-hang Liu,
Jia-Shu Niu,
Zhihui Peng,
Liang Sun,
Zheng Sun,
Jia Wang,
Puxian Wei,
Lina Wu,
Zhongchen Xiang,
Qiaoli Yang,
Chi Zhang,
Wenxing Zhang,
Xin Zhang,
Dongning Zheng,
Ruifeng Zheng
, et al. (1 additional authors not shown)
Abstract:
Possible light bosonic dark matter interactions with the Standard Model photon have been searched by microwave resonant cavities. In this paper, we demonstrate the cryogenic readout system calibration of a 7.138 GHz copper cavity with a loaded quality factor $Q_l=10^4$, operated at 22 mK temperature based on a dilution refrigerator. Our readout system consists of High Electron Mobility Transistors…
▽ More
Possible light bosonic dark matter interactions with the Standard Model photon have been searched by microwave resonant cavities. In this paper, we demonstrate the cryogenic readout system calibration of a 7.138 GHz copper cavity with a loaded quality factor $Q_l=10^4$, operated at 22 mK temperature based on a dilution refrigerator. Our readout system consists of High Electron Mobility Transistors as cryogenic amplifiers at 4 K, plus room-temperature amplifiers and a spectrum analyzer for signal power detection. We test the system with a superconducting two-level system as a single-photon source in the microwave frequency regime and report an overall 95.6 dB system gain and -71.4 dB attenuation in the cavity's input channel. The effective noise temperature of the measurement system is 7.5 K.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development
Authors:
Xiaohui Duan,
Yuxuan Li,
Zhao Liu,
Bin Yang,
Juepeng Zheng,
Haohuan Fu,
Shaoqing Zhang,
Shiming Xu,
Yang Gao,
Wei Xue,
Di Wei,
Xiao**g Lv,
Lifeng Yan,
Haopeng Huang,
Haitian Lu,
Lingfeng Wan,
Haoran Lin,
Qixin Chang,
Chenlin Li,
Quanjie He,
Zeyu Song,
Xuantong Wang,
Yangyang Yu,
Xilong Fan,
Zhaopeng Qu
, et al. (16 additional authors not shown)
Abstract:
With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries t…
▽ More
With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries to minimizes manual code modifications, our project tries to achieve both improvement of performance and consistency of the model code. By using a hierarchical grid system and an OpenMP-based offloading toolkit, our porting and parallelization effort covers over 80% of the code, and achieves a simulation speed of 340 SDPD (simulated days per day) for 5-km atmosphere, 265 SDPD for 3-km ocean, and 222 SDPD for a coupled model, thus making multi-year or even multi-decadal experiments at such high resolution possible.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Picturing the Gap Between the Performance and US-DOE's Hydrogen Storage Target: A Data-Driven Model for MgH2 Dehydrogenation
Authors:
Chaoqun Li,
Weijie Yang,
Hao Liu,
Xinyuan Liu,
Xiu**g Xing,
Zhengyang Gao,
Shuai Dong,
Hao Li
Abstract:
Develo** solid-state hydrogen storage materials is as pressing as ever, which requires a comprehensive understanding of the dehydrogenation chemistry of a solid-state hydride. Transition state search and kinetics calculations are essential to understanding and designing high-performance solid-state hydrogen storage materials by filling in the knowledge gap that current experimental techniques ca…
▽ More
Develo** solid-state hydrogen storage materials is as pressing as ever, which requires a comprehensive understanding of the dehydrogenation chemistry of a solid-state hydride. Transition state search and kinetics calculations are essential to understanding and designing high-performance solid-state hydrogen storage materials by filling in the knowledge gap that current experimental techniques cannot measure. However, the ab initio analysis of these processes is computationally expensive and time-consuming. Searching for descriptors to accurately predict the energy barrier is urgently needed, to accelerate the prediction of hydrogen storage material properties and identify the opportunities and challenges in this field. Herein, we develop a data-driven model to describe and predict the dehydrogenation barriers of a typical solid-state hydrogen storage material, magnesium hydride (MgH2), based on the combination of the crystal Hamilton population orbital of Mg-H bond and the distance between atomic hydrogen. By deriving the distance energy ratio, this model elucidates the key chemistry of the reaction kinetics. All the parameters in this model can be directly calculated with significantly less computational cost than conventional transition state search, so that the dehydrogenation performance of hydrogen storage materials can be predicted efficiently. Finally, we found that this model leads to excellent agreement with typical experimental measurements reported to date and provides clear design guidelines on how to propel the performance of MgH2 closer to the target set by the United States Department of Energy (US-DOE).
△ Less
Submitted 29 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR
Authors:
Zelin Wu,
Gan Song,
Christopher Li,
Pat Rondon,
Zhong Meng,
Xavier Velez,
Weiran Wang,
Diamantino Caseiro,
Golan Pundak,
Tsendsuren Munkhdalai,
Angad Chandorkar,
Rohit Prabhavalkar
Abstract:
Contextual biasing enables speech recognizers to transcribe important phrases in the speaker's context, such as contact names, even if they are rare in, or absent from, the training data. Attention-based biasing is a leading approach which allows for full end-to-end cotraining of the recognizer and biasing system and requires no separate inference-time components. Such biasers typically consist of…
▽ More
Contextual biasing enables speech recognizers to transcribe important phrases in the speaker's context, such as contact names, even if they are rare in, or absent from, the training data. Attention-based biasing is a leading approach which allows for full end-to-end cotraining of the recognizer and biasing system and requires no separate inference-time components. Such biasers typically consist of a context encoder; followed by a context filter which narrows down the context to apply, improving per-step inference time; and, finally, context application via cross attention. Though much work has gone into optimizing per-frame performance, the context encoder is at least as important: recognition cannot begin before context encoding ends. Here, we show the lightweight phrase selection pass can be moved before context encoding, resulting in a speedup of up to 16.1 times and enabling biasing to scale to 20K phrases with a maximum pre-decoding delay under 33ms. With the addition of phrase- and wordpiece-level cross-entropy losses, our technique also achieves up to a 37.5% relative WER reduction over the baseline without the losses and lightweight phrase selection pass.
△ Less
Submitted 23 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
An Efficient Evolutionary Algorithm for Diversified Top-k (Weight) Clique Search Problems
Authors:
Jiongzhi Zheng,
**ghui Xue,
Kun He,
Chu-Min Li,
Yanli Liu
Abstract:
In many real-world problems and applications, finding only a single element, even though the best, among all possible candidates, cannot fully meet the requirements. We may wish to have a collection where each individual is not only outstanding but also distinctive. Diversified Top-k (DTk) problems are a kind of combinatorial optimization problem for finding such a promising collection of multiple…
▽ More
In many real-world problems and applications, finding only a single element, even though the best, among all possible candidates, cannot fully meet the requirements. We may wish to have a collection where each individual is not only outstanding but also distinctive. Diversified Top-k (DTk) problems are a kind of combinatorial optimization problem for finding such a promising collection of multiple sub-structures, such as subgraphs like cliques and social communities. In this paper, we address two representative and practical DTk problems, DTk Clique search (DTkC) and DTk Weight Clique search (DTkWC), and propose an efficient algorithm called Diversified Top-k Evolutionary AlgorithM (DiverTEAM) for these two problems. DiverTEAM consists of a local search algorithm, which focuses on generating high-quality and diverse individuals and sub-structures, and a genetic algorithm that makes individuals work as a team and converge to (near-)optima efficiently. Extensive experiments show that DiverTEAM exhibits an excellent and robust performance across various benchmarks of DTkC and DTkWC.
△ Less
Submitted 19 January, 2024;
originally announced April 2024.
-
First Search for Light Fermionic Dark Matter Absorption on Electrons Using Germanium Detector in CDEX-10 Experiment
Authors:
J. X. Liu,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
Y. H. Chen,
J. P. Cheng,
W. H. Dai,
Z. Deng,
C. H. Fang,
X. P. Geng,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
J. W. Hu,
H. X. Huang,
T. C. Huang,
L. Jiang,
S. Karmakar
, et al. (61 additional authors not shown)
Abstract:
We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present ne…
▽ More
We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present new constraints of cross section in the DM range of 0.1--10 keV/$c^2$ for vector and axial-vector interaction. The upper limit on the cross section is set to be $\rm 5.5\times10^{-46}~cm^2$ for vector interaction, and $\rm 1.8\times10^{-46}~cm^2$ for axial-vector interaction at DM mass of 5 keV/$c^2$.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
State Space Model for New-Generation Network Alternative to Transformers: A Survey
Authors:
Xiao Wang,
Shiao Wang,
Yuhe Ding,
Yuehang Li,
Wentao Wu,
Yao Rong,
Weizhe Kong,
Ju Huang,
Shihao Li,
Haoxiang Yang,
Ziwen Wang,
Bo Jiang,
Chenglong Li,
Yaowei Wang,
Yonghong Tian,
** Tang
Abstract:
In the post-deep learning era, the Transformer architecture has demonstrated its powerful performance across pre-trained big models and various downstream tasks. However, the enormous computational demands of this architecture have deterred many researchers. To further reduce the complexity of attention models, numerous efforts have been made to design more efficient methods. Among them, the State…
▽ More
In the post-deep learning era, the Transformer architecture has demonstrated its powerful performance across pre-trained big models and various downstream tasks. However, the enormous computational demands of this architecture have deterred many researchers. To further reduce the complexity of attention models, numerous efforts have been made to design more efficient methods. Among them, the State Space Model (SSM), as a possible replacement for the self-attention based Transformer model, has drawn more and more attention in recent years. In this paper, we give the first comprehensive review of these works and also provide experimental comparisons and analysis to better demonstrate the features and advantages of SSM. Specifically, we first give a detailed description of principles to help the readers quickly capture the key ideas of SSM. After that, we dive into the reviews of existing SSMs and their various applications, including natural language processing, computer vision, graph, multi-modal and multi-media, point cloud/event stream, time series data, and other domains. In addition, we give statistical comparisons and analysis of these models and hope it helps the readers to understand the effectiveness of different structures on various tasks. Then, we propose possible research points in this direction to better promote the development of the theoretical model and application of SSM. More related works will be continuously updated on the following GitHub: https://github.com/Event-AHU/Mamba_State_Space_Model_Paper_List.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Observation of $D \to a_{0}(980)π$ in the decays $D^{0} \rightarrow π^{+}π^{-}η$ and $D^{+} \rightarrow π^{+}π^{0}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
We report the first amplitude analysis of the decays $D^{0} \to π^{+} π^{-} η$ and $D^{+} \rightarrow π^{+}π^{0}η$ using a data sample taken with the BESIII detector at the center-of-mass energy of 3.773 GeV, corresponding to an integrated luminosity of 7.9 ${\rm fb}^{-1}$. The contribution from the process $D^{0(+)} \to a_{0}(980)^{+} π^{-(0)}$ is significantly larger than the…
▽ More
We report the first amplitude analysis of the decays $D^{0} \to π^{+} π^{-} η$ and $D^{+} \rightarrow π^{+}π^{0}η$ using a data sample taken with the BESIII detector at the center-of-mass energy of 3.773 GeV, corresponding to an integrated luminosity of 7.9 ${\rm fb}^{-1}$. The contribution from the process $D^{0(+)} \to a_{0}(980)^{+} π^{-(0)}$ is significantly larger than the $D^{0(+)} \to a_{0}(980)^{-(0)} π^{+}$ contribution. The ratios $\mathcal{B}(D^{0} \rightarrow a_{0}(980)^{+}π^{-})/\mathcal{B}(D^{0} \rightarrow a_{0}(980)^{-}π^{+})$ and $\mathcal{B}(D^{+} \rightarrow a_{0}(980)^{+}π^{0})/\mathcal{B}(D^{+} \rightarrow a_{0}(980)^{0}π^{+})$ are measured to be $7.5^{+2.5}_{-0.8\,\mathrm{stat.}}\pm1.7_{\mathrm{syst.}}$ and $2.6\pm0.6_{\mathrm{stat.}}\pm0.3_{\mathrm{syst.}}$, respectively. The measured $D^{0}$ ratio disagrees with the theoretical predictions by orders of magnitudes, thus implying a substantial contribution from final-state interactions.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
Flat Band Josephson Junctions with Quantum Metric
Authors:
Zhong C. F. Li,
Yuxuan Deng,
Shuai A. Chen,
Dmitri K. Efetov,
K. T. Law
Abstract:
In this work, we consider superconductor/flat band material/superconductor (S/FB/S) Josephson junctions (JJs) where the flat band material possesses isolated flat bands with exactly zero Fermi velocity. Contrary to conventional S/N/S JJs where the critical Josephson current vanishes when the Fermi velocity goes to zero, we show in this work that the critical current in the S/FB/S junction is contr…
▽ More
In this work, we consider superconductor/flat band material/superconductor (S/FB/S) Josephson junctions (JJs) where the flat band material possesses isolated flat bands with exactly zero Fermi velocity. Contrary to conventional S/N/S JJs where the critical Josephson current vanishes when the Fermi velocity goes to zero, we show in this work that the critical current in the S/FB/S junction is controlled by the quantum metric length $ξ_\mathrm{QM}$ of the flat bands. Microscopically, when $ξ_\mathrm{QM}$ of the flat band is long enough, the interface bound states originally localized at the two S/FB, FB/S interfaces can penetrate deeply into the flat band material and hybridize to form Andreev bound states (ABSs). These ABSs are able to carry long range and sizable supercurrents. Importantly, $ξ_\mathrm{QM}$ also controls how far the proximity effect can penetrate into the flat band material. This stands in sharp contrast to the de Gennes' theory for S/N junctions which predicts that the proximity effect is expected to be zero when the Fermi velocity of the normal metal is zero. We further suggest that the S/FB/S junctions would give rise to a new type of resonant Josephson transistors which can carry sizable and highly gate-tunable supercurrent.
△ Less
Submitted 13 June, 2024; v1 submitted 14 April, 2024;
originally announced April 2024.
-
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Authors:
Binghua Li,
Jie Mao,
Zhe Sun,
Chao Li,
Qibin Zhao,
Toshihisa Tanaka
Abstract:
Automated diagnosis with artificial intelligence has emerged as a promising area in the realm of medical imaging, while the interpretability of the introduced deep neural networks still remains an urgent concern. Although contemporary works, such as XProtoNet and MProtoNet, has sought to design interpretable prediction models for the issue, the localization precision of their resulting attribution…
▽ More
Automated diagnosis with artificial intelligence has emerged as a promising area in the realm of medical imaging, while the interpretability of the introduced deep neural networks still remains an urgent concern. Although contemporary works, such as XProtoNet and MProtoNet, has sought to design interpretable prediction models for the issue, the localization precision of their resulting attribution maps can be further improved. To this end, we propose a Multi-scale Attentive Prototypical part Network, termed MAProtoNet, to provide more precise maps for attribution. Specifically, we introduce a concise multi-scale module to merge attentive features from quadruplet attention layers, and produces attribution maps. The proposed quadruplet attention layers can enhance the existing online class activation map** loss via capturing interactions between the spatial and channel dimension, while the multi-scale module then fuses both fine-grained and coarse-grained information for precise maps generation. We also apply a novel multi-scale map** loss for supervision on the proposed multi-scale module. Compared to existing interpretable prototypical part networks in medical imaging, MAProtoNet can achieve state-of-the-art performance in localization on brain tumor segmentation (BraTS) datasets, resulting in approximately 4% overall improvement on activation precision score (with a best score of 85.8%), without using additional annotated labels of segmentation. Our code will be released in https://github.com/TUAT-Novice/maprotonet.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
SpectralMamba: Efficient Mamba for Hyperspectral Image Classification
Authors:
**g Yao,
Danfeng Hong,
Chenyu Li,
Jocelyn Chanussot
Abstract:
Recurrent neural networks and Transformers have recently dominated most applications in hyperspectral (HS) imaging, owing to their capability to capture long-range dependencies from spectrum sequences. However, despite the success of these sequential architectures, the non-ignorable inefficiency caused by either difficulty in parallelization or computationally prohibitive attention still hinders t…
▽ More
Recurrent neural networks and Transformers have recently dominated most applications in hyperspectral (HS) imaging, owing to their capability to capture long-range dependencies from spectrum sequences. However, despite the success of these sequential architectures, the non-ignorable inefficiency caused by either difficulty in parallelization or computationally prohibitive attention still hinders their practicality, especially for large-scale observation in remote sensing scenarios. To address this issue, we herein propose SpectralMamba -- a novel state space model incorporated efficient deep learning framework for HS image classification. SpectralMamba features the simplified but adequate modeling of HS data dynamics at two levels. First, in spatial-spectral space, a dynamical mask is learned by efficient convolutions to simultaneously encode spatial regularity and spectral peculiarity, thus attenuating the spectral variability and confusion in discriminative representation learning. Second, the merged spectrum can then be efficiently operated in the hidden state space with all parameters learned input-dependent, yielding selectively focused responses without reliance on redundant attention or imparallelizable recurrence. To explore the room for further computational downsizing, a piece-wise scanning mechanism is employed in-between, transferring approximately continuous spectrum into sequences with squeezed length while maintaining short- and long-term contextual profiles among hundreds of bands. Through extensive experiments on four benchmark HS datasets acquired by satellite-, aircraft-, and UAV-borne imagers, SpectralMamba surprisingly creates promising win-wins from both performance and efficiency perspectives.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Unconventional superconducting diode effects via antisymmetry and antisymmetry breaking
Authors:
Chong Li,
Yang-Yang Lyu,
Wen-Cheng Yue,
Peiyuan Huang,
Haojie Li,
Tianyu Li,
Chen-Guang Wang,
Zixiong Yuan,
Ying Dong,
Xiaoyu Ma,
Xuecou Tu,
Tao Tao,
Sining Dong,
Liang He,
Xiaoqing Jia,
Guozhu Sun,
Lin Kang,
Huabing Wang,
Francois M. Peeters,
Milorad V. Milošević,
Peiheng Wu,
Yong-Lei Wang
Abstract:
Symmetry-breaking plays a pivotal role in unlocking intriguing properties and functionalities in material systems. For example, the breaking of spatial and temporal symmetries leads to a fascinating phenomenon of superconducting diode effect. However, generating and precisely controlling the superconducting diode effect poses significant challenges. Here, we take a novel route with deliberate mani…
▽ More
Symmetry-breaking plays a pivotal role in unlocking intriguing properties and functionalities in material systems. For example, the breaking of spatial and temporal symmetries leads to a fascinating phenomenon of superconducting diode effect. However, generating and precisely controlling the superconducting diode effect poses significant challenges. Here, we take a novel route with deliberate manipulation of magnetic charge potentials to realize unconventional superconducting flux-quantum diode effects. We achieve this through suitably tailored nanoengineered arrays of nanobar magnets on top of a superconducting thin film. We demonstrate the vital roles of inversion antisymmetry and its breaking in evoking unconventional superconducting effects-a magnetically symmetric diode effect and an odd-parity magnetotransport effect. These effects are non-volatilely controllable through in-situ magnetization switching of the nanobar magnets. Our findings promote the use of antisymmetry (breaking) for initiating unconventional superconducting properties, paving the way for exciting prospects and innovative functionalities in superconducting electronics.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework
Authors:
Junyi Mei,
Shixuan Sun,
Chao Li,
Cheng Xu,
Cheng Chen,
Yibo Liu,
**g Wang,
Cheng Zhao,
Xiaofeng Hou,
Minyi Guo,
Bingsheng He,
Xiaoliang Cong
Abstract:
Dynamic graph random walk (DGRW) emerges as a practical tool for capturing structural relations within a graph. Effectively executing DGRW on GPU presents certain challenges. First, existing sampling methods demand a pre-processing buffer, causing substantial space complexity. Moreover, the power-law distribution of graph vertex degrees introduces workload imbalance issues, rendering DGRW embarras…
▽ More
Dynamic graph random walk (DGRW) emerges as a practical tool for capturing structural relations within a graph. Effectively executing DGRW on GPU presents certain challenges. First, existing sampling methods demand a pre-processing buffer, causing substantial space complexity. Moreover, the power-law distribution of graph vertex degrees introduces workload imbalance issues, rendering DGRW embarrassed to parallelize. In this paper, we propose FlowWalker, a GPU-based dynamic graph random walk framework. FlowWalker implements an efficient parallel sampling method to fully exploit the GPU parallelism and reduce space complexity. Moreover, it employs a sampler-centric paradigm alongside a dynamic scheduling strategy to handle the huge amounts of walking queries. FlowWalker stands as a memory-efficient framework that requires no auxiliary data structures in GPU global memory. We examine the performance of FlowWalker extensively on ten datasets, and experiment results show that FlowWalker achieves up to 752.2x, 72.1x, and 16.4x speedup compared with existing CPU, GPU, and FPGA random walk frameworks, respectively. Case study shows that FlowWalker diminishes random walk time from 35% to 3% in a pipeline of ByteDance friend recommendation GNN training.
△ Less
Submitted 26 April, 2024; v1 submitted 12 April, 2024;
originally announced April 2024.
-
The Origin of Young Stellar Populations in NGC 1783: Accretion of External Stars
Authors:
Li Wang,
Licai Deng,
Xiaoying Pang,
Long Wang,
Richard de Grijs,
Antonino P. Milone,
Chengyuan Li
Abstract:
The presence of young stellar populations in the Large Magellanic Cloud cluster NGC 1783 has caught significant attention, with suggestions ranging from it being a genuine secondary stellar generation to a population of blue straggler stars or simply contamination from background stars. Thanks to multi-epoch observations with the Hubble Space Telescope, proper motions for stars within the field of…
▽ More
The presence of young stellar populations in the Large Magellanic Cloud cluster NGC 1783 has caught significant attention, with suggestions ranging from it being a genuine secondary stellar generation to a population of blue straggler stars or simply contamination from background stars. Thanks to multi-epoch observations with the Hubble Space Telescope, proper motions for stars within the field of NGC 1783 have been derived, thus allowing accurate cluster membership determination. Here, we report that the younger stars within NGC 1783 indeed belong to the cluster, and their spatial distribution is more extended compared to the bulk of the older stellar population, consistent with previous studies. Through N-body simulations, we demonstrate that the observed characteristics of the younger stars cannot be explained solely by blue straggler stars in the context of the isolated dynamical evolution of NGC 1783. Instead, accretion of the external, low-mass stellar system can better account for both the inverse spatial concentration and the radial velocity isotropy of the younger stars. We propose that NGC 1783 may have accreted external stars from low-mass stellar systems, resulting in a mixture of external younger stars and blue straggler stars from the older bulk population, thereby accounting for the characteristics of the younger sequence.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock Forecasting
Authors:
Chufeng Li,
Jianyong Chen
Abstract:
As a branch of time series forecasting, stock movement forecasting is one of the challenging problems for investors and researchers. Since Transformer was introduced to analyze financial data, many researchers have dedicated themselves to forecasting stock movement using Transformer or attention mechanisms. However, existing research mostly focuses on individual stock information but ignores stock…
▽ More
As a branch of time series forecasting, stock movement forecasting is one of the challenging problems for investors and researchers. Since Transformer was introduced to analyze financial data, many researchers have dedicated themselves to forecasting stock movement using Transformer or attention mechanisms. However, existing research mostly focuses on individual stock information but ignores stock market information and high noise in stock data. In this paper, we propose a novel method using the attention mechanism in which both stock market information and individual stock information are considered. Meanwhile, we propose a novel EMD-based algorithm for reducing short-term noise in stock data. Two randomly selected exchange-traded funds (ETFs) spanning over ten years from US stock markets are used to demonstrate the superior performance of the proposed attention-based method. The experimental analysis demonstrates that the proposed attention-based method significantly outperforms other state-of-the-art baselines. Code is available at https://github.com/DurandalLee/ACEFormer.
△ Less
Submitted 25 March, 2024;
originally announced April 2024.
-
Global regularity of 2D Rayleigh-Bénard equations with logarithmic supercritical dissipation
Authors:
Baoquan Yuan,
Xinyuan Xu,
Changhao Li
Abstract:
In this paper, we study the global regularity problem for the 2D Rayleigh-Bénard equations with logarithmic supercritical dissipation. By exploiting a combined quantity of the system, the technique of Littlewood-Paley decomposition and Besov spaces, and some commutator estimates, we establish the global regularity of a strong solution to this equations in the Sobolev space $H^{s}(\mathbb{R}^{2})$…
▽ More
In this paper, we study the global regularity problem for the 2D Rayleigh-Bénard equations with logarithmic supercritical dissipation. By exploiting a combined quantity of the system, the technique of Littlewood-Paley decomposition and Besov spaces, and some commutator estimates, we establish the global regularity of a strong solution to this equations in the Sobolev space $H^{s}(\mathbb{R}^{2})$ for $s \ge2$.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Chaos in Motion: Unveiling Robustness in Remote Heart Rate Measurement through Brain-Inspired Skin Tracking
Authors:
Jie Wang,
**g Lian,
Minjie Ma,
Junqiang Lei,
Chunbiao Li,
Bin Li,
Jizhao Liu
Abstract:
Heart rate is an important physiological indicator of human health status. Existing remote heart rate measurement methods typically involve facial detection followed by signal extraction from the region of interest (ROI). These SOTA methods have three serious problems: (a) inaccuracies even failures in detection caused by environmental influences or subject movement; (b) failures for special patie…
▽ More
Heart rate is an important physiological indicator of human health status. Existing remote heart rate measurement methods typically involve facial detection followed by signal extraction from the region of interest (ROI). These SOTA methods have three serious problems: (a) inaccuracies even failures in detection caused by environmental influences or subject movement; (b) failures for special patients such as infants and burn victims; (c) privacy leakage issues resulting from collecting face video. To address these issues, we regard the remote heart rate measurement as the process of analyzing the spatiotemporal characteristics of the optical flow signal in the video. We apply chaos theory to computer vision tasks for the first time, thus designing a brain-inspired framework. Firstly, using an artificial primary visual cortex model to extract the skin in the videos, and then calculate heart rate by time-frequency analysis on all pixels. Our method achieves Robust Skin Tracking for Heart Rate measurement, called HR-RST. The experimental results show that HR-RST overcomes the difficulty of environmental influences and effectively tracks the subject movement. Moreover, the method could extend to other body parts. Consequently, the method can be applied to special patients and effectively protect individual privacy, offering an innovative solution.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.