-
Insulator-to-Metal Transition and Isotropic Gigantic Magnetoresistance in Layered Magnetic Semiconductors
Authors:
Gokul Acharya,
Bimal Neupane,
Chia-Hsiu Hsu,
Xian P. Yang,
David Graf,
Eun Sang Choi,
Krishna Pandey,
Md Rafique Un Nabi,
Santosh Karki Chhetri,
Rabindra Basnet,
Sumaya Rahman,
Jian Wang,
Zhengxin Hu,
Bo Da,
Hugh Churchill,
Guoqing Chang,
M. Zahid Hasan,
Yuanxi Wang,
** Hu
Abstract:
Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology ap…
▽ More
Magnetotransport, the response of electrical conduction to external magnetic field, acts as an important tool to reveal fundamental concepts behind exotic phenomena and plays a key role in enabling spintronic applications. Magnetotransport is generally sensitive to magnetic field orientations. In contrast, efficient and isotropic modulation of electronic transport, which is useful in technology applications such as omnidirectional sensing, is rarely seen, especially for pristine crystals. Here we propose a strategy to realize extremely strong modulation of electron conduction by magnetic field which is independent of field direction. GdPS, a layered antiferromagnetic semiconductor with resistivity anisotropies, supports a field-driven insulator-to-metal transition with a paradoxically isotropic gigantic negative magnetoresistance insensitive to magnetic field orientations. This isotropic magnetoresistance originates from the combined effects of a near-zero spin-orbit coupling of Gd3+-based half-filling f-electron system and the strong on-site f-d exchange coupling in Gd atoms. Our results not only provide a novel material system with extraordinary magnetotransport that offers a missing block for antiferromagnet-based ultrafast and efficient spintronic devices, but also demonstrate the key ingredients for designing magnetic materials with desired transport properties for advanced functionalities.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Vision-driven Automated Mobile GUI Testing via Multimodal Large Language Model
Authors:
Zhe Liu,
Cheng Li,
Chunyang Chen,
Junjie Wang,
Boyu Wu,
Yawen Wang,
Jun Hu,
Qing Wang
Abstract:
With the advancement of software rendering techniques, GUI pages in mobile apps now encompass a wealth of visual information, where the visual semantics of each page contribute to the overall app logic, presenting new challenges to software testing. Despite the progress in automated Graphical User Interface (GUI) testing, the absence of testing oracles has constrained its efficacy to identify only…
▽ More
With the advancement of software rendering techniques, GUI pages in mobile apps now encompass a wealth of visual information, where the visual semantics of each page contribute to the overall app logic, presenting new challenges to software testing. Despite the progress in automated Graphical User Interface (GUI) testing, the absence of testing oracles has constrained its efficacy to identify only crash bugs with evident abnormal signals. Nonetheless, there are still a considerable number of non-crash bugs, ranging from unexpected behaviors to misalignments, often evading detection by existing techniques. While these bugs can exhibit visual cues that serve as potential testing oracles, they often entail a sequence of screenshots, and detecting them necessitates an understanding of the operational logic among GUI page transitions, which is challenging traditional techniques. Considering the remarkable performance of Multimodal Large Language Models (MLLM) in visual and language understanding, this paper proposes a vision-driven automated GUI testing approach VisionDroid to detect non-crash functional bugs with MLLM. It begins by extracting GUI text information and aligning it with screenshots to form a vision prompt, enabling MLLM to understand GUI context. The function-aware explorer then employs MLLM for deeper and function-oriented GUI page exploration, while the logic-aware bug detector segments the entire exploration history into logically cohesive parts and prompts the MLLM for bug detection. We evaluate VisionDroid on three datasets and compare it with 10 baselines, demonstrating its excellent performance. The ablation study further proves the contribution of each module. Moreover, VisionDroid identifies 29 new bugs on Google Play, of which 19 have been confirmed and fixed.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
A high precision measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$ is performed using $(10 087 \pm 44) \times 10^6$ $J/ψ$ events recorded by the {BESIII} detector at the {BEPCII} storage ring. The branching fractions of the two decays $J/ψ\to p \bar{p} η(η\to γγ)$ and $J/ψ\to p \bar{p} η(η\to π^+ π^- π^0)$ are measured individually to be…
▽ More
A high precision measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$ is performed using $(10 087 \pm 44) \times 10^6$ $J/ψ$ events recorded by the {BESIII} detector at the {BEPCII} storage ring. The branching fractions of the two decays $J/ψ\to p \bar{p} η(η\to γγ)$ and $J/ψ\to p \bar{p} η(η\to π^+ π^- π^0)$ are measured individually to be $\mathcal{B}(J/ψ\to p \bar{p} η(η\to γγ)) = (1.480 \pm 0.001 \pm 0.024)\times\,10^{-3}$ and $\mathcal{B}(J/ψ\to p \bar{p} η(η\to π^+ π^- π^0)) = (1.557 \pm 0.003 \pm 0.038)\times\,10^{-3}$, where the first uncertainties are statistical and the second systematic. Both results are compatible within their uncorrelated systematic uncertainties. The combined result is $\mathcal{B}(J/ψ\to p \bar{p} η)=(1.495 \pm 0.001 \pm 0.023)\times\,10^{-3}$ where the first uncertainty is the combined statistical uncertainty and the second one the combined systematic uncertainty of both analyses, incorporating correlations between them. In addition, the $p \bar{p}$ threshold region is investigated for a potential threshold enhancement, and no evidence for one is observed.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Authors:
Ruiqing Yan,
Xingbo Du,
Haoyu Deng,
Linghan Zheng,
Qiuzhuang Sun,
Jifang Hu,
Yuhang Shao,
Penghao Jiang,
**rong Jiang,
Lian Zhao
Abstract:
With the advent of large models based on the Transformer architecture, researchers have observed an anomalous phenomenon in the Attention mechanism--there is a very high attention on the first element, which is prevalent across Transformer-based models. It is crucial to understand it for the development of techniques focusing on attention distribution, such as Key-Value (KV) Cache compression and…
▽ More
With the advent of large models based on the Transformer architecture, researchers have observed an anomalous phenomenon in the Attention mechanism--there is a very high attention on the first element, which is prevalent across Transformer-based models. It is crucial to understand it for the development of techniques focusing on attention distribution, such as Key-Value (KV) Cache compression and infinite extrapolation; however, the latent cause leaves to be unknown. In this paper, we analyze such a phenomenon from the perspective of waiver phenomenon, which involves reducing the internal values of certain elements in the sequence, allowing them to absorb excess attention without affecting their contribution to information. In specific models, due to differences in positional encoding and attention patterns, we have found that the selection of waiver elements by the model can be categorized into two methods: positional-encoding-based and feature-distribution-within-elements-based.
△ Less
Submitted 3 July, 2024; v1 submitted 26 June, 2024;
originally announced July 2024.
-
Centerline Boundary Dice Loss for Vascular Segmentation
Authors:
Pengcheng Shi,
Jiesi Hu,
Yanwu Yang,
Zilve Gao,
Wei Liu,
Ting Ma
Abstract:
Vascular segmentation in medical imaging plays a crucial role in analysing morphological and functional assessments. Traditional methods, like the centerline Dice (clDice) loss, ensure topology preservation but falter in capturing geometric details, especially under translation and deformation. The combination of clDice with traditional Dice loss can lead to diameter imbalance, favoring larger ves…
▽ More
Vascular segmentation in medical imaging plays a crucial role in analysing morphological and functional assessments. Traditional methods, like the centerline Dice (clDice) loss, ensure topology preservation but falter in capturing geometric details, especially under translation and deformation. The combination of clDice with traditional Dice loss can lead to diameter imbalance, favoring larger vessels. Addressing these challenges, we introduce the centerline boundary Dice (cbDice) loss function, which harmonizes topological integrity and geometric nuances, ensuring consistent segmentation across various vessel sizes. cbDice enriches the clDice approach by including boundary-aware aspects, thereby improving geometric detail recognition. It matches the performance of the boundary difference over union (B-DoU) loss through a mask-distance-based approach, enhancing traslation sensitivity. Crucially, cbDice incorporates radius information from vascular skeletons, enabling uniform adaptation to vascular diameter changes and maintaining balance in branch growth and fracture impacts. Furthermore, we conducted a theoretical analysis of clDice variants (cl-X-Dice). We validated cbDice's efficacy on three diverse vascular segmentation datasets, encompassing both 2D and 3D, and binary and multi-class segmentation. Particularly, the method integrated with cbDice demonstrated outstanding performance on the MICCAI 2023 TopCoW Challenge dataset. Our code is made publicly available at: https://github.com/PengchengShi1220/cbDice.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Learning Unsigned Distance Fields from Local Shape Functions for 3D Surface Reconstruction
Authors:
Jiangbei Hu,
Yanggeng Li,
Fei Hou,
Junhui Hou,
Zhebin Zhang,
Shengfa Wang,
Na Lei,
Ying He
Abstract:
Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework,…
▽ More
Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework, LoSF-UDF, for reconstructing surfaces from 3D point clouds by leveraging local shape functions to learn UDFs. We observe that 3D shapes manifest simple patterns within localized areas, prompting us to create a training dataset of point cloud patches characterized by mathematical functions that represent a continuum from smooth surfaces to sharp edges and corners. Our approach learns features within a specific radius around each query point and utilizes an attention mechanism to focus on the crucial features for UDF estimation. This method enables efficient and robust surface reconstruction from point clouds without the need for shape-specific training. Additionally, our method exhibits enhanced resilience to noise and outliers in point clouds compared to existing methods. We present comprehensive experiments and comparisons across various datasets, including synthetic and real-scanned point clouds, to validate our method's efficacy.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs)
Authors:
Jerry Yao-Chieh Hu,
Weimin Wu,
Zhuoru Li,
Zhao Song,
Han Liu
Abstract:
We investigate the statistical and computational limits of latent \textbf{Di}ffusion \textbf{T}ransformers (\textbf{DiT}s) under the low-dimensional linear latent space assumption. Statistically, we study the universal approximation and sample complexity of the DiTs score function, as well as the distribution recovery property of the initial data. Specifically, under mild data assumptions, we deri…
▽ More
We investigate the statistical and computational limits of latent \textbf{Di}ffusion \textbf{T}ransformers (\textbf{DiT}s) under the low-dimensional linear latent space assumption. Statistically, we study the universal approximation and sample complexity of the DiTs score function, as well as the distribution recovery property of the initial data. Specifically, under mild data assumptions, we derive an approximation error bound for the score network of latent DiTs, which is sub-linear in the latent space dimension. Additionally, we derive the corresponding sample complexity bound and show that the data distribution generated from the estimated score function converges toward a proximate area of the original one. Computationally, we characterize the hardness of both forward inference and backward computation of latent DiTs, assuming the Strong Exponential Time Hypothesis (SETH). For forward inference, we identify efficient criteria for all possible latent DiTs inference algorithms and showcase our theory by pushing the efficiency toward almost-linear time inference. For backward computation, we leverage the low-rank structure within the gradient computation of DiTs training for possible algorithmic speedup. Specifically, we show that such speedup achieves almost-linear time latent DiTs training by casting the DiTs gradient as a series of chained low-rank approximations with bounded error. Under the low-dimensional assumption, we show that the convergence rate and the computational efficiency are both dominated by the dimension of the subspace, suggesting that latent DiTs have the potential to bypass the challenges associated with the high dimensionality of initial data.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Generative prediction of flow field based on the diffusion model
Authors:
Jiajun Hu,
Zhen Lu,
Yue Yang
Abstract:
We propose a geometry-to-flow diffusion model that utilizes the input of obstacle shape to predict a flow field past the obstacle. The model is based on a learnable Markov transition kernel to recover the data distribution from the Gaussian distribution. The Markov process is conditioned on the obstacle geometry, estimating the noise to be removed at each step, implemented via a U-Net. A cross-att…
▽ More
We propose a geometry-to-flow diffusion model that utilizes the input of obstacle shape to predict a flow field past the obstacle. The model is based on a learnable Markov transition kernel to recover the data distribution from the Gaussian distribution. The Markov process is conditioned on the obstacle geometry, estimating the noise to be removed at each step, implemented via a U-Net. A cross-attention mechanism incorporates the geometry as a prompt. We train the geometry-to-flow diffusion model using a dataset of flows past simple obstacles, including the circle, ellipse, rectangle, and triangle. For comparison, the CNN model is trained using the same dataset. Tests are carried out on flows past obstacles with simple and complex geometries, representing interpolation and extrapolation on the geometry condition, respectively. In the test set, challenging scenarios include a cross and characters `PKU'. Generated flow fields show that the geometry-to-flow diffusion model is superior to the CNN model in predicting instantaneous flow fields and handling complex geometries. Quantitative analysis of the model accuracy and divergence in the fields demonstrate the high robustness of the diffusion model, indicating that the diffusion model learns physical laws implicitly.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
CSPBench: a benchmark and critical evaluation of Crystal Structure Prediction
Authors:
Lai Wei,
Sadman Sadeed Omee,
Rongzhi Dong,
Nihang Fu,
Yuqi Song,
Edirisuriya M. D. Siriwardane,
Meiling Xu,
Chris Wolverton,
Jianjun Hu
Abstract:
Crystal structure prediction (CSP) is now increasingly used in discovering novel materials with applications in diverse industries. However, despite decades of developments and significant progress in this area, there lacks a set of well-defined benchmark dataset, quantitative performance metrics, and studies that evaluate the status of the field. We aim to fill this gap by introducing a CSP bench…
▽ More
Crystal structure prediction (CSP) is now increasingly used in discovering novel materials with applications in diverse industries. However, despite decades of developments and significant progress in this area, there lacks a set of well-defined benchmark dataset, quantitative performance metrics, and studies that evaluate the status of the field. We aim to fill this gap by introducing a CSP benchmark suite with 180 test structures along with our recently implemented CSP performance metric set. We benchmark a collection of 13 state-of-the-art (SOTA) CSP algorithms including template-based CSP algorithms, conventional CSP algorithms based on DFT calculations and global search such as CALYPSO, CSP algorithms based on machine learning (ML) potentials and global search, and distance matrix based CSP algorithms. Our results demonstrate that the performance of the current CSP algorithms is far from being satisfactory. Most algorithms cannot even identify the structures with the correct space groups except for the template-based algorithms when applied to test structures with similar templates. We also find that the ML potential based CSP algorithms are now able to achieve competitive performances compared to the DFT-based algorithms. These CSP algorithms' performance is strongly determined by the quality of the neural potentials as well as the global optimization algorithms. Our benchmark suite comes with a comprehensive open-source codebase and 180 well-selected benchmark crystal structures, making it convenient to evaluate the advantages and disadvantages of CSP algorithms from future studies. All the code and benchmark data are available at https://github.com/usccolumbia/cspbenchmark
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation
Authors:
Yuchuan Tian,
Jianhong Han,
Hanting Chen,
Yuanyuan Xi,
Guoyang Zhang,
Jie Hu,
Chao Xu,
Yunhe Wang
Abstract:
Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transform…
▽ More
Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transformer that could effectively address manifold image restoration tasks with large inter-task gaps, such as denoising, deblurring, deraining, dehazing, and desnowing. Rather than popular feature adaptation methods, we propose weight modulation that adapts weights to specific tasks. Firstly, we figure out task-sensitive weights via a toy experiment and introduce task-specific biases on top of them. Secondly, we conduct rank analysis for a good compression strategy and perform low-rank decomposition on the biases. Thirdly, we propose synchronous training that updates the task-general backbone model and the task-specific biases simultaneously. In this way, the model is instructed to learn general and task-specific knowledge. Via our simple yet effective method that instructs the IPT to be task experts, Instruct-IPT could better cooperate between tasks with distinct characteristics at humble costs. Further, we propose to maneuver Instruct-IPT with text instructions for better user interfaces. We have conducted experiments on Instruct-IPT to demonstrate the effectiveness of our method on manifold tasks, and we have effectively extended our method to diffusion denoisers as well. The code is available at https://github.com/huawei-noah/Pretrained-IPT.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Fully discrete energy-dissipative and conservative discrete gradient particle methods for a class of continuity equations
Authors:
**gwei Hu,
Samuel Q. Van Fleet,
Andy T. S. Wan
Abstract:
Structure-preserving particle methods have recently been proposed for a class of nonlinear continuity equations, including aggregation-diffusion equation in [J. Carrillo, K. Craig, F. Patacchini, Calc. Var., 58 (2019), pp. 53] and the Landau equation in [J. Carrillo, J. Hu., L. Wang, J. Wu, J. Comput. Phys. X, 7 (2020), pp. 100066]. One common feature to these equations is that they both admit som…
▽ More
Structure-preserving particle methods have recently been proposed for a class of nonlinear continuity equations, including aggregation-diffusion equation in [J. Carrillo, K. Craig, F. Patacchini, Calc. Var., 58 (2019), pp. 53] and the Landau equation in [J. Carrillo, J. Hu., L. Wang, J. Wu, J. Comput. Phys. X, 7 (2020), pp. 100066]. One common feature to these equations is that they both admit some variational formulation, which upon proper regularization, leads to particle approximations dissipating the energy and conserving some quantities simultaneously at the semi-discrete level. In this paper, we formulate continuity equations with a density dependent bilinear form associated with the variational derivative of the energy functional and prove that appropriate particle methods satisfy a compatibility condition with its regularized energy. This enables us to utilize discrete gradient time integrators and show that the energy can be dissipated and the mass conserved simultaneously at the fully discrete level. In the case of the Landau equation, we prove that our approach also conserves the momentum and kinetic energy at the fully discrete level. Several numerical examples are presented to demonstrate the dissipative and conservative properties of our proposed method.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Three-dimensional non-reciprocal transport in photonic topological heterostructure of arbitrary shape
Authors:
Mudi Wang,
Ruo-Yang Zhang,
Chenyu Zhang,
Haoran Xue,
Hongwei Jia,
**g Hu,
Dongyang Wang,
Tianshu Jiang,
C. T. Chan
Abstract:
Electromagnetic wave propagation in three-dimensional space typically suffers omnidirectional scattering when encountering obstacles. In this study, we employed Chern vectors to construct a topological heterostructure, where large-volume non-reciprocal topological transport in three-dimension is achieved. The shape of the cross-section in the heterostructure can be arbitrary designed, and we exper…
▽ More
Electromagnetic wave propagation in three-dimensional space typically suffers omnidirectional scattering when encountering obstacles. In this study, we employed Chern vectors to construct a topological heterostructure, where large-volume non-reciprocal topological transport in three-dimension is achieved. The shape of the cross-section in the heterostructure can be arbitrary designed, and we experimentally observed the distinctive cross-shaped field pattern transport, non-reciprocal energy harvesting, and most importantly, the remarkable ability of electromagnetic wave to traverse obstacles and abrupt structure changes without encountering reflections in 3D space.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Observation of the Electromagnetic Dalitz Transition $h_c \rightarrow e^+e^-η_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
S. Ahmed,
M. Albrecht,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
X. H. Bai,
Y. Bai,
O. Bakina,
R. Baldini Ferroli,
I. Balossino,
Y. Ban,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (495 additional authors not shown)
Abstract:
Using $(27.12\pm 0.14)\times10^8$ $ψ(3686)$ decays and data samples of $e^+e^-$ collisions with $\sqrt{s}$ from 4.130 to 4.780~GeV collected with the BESIII detector, we report the first observation of the electromagnetic Dalitz transition $h_c\to e^+e^-η_c$ with a statistical significance of $5.4σ$. We measure the ratio of the branching fractions…
▽ More
Using $(27.12\pm 0.14)\times10^8$ $ψ(3686)$ decays and data samples of $e^+e^-$ collisions with $\sqrt{s}$ from 4.130 to 4.780~GeV collected with the BESIII detector, we report the first observation of the electromagnetic Dalitz transition $h_c\to e^+e^-η_c$ with a statistical significance of $5.4σ$. We measure the ratio of the branching fractions $\frac{\mathcal{B}(h_c\rightarrow e^+e^-η_c)}{\mathcal{B}(h_c\rightarrow γη_c)}$ separately for the $h_c$ samples produced via $ψ(3686)\toπ^0h_c$ and $e^+e^-\toπ^+π^-h_c$. The average ratio is determined to be $(0.59\pm0.10(\text{stat.})\pm0.04(\text{syst.}))\%$, where the uncertainty includes both statistical and systematic components.
△ Less
Submitted 2 July, 2024; v1 submitted 28 June, 2024;
originally announced July 2024.
-
Microheater hotspot engineering for repeatable multi-level switching in foundry-processed phase change silicon photonics
Authors:
Hongyi Sun,
Chuanyu Lian,
Francis Vásquez-Aza,
Sadra Rahimi Kari,
Yi-Siou Huang,
Alessandro Restelli,
Steven A. Vitale,
Ichiro Takeuchi,
Juejun Hu,
Nathan Youngblood,
Georges Pavlidis,
Carlos A. Ríos Ocampo
Abstract:
Nonvolatile photonic integrated circuits employing phase change materials have relied either on optical switching mechanisms with precise multi-level control but poor scalability or electrical switching with seamless integration and scalability but mostly limited to a binary response. Recent works have demonstrated electrical multi-level switching; however, they relied on the stochastic nucleation…
▽ More
Nonvolatile photonic integrated circuits employing phase change materials have relied either on optical switching mechanisms with precise multi-level control but poor scalability or electrical switching with seamless integration and scalability but mostly limited to a binary response. Recent works have demonstrated electrical multi-level switching; however, they relied on the stochastic nucleation process to achieve partial crystallization with low demonstrated repeatability and cyclability. Here, we re-engineer waveguide-integrated microheaters to achieve precise spatial control of the temperature profile (i.e., hotspot) and, thus, switch deterministic areas of an embedded phase change material cell. We experimentally demonstrate this concept using a variety of foundry-processed doped-silicon microheaters on a silicon-on-insulator platform to trigger multi-step amorphization and reversible switching of Sb$_{2}$Se$_{3}$ and Ge$_{2}$Sb$_{2}$Se$_{4}$Te alloys. We further characterize the response of our microheaters using Transient Thermoreflectance Imaging. Our approach combines the deterministic control resulting from a spatially resolved glassy-crystalline distribution with the scalability of electro-thermal switching devices, thus paving the way to reliable multi-level switching towards robust reprogrammable phase-change photonic devices for analog processing and computing.
△ Less
Submitted 15 June, 2024;
originally announced July 2024.
-
Forecast of cosmological constraints with superluminous supernovae from the Chinese Space Station Telescope
Authors:
Xuan-Dong Jia,
Jian-** Hu,
Fa-Yin Wang,
Zi-Gao Dai
Abstract:
Superluminous supernovae (SLSNe) are a class of intense celestial events that can be standardized for measuring cosmological parameters, bridging the gap between type Ia supernovae and the cosmic microwave background. In this work, we discuss the cosmological applications of SLSNe from the Chinese Space Station Telescope (CSST). Our estimation suggests that SLSNe rate is biased tracing the cosmic…
▽ More
Superluminous supernovae (SLSNe) are a class of intense celestial events that can be standardized for measuring cosmological parameters, bridging the gap between type Ia supernovae and the cosmic microwave background. In this work, we discuss the cosmological applications of SLSNe from the Chinese Space Station Telescope (CSST). Our estimation suggests that SLSNe rate is biased tracing the cosmic star formation rate, exhibiting a factor of $(1+z)^{1.2}$. We futher predict that CSST is poised to observe $\sim 360$ SLSNe in the 10 square degrees ultra-deep field survey within a span of 2.5 years. A stringent constraint on cosmological parameters can be derived from their peak-color relationship. CSST is anticipated to uncover a substantial number of SLSNe, contributing to a deeper understanding of their central engines and shedding light on the nature of dark energy at high redshifts.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Lightweight Predictive 3D Gaussian Splats
Authors:
Junli Cao,
Vidit Goel,
Chaoyang Wang,
Anil Kag,
Ju Hu,
Sergei Korolev,
Chenfanfu Jiang,
Sergey Tulyakov,
Jian Ren
Abstract:
Recent approaches representing 3D objects and scenes using Gaussian splats show increased rendering speed across a variety of platforms and devices. While rendering such representations is indeed extremely efficient, storing and transmitting them is often prohibitively expensive. To represent large-scale scenes, one often needs to store millions of 3D Gaussians, occupying gigabytes of disk space.…
▽ More
Recent approaches representing 3D objects and scenes using Gaussian splats show increased rendering speed across a variety of platforms and devices. While rendering such representations is indeed extremely efficient, storing and transmitting them is often prohibitively expensive. To represent large-scale scenes, one often needs to store millions of 3D Gaussians, occupying gigabytes of disk space. This poses a very practical limitation, prohibiting widespread adoption.Several solutions have been proposed to strike a balance between disk size and rendering quality, noticeably reducing the visual quality. In this work, we propose a new representation that dramatically reduces the hard drive footprint while featuring similar or improved quality when compared to the standard 3D Gaussian splats. When compared to other compact solutions, ours offers higher quality renderings with significantly reduced storage, being able to efficiently run on a mobile device in real-time. Our key observation is that nearby points in the scene can share similar representations. Hence, only a small ratio of 3D points needs to be stored. We introduce an approach to identify such points which are called parent points. The discarded points called children points along with attributes can be efficiently predicted by tiny MLPs.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Integrated Triply Resonant Electro-Optic Frequency Comb in Lithium Tantalate
Authors:
Junyin Zhang,
Chengli Wang,
Connor Denney,
Grigory Lihachev,
Jianqi Hu,
Wil Kao,
Terence Blésin,
Nikolai Kuznetsov,
Zihan Li,
Mikhail Churaev,
Xin Ou,
Johann Riemensberger,
Gabriel Santamaria-Botello,
Tobias J. Kippenberg
Abstract:
Integrated frequency comb generators based on Kerr parametric oscillation have led to chip-scale, gigahertz-spaced combs with new applications spanning hyperscale telecommunications, low-noise microwave synthesis, LiDAR, and astrophysical spectrometer calibration. Recent progress in lithium niobate (LN) photonic integrated circuits (PICs) has resulted in chip-scale electro-optic (EO) frequency com…
▽ More
Integrated frequency comb generators based on Kerr parametric oscillation have led to chip-scale, gigahertz-spaced combs with new applications spanning hyperscale telecommunications, low-noise microwave synthesis, LiDAR, and astrophysical spectrometer calibration. Recent progress in lithium niobate (LN) photonic integrated circuits (PICs) has resulted in chip-scale electro-optic (EO) frequency combs, offering precise comb-line positioning and simple operation without relying on the formation of dissipative Kerr solitons. However, current integrated EO combs face limited spectral coverage due to the large microwave power required to drive the non-resonant capacitive electrodes and the strong intrinsic birefringence of Lithium Niobate. Here, we overcome both challenges with an integrated triply resonant architecture, combining monolithic microwave integrated circuits (MMICs) with PICs based on the recently emerged thin-film lithium tantalate. With resonantly enhanced EO interaction and reduced birefringence in Lithium Tantalate, we achieve a four-fold comb span extension and a 16-fold power reduction compared to the conventional non-resonant microwave design. Driven by a hybrid-integrated laser diode, the comb spans over 450nm (60THz) with >2000 lines, and the generator fits within a compact 1cm^2 footprint. We additionally observe that the strong EO coupling leads to an increased comb existence range approaching the full free spectral range of the optical microresonator. The ultra-broadband comb generator, combined with detuning-agnostic operation, could advance chip-scale spectrometry and ultra-low-noise millimeter wave synthesis and unlock octave-spanning EO combs. The methodology of co-designing microwave and optical resonators can be extended to a wide range of integrated electro-optics applications.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Improved measurement of the semileptonic decay $D^+_{s}\to K^0 e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (643 additional authors not shown)
Abstract:
Analyzing $e^+e^-$ collision data corresponding to an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we measure the branching fraction of the semileptonic decay $D^+_{s}\to K^0 e^+ν_e$ to be $(2.98\pm0.23\pm0.12)\times10^{-3}$. The $D_s^+\to K^0$ hadronic form factor is determined from the differential dec…
▽ More
Analyzing $e^+e^-$ collision data corresponding to an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we measure the branching fraction of the semileptonic decay $D^+_{s}\to K^0 e^+ν_e$ to be $(2.98\pm0.23\pm0.12)\times10^{-3}$. The $D_s^+\to K^0$ hadronic form factor is determined from the differential decay rate of $D^+_s\to K^0 e^+ν_e$ to be $f^{K^0}_+(0)=0.636\pm0.049\pm0.013$. For both measurements, the first uncertainty is statistical and the second systematic. The branching fraction and form factor measurements are factors of 1.6 and 1.7 more precise than the previous world averages, respectively.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views
Authors:
Zongyu Li,
Yixuan Jia,
Xiaojian Xu,
Jason Hu,
Jeffrey A. Fessler,
Yuni K. Dewaraja
Abstract:
Purpose: This study addresses the challenge of extended SPECT imaging duration under low-count conditions, as encountered in Lu-177 SPECT imaging, by develo** a self-supervised learning approach to synthesize skipped SPECT projection views, thus shortening scan times in clinical settings. Methods: We employed a self-supervised coordinate-based learning technique, adapting the neural radiance fie…
▽ More
Purpose: This study addresses the challenge of extended SPECT imaging duration under low-count conditions, as encountered in Lu-177 SPECT imaging, by develo** a self-supervised learning approach to synthesize skipped SPECT projection views, thus shortening scan times in clinical settings. Methods: We employed a self-supervised coordinate-based learning technique, adapting the neural radiance field (NeRF) concept in computer vision to synthesize under-sampled SPECT projection views. For each single scan, we used self-supervised coordinate learning to estimate skipped SPECT projection views. The method was tested with various down-sampling factors (DFs=2, 4, 8) on both Lu-177 phantom SPECT/CT measurements and clinical SPECT/CT datasets, from 11 patients undergoing Lu-177 DOTATATE and 6 patients undergoing Lu-177 PSMA-617 radiopharmaceutical therapy. Results: For SPECT reconstructions, our method outperformed the use of linearly interpolated projections and partial projection views in relative contrast-to-noise-ratios (RCNR) averaged across different downsampling factors: 1) DOTATATE: 83% vs. 65% vs. 67% for lesions and 86% vs. 70% vs. 67% for kidney, 2) PSMA: 76% vs. 69% vs. 68% for lesions and 75% vs. 55% vs. 66% for organs, including kidneys, lacrimal glands, parotid glands, and submandibular glands. Conclusion: The proposed method enables reduction in acquisition time (by factors of 2, 4, or 8) while maintaining quantitative accuracy in clinical SPECT protocols by allowing for the collection of fewer projections. Importantly, the self-supervised nature of this NeRF-based approach eliminates the need for extensive training data, instead learning from each patient's projection data alone. The reduction in acquisition time is particularly relevant for imaging under low-count conditions and for protocols that require multiple-bed positions such as whole-body imaging.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Assisting Tibetan Students in Learning Quantum Mechanics via Mathematica
Authors:
Guangtian Zhu,
**g Hu,
Chun Du
Abstract:
Undergraduate students of physics in Tibet have great difficulty learning quantum mechanics (QM). We attempt to use PER-based methods to help Tibetan students learn QM. In this preliminary study, we incorporate Mathematica in a QM course at Tibet University and record students' learning experiences. Tibetan students tend to have subjective feelings of learning Mathematica, whereas Han students (ma…
▽ More
Undergraduate students of physics in Tibet have great difficulty learning quantum mechanics (QM). We attempt to use PER-based methods to help Tibetan students learn QM. In this preliminary study, we incorporate Mathematica in a QM course at Tibet University and record students' learning experiences. Tibetan students tend to have subjective feelings of learning Mathematica, whereas Han students (majority) are more focused on the operational techniques of Mathematica. The results also suggest that both Tibetan students and Han students show limited improvement in time-independent Schrodinger equations after learning QM with Mathematica. Further effort is needed to improve the academic literacy skills of physics students in Tibet.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Relative Measurement and Extrapolation of the Scintillation Quenching Factor of $α$-Particles in Liquid Argon using DEAP-3600 Data
Authors:
The DEAP Collaboration,
P. Adhikari,
M. Alpízar-Venegas,
P. -A. Amaudruz,
J. Anstey,
D. J. Auty,
M. Batygov,
B. Beltran,
C. E. Bina,
W. Bonivento,
M. G. Boulay,
J. F. Bueno,
B. Cai,
M. Cárdenas-Montes,
S. Choudhary,
B. T. Cleveland,
R. Crampton,
S. Daugherty,
P. DelGobbo,
P. Di Stefano,
G. Dolganov,
L. Doria,
F. A. Duncan,
M. Dunford,
E. Ellingwood
, et al. (73 additional authors not shown)
Abstract:
The knowledge of scintillation quenching of $α$-particles plays a paramount role in understanding $α$-induced backgrounds and improving the sensitivity of liquid argon-based direct detection of dark matter experiments. We performed a relative measurement of scintillation quenching in the MeV energy region using radioactive isotopes ($^{222}$Rn, $^{218}$Po and $^{214}$Po isotopes) present in trace…
▽ More
The knowledge of scintillation quenching of $α$-particles plays a paramount role in understanding $α$-induced backgrounds and improving the sensitivity of liquid argon-based direct detection of dark matter experiments. We performed a relative measurement of scintillation quenching in the MeV energy region using radioactive isotopes ($^{222}$Rn, $^{218}$Po and $^{214}$Po isotopes) present in trace amounts in the DEAP-3600 detector and quantified the uncertainty of extrapolating the quenching factor to the low-energy region.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
GS-Octree: Octree-based 3D Gaussian Splatting for Robust Object-level 3D Reconstruction Under Strong Lighting
Authors:
Jiaze Li,
Zhengyu Wen,
Luo Zhang,
Jiangbei Hu,
Fei Hou,
Zhebin Zhang,
Ying He
Abstract:
The 3D Gaussian Splatting technique has significantly advanced the construction of radiance fields from multi-view images, enabling real-time rendering. While point-based rasterization effectively reduces computational demands for rendering, it often struggles to accurately reconstruct the geometry of the target object, especially under strong lighting. To address this challenge, we introduce a no…
▽ More
The 3D Gaussian Splatting technique has significantly advanced the construction of radiance fields from multi-view images, enabling real-time rendering. While point-based rasterization effectively reduces computational demands for rendering, it often struggles to accurately reconstruct the geometry of the target object, especially under strong lighting. To address this challenge, we introduce a novel approach that combines octree-based implicit surface representations with Gaussian splatting. Our method consists of four stages. Initially, it reconstructs a signed distance field (SDF) and a radiance field through volume rendering, encoding them in a low-resolution octree. The initial SDF represents the coarse geometry of the target object. Subsequently, it introduces 3D Gaussians as additional degrees of freedom, which are guided by the SDF. In the third stage, the optimized Gaussians further improve the accuracy of the SDF, allowing it to recover finer geometric details compared to the initial SDF obtained in the first stage. Finally, it adopts the refined SDF to further optimize the 3D Gaussians via splatting, eliminating those that contribute little to visual appearance. Experimental results show that our method, which leverages the distribution of 3D Gaussians with SDFs, reconstructs more accurate geometry, particularly in images with specular highlights caused by strong lighting.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Measurement of the cross sections of $e^+e^-\to K^{-}\barΞ^{+}Λ/Σ^{0}$ at center-of-mass energies between 3.510 and 4.914 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 3.510 and 4.914GeV, corresponding to an integrated luminosity of 25 fb$^{-1}$, we measure the Born cross sections for the process $e^+e^-\to K^-\barΞ^+Λ/Σ^{0}$ at thirty-five energy points with a partial-reconstruction strategy. By fitting the dressed cross sections of…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at center-of-mass energies between 3.510 and 4.914GeV, corresponding to an integrated luminosity of 25 fb$^{-1}$, we measure the Born cross sections for the process $e^+e^-\to K^-\barΞ^+Λ/Σ^{0}$ at thirty-five energy points with a partial-reconstruction strategy. By fitting the dressed cross sections of $e^+e^-\to K^-\barΞ^+Λ/Σ^{0}$, evidence for $ψ(4160) \to K^{-}\barΞ^{+}Λ$ is found for the first time with a significance of 4.4$σ$, including systematic uncertainties. No evidence for other possible resonances is found. In addition, the products of electronic partial width and branching fraction for all assumed resonances decaying into $K^{-}\barΞ^{+}Λ/Σ^{0}$ are determined.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Measurements of $K_S^0$-$K_L^0$ asymmetries in the decays $Λ_c^+ \to pK_{L,S}^0$, $pK_{L,S}^0π^+π^-$ and $pK_{L,S}^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (643 additional authors not shown)
Abstract:
Using $e^+e^-$ annihilation data sets corresponding to an integrated luminosity of 4.5 $\text{fb}^{-1}$, collected with the BESIII detector at center-of-mass energies between 4.600 and 4.699 GeV, we report the first measurements of the absolute branching fractions $\mathcal{B}(Λ_c^+\to pK_{L}^{0})=(1.67 \pm 0.06 \pm 0. 04)\%$, $\mathcal{B}(Λ_c^+\to pK_{L}^{0}π^+π^-)=(1.69 \pm 0.10 \pm 0.05)\%$, an…
▽ More
Using $e^+e^-$ annihilation data sets corresponding to an integrated luminosity of 4.5 $\text{fb}^{-1}$, collected with the BESIII detector at center-of-mass energies between 4.600 and 4.699 GeV, we report the first measurements of the absolute branching fractions $\mathcal{B}(Λ_c^+\to pK_{L}^{0})=(1.67 \pm 0.06 \pm 0. 04)\%$, $\mathcal{B}(Λ_c^+\to pK_{L}^{0}π^+π^-)=(1.69 \pm 0.10 \pm 0.05)\%$, and $\mathcal{B}(Λ_c^+\to pK_{L}^{0}π^0)=(2.02 \pm 0.13 \pm 0.05)\%$, where the first uncertainties are statistical and the second systematic. Combining with the known branching fractions of $Λ_c^+ \to pK_{S}^{0}$, $Λ_c^+ \to pK_{S}^{0}π^+π^-$, and $Λ_c^+ \to pK_{S}^{0}π^0$, we present the first measurements of the $K_{S}^{0}$-$K_{L}^{0}$ asymmetries $R(Λ_c^+, K_{S,L}^0X) = \frac{\mathcal{B}(Λ_c^+ \to K_{S}^{0} X) - \mathcal{B}(Λ_c^+ \to K_{L}^{0} X)}{\mathcal{B}(Λ_c^+ \to K_{S}^{0} X) + \mathcal{B}(Λ_c^+ \to K_{L}^{0} X)}$ in charmed baryon decays: $R(Λ_c^+, pK_{S,L}^0) = -0.025 \pm 0.031$, $R(Λ_c^+, pK_{S,L}^0π^+π^-) = -0.027 \pm 0.048$, and $R(Λ_c^+, pK_{S,L}^0π^0) =-0.015 \pm 0.046$. No significant asymmetries within the uncertainties are observed.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation
Authors:
Qilai Zhang,
Jiawen Li,
Peiran Liao,
Jiali Hu,
Tian Guan,
Anjia Han,
Yonghong He
Abstract:
The two primary types of Hematoxylin and Eosin (H&E) slides in histopathology are Formalin-Fixed Paraffin-Embedded (FFPE) and Fresh Frozen (FF). FFPE slides offer high quality histopathological images but require a labor-intensive acquisition process. In contrast, FF slides can be prepared quickly, but the image quality is relatively poor. Our task is to translate FF images into FFPE style, thereb…
▽ More
The two primary types of Hematoxylin and Eosin (H&E) slides in histopathology are Formalin-Fixed Paraffin-Embedded (FFPE) and Fresh Frozen (FF). FFPE slides offer high quality histopathological images but require a labor-intensive acquisition process. In contrast, FF slides can be prepared quickly, but the image quality is relatively poor. Our task is to translate FF images into FFPE style, thereby improving the image quality for diagnostic purposes. In this paper, we propose Diffusion-FFPE, a method for FF-to-FFPE histopathological image translation using a pre-trained diffusion model. Specifically, we employ a one-step diffusion model as the generator and fine-tune it with LoRA adapters using adversarial learning objectives. To ensure that the model effectively captures both global structural information and local details, we propose a multi-scale feature fusion (MFF) module. This module utilizes two VAE encoders to extract features of varying image sizes and performs feature fusion before feeding them into the UNet. Furthermore, we utilize a pre-trained vision-language model for histopathology as the backbone for the discriminator to further improve performance We conducted FF-to-FFPE translation experiments on the TCGA-NSCLC datasets, and our method achieved better performance compared to other methods. The code and models are released at https://github.com/QilaiZhang/Diffusion-FFPE.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool
Authors:
Cunchen Hu,
Heyang Huang,
Junhao Hu,
Jiang Xu,
Xusheng Chen,
Tao Xie,
Chenxi Wang,
Sa Wang,
Yungang Bao,
Ninghui Sun,
Yizhou Shan
Abstract:
Large language model (LLM) serving has transformed from stateless to stateful systems, utilizing techniques like context caching and disaggregated inference. These optimizations extend the lifespan and domain of the KV cache, necessitating a new architectural approach. We present MemServe, a unified system that integrates both inter-request and intra-request optimizations. MemServe introduces MemP…
▽ More
Large language model (LLM) serving has transformed from stateless to stateful systems, utilizing techniques like context caching and disaggregated inference. These optimizations extend the lifespan and domain of the KV cache, necessitating a new architectural approach. We present MemServe, a unified system that integrates both inter-request and intra-request optimizations. MemServe introduces MemPool, an elastic memory pool managing distributed memory and KV caches across serving instances. Using MemPool APIs, MemServe combines context caching with disaggregated inference for the first time, supported by a global scheduler that enhances cache reuse through a global prompt tree-based locality-aware policy. Tests show that MemServe significantly improves job completion time and time-to-first-time.
△ Less
Submitted 26 June, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Study of the $f_{0}(980)$ through the decay $D_{s}^{+}\rightarrow π^{+}π^{+}π^{-}π^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (649 additional authors not shown)
Abstract:
We perform the first amplitude analysis of $D^+_s \to π^+π^+π^-π^0$ decays, based on data samples of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV, corresponding to an integrated luminosity of 7.33~fb$^{-1}$. We report the observation of $D_{s}^{+} \to f_0(980)ρ(770)^{+}$ with a statistical significance greater than 10$σ$ and…
▽ More
We perform the first amplitude analysis of $D^+_s \to π^+π^+π^-π^0$ decays, based on data samples of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV, corresponding to an integrated luminosity of 7.33~fb$^{-1}$. We report the observation of $D_{s}^{+} \to f_0(980)ρ(770)^{+}$ with a statistical significance greater than 10$σ$ and determine the branching fractions $\mathcal{B}(D_s^+\toπ^+π^+π^-π^0|_{{\rm non}-η})=(2.04\pm0.08_{\rm stat.}\pm0.05_{\rm syst.})\%$ and $\mathcal{B}(D_s^+\toηπ^+)=(1.56\pm0.09_{\rm stat.}\pm0.04_{\rm syst.})\%$. Moreover, we measure the relative branching fraction between $φ\toπ^+π^-π^0$ and $φ\to K^+K^-$ to be $\frac{\mathcal{B}(φ(1020) \to π^+π^-π^0)}{\mathcal{B}(φ(1020) \to K^+K^-)}=0.230 \pm 0.014_{\rm stat.} \pm 0.010_{\rm syst.}$, which deviates from the world average value by more than $4σ$.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Depth-Guided Semi-Supervised Instance Segmentation
Authors:
Xin Chen,
Jie Hu,
Xiawu Zheng,
Jianghang Lin,
Liujuan Cao,
Rongrong Ji
Abstract:
Semi-Supervised Instance Segmentation (SSIS) aims to leverage an amount of unlabeled data during training. Previous frameworks primarily utilized the RGB information of unlabeled images to generate pseudo-labels. However, such a mechanism often introduces unstable noise, as a single instance can display multiple RGB values. To overcome this limitation, we introduce a Depth-Guided (DG) SSIS framewo…
▽ More
Semi-Supervised Instance Segmentation (SSIS) aims to leverage an amount of unlabeled data during training. Previous frameworks primarily utilized the RGB information of unlabeled images to generate pseudo-labels. However, such a mechanism often introduces unstable noise, as a single instance can display multiple RGB values. To overcome this limitation, we introduce a Depth-Guided (DG) SSIS framework. This framework uses depth maps extracted from input images, which represent individual instances with closely associated distance values, offering precise contours for distinct instances. Unlike RGB data, depth maps provide a unique perspective, making their integration into the SSIS process complex. To this end, we propose Depth Feature Fusion, which integrates features extracted from depth estimation. This integration allows the model to understand depth information better and ensure its effective utilization. Additionally, to manage the variability of depth images during training, we introduce the Depth Controller. This component enables adaptive adjustments of the depth map, enhancing convergence speed and dynamically balancing the loss weights between RGB and depth maps. Extensive experiments conducted on the COCO and Cityscapes datasets validate the efficacy of our proposed method. Our approach establishes a new benchmark for SSIS, outperforming previous methods. Specifically, our DG achieves 22.29%, 31.47%, and 35.14% mAP for 1%, 5%, and 10% labeled data on the COCO dataset, respectively.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Quantum gravitomagnetic interaction
Authors:
Di Hao,
Jiawei Hu,
Hongwei Yu
Abstract:
In the framework of linearized quantum gravity, we study the quantum gravitational interaction between two nonpointlike objects induced by fluctuating gravitomagnetic fields in vacuum. We find that, in addition to the quantum gravitational interaction induced by fluctuating gravitoelectric fields previously studied, there exists a quantum gravitomagnetic interaction. This interaction originates fr…
▽ More
In the framework of linearized quantum gravity, we study the quantum gravitational interaction between two nonpointlike objects induced by fluctuating gravitomagnetic fields in vacuum. We find that, in addition to the quantum gravitational interaction induced by fluctuating gravitoelectric fields previously studied, there exists a quantum gravitomagnetic interaction. This interaction originates from the interaction between the instantaneous localized mass currents in nonpointlike objects induced by the fluctuating gravitomagnetic fields. Using fourth-order perturbation theory, we derive the explicit form of the quantum gravitomagnetic interaction energy, which shows an $r^{-10}$ dependence in the near regime and an $r^{-11}$ dependence in the far regime, where $r$ is the distance between the two objects. This interaction energy is expected to be significant when the gravitomagnetic polarizability of the objects is large.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks
Authors:
Yun-Shiuan Chuang,
Zach Studdiford,
Krirk Nirunwiroj,
Agam Goyal,
Vincent V. Frigo,
Sijia Yang,
Dhavan Shah,
Junjie Hu,
Timothy T. Rogers
Abstract:
Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estima…
▽ More
Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estimated a belief network encompassing 18 topics loading on two non-overlap** latent factors. We then seeded LLM-based agents with an opinion on one topic, and assessed the alignment of its expressed opinions on remaining test topics with corresponding human data. Role-playing based on demographic information alone did not align LLM and human opinions, but seeding the agent with a single belief greatly improved alignment for topics related in the belief network, and not for topics outside the network. These results suggest a novel path for human-LLM belief alignment in work seeking to simulate and understand patterns of belief distributions in society.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Probing the nature of the $χ_{c1}(3872)$ state using radiative decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1094 additional authors not shown)
Abstract:
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and…
▽ More
The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and the ratio of its partial width to that of the $χ_{c1}(3872)\rightarrow J/ψγ$ decay is measured to be $$ \frac{Γ_{χ_{c1}(3872)\rightarrow ψ(2S)γ}}
{Γ_{χ_{c1}(3872)\rightarrow J/ψγ}} = 1.67 \pm 0.21 \pm 0.12 \pm0.04 , $$ where the first uncertainty is statistical, the second systematic and the third is due to the uncertainties on the branching fractions of the $ψ(2S)$ and $J/ψ$ mesons. The measured ratio makes the interpretation of the $χ_{c1}(3872)$ state as a~pure $D^0\bar{D}^{*0}+\bar{D}^0D^{*0}$ molecule questionable and strongly indicates a sizeable compact charmonium or tetraquark component within the $χ_{c1}(3872)$ state.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Research on Feature Extraction Data Processing System For MRI of Brain Diseases Based on Computer Deep Learning
Authors:
Lingxi Xiao,
**xin Hu,
Yutian Yang,
Yinqiu Feng,
Zichao Li,
Zexi Chen
Abstract:
Most of the existing wavelet image processing techniques are carried out in the form of single-scale reconstruction and multiple iterations. However, processing high-quality fMRI data presents problems such as mixed noise and excessive computation time. This project proposes the use of matrix operations by combining mixed noise elimination methods with wavelet analysis to replace traditional itera…
▽ More
Most of the existing wavelet image processing techniques are carried out in the form of single-scale reconstruction and multiple iterations. However, processing high-quality fMRI data presents problems such as mixed noise and excessive computation time. This project proposes the use of matrix operations by combining mixed noise elimination methods with wavelet analysis to replace traditional iterative algorithms. Functional magnetic resonance imaging (fMRI) of the auditory cortex of a single subject is analyzed and compared to the wavelet domain signal processing technology based on repeated times and the world's most influential SPM8. Experiments show that this algorithm is the fastest in computing time, and its detection effect is comparable to the traditional iterative algorithm. However, this has a higher practical value for the processing of FMRI data. In addition, the wavelet analysis method proposed signal processing to speed up the calculation rate.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Authors:
Yirui Chen,
Xudong Huang,
Quan Zhang,
Wei Li,
Mingjian Zhu,
Qiangyu Yan,
Simiao Li,
Hanting Chen,
Hailin Hu,
Jie Yang,
Wei Liu,
Jie Hu
Abstract:
The extraordinary ability of generative models emerges as a new trend in image editing and generating realistic images, posing a serious threat to the trustworthiness of multimedia data and driving the research of image manipulation detection and location(IMDL). However, the lack of a large-scale data foundation makes IMDL task unattainable. In this paper, a local manipulation pipeline is designed…
▽ More
The extraordinary ability of generative models emerges as a new trend in image editing and generating realistic images, posing a serious threat to the trustworthiness of multimedia data and driving the research of image manipulation detection and location(IMDL). However, the lack of a large-scale data foundation makes IMDL task unattainable. In this paper, a local manipulation pipeline is designed, incorporating the powerful SAM, ChatGPT and generative models. Upon this basis, We propose the GIM dataset, which has the following advantages: 1) Large scale, including over one million pairs of AI-manipulated images and real images. 2) Rich Image Content, encompassing a broad range of image classes 3) Diverse Generative Manipulation, manipulated images with state-of-the-art generators and various manipulation tasks. The aforementioned advantages allow for a more comprehensive evaluation of IMDL methods, extending their applicability to diverse images. We introduce two benchmark settings to evaluate the generalization capability and comprehensive performance of baseline methods. In addition, we propose a novel IMDL framework, termed GIMFormer, which consists of a ShadowTracer, Frequency-Spatial Block (FSB), and a Multi-window Anomalous Modelling (MWAM) Module. Extensive experiments on the GIM demonstrate that GIMFormer surpasses previous state-of-the-art works significantly on two different benchmarks.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Bounding-Box Inference for Error-Aware Model-Based Reinforcement Learning
Authors:
Erin J. Talvitie,
Zilei Shao,
Huiying Li,
**ghan Hu,
Jacob Boerma,
Rory Zhao,
Xintong Wang
Abstract:
In model-based reinforcement learning, simulated experiences from the learned model are often treated as equivalent to experience from the real environment. However, when the model is inaccurate, it can catastrophically interfere with policy learning. Alternatively, the agent might learn about the model's accuracy and selectively use it only when it can provide reliable predictions. We empirically…
▽ More
In model-based reinforcement learning, simulated experiences from the learned model are often treated as equivalent to experience from the real environment. However, when the model is inaccurate, it can catastrophically interfere with policy learning. Alternatively, the agent might learn about the model's accuracy and selectively use it only when it can provide reliable predictions. We empirically explore model uncertainty measures for selective planning and show that best results require distribution insensitive inference to estimate the uncertainty over model-based updates. To that end, we propose and evaluate bounding-box inference, which operates on bounding-boxes around sets of possible states and other quantities. We find that bounding-box inference can reliably support effective selective planning.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Pairwise-Independent Contention Resolution
Authors:
Anupam Gupta,
**qiao Hu,
Gregory Kehne,
Roie Levin
Abstract:
We study online contention resolution schemes (OCRSs) and prophet inequalities for non-product distributions. Specifically, when the active set is sampled according to a pairwise-independent (PI) distribution, we show a $(1-o_k(1))$-selectable OCRS for uniform matroids of rank $k$, and $Ω(1)$-selectable OCRSs for laminar, graphic, cographic, transversal, and regular matroids. These imply prophet i…
▽ More
We study online contention resolution schemes (OCRSs) and prophet inequalities for non-product distributions. Specifically, when the active set is sampled according to a pairwise-independent (PI) distribution, we show a $(1-o_k(1))$-selectable OCRS for uniform matroids of rank $k$, and $Ω(1)$-selectable OCRSs for laminar, graphic, cographic, transversal, and regular matroids. These imply prophet inequalities with the same ratios when the set of values is drawn according to a PI distribution. Our results complement recent work of Dughmi, Kalayci, and Patel (STOC '24) showing that no $ω(1/k)$-selectable OCRS exists in the PI setting for general matroids of rank $k$.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Breaking Secure Aggregation: Label Leakage from Aggregated Gradients in Federated Learning
Authors:
Zhibo Wang,
Zhiwei Chang,
Jiahui Hu,
Xiaoyi Pang,
Jiacheng Du,
Yongle Chen,
Kui Ren
Abstract:
Federated Learning (FL) exhibits privacy vulnerabilities under gradient inversion attacks (GIAs), which can extract private information from individual gradients. To enhance privacy, FL incorporates Secure Aggregation (SA) to prevent the server from obtaining individual gradients, thus effectively resisting GIAs. In this paper, we propose a stealthy label inference attack to bypass SA and recover…
▽ More
Federated Learning (FL) exhibits privacy vulnerabilities under gradient inversion attacks (GIAs), which can extract private information from individual gradients. To enhance privacy, FL incorporates Secure Aggregation (SA) to prevent the server from obtaining individual gradients, thus effectively resisting GIAs. In this paper, we propose a stealthy label inference attack to bypass SA and recover individual clients' private labels. Specifically, we conduct a theoretical analysis of label inference from the aggregated gradients that are exclusively obtained after implementing SA. The analysis results reveal that the inputs (embeddings) and outputs (logits) of the final fully connected layer (FCL) contribute to gradient disaggregation and label restoration. To preset the embeddings and logits of FCL, we craft a fishing model by solely modifying the parameters of a single batch normalization (BN) layer in the original model. Distributing client-specific fishing models, the server can derive the individual gradients regarding the bias of FCL by resolving a linear system with expected embeddings and the aggregated gradients as coefficients. Then the labels of each client can be precisely computed based on preset logits and gradients of FCL's bias. Extensive experiments show that our attack achieves large-scale label recovery with 100\% accuracy on various datasets and model architectures.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Sublattice Dichotomy in Monolayer FeSe Superconductor
Authors:
Cui Ding,
Zhipeng Xu,
Xiaotong Jiao,
Qiyin Hu,
Wenxuan Zhao,
Lexian Yang,
Kun Jiang,
**-Feng Jia,
Lili Wang,
Jiang** Hu,
Qi-Kun Xue
Abstract:
The pairing mechanism behind the monolayer FeSe is one essential question for iron-based superconductors. In this work, we show the sublattice degree of freedoms of monolayer FeSe plays a special role in its pairing properties, namely the sublattice dichotomy. The high-quality monolayer FeSe samples with atomic flat $1\times1$ topography on the SrTiO$_3$(001) substrates are grown by molecular beam…
▽ More
The pairing mechanism behind the monolayer FeSe is one essential question for iron-based superconductors. In this work, we show the sublattice degree of freedoms of monolayer FeSe plays a special role in its pairing properties, namely the sublattice dichotomy. The high-quality monolayer FeSe samples with atomic flat $1\times1$ topography on the SrTiO$_3$(001) substrates are grown by molecular beam epitaxy. By comparing the tunneling spectra at $α$ and $β$ Fe sublattices, we find the coherence peak of $α$-Fe at the inner gap $+V_i$ is higher than $β$-Fe while the coherence peak of $β$-Fe at $-V_i$ is higher than $α$-Fe with a similar amount. We also observed a reversed effect at the outer gap $\pm V_o$. We propose the $η$-pairing mechanism between $k$ and $-k+Q$ is the key mechanism for this unconventional sublattice dichotomy effect.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Search for the $e^+e^- \to φχ_{c1}(3872)$ process at BESIII
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Based on 368.5 pb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies 4.914 and 4.946 GeV by the BESIII detector, the $e^+e^- \to φχ_{c1}(3872)$ process is searched for the first time. No significant signal is observed and the upper limits at the 90\% confidence level on the product of the Born cross section $σ(e^+e^- \to φχ_{c1}(3872))$ and the branching fraction…
▽ More
Based on 368.5 pb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies 4.914 and 4.946 GeV by the BESIII detector, the $e^+e^- \to φχ_{c1}(3872)$ process is searched for the first time. No significant signal is observed and the upper limits at the 90\% confidence level on the product of the Born cross section $σ(e^+e^- \to φχ_{c1}(3872))$ and the branching fraction $\mathcal{B}[χ_{c1}(3872)\toπ^+π^- J/ψ]$ at 4.914 and 4.946 GeV are set to be 0.85 and 0.96 pb, respectively. These measurements provide useful information for the production of the $χ_{c1}(3872)$ at $e^+e^-$ collider and deepen our understanding about the nature of this particle.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Testing cosmic anisotropy with Pade approximation and Pantheon+ sample
Authors:
Jian** Hu,
Jian Hu,
Xuandong Jia,
Baoquan Gao,
Fayin Wang
Abstract:
Cosmography can be used to constrain the kinematics of universe in a model-independent way. In this work, we attempted to combine the Pad$\rm \acute{e}$ approximations with the latest Pantheon+ sample for testing cosmological principle. Based on the Pad$\rm \acute{e}$ approximations, we first gave the cosmographic constraints on the different order polynomials including third-order (Pad…
▽ More
Cosmography can be used to constrain the kinematics of universe in a model-independent way. In this work, we attempted to combine the Pad$\rm \acute{e}$ approximations with the latest Pantheon+ sample for testing cosmological principle. Based on the Pad$\rm \acute{e}$ approximations, we first gave the cosmographic constraints on the different order polynomials including third-order (Pad$\rm \acute{e}$$_{(2,1)}$), fourth-order (Pad$\rm \acute{e}$$_{(2,2)}$) and fifth-order (Pad$\rm \acute{e}$$_{(3,2)}$). Based on the Pad$\rm \acute{e}$$_{(2,1)}$ ($j_{0}$ = 1) polynomial and hemisphere comparison (HC) method, we tested the cosmological principle and found the preferred directions of cosmic anisotropy, such as (l, b) = (304.6$^{\circ}$$_{-37.4}^{+51.4}$, $-$18.7$^{\circ}$$_{-20.3}^{+14.7}$) and (311.1$^{\circ}$$_{-8.4}^{+17.4}$, $-$17.53$^{\circ}$$_{-7.7}^{+7.8}$) for $q_{0}$ and $H_{0}$, respectively. These two directions are consistent with each other in $1σ$ confidence level, but the corresponding results of statistical isotropy analyses including Isotropy and Isotropy with real positions (RP) are quite different. The statistical significance of $H_{0}$ are stronger than that of $q_{0}$, i.e., 4.75$σ$ and 4.39$σ$ for the Isotropy and Isotropy with RP respectively. Reanalysis with fixed $q_{0} = -0.55$ (corresponds to $Ω_{m}$ = 0.30) gives similar results. Overall, our model-independent results provide clear indications for a possible cosmic anisotropy, which must be taken seriously. Further test is needed to better understand this signal.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Wide Field of View Large Aperture Meta-Doublet Eyepiece
Authors:
Anna Wirth-Singh,
Johannes E. Fröch,
Fan Yang,
Louis Martin,
Hualiang Zhang,
Quentin T. Tanguy,
Zhihao Zhou,
Luocheng Huang,
Demis D. John,
Biljana Stamenic,
Juejun Hu,
Tian Gu,
Arka Majumdar
Abstract:
Wide field of view and light weight optics are critical for advanced eyewear, with applications in augmented/virtual reality and night vision. Conventional refractive lenses are often stacked to correct aberrations at wide field of view, leading to limited performance and increased size and weight. In particular, simultaneously achieving wide field of view and large aperture for light collection i…
▽ More
Wide field of view and light weight optics are critical for advanced eyewear, with applications in augmented/virtual reality and night vision. Conventional refractive lenses are often stacked to correct aberrations at wide field of view, leading to limited performance and increased size and weight. In particular, simultaneously achieving wide field of view and large aperture for light collection is desirable but challenging to realize in a compact form-factor. Here, we demonstrate a wide field of view (greater than 60$^\circ$) meta-optic doublet eyepiece with an entrance aperture of 2.1 cm. At the design wavelength of 633 nm, the meta-optic doublet achieves comparable performance to a refractive lens-based eyepiece system. This meta-doublet eyepiece illustrates the potential for meta-optics to play an important role in the development of high-quality monochrome near-eye display and night vision systems.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Perspective+ Unet: Enhancing Segmentation with Bi-Path Fusion and Efficient Non-Local Attention for Superior Receptive Fields
Authors:
**tong Hu,
Siyan Chen,
Zhiyi Pan,
Sen Zeng,
Wenming Yang
Abstract:
Precise segmentation of medical images is fundamental for extracting critical clinical information, which plays a pivotal role in enhancing the accuracy of diagnoses, formulating effective treatment plans, and improving patient outcomes. Although Convolutional Neural Networks (CNNs) and non-local attention methods have achieved notable success in medical image segmentation, they either struggle to…
▽ More
Precise segmentation of medical images is fundamental for extracting critical clinical information, which plays a pivotal role in enhancing the accuracy of diagnoses, formulating effective treatment plans, and improving patient outcomes. Although Convolutional Neural Networks (CNNs) and non-local attention methods have achieved notable success in medical image segmentation, they either struggle to capture long-range spatial dependencies due to their reliance on local features, or face significant computational and feature integration challenges when attempting to address this issue with global attention mechanisms. To overcome existing limitations in medical image segmentation, we propose a novel architecture, Perspective+ Unet. This framework is characterized by three major innovations: (i) It introduces a dual-pathway strategy at the encoder stage that combines the outcomes of traditional and dilated convolutions. This not only maintains the local receptive field but also significantly expands it, enabling better comprehension of the global structure of images while retaining detail sensitivity. (ii) The framework incorporates an efficient non-local transformer block, named ENLTB, which utilizes kernel function approximation for effective long-range dependency capture with linear computational and spatial complexity. (iii) A Spatial Cross-Scale Integrator strategy is employed to merge global dependencies and local contextual cues across model stages, meticulously refining features from various levels to harmonize global and local information. Experimental results on the ACDC and Synapse datasets demonstrate the effectiveness of our proposed Perspective+ Unet. The code is available in the supplementary material.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Model structure arising from one hereditary cotorsion pair on extriangulated categories
Authors:
Jiangsheng Hu,
Dongdong Zhang,
Panyue Zhou
Abstract:
Let $\mathcal{C}$ be a weakly idempotent complete extriangulated category. In contrast with the Hovey correspondence of admissible model structures on weakly idempotent complete exact categories from two complete cotorsion pairs, we give a construction of model structures on $\mathcal{C}$ from only one complete cotorsion pair. Our main result not only generalizes the work by Beligiannis-Reiten and…
▽ More
Let $\mathcal{C}$ be a weakly idempotent complete extriangulated category. In contrast with the Hovey correspondence of admissible model structures on weakly idempotent complete exact categories from two complete cotorsion pairs, we give a construction of model structures on $\mathcal{C}$ from only one complete cotorsion pair. Our main result not only generalizes the work by Beligiannis-Reiten and Cui-Lu-Zhang, but also provides methods to construct model structures from silting objects of $\mathcal{C}$ and co-$t$-structures in triangulated categories.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Authors:
Bofei Gao,
Zefan Cai,
Runxin Xu,
Peiyi Wang,
Ce Zheng,
Runji Lin,
Keming Lu,
Junyang Lin,
Chang Zhou,
Wen Xiao,
Junjie Hu,
Tianyu Liu,
Baobao Chang
Abstract:
Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la…
▽ More
Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale labels (i.e., the correctness of the current step and the explanations). In this paper, we propose \textbf{Math-Minos}, a natural language feedback enhanced verifier by constructing automatically-generated training data and a two-stage training paradigm for effective training and efficient inference. Our experiments reveal that a small set (30k) of natural language feedbacks can significantly boost the performance of the verifier by the accuracy of 1.6\% (86.6\% $\rightarrow$ 88.2\%) on GSM8K and 0.8\% (37.8\% $\rightarrow$ 38.6\%) on MATH. We have released our code and data for further exploration.
△ Less
Submitted 30 June, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Formation of super-thin galaxies in Illustris-TNG
Authors:
Jianhong Hu,
Dandan Xu,
Cheng Li
Abstract:
Superthin galaxies are observed to have stellar disks with extremely small minor-to-major axis ratios. In this work, we investigate the formation of superthin galaxies in the TNG100 simulation. We trace the merger history and investigate the evolution of galaxy properties of a selected sample of superthin galaxies and a control sample of galaxies that share the same joint probability distribution…
▽ More
Superthin galaxies are observed to have stellar disks with extremely small minor-to-major axis ratios. In this work, we investigate the formation of superthin galaxies in the TNG100 simulation. We trace the merger history and investigate the evolution of galaxy properties of a selected sample of superthin galaxies and a control sample of galaxies that share the same joint probability distribution in the stellar-mass and color diagram. Through making comparisons between the two galaxy samples, we find that present-day superthin galaxies had similar morphologies as the control sample counterparts at higher redshifts, but have developed extended flat `superthin' morphologies since $z \sim 1$. During this latter evolution stage, superthin galaxies undergo overwhelmingly higher frequency of prograde mergers (with orbit-spin angle $θ_{\rm orb} \leqslant 40^\circ$). Accordingly the spins of their dark matter halos have grown significantly and become noticeably higher than that of their normal disk counterparts. This further results in the buildup of their stellar disks at larger distances much beyond the regimes of normal disk galaxies. We also discuss the formation scenario of those superthin galaxies that live in larger dark matter halos as satellite galaxies therein.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Dynamical phase-field model of cavity electromagnonic systems
Authors:
Shihao Zhuang,
Yujie Zhu,
Changchun Zhong,
Liang Jiang,
Xufeng Zhang,
Jia-Mian Hu
Abstract:
Cavity electromagnonic system, which simultaneously consists of cavities for photons, magnons (quanta of spin waves), and acoustic phonons, provides an exciting platform to achieve coherent energy transduction among different physical systems down to single quantum level. Here we report a dynamical phase-field model that allows simulating the coupled dynamics of the electromagnetic waves, magnetiz…
▽ More
Cavity electromagnonic system, which simultaneously consists of cavities for photons, magnons (quanta of spin waves), and acoustic phonons, provides an exciting platform to achieve coherent energy transduction among different physical systems down to single quantum level. Here we report a dynamical phase-field model that allows simulating the coupled dynamics of the electromagnetic waves, magnetization, and strain in 3D multiphase systems. As examples of application, we computationally demonstrate the excitation of hybrid magnon-photon modes (magnon polaritons), Floquet-induced magnonic Aulter-Townes splitting, dynamical energy exchange (Rabi oscillation) and relative phase control (Ramsey interference) between the two magnon polariton modes. The simulation results are consistent with analytical calculations based on Floquet Hamiltonian theory. Simulations are also performed to design a cavity electro-magno-mechanical system that enables the triple phonon-magnon-photon resonance, where the resonant excitation of a chiral, fundamental (n=1) transverse acoustic phonon mode by magnon polaritons is demonstrated. With the capability to predict coupling strength, dissipation rates, and temporal evolution of photon/magnon/phonon mode profiles using fundamental materials parameters as the inputs, the present dynamical phase-field model represents a valuable computational tool to guide the fabrication of the cavity electromagnonic system and the design of operating conditions for applications in quantum sensing, transduction, and communication.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Precision measurement of the $Ξ^-_b$ baryon lifetime
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1064 additional authors not shown)
Abstract:
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ^{\rm Run\,2}=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second sys…
▽ More
A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ^{\rm Run\,2}=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second systematic. This value is averaged with the corresponding value from Run 1 to obtain ${r_τ^{\rm Run\,1,2} = 1.078\pm0.012\pm0.007}$. Multiplying by the world-average value of the $Λ^0_b$ lifetime yields $τ_{Ξ^-_b}^{\rm Run~1,2} = 1.578\pm0.018\pm0.010\pm0.011$ ps, where the uncertainties are statistical, systematic, and due to the limited knowledge of the $Λ^0_b$ lifetime. This measurement improves the precision of the current world average of the $Ξ^-_b$ lifetime by about a factor of two, and is in good agreement with the most recent theoretical predictions.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Experimental verification of the optimal fingerprint method for detecting climate change
Authors:
**bo Hu,
Hong Yuan,
Letian Chen,
Nan Zhao,
C. P. Sun
Abstract:
The optimal fingerprint method serves as a potent approach for detecting and attributing climate change. However, its experimental validation encounters challenges due to the intricate nature of climate systems. Here, we experimentally examine the optimal fingerprint method simulated by a precisely controlled magnetic resonance system of spins. The spin dynamic under an applied deterministic drivi…
▽ More
The optimal fingerprint method serves as a potent approach for detecting and attributing climate change. However, its experimental validation encounters challenges due to the intricate nature of climate systems. Here, we experimentally examine the optimal fingerprint method simulated by a precisely controlled magnetic resonance system of spins. The spin dynamic under an applied deterministic driving field and a noise field is utilized to emulate the complex climate system with external forcing and internal variability. Our experimental results affirm the theoretical prediction regarding the existence of an optimal detection direction which maximizes the signal-to-noise ratio, thereby validating the optimal fingerprint method. This work offers direct empirical verification of the optimal fingerprint method, crucial for comprehending climate change and its societal impacts.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Wake dynamics of wind turbines in unsteady streamwise flow conditions
Authors:
Nathaniel J. Wei,
Adnan El Makdah,
JiaCheng Hu,
Frieder Kaiser,
David E. Rival,
John O. Dabiri
Abstract:
The unsteady flow physics of wind-turbine wakes under dynamic forcing conditions are critical to the modeling and control of wind farms for optimal power density. Unsteady forcing in the streamwise direction may be generated by unsteady inflow conditions in the atmospheric boundary layer, dynamic induction control of the turbine, or streamwise surge motions of a floating offshore wind turbine due…
▽ More
The unsteady flow physics of wind-turbine wakes under dynamic forcing conditions are critical to the modeling and control of wind farms for optimal power density. Unsteady forcing in the streamwise direction may be generated by unsteady inflow conditions in the atmospheric boundary layer, dynamic induction control of the turbine, or streamwise surge motions of a floating offshore wind turbine due to floating-platform oscillations. This study seeks to identify the dominant flow mechanisms in unsteady wakes forced by a periodic upstream inflow condition. A theoretical framework for the problem is derived, which describes traveling-wave undulations in the wake radius and streamwise velocity. These dynamics encourage the aggregation of tip vortices into large structures that are advected along in the wake. Flow measurements in the wake of a periodically surging turbine were obtained in an optically accessible towing-tank facility, with an average diameter-based Reynolds number of 300,000 and with surge-velocity amplitudes of up to 40% of the mean inflow velocity. Qualitative agreement between trends in the measurements and model predictions is observed, supporting the validity of the theoretical analyses. The experiments also demonstrate large enhancements in the recovery of the wake relative to the steady-flow case, with wake-length reductions of up to 46.5% and improvements in the available power at 10 diameters downstream of up to 15.7%. These results provide fundamental insights into the dynamics of unsteady wakes and serve as additional evidence that unsteady fluid mechanics can be leveraged to increase the power density of wind farms.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
GUICourse: From General Vision Language Models to Versatile GUI Agents
Authors:
Wentong Chen,
Junbo Cui,
**yi Hu,
Yujia Qin,
Junjie Fang,
Yue Zhao,
Chongyi Wang,
Jun Liu,
Guirong Chen,
Yupeng Huo,
Yuan Yao,
Yankai Lin,
Zhiyuan Liu,
Maosong Sun
Abstract:
Utilizing Graphic User Interface (GUI) for human-computer interaction is essential for accessing a wide range of digital tools. Recent advancements in Vision Language Models (VLMs) highlight the compelling potential to develop versatile agents to help humans finish GUI navigation tasks. However, current VLMs are challenged in terms of fundamental abilities (OCR and grounding) and GUI knowledge (th…
▽ More
Utilizing Graphic User Interface (GUI) for human-computer interaction is essential for accessing a wide range of digital tools. Recent advancements in Vision Language Models (VLMs) highlight the compelling potential to develop versatile agents to help humans finish GUI navigation tasks. However, current VLMs are challenged in terms of fundamental abilities (OCR and grounding) and GUI knowledge (the functions and control methods of GUI elements), preventing them from becoming practical GUI agents. To solve these challenges, we contribute GUICourse, a suite of datasets to train visual-based GUI agents from general VLMs. First, we introduce the GUIEnv dataset to strengthen the OCR and grounding capabilities of VLMs. Then, we introduce the GUIAct and GUIChat datasets to enrich their knowledge of GUI components and interactions. Experiments demonstrate that our GUI agents have better performance on common GUI tasks than their baseline VLMs. Even the small-size GUI agent (with 3.1B parameters) can still work well on single-step and multi-step GUI tasks. Finally, we analyze the different varieties in the training stage of this agent by ablation study. Our source codes and datasets are released at https://github.com/yiye3/GUICourse.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Asymptotic properties of a multicolored random reinforced urn model with an application to multi-armed bandits
Authors:
Li Yang,
Jiang Hu,
Jianghao Li,
Zhidong Bai
Abstract:
The random self-reinforcement mechanism, characterized by the principle of ``the rich get richer'', has demonstrated significant utility across various domains. One prominent model embodying this mechanism is the random reinforcement urn model. This paper investigates a multicolored, multiple-drawing variant of the random reinforced urn model. We establish the limiting behavior of the normalized u…
▽ More
The random self-reinforcement mechanism, characterized by the principle of ``the rich get richer'', has demonstrated significant utility across various domains. One prominent model embodying this mechanism is the random reinforcement urn model. This paper investigates a multicolored, multiple-drawing variant of the random reinforced urn model. We establish the limiting behavior of the normalized urn composition and demonstrate strong convergence upon scaling the counts of each color. Additionally, we derive strong convergence estimators for the reinforcement means, i.e., for the expectations of the replacement matrix's diagonal elements, and prove their joint asymptotic normality. It is noteworthy that the estimators of the largest reinforcement mean are asymptotically independent of the estimators of the other smaller reinforcement means. Additionally, if a reinforcement mean is not the largest, the estimators of these smaller reinforcement means will also demonstrate asymptotic independence among themselves. Furthermore, we explore the parallels between the reinforced mechanisms in random reinforced urn models and multi-armed bandits, addressing hypothesis testing for expected payoffs in the latter context.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.