-
Hierarchical Reinforcement Learning Empowered Task Offloading in V2I Networks
Authors:
Xinyu You,
Haojie Yan,
Yuedong Xu,
Lifeng Wang,
Liangui Dai
Abstract:
Edge computing plays an essential role in the vehicle-to-infrastructure (V2I) networks, where vehicles offload their intensive computation tasks to the road-side units for saving energy and reduce the latency. This paper designs the optimal task offloading policy to address the concerns involving processing delay, energy consumption and edge computing cost. Each computation task consisting of some…
▽ More
Edge computing plays an essential role in the vehicle-to-infrastructure (V2I) networks, where vehicles offload their intensive computation tasks to the road-side units for saving energy and reduce the latency. This paper designs the optimal task offloading policy to address the concerns involving processing delay, energy consumption and edge computing cost. Each computation task consisting of some interdependent sub-tasks is characterized as a directed acyclic graph (DAG). In such dynamic networks, a novel hierarchical Offloading scheme is proposed by leveraging deep reinforcement learning (DRL). The inter-dependencies among the DAGs of the computation tasks are extracted using a graph neural network with attention mechanism. A parameterized DRL algorithm is developed to deal with the hierarchical action space containing both discrete and continuous actions. Simulation results with a real-world car speed dataset demonstrate that the proposed scheme can effectively reduce the system overhead.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable Missing
Authors:
Chengqing Yu,
Fei Wang,
Zezhi Shao,
Tangwen Qian,
Zhao Zhang,
Wei Wei,
Yongjun Xu
Abstract:
Multivariate time series forecasting (MTSF) is crucial for decision-making to precisely forecast the future values/trends, based on the complex relationships identified from historical observations of multiple sequences. Recently, Spatial-Temporal Graph Neural Networks (STGNNs) have gradually become the theme of MTSF model as their powerful capability in mining spatial-temporal dependencies, but a…
▽ More
Multivariate time series forecasting (MTSF) is crucial for decision-making to precisely forecast the future values/trends, based on the complex relationships identified from historical observations of multiple sequences. Recently, Spatial-Temporal Graph Neural Networks (STGNNs) have gradually become the theme of MTSF model as their powerful capability in mining spatial-temporal dependencies, but almost of them heavily rely on the assumption of historical data integrity. In reality, due to factors such as data collector failures and time-consuming repairment, it is extremely challenging to collect the whole historical observations without missing any variable. In this case, STGNNs can only utilize a subset of normal variables and easily suffer from the incorrect spatial-temporal dependency modeling issue, resulting in the degradation of their forecasting performance. To address the problem, in this paper, we propose a novel Graph Interpolation Attention Recursive Network (named GinAR) to precisely model the spatial-temporal dependencies over the limited collected data for forecasting. In GinAR, it consists of two key components, that is, interpolation attention and adaptive graph convolution to take place of the fully connected layer of simple recursive units, and thus are capable of recovering all missing variables and reconstructing the correct spatial-temporal dependencies for recursively modeling of multivariate time series data, respectively. Extensive experiments conducted on five real-world datasets demonstrate that GinAR outperforms 11 SOTA baselines, and even when 90% of variables are missing, it can still accurately predict the future values of all variables.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
SimAD: A Simple Dissimilarity-based Approach for Time Series Anomaly Detection
Authors:
Zhijie Zhong,
Zhiwen Yu,
Xing Xi,
Yue Xu,
Jiahui Chen,
Kaixiang Yang
Abstract:
Despite the prevalence of reconstruction-based deep learning methods, time series anomaly detection remains challenging. Existing approaches often struggle with limited temporal contexts, inadequate representation of normal patterns, and flawed evaluation metrics, hindering their effectiveness in identifying aberrant behavior. To address these issues, we introduce $\textbf{SimAD}$, a…
▽ More
Despite the prevalence of reconstruction-based deep learning methods, time series anomaly detection remains challenging. Existing approaches often struggle with limited temporal contexts, inadequate representation of normal patterns, and flawed evaluation metrics, hindering their effectiveness in identifying aberrant behavior. To address these issues, we introduce $\textbf{SimAD}$, a $\textbf{Sim}$ple dissimilarity-based approach for time series $\textbf{A}$nomaly $\textbf{D}$etection. SimAD incorporates an advanced feature extractor adept at processing extended temporal windows, utilizes the EmbedPatch encoder to integrate normal behavioral patterns comprehensively, and introduces an innovative ContrastFusion module designed to accentuate distributional divergences between normal and abnormal data, thereby enhancing the robustness of anomaly discrimination. Additionally, we propose two robust evaluation metrics, UAff and NAff, addressing the limitations of existing metrics and demonstrating their reliability through theoretical and experimental analyses. Experiments across $\textbf{seven}$ diverse time series datasets demonstrate SimAD's superior performance compared to state-of-the-art methods, achieving relative improvements of $\textbf{19.85%}$ on F1, $\textbf{4.44%}$ on Aff-F1, $\textbf{77.79%}$ on NAff-F1, and $\textbf{9.69%}$ on AUC on six multivariate datasets. Code and pre-trained models are available at https://github.com/EmorZz1G/SimAD.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Linear canonical space-time transform and convolution theorems
Authors:
Yi-Qiao Xu,
Bing-Zhao Li
Abstract:
Following the idea of the fractional space-time Fourier transform, a linear canonical space-time transform for 16-dimensional space-time $C\ell_{3,1}$-valued signals is investigated in this paper. First, the definition of the proposed linear canonical space-time transform is given, and some related properties of this transform are obtained. Second, the convolution operator and the corresponding co…
▽ More
Following the idea of the fractional space-time Fourier transform, a linear canonical space-time transform for 16-dimensional space-time $C\ell_{3,1}$-valued signals is investigated in this paper. First, the definition of the proposed linear canonical space-time transform is given, and some related properties of this transform are obtained. Second, the convolution operator and the corresponding convolution theorem are proposed. Third, the convolution theorem associated with the two-sided linear canonical space-time transform is derived.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Manifold-based Incomplete Multi-view Clustering via Bi-Consistency Guidance
Authors:
Huibing Wang,
Mingze Yao,
Yawei Chen,
Yunqiu Xu,
Haipeng Liu,
Wei Jia,
** Fu,
Yang Wang
Abstract:
Incomplete multi-view clustering primarily focuses on dividing unlabeled data into corresponding categories with missing instances, and has received intensive attention due to its superiority in real applications. Considering the influence of incomplete data, the existing methods mostly attempt to recover data by adding extra terms. However, for the unsupervised methods, a simple recovery strategy…
▽ More
Incomplete multi-view clustering primarily focuses on dividing unlabeled data into corresponding categories with missing instances, and has received intensive attention due to its superiority in real applications. Considering the influence of incomplete data, the existing methods mostly attempt to recover data by adding extra terms. However, for the unsupervised methods, a simple recovery strategy will cause errors and outlying value accumulations, which will affect the performance of the methods. Broadly, the previous methods have not taken the effectiveness of recovered instances into consideration, or cannot flexibly balance the discrepancies between recovered data and original data. To address these problems, we propose a novel method termed Manifold-based Incomplete Multi-view clustering via Bi-consistency guidance (MIMB), which flexibly recovers incomplete data among various views, and attempts to achieve biconsistency guidance via reverse regularization. In particular, MIMB adds reconstruction terms to representation learning by recovering missing instances, which dynamically examines the latent consensus representation. Moreover, to preserve the consistency information among multiple views, MIMB implements a biconsistency guidance strategy with reverse regularization of the consensus representation and proposes a manifold embedding measure for exploring the hidden structure of the recovered data. Notably, MIMB aims to balance the importance of different views, and introduces an adaptive weight term for each view. Finally, an optimization algorithm with an alternating iteration optimization strategy is designed for final clustering. Extensive experimental results on 6 benchmark datasets are provided to confirm that MIMB can significantly obtain superior results as compared with several state-of-the-art baselines.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers
Authors:
Kaiyu Huang,
Fengran Mo,
Hongliang Li,
You Li,
Yuanchi Zhang,
Weijian Yi,
Yulong Mao,
**chen Liu,
Yuzhuang Xu,
**an Xu,
Jian-Yun Nie,
Yang Liu
Abstract:
The rapid development of Large Language Models (LLMs) demonstrates remarkable multilingual capabilities in natural language processing, attracting global attention in both academia and industry. To mitigate potential discrimination and enhance the overall usability and accessibility for diverse language user groups, it is important for the development of language-fair technology. Despite the break…
▽ More
The rapid development of Large Language Models (LLMs) demonstrates remarkable multilingual capabilities in natural language processing, attracting global attention in both academia and industry. To mitigate potential discrimination and enhance the overall usability and accessibility for diverse language user groups, it is important for the development of language-fair technology. Despite the breakthroughs of LLMs, the investigation into the multilingual scenario remains insufficient, where a comprehensive survey to summarize recent approaches, developments, limitations, and potential solutions is desirable. To this end, we provide a survey with multiple perspectives on the utilization of LLMs in the multilingual scenario. We first rethink the transitions between previous and current research on pre-trained language models. Then we introduce several perspectives on the multilingualism of LLMs, including training and inference methods, model security, multi-domain with language culture, and usage of datasets. We also discuss the major challenges that arise in these aspects, along with possible solutions. Besides, we highlight future research directions that aim at further enhancing LLMs with multilingualism. The survey aims to help the research community address multilingual problems and provide a comprehensive understanding of the core concepts, key techniques, and latest developments in multilingual natural language processing based on LLMs.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Steady-State Convergence of the Continuous-Time JSQ System with General Distributions in Heavy Traffic
Authors:
J. G. Dai,
** Guang,
Yaosheng Xu
Abstract:
This paper studies the continuous-time join-the-shortest-queue (JSQ) system with general interarrival and service distributions. Under a much weaker assumption than the one in the literature, we prove that each station's scaled steady-state queue length weakly converges to an identical exponential random variable in heavy traffic. Specifically, we establish our results by only assuming $2+δ_0$ mom…
▽ More
This paper studies the continuous-time join-the-shortest-queue (JSQ) system with general interarrival and service distributions. Under a much weaker assumption than the one in the literature, we prove that each station's scaled steady-state queue length weakly converges to an identical exponential random variable in heavy traffic. Specifically, we establish our results by only assuming $2+δ_0$ moment on the arrival and service distributions for some $δ_0>0$. Our proof exploits the Palm version of the basic adjoint relationship (BAR) approach as a key technique.
△ Less
Submitted 3 June, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Fast transport and splitting of spin-orbit-coupled spin-1 Bose-Einstein Condensates
Authors:
Yaning Xu,
Yuanyuan Chen,
Xi Chen
Abstract:
In this study, we investigate the dynamics of tunable spin-orbit-coupled spin-1 Bose-Einstein condensates confined within a harmonic trap, focusing on rapid transport, spin manipulation, and splitting dynamics. Using shortcuts to adiabaticity, we design time-dependent trap trajectories and spin-orbit-coupling strength to facilitate fast transport with simultaneous spin flip. Additionally, we showc…
▽ More
In this study, we investigate the dynamics of tunable spin-orbit-coupled spin-1 Bose-Einstein condensates confined within a harmonic trap, focusing on rapid transport, spin manipulation, and splitting dynamics. Using shortcuts to adiabaticity, we design time-dependent trap trajectories and spin-orbit-coupling strength to facilitate fast transport with simultaneous spin flip. Additionally, we showcase the creation of spin-dependent coherent states via engineering the spin-orbit-coupling strength. To deepen our understanding, we elucidate non-adiabatic transport and associated spin dynamics, contrasting them with simple scenarios characterized by constant spin-orbit coupling and trap velocity. Furthermore, we discuss the transverse Zeeman potential and nonlinear effect induced by interatomic interactions using the Gross-Pitaevskii equation, highlighting the stability and feasibility of the proposed protocols for the state-of-the-art experiments with cold atoms.
△ Less
Submitted 20 May, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Optimization-Aided Construction of Multivariate Chebyshev Polynomials
Authors:
Mareike Dressler,
Simon Foucart,
Etienne de Klerk,
Mioara Joldes,
Jean Bernard Lasserre,
Yuan Xu
Abstract:
This article is concerned with an extension of univariate Chebyshev polynomials of the first kind to the multivariate setting, where one chases best approximants to specific monomials by polynomials of lower degree relative to the uniform norm. Exploiting the Moment-SOS hierarchy, we devise a versatile semidefinite-programming-based procedure to compute such best approximants, as well as associate…
▽ More
This article is concerned with an extension of univariate Chebyshev polynomials of the first kind to the multivariate setting, where one chases best approximants to specific monomials by polynomials of lower degree relative to the uniform norm. Exploiting the Moment-SOS hierarchy, we devise a versatile semidefinite-programming-based procedure to compute such best approximants, as well as associated signatures. Applying this procedure in three variables leads to the values of best approximation errors for all mononials up to degree six on the euclidean ball, the simplex, and the cross-polytope. Furthermore, inspired by numerical experiments, we obtain explicit expressions for Chebyshev polynomials in two cases unresolved before, namely for the monomial $x_1^2 x_2^2 x_3$ on the euclidean ball and for the monomial $x_1^2 x_2 x_3$ on the simplex.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Diffusive Limit of the Vlasov-Maxwell-Boltzmann System without Angular Cutoff
Authors:
Yuan Xu,
Fujun Zhou,
Weihua Gong,
Weijun Wu
Abstract:
Diffusive limit of the non-cutoff Vlasov-Maxwell-Boltzmann system in perturbation framework still remains open. By employing a new weight function and making full use of the anisotropic dissipation property of the non-cutoff linearized Boltzmann operator, we solve this problem with some novel treatments for non-cutoff potentials $γ> \max\{-3, -\frac{3}{2}-2s\}$, including both strong angular singu…
▽ More
Diffusive limit of the non-cutoff Vlasov-Maxwell-Boltzmann system in perturbation framework still remains open. By employing a new weight function and making full use of the anisotropic dissipation property of the non-cutoff linearized Boltzmann operator, we solve this problem with some novel treatments for non-cutoff potentials $γ> \max\{-3, -\frac{3}{2}-2s\}$, including both strong angular singularity $\frac{1}{2} \leq s <1$ and weak angular singularity $0 < s < \frac{1}{2}$. Uniform estimate with respect to the Knudsen number $\varepsilon\in (0,1]$ is established globally in time, which eventually leads to the global existence of solutions to the non-cutoff Vlasov-Maxwell-Boltzmann system as well as hydrodynamic limit to the two-fluid incompressible Navier-Stokes-Fourier-Maxwell system with Ohm's law. The indicators $γ> \max\{-3, -\frac{3}{2}-2s\}$ and $0 < s <1$ in this paper cover all ranges that can be achieved by the previously established global solutions to the non-cutoff Vlasov-Maxwell-Boltzmann system in perturbation framework.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Libra: Building Decoupled Vision System on Large Language Models
Authors:
Yifan Xu,
Xiaoshan Yang,
Yaguang Song,
Changsheng Xu
Abstract:
In this work, we introduce Libra, a prototype model with a decoupled vision system on a large language model (LLM). The decoupled vision system decouples inner-modal modeling and cross-modal interaction, yielding unique visual information modeling and effective cross-modal comprehension. Libra is trained through discrete auto-regressive modeling on both vision and language inputs. Specifically, we…
▽ More
In this work, we introduce Libra, a prototype model with a decoupled vision system on a large language model (LLM). The decoupled vision system decouples inner-modal modeling and cross-modal interaction, yielding unique visual information modeling and effective cross-modal comprehension. Libra is trained through discrete auto-regressive modeling on both vision and language inputs. Specifically, we incorporate a routed visual expert with a cross-modal bridge module into a pretrained LLM to route the vision and language flows during attention computing to enable different attention patterns in inner-modal modeling and cross-modal interaction scenarios. Experimental results demonstrate that the dedicated design of Libra achieves a strong MLLM baseline that rivals existing works in the image-to-text scenario with merely 50 million training data, providing a new perspective for future multimodal foundation models. Code is available at https://github.com/YifanXu74/Libra.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection
Authors:
Siliang Ma,
Yong Xu
Abstract:
Bounding box regression is one of the important steps of object detection. However, rotation detectors often involve a more complicated loss based on SkewIoU which is unfriendly to gradient-based training. Most of the existing loss functions for rotated object detection calculate the difference between two bounding boxes only focus on the deviation of area or each points distance (e.g.,…
▽ More
Bounding box regression is one of the important steps of object detection. However, rotation detectors often involve a more complicated loss based on SkewIoU which is unfriendly to gradient-based training. Most of the existing loss functions for rotated object detection calculate the difference between two bounding boxes only focus on the deviation of area or each points distance (e.g., $\mathcal{L}_{Smooth-\ell 1}$, $\mathcal{L}_{RotatedIoU}$ and $\mathcal{L}_{PIoU}$). The calculation process of some loss functions is extremely complex (e.g. $\mathcal{L}_{KFIoU}$). In order to improve the efficiency and accuracy of bounding box regression for rotated object detection, we proposed a novel metric for arbitrary shapes comparison based on minimum points distance, which takes most of the factors from existing loss functions for rotated object detection into account, i.e., the overlap or nonoverlap** area, the central points distance and the rotation angle. We also proposed a loss function called $\mathcal{L}_{FPDIoU}$ based on four points distance for accurate bounding box regression focusing on faster and high quality anchor boxes. In the experiments, $FPDIoU$ loss has been applied to state-of-the-art rotated object detection (e.g., RTMDET, H2RBox) models training with three popular benchmarks of rotated object detection including DOTA, DIOR, HRSC2016 and two benchmarks of arbitrary orientation scene text detection including ICDAR 2017 RRC-MLT and ICDAR 2019 RRC-MLT, which achieves better performance than existing loss functions.
△ Less
Submitted 19 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Ferroelectricity Driven by Orbital Resonance of Protons in CH$_3$NH$_3$Cl and CH$_3$NH$_3$Br
Authors:
Chu Xin Peng,
Lei Meng,
Yi Yang Xu,
Tian Tian Xing,
Miao Miao Zhao,
Peng Ren,
Fei Yen
Abstract:
The $β$ and $γ$ phases of methylammonium chloride CH$_3$NH$_3$Cl and methylammonium bromide CH$_3$NH$_3$Br are identified to be ferroelectric $via$ pyroelectric current and dielectric constant measurements. The magnetic susceptibility also exhibits pronounced discontinuities at the Curie temperatures. We attribute the origin of spontaneous polarization to the emergence of two groups of proton orbi…
▽ More
The $β$ and $γ$ phases of methylammonium chloride CH$_3$NH$_3$Cl and methylammonium bromide CH$_3$NH$_3$Br are identified to be ferroelectric $via$ pyroelectric current and dielectric constant measurements. The magnetic susceptibility also exhibits pronounced discontinuities at the Curie temperatures. We attribute the origin of spontaneous polarization to the emergence of two groups of proton orbital magnetic moments from the uncorrelated motion of the CH$_3$ and NH$_3$ groups in the $β$ and $γ$ phases. The two inequivalent frameworks of intermolecular orbital resonances interact with each other to distort the lattice in a non-centrosymmetric fashion. Our findings indicate that the structural instabilities in molecular frameworks are magnetic in origin as well as provide a new pathway toward uncovering new organic ferroelectrics.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Probing Reheating with Gravitational Waves from Graviton Bremsstrahlung
Authors:
Basabendu Barman,
Nicolás Bernal,
Simon Cléry,
Yann Mambrini,
Yong Xu,
Óscar Zapata
Abstract:
In this talk, based on arXiv:2301.11345, arXiv:2305.16388, arXiv:2311.12694, we discuss the production of primordial gravitational waves (GW) sourced by graviton bremsstrahlung during inflationary reheating. For reheating, we consider inflaton decays and annihilations into pairs of bosons or fermions, assuming an inflaton $φ$ that oscillates around a generic monomial potential $V(φ) \propto φ^n$.…
▽ More
In this talk, based on arXiv:2301.11345, arXiv:2305.16388, arXiv:2311.12694, we discuss the production of primordial gravitational waves (GW) sourced by graviton bremsstrahlung during inflationary reheating. For reheating, we consider inflaton decays and annihilations into pairs of bosons or fermions, assuming an inflaton $φ$ that oscillates around a generic monomial potential $V(φ) \propto φ^n$. The GW spectrum exhibits distinct features depending on the underlying reheating dynamics, which is controlled by the inflaton potential and the type of coupling between the inflaton and the matter fields. We show that the produced stochastic GW background could be probed in next-generation GW detectors, especially at high frequencies. We further highlight the potential of bremsstrahlung-induced GW to probe the underlying dynamics of reheating.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies (AGNSTRONG). I. Sample and Strategy
Authors:
Huynh Anh N. Le,
Chen Qin,
Yongquan Xue,
Shifu Zhu,
Kim Ngan N. Nguyen,
Ruisong Xia,
Xiaozhi Lin
Abstract:
We introduce our project, AGNSTRONG (Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies). Our research goals encompass investigating the kinematic properties of ionized and molecular gas outflows, understanding the impact of AGN feedback, and exploring the coevolution dynamics between AGN strength activity and star formation activity. We aim to conduct a thorough analysis to determine wh…
▽ More
We introduce our project, AGNSTRONG (Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies). Our research goals encompass investigating the kinematic properties of ionized and molecular gas outflows, understanding the impact of AGN feedback, and exploring the coevolution dynamics between AGN strength activity and star formation activity. We aim to conduct a thorough analysis to determine whether there is an increase or suppression in SFRs among targets with and without powerful relativistic jets. Our sample consists of 35 nearby AGNs with and without powerful relativistic jet detections. Utilizing sub-millimeter (sub-mm) continuum observations at 450 μm and 850 μm from SCUBA-2 at the James Clerk Maxwell Telescope, we determine star-formation rates (SFRs) for our sources using spectral energy distribution (SED) fitting models. Additionally, we employ high-quality, spatially resolved spectra from UV-optical to near-infrared bands obtained with the Double Spectrograph and Triple Spectrograph mounted on the 200-inch Hale telescope at Palomar Observatory to study their multiphase gas outflow properties. This paper presents an overview of our sample selection methodology, research strategy, and initial results of our project. We find that the SFRs determined without including the sub-mm data in the SED fitting are overestimated by approximately 0.08 dex compared to those estimated with the inclusion of sub-mm data. Additionally, we compare the estimated SFRs in our work with those traced by the 4000Å break, as provided by the MPA-JHU catalog. We find that our determined SFRs are systematically higher than those traced by the 4000Å break. Finally, we outline our future research plans.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Magnetic interactions based on proton orbital motion in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$
Authors:
Lei Meng,
Miao Miao Zhao,
Yi Yang Xu,
Chu Xin Peng,
Yang Yang,
Tian Tian Xing,
Peng Ren,
Fei Yen
Abstract:
The microscopic origin of the remarkable optoelectronic properties of one of the most studied contemporary materials remains unclear. Here, we identify the existence of magnetic interactions between intermolecular proton orbitals in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$. In particular, a unique sharp drop and a pronounced step-up discontinuity in the magnetic susceptibility at the tetragona…
▽ More
The microscopic origin of the remarkable optoelectronic properties of one of the most studied contemporary materials remains unclear. Here, we identify the existence of magnetic interactions between intermolecular proton orbitals in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$. In particular, a unique sharp drop and a pronounced step-up discontinuity in the magnetic susceptibility at the tetragonal-to-cubic phase transitions are identified in CH$_3$NH$_3$PbI$_3$ and CH$_3$NH$_3$PbBr$_3$, respectively. The magnetic interactions in the orthorhombic and tetragonal phases are dependent on thermal history and lattice orientation while nearly independent of the applied external magnetic field. In CH$_3$NH$_3$PbBr$_3$, the CH$_3$ and NH$_3$$^+$ components reorient in an uncorrelated fashion resulting the cubic phase to also exhibit magnetic anisotropy. Our findings provide a potential link connecting the highly light-absorbing CH$_3$NH$_3$$^+$ and the exceptional properties of the charge carriers of the inorganic framework in hybrid perovskite solar cells.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
M. Albrecht,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
R. Baldini Ferroli,
I. Balossino,
Y. Ban,
V. Batozskaya,
D. Becker,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko
, et al. (559 additional authors not shown)
Abstract:
We present the first search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ by analyzing a data sample of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.178 and 4.226 GeV, corresponding to an integrated luminosity of 6.32~fb$^{-1}$. No significant signal is observed. The upper limits on the branching fractions for…
▽ More
We present the first search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ by analyzing a data sample of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.178 and 4.226 GeV, corresponding to an integrated luminosity of 6.32~fb$^{-1}$. No significant signal is observed. The upper limits on the branching fractions for $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ are set to be $1.1 \times 10^{-5}$ and $4.3 \times 10^{-6}$ at 90\% confidence level, respectively.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Dynamic Surfactants Drive Anisotropic Colloidal Assembly
Authors:
Yaxin Xu,
Prabhat Jandhyala,
Sho C. Takatori
Abstract:
Colloidal building blocks with re-configurable shapes and dynamic interactions can exhibit unusual self-assembly behaviors and pathways. In this work, we consider the phase behavior of colloids coated with surface-mobile polymer brushes that behave as "dynamic surfactants." Unlike traditional polymer-grafted colloids, we show that colloids coated with dynamic surfactants can acquire anisotropic ma…
▽ More
Colloidal building blocks with re-configurable shapes and dynamic interactions can exhibit unusual self-assembly behaviors and pathways. In this work, we consider the phase behavior of colloids coated with surface-mobile polymer brushes that behave as "dynamic surfactants." Unlike traditional polymer-grafted colloids, we show that colloids coated with dynamic surfactants can acquire anisotropic macroscopic assemblies, even for spherical colloids with isotropic attractive interactions. We use Brownian Dynamics simulations and dynamic density functional theory (DDFT) to demonstrate that time-dependent reorganization of the dynamic surfactants leads to phase diagrams with anisotropic assemblies. We observed that the microscopic polymer distributions impose unique geometric constraints between colloids that control their packing into lamellar, string, and vesicle phases. Our work may help discover versatile building blocks and provide extensive design freedom for assembly out of thermodynamic equilibrium.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
Authors:
Ziquan Liu,
Yufei Cui,
Yan Yan,
Yi Xu,
Xiangyang Ji,
Xue Liu,
Antoni B. Chan
Abstract:
In safety-critical applications such as medical imaging and autonomous driving, where decisions have profound implications for patient health and road safety, it is imperative to maintain both high adversarial robustness to protect against potential adversarial attacks and reliable uncertainty quantification in decision-making. With extensive research focused on enhancing adversarial robustness th…
▽ More
In safety-critical applications such as medical imaging and autonomous driving, where decisions have profound implications for patient health and road safety, it is imperative to maintain both high adversarial robustness to protect against potential adversarial attacks and reliable uncertainty quantification in decision-making. With extensive research focused on enhancing adversarial robustness through various forms of adversarial training (AT), a notable knowledge gap remains concerning the uncertainty inherent in adversarially trained models. To address this gap, this study investigates the uncertainty of deep learning models by examining the performance of conformal prediction (CP) in the context of standard adversarial attacks within the adversarial defense community. It is first unveiled that existing CP methods do not produce informative prediction sets under the commonly used $l_{\infty}$-norm bounded attack if the model is not adversarially trained, which underpins the importance of adversarial training for CP. Our paper next demonstrates that the prediction set size (PSS) of CP using adversarially trained models with AT variants is often worse than using standard AT, inspiring us to research into CP-efficient AT for improved PSS. We propose to optimize a Beta-weighting loss with an entropy minimization regularizer during AT to improve CP-efficiency, where the Beta-weighting loss is shown to be an upper bound of PSS at the population level by our theoretical analysis. Moreover, our empirical study on four image classification datasets across three popular AT baselines validates the effectiveness of the proposed Uncertainty-Reducing AT (AT-UR).
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Authors:
Lingdong Kong,
Shaoyuan Xie,
Hanjiang Hu,
Yaru Niu,
Wei Tsang Ooi,
Benoit R. Cottereau,
Lai Xing Ng,
Yuexin Ma,
Wenwei Zhang,
Liang Pan,
Kai Chen,
Ziwei Liu,
Weichao Qiu,
Wei Zhang,
Xu Cao,
Hao Lu,
Ying-Cong Chen,
Caixin Kang,
Xinning Zhou,
Chengyang Ying,
Wentao Shang,
Xingxing Wei,
Yinpeng Dong,
Bo Yang,
Shengyin Jiang
, et al. (66 additional authors not shown)
Abstract:
In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c…
▽ More
In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that can withstand and adapt to these real-world variabilities. Focusing on four pivotal tasks -- BEV detection, map segmentation, semantic occupancy prediction, and multi-view depth estimation -- the competition laid down a gauntlet to innovate and enhance system resilience against typical and atypical disturbances. This year's challenge consisted of five distinct tracks and attracted 140 registered teams from 93 institutes across 11 countries, resulting in nearly one thousand submissions evaluated through our servers. The competition culminated in 15 top-performing solutions, which introduced a range of innovative approaches including advanced data augmentation, multi-sensor fusion, self-supervised learning for error correction, and new algorithmic strategies to enhance sensor robustness. These contributions significantly advanced the state of the art, particularly in handling sensor inconsistencies and environmental variability. Participants, through collaborative efforts, pushed the boundaries of current technologies, showcasing their potential in real-world scenarios. Extensive evaluations and analyses provided insights into the effectiveness of these solutions, highlighting key trends and successful strategies for improving the resilience of driving perception systems. This challenge has set a new benchmark in the field, providing a rich repository of techniques expected to guide future research in this field.
△ Less
Submitted 29 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Nonreciprocal quantum phase transition in a spinning microwave magnonic system
Authors:
Ye-jun Xu,
Long-hua Zhai,
Peng Fu,
Shou-**g Cheng,
Guo-Qiang Zhang
Abstract:
We propose how to achieve nonreciprocal quantum phase transition in a spinning microwave magnonic system composed of a spinning microwave resonator coupled with an yttrium iron garnet sphere with magnon Kerr effect. Sagnac-Fizeau shift caused by the spinning of the resonator brings about a significant modification in the critical driving strengths for second- and one-order quantum phase transition…
▽ More
We propose how to achieve nonreciprocal quantum phase transition in a spinning microwave magnonic system composed of a spinning microwave resonator coupled with an yttrium iron garnet sphere with magnon Kerr effect. Sagnac-Fizeau shift caused by the spinning of the resonator brings about a significant modification in the critical driving strengths for second- and one-order quantum phase transitions, which means that the highly controllable quantum phase can be realized by the spinning speed of the resonator. More importantly, based on the difference in the detunings of the counterclockwise and clockwise modes induced by spinning direction of the resonator, the phase transition in this system is nonreciprocal, that is, the quantum phase transition occurs when the system is driven in one direction but not the other. Our work offers an alternative path to engineer and design nonreciprocal magnonic devices.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (635 additional authors not shown)
Abstract:
Using 9.0 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies from 4.178 to 4.278 GeV with the BESIII detector at the BEPCII collider, we perform the first search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$. No $χ_{c1}(3872)\toγψ_2(3823)$ signal is observed. The upper limit on the ratio of branching fractions…
▽ More
Using 9.0 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies from 4.178 to 4.278 GeV with the BESIII detector at the BEPCII collider, we perform the first search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$. No $χ_{c1}(3872)\toγψ_2(3823)$ signal is observed. The upper limit on the ratio of branching fractions $\mathcal{B}(χ_{c1}(3872)\toγψ_2(3823), ψ_2(3823)\toγχ_{c1})/\mathcal{B}(χ_{c1}(3872)\toπ^+π^- J/ψ)$ is set as 0.075 at the 90\% confidence level. Our result contradicts theoretical predictions under the assumption that the $χ_{c1}(3872)$ is the pure charmonium state $χ_{c1}(2P)$.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS
Authors:
Qingyang Li,
Yihang Zhang,
Zhidong Jia,
Yannan Hu,
Lei Zhang,
Jianrong Zhang,
Yongming Xu,
Yong Cui,
Zongming Guo,
Xinggong Zhang
Abstract:
It is an interesting question Can and How Large Language Models (LLMs) understand non-language network data, and help us detect unknown malicious flows. This paper takes Carpet Bombing as a case study and shows how to exploit LLMs' powerful capability in the networking area. Carpet Bombing is a new DDoS attack that has dramatically increased in recent years, significantly threatening network infra…
▽ More
It is an interesting question Can and How Large Language Models (LLMs) understand non-language network data, and help us detect unknown malicious flows. This paper takes Carpet Bombing as a case study and shows how to exploit LLMs' powerful capability in the networking area. Carpet Bombing is a new DDoS attack that has dramatically increased in recent years, significantly threatening network infrastructures. It targets multiple victim IPs within subnets, causing congestion on access links and disrupting network services for a vast number of users. Characterized by low-rates, multi-vectors, these attacks challenge traditional DDoS defenses. We propose DoLLM, a DDoS detection model utilizes open-source LLMs as backbone. By reorganizing non-contextual network flows into Flow-Sequences and projecting them into LLMs semantic space as token embeddings, DoLLM leverages LLMs' contextual understanding to extract flow representations in overall network context. The representations are used to improve the DDoS detection performance. We evaluate DoLLM with public datasets CIC-DDoS2019 and real NetFlow trace from Top-3 countrywide ISP. The tests have proven that DoLLM possesses strong detection capabilities. Its F1 score increased by up to 33.3% in zero-shot scenarios and by at least 20.6% in real ISP traces.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Existence of non-Abelian vortices in a coupled 4D-2D quantum field theory
Authors:
Yilu Xu,
Shouxin Chen
Abstract:
Vortices produce locally concentrated field configurations and are solutions to the nonlinear partial differential equations systems of complicated structures. In this paper, we establish the existence and uniqueness for solutions of the gauged non-Abelian vortices in a coupled 4D-2D quantum field theory by researching the nonlinear elliptic equations systems with exponential terms in…
▽ More
Vortices produce locally concentrated field configurations and are solutions to the nonlinear partial differential equations systems of complicated structures. In this paper, we establish the existence and uniqueness for solutions of the gauged non-Abelian vortices in a coupled 4D-2D quantum field theory by researching the nonlinear elliptic equations systems with exponential terms in $\mathbb{R}^{2}$ using the calculus of variations. In addition, we obtain the asymptotic behavior of the solutions at infinity and the quantized integrals in $\mathbb{R}^{2}$.
△ Less
Submitted 30 May, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
A systematic study of the ultra-fast outflow responses to luminosity variations in active galactic nuclei
Authors:
Yerong Xu,
Ciro Pinto,
Daniele Rogantini,
Didier Barret,
Stefano Bianchi,
Matteo Guainazzi,
Jacobo Ebrero,
William Alston,
Erin Kara,
Giancarlo Cusumano
Abstract:
The extreme velocities and high ionization states of ultra-fast outflows (UFOs) make them a promising candidate for AGN feedback on the evolution of the host galaxy. However, their exact underlying driving mechanism is not yet fully understood. Given that the variability of UFOs may be used to distinguish among different launching mechanisms, we aim to search for and characterize the responses of…
▽ More
The extreme velocities and high ionization states of ultra-fast outflows (UFOs) make them a promising candidate for AGN feedback on the evolution of the host galaxy. However, their exact underlying driving mechanism is not yet fully understood. Given that the variability of UFOs may be used to distinguish among different launching mechanisms, we aim to search for and characterize the responses of the UFO properties to the variable irradiating luminosity. We performed a high-resolution spectroscopy of archival XMM-Newton observations on six highly-accreting NLS1 galaxies. The state-of-the-art methods of the blind Gaussian line scan and photoionization model scan are used to identify UFO solutions. We search for ionized winds and investigate the structure of ionized winds and their responses to the luminosity variations. The powerful photoionization model scan reveals three previously unreported UFOs in RE J1034+396, PG 1244+026 and I ZW 1, and two new WAs in RE J1034+396. The entrained UFOs are discovered in 4 (66%) AGN, supporting the shocked outflow interpretation for AGN ionized winds. 2 out of 7 (28%) UFOs seem to respond to the continuum and 3 (43%) UFOs hint at a radiatively accelerated nature. Combined with published works, we do not find any correlations between UFO responses and AGN properties except for a tentative ($\sim1.8σ$) anti-correlation between the UFO acceleration and the Eddington ratio, to be confirmed by further observations and an enlarged sample. The kinetic energy of UFOs, mostly detected in soft X-rays, is found to have a large uncertainty. We, therefore, cannot conclude whether soft X-ray UFOs have sufficient energy to drive the AGN feedback, although they are very promising based on some reasonable assumptions.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment
Authors:
L. T. Yang,
S. K. Liu,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
Y. H. Chen,
J. P. Cheng,
W. H. Dai,
Z. Deng,
C. H. Fang,
X. P. Geng,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
J. R. He,
J. W. Hu,
H. X. Huang,
T. C. Huang,
L. Jiang,
S. Karmakar
, et al. (61 additional authors not shown)
Abstract:
We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China **** Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio…
▽ More
We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China **** Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axions with mass up to 100 eV/$c^2$. Within the hadronic model of KSVZ, our results exclude axion mass $>5.3~\rm{eV}/c^2$ at 95\% C.L.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Robust Semi-supervised Learning by Wisely Leveraging Open-set Data
Authors:
Yang Yang,
Nan Jiang,
Yi Xu,
De-Chuan Zhan
Abstract:
Open-set Semi-supervised Learning (OSSL) holds a realistic setting that unlabeled data may come from classes unseen in the labeled set, i.e., out-of-distribution (OOD) data, which could cause performance degradation in conventional SSL models. To handle this issue, except for the traditional in-distribution (ID) classifier, some existing OSSL approaches employ an extra OOD detection module to avoi…
▽ More
Open-set Semi-supervised Learning (OSSL) holds a realistic setting that unlabeled data may come from classes unseen in the labeled set, i.e., out-of-distribution (OOD) data, which could cause performance degradation in conventional SSL models. To handle this issue, except for the traditional in-distribution (ID) classifier, some existing OSSL approaches employ an extra OOD detection module to avoid the potential negative impact of the OOD data. Nevertheless, these approaches typically employ the entire set of open-set data during their training process, which may contain data unfriendly to the OSSL task that can negatively influence the model performance. This inspires us to develop a robust open-set data selection strategy for OSSL. Through a theoretical understanding from the perspective of learning theory, we propose Wise Open-set Semi-supervised Learning (WiseOpen), a generic OSSL framework that selectively leverages the open-set data for training the model. By applying a gradient-variance-based selection mechanism, WiseOpen exploits a friendly subset instead of the whole open-set dataset to enhance the model's capability of ID classification. Moreover, to reduce the computational expense, we also propose two practical variants of WiseOpen by adopting low-frequency update and loss-based selection respectively. Extensive experiments demonstrate the effectiveness of WiseOpen in comparison with the state-of-the-art.
△ Less
Submitted 20 May, 2024; v1 submitted 11 May, 2024;
originally announced May 2024.
-
Van der Waals Magnetic Electrode Transfer for Two-Dimensional Spintronic Devices
Authors:
Zhongzhong Luo,
Zhihao Yu,
Xiangqian Lu,
Wei Niu,
Yao Yu,
Yu Yao,
Fuguo Tian,
Chee Leong Tan,
Huabin Sun,
Li Gao,
Wei Qin,
Yong Xu,
Qiang Zhao,
Xiang-Xiang Song
Abstract:
Two-dimensional (2D) materials are promising candidates for spintronic applications. Maintaining their atomically smooth interfaces during integration of ferromagnetic (FM) electrodes is crucial since conventional metal deposition tends to induce defects at the interfaces. Meanwhile, the difficulties in picking up FM metals with strong adhesion and in achieving conductance match between FM electro…
▽ More
Two-dimensional (2D) materials are promising candidates for spintronic applications. Maintaining their atomically smooth interfaces during integration of ferromagnetic (FM) electrodes is crucial since conventional metal deposition tends to induce defects at the interfaces. Meanwhile, the difficulties in picking up FM metals with strong adhesion and in achieving conductance match between FM electrodes and spin transport channels make it challenging to fabricate high-quality 2D spintronic devices using metal transfer techniques. Here, we report a solvent-free magnetic electrode transfer technique that employs a graphene layer to assist in the transfer of FM metals. It also serves as part of the FM electrode after transfer for optimizing spin injection, which enables the realization of spin valves with excellent performance based on various 2D materials. In addition to two-terminal devices, we demonstrate that the technique is applicable for four-terminal spin valves with nonlocal geometry. Our results provide a promising future of realizing 2D spintronic applications using the developed magnetic electrode transfer technique.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
On-the-fly Learning to Transfer Motion Style with Diffusion Models: A Semantic Guidance Approach
Authors:
Lei Hu,
Zihao Zhang,
Yong**g Ye,
Yiwen Xu,
Shihong Xia
Abstract:
In recent years, the emergence of generative models has spurred development of human motion generation, among which the generation of stylized human motion has consistently been a focal point of research. The conventional approach for stylized human motion generation involves transferring the style from given style examples to new motions. Despite decades of research in human motion style transfer…
▽ More
In recent years, the emergence of generative models has spurred development of human motion generation, among which the generation of stylized human motion has consistently been a focal point of research. The conventional approach for stylized human motion generation involves transferring the style from given style examples to new motions. Despite decades of research in human motion style transfer, it still faces three main challenges: 1) difficulties in decoupling the motion content and style; 2) generalization to unseen motion style. 3) requirements of dedicated motion style dataset; To address these issues, we propose an on-the-fly human motion style transfer learning method based on the diffusion model, which can learn a style transfer model in a few minutes of fine-tuning to transfer an unseen style to diverse content motions. The key idea of our method is to consider the denoising process of the diffusion model as a motion translation process that learns the difference between the style-neutral motion pair, thereby avoiding the challenge of style and content decoupling. Specifically, given an unseen style example, we first generate the corresponding neutral motion through the proposed Style-Neutral Motion Pair Generation module. We then add noise to the generated neutral motion and denoise it to be close to the style example to fine-tune the style transfer diffusion model. We only need one style example and a text-to-motion dataset with predominantly neutral motion (e.g. HumanML3D). The qualitative and quantitative evaluations demonstrate that our method can achieve state-of-the-art performance and has practical applications.
△ Less
Submitted 20 March, 2024;
originally announced May 2024.
-
Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data
Authors:
Yonghao Xu,
Pedram Ghamisi,
Yannis Avrithis
Abstract:
Multi-target unsupervised domain adaptation (UDA) aims to learn a unified model to address the domain shift between multiple target domains. Due to the difficulty of obtaining annotations for dense predictions, it has recently been introduced into cross-domain semantic segmentation. However, most existing solutions require labeled data from the source domain and unlabeled data from multiple target…
▽ More
Multi-target unsupervised domain adaptation (UDA) aims to learn a unified model to address the domain shift between multiple target domains. Due to the difficulty of obtaining annotations for dense predictions, it has recently been introduced into cross-domain semantic segmentation. However, most existing solutions require labeled data from the source domain and unlabeled data from multiple target domains concurrently during training. Collectively, we refer to this data as "external". When faced with new unlabeled data from an unseen target domain, these solutions either do not generalize well or require retraining from scratch on all data. To address these challenges, we introduce a new strategy called "multi-target UDA without external data" for semantic segmentation. Specifically, the segmentation model is initially trained on the external data. Then, it is adapted to a new unseen target domain without accessing any external data. This approach is thus more scalable than existing solutions and remains applicable when external data is inaccessible. We demonstrate this strategy using a simple method that incorporates self-distillation and adversarial learning, where knowledge acquired from the external data is preserved during adaptation through "one-way" adversarial learning. Extensive experiments in several synthetic-to-real and real-to-real adaptation settings on four benchmark urban driving datasets show that our method significantly outperforms current state-of-the-art solutions, even in the absence of external data. Our source code is available online (https://github.com/YonghaoXu/UT-KD).
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Measurement of the ${e}^{+}{e}^{-}\to p \bar{p}π^{0}$ cross section at $\sqrt{s}=2.1000-3.0800$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
The process $e^{+}e^{-}\to p\bar{p}π^{0}$ is studied at 20 center-of-mass energies ranging from 2.1000 to 3.0800 GeV using 636.8 pb$^{-1}$ of data collected with the BESIII detector operating at the BEPCII collider. The Born cross sections for $e^{+}e^{-}\to p\bar{p}π^{0}$ are measured with high precision. Since the lowest center-of-mass energy, 2.1000 GeV, is less than 90 MeV above the…
▽ More
The process $e^{+}e^{-}\to p\bar{p}π^{0}$ is studied at 20 center-of-mass energies ranging from 2.1000 to 3.0800 GeV using 636.8 pb$^{-1}$ of data collected with the BESIII detector operating at the BEPCII collider. The Born cross sections for $e^{+}e^{-}\to p\bar{p}π^{0}$ are measured with high precision. Since the lowest center-of-mass energy, 2.1000 GeV, is less than 90 MeV above the $p\bar{p}π^0$ energy threshold, we can probe the threshold behavior for this reaction. However, no anomalous threshold enhancement is found in the cross sections for $e^{+}e^{-}\to p\bar{p}π^{0}$.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Map** dissolved carbon in space and time: An experimental technique for the measurement of pH and total carbon concentration in density driven convection of CO$_2$ dissolved in water
Authors:
Hilmar Yngvi Birggison,
Yao Xu,
Marcel Moura,
Eirik Grude Flekkøy,
Knut Jørgen Måløy
Abstract:
We present an experimental technique for determining the pH and the total carbon concentration when \ch{CO2} diffuses and flows in water. The technique employs three different pH indicators, which, when combined with an image analysis technique, provides a dynamic range in pH from 4.0 to 9.5. In contrast to usual techniques in which a single pH indicator is used, the methodology presented allows n…
▽ More
We present an experimental technique for determining the pH and the total carbon concentration when \ch{CO2} diffuses and flows in water. The technique employs three different pH indicators, which, when combined with an image analysis technique, provides a dynamic range in pH from 4.0 to 9.5. In contrast to usual techniques in which a single pH indicator is used, the methodology presented allows not only to produce a binary classification (pH larger or smaller than a given threshold) but to access a much more complete continuous spatial distribution of pH and concentration levels in the system. We calibrate the method against benchmark solutions and further demonstrate its potential by measuring the pH and total carbon concentration in a density driven convection (DDC) of carbon-enriched water. The motivation for testing the method in this particular experiment comes from the fact that DDC plays a pivotal role in the efficiency of engineered carbon storage processes. The application of the technique presented here provided a direct window for the analysis of the spatial distribution of captured carbon in the DDC flow.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Simulating Spin Dynamics of Supersolid States in a Quantum Ising Magnet
Authors:
Yi Xu,
Juraj Hasik,
Boris Ponsioen,
Andriy H. Nevidomskyy
Abstract:
Motivated by the recent experimental study on a quantum Ising magnet $\text{K}_2\text{Co}(\text{SeO}_3)_2$ where spectroscopic evidence of zero-field supersolidity is presented [arXiv: 2402.15869], we simulate the excitation spectrum of the corresponding microscopic $XXZ$ model for the compound, using the recently developed excitation ansatz of infinite projected entangled-pair states (iPEPS). We…
▽ More
Motivated by the recent experimental study on a quantum Ising magnet $\text{K}_2\text{Co}(\text{SeO}_3)_2$ where spectroscopic evidence of zero-field supersolidity is presented [arXiv: 2402.15869], we simulate the excitation spectrum of the corresponding microscopic $XXZ$ model for the compound, using the recently developed excitation ansatz of infinite projected entangled-pair states (iPEPS). We map out the ground state phase diagram and compute the dynamical spin structure factors across a range of magnetic field strengths, focusing especially on the two supersolid phases found near zero and saturation fields. Our simulated excitation spectra for the zero-field supersolid "Y" phase are in excellent agreement with the experimental data -- recovering the low-energy branches and integer quantized excited energy levels $ω_n=nJ_{zz}$. Furthermore, we demonstrate the nonlocal multi-spin-flip features for modes at $ω_2$, indicative of their multi-magnon nature. Additionally, we identify characteristics of the high-field supersolid "V" phase in the simulated spectra, to be compared with future experimental results.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Authors:
Yaqi Wu,
Zhihao Fan,
Xiaofeng Chu,
Jimmy S. Ren,
Xiaoming Li,
Zongsheng Yue,
Chongyi Li,
Shangcheng Zhou,
Ruicheng Feng,
Yuekun Dai,
Peiqing Yang,
Chen Change Loy,
Senyan Xu,
Zhi**g Sun,
Jiaying Zhu,
Yurui Zhu,
Xueyang Fu,
Zheng-Jun Zha,
Jun Cao,
Cheng Li,
Shu Chen,
Liang Ma,
Shiyang Zhou,
Hai** Zeng,
Kai Feng
, et al. (24 additional authors not shown)
Abstract:
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra…
▽ More
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging (MIPI). Building on the achievements of the previous MIPI Workshops held at ECCV 2022 and CVPR 2023, we introduce our third MIPI challenge including three tracks focusing on novel image sensors and imaging algorithms. In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2024. In total, 170 participants were successfully registered, and 14 teams submitted results in the final testing phase. The developed solutions in this challenge achieved state-of-the-art performance on Nighttime Flare Removal. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2024/.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Dynamic Workforce Scheduling and Relocation in Hyperconnected Parcel Logistic Hubs
Authors:
Yujia Xu,
Yiguo Liu,
Benoit Montreuil
Abstract:
With the development of e-commerce during the Covid-19 pandemic, one of the major challenges for many parcel logistics companies is to design reliable and flexible scheduling algorithms to meet uncertainties of parcel arrivals as well as manpower supplies in logistic hubs, especially for those depending on workforce greatly. Currently, most labor scheduling is periodic and limited to single facili…
▽ More
With the development of e-commerce during the Covid-19 pandemic, one of the major challenges for many parcel logistics companies is to design reliable and flexible scheduling algorithms to meet uncertainties of parcel arrivals as well as manpower supplies in logistic hubs, especially for those depending on workforce greatly. Currently, most labor scheduling is periodic and limited to single facility, thus the number of required workers in each hub is constrained to meet the peak demand with high variance. We approach this challenge, recognizing that not only workforce schedules but also working locations could be dynamically optimized by develo** a dynamic workforce scheduling and relocation system, fed from updated data with sensors and dynamically updated hub arrival demand predictions. In this paper, we propose novel reactive scheduling heuristics to dynamically match predicted arrivals with shifts at hyperconnected parcel logistics hubs. Dynamic scheduling and allocation mechanisms are carried out dynamically during delivery periods to spatiotemporally adjust the available workforce. We also include penalty costs to keep parcels sorted in time and scheduling adjustments are made in advance to allow sufficient time for crew planning. To assess the proposed methods, we conduct comprehensive case studies based on real-world parcel logistic networks of a logistic company in China. The results show that our proposed approach can significantly outperform traditional workforce scheduling strategies in hubs with limited computation time.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Transverse Cooper-Pair Rectifier
Authors:
Pei-Hao Fu,
Jun-Feng Liu,
Yong Xu,
Ching Hua Lee,
Yee Sin Ang
Abstract:
Non-reciprocal devices are key components in modern electronics covering broad applications ranging from transistors to logic circuits thanks to the output rectified signal in the direction parallel to the input. In this work, we propose a transverse Cooper-pair rectifier in which a non-reciprocal current is perpendicular to the driving field, when inversion, time reversal, and mirror symmetries a…
▽ More
Non-reciprocal devices are key components in modern electronics covering broad applications ranging from transistors to logic circuits thanks to the output rectified signal in the direction parallel to the input. In this work, we propose a transverse Cooper-pair rectifier in which a non-reciprocal current is perpendicular to the driving field, when inversion, time reversal, and mirror symmetries are broken simultaneously. The Blonder-Tinkham-Klapwijk formalism is developed to describe the transverse current-voltage relation in a normal-metal/superconductor tunneling junction, where symmetry constraints are achieved by an effective built-in supercurrent manifesting in an asymmetric and anisotropic Andreev reflection. The asymmetry in the Andreev reflection is induced when inversion and time reversal symmetry are broken by the supercurrent component parallel to the junction while the anisotropy occurs when the mirror symmetry with respect to the normal of the junction interface is broken by the perpendicular supercurrent component to the junction. Compared to the conventional longitudinal one, the transverse rectifier supports fully polarized diode efficiency and colossal nonreciprocal conductance rectification, completely decoupling the path of the input excitation from the output rectified signal. This work provides a formalism for realizing transverse non-reciprocity in superconducting junctions, which is expected to be achieved by modifying current experimental setups and may pave the way for future low-dissipation superconducting electronics.
△ Less
Submitted 25 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Authors:
DeepSeek-AI,
Aixin Liu,
Bei Feng,
Bin Wang,
Bingxuan Wang,
Bo Liu,
Chenggang Zhao,
Chengqi Dengr,
Chong Ruan,
Damai Dai,
Daya Guo,
Dejian Yang,
Deli Chen,
Dongjie Ji,
Erhang Li,
Fangyun Lin,
Fuli Luo,
Guangbo Hao,
Guanting Chen,
Guowei Li,
H. Zhang,
Hanwei Xu,
Hao Yang,
Haowei Zhang,
Honghui Ding
, et al. (132 additional authors not shown)
Abstract:
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference…
▽ More
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation. Compared with DeepSeek 67B, DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times. We pretrain DeepSeek-V2 on a high-quality and multi-source corpus consisting of 8.1T tokens, and further perform Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to fully unlock its potential. Evaluation results show that, even with only 21B activated parameters, DeepSeek-V2 and its chat versions still achieve top-tier performance among open-source models.
△ Less
Submitted 19 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs
Authors:
Jiahao Nick Li,
Yan Xu,
Tovi Grossman,
Stephanie Santosa,
Michelle Li
Abstract:
The progression to "Pervasive Augmented Reality" envisions easy access to multimodal information continuously. However, in many everyday scenarios, users are occupied physically, cognitively or socially. This may increase the friction to act upon the multimodal information that users encounter in the world. To reduce such friction, future interactive interfaces should intelligently provide quick a…
▽ More
The progression to "Pervasive Augmented Reality" envisions easy access to multimodal information continuously. However, in many everyday scenarios, users are occupied physically, cognitively or socially. This may increase the friction to act upon the multimodal information that users encounter in the world. To reduce such friction, future interactive interfaces should intelligently provide quick access to digital actions based on users' context. To explore the range of possible digital actions, we conducted a diary study that required participants to capture and share the media that they intended to perform actions on (e.g., images or audio), along with their desired actions and other contextual information. Using this data, we generated a holistic design space of digital follow-up actions that could be performed in response to different types of multimodal sensory inputs. We then designed OmniActions, a pipeline powered by large language models (LLMs) that processes multimodal sensory inputs and predicts follow-up actions on the target information grounded in the derived design space. Using the empirical data collected in the diary study, we performed quantitative evaluations on three variations of LLM techniques (intent classification, in-context learning and finetuning) and identified the most effective technique for our task. Additionally, as an instantiation of the pipeline, we developed an interactive prototype and reported preliminary user feedback about how people perceive and react to the action predictions and its errors.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
ID-centric Pre-training for Recommendation
Authors:
Yiqing Wu,
Ruobing Xie,
Zhao Zhang,
Fuzhen Zhuang,
Xu Zhang,
Leyu Lin,
Zhanhui Kang,
Yongjun Xu
Abstract:
Classical sequential recommendation models generally adopt ID embeddings to store knowledge learned from user historical behaviors and represent items. However, these unique IDs are challenging to be transferred to new domains. With the thriving of pre-trained language model (PLM), some pioneer works adopt PLM for pre-trained recommendation, where modality information (e.g., text) is considered un…
▽ More
Classical sequential recommendation models generally adopt ID embeddings to store knowledge learned from user historical behaviors and represent items. However, these unique IDs are challenging to be transferred to new domains. With the thriving of pre-trained language model (PLM), some pioneer works adopt PLM for pre-trained recommendation, where modality information (e.g., text) is considered universal across domains via PLM. Unfortunately, the behavioral information in ID embeddings is still verified to be dominating in PLM-based recommendation models compared to modality information and thus limits these models' performance. In this work, we propose a novel ID-centric recommendation pre-training paradigm (IDP), which directly transfers informative ID embeddings learned in pre-training domains to item representations in new domains. Specifically, in pre-training stage, besides the ID-based sequential model for recommendation, we also build a Cross-domain ID-matcher (CDIM) learned by both behavioral and modality information. In the tuning stage, modality information of new domain items is regarded as a cross-domain bridge built by CDIM. We first leverage the textual information of downstream domain items to retrieve behaviorally and semantically similar items from pre-training domains using CDIM. Next, these retrieved pre-trained ID embeddings, rather than certain textual embeddings, are directly adopted to generate downstream new items' embeddings. Through extensive experiments on real-world datasets, both in cold and warm settings, we demonstrate that our proposed model significantly outperforms all baselines. Codes will be released upon acceptance.
△ Less
Submitted 7 May, 2024; v1 submitted 6 May, 2024;
originally announced May 2024.
-
FairRelay: Fair and Cost-Efficient Peer-to-Peer Content Delivery through Payment Channel Networks
Authors:
**gyu Liu,
Yingjie Xue,
Zifan Peng,
Chao Lin,
Xinyi Huang
Abstract:
Peer-to-Peer (P2P) content delivery, known for scalability and resilience, offers a decentralized alternative to traditional centralized Content Delivery Networks (CDNs). A significant challenge in P2P content delivery remains: the fair compensation of relayers for their bandwidth contributions. Existing solutions employ blockchains for payment settlements, however, they are not practical due to h…
▽ More
Peer-to-Peer (P2P) content delivery, known for scalability and resilience, offers a decentralized alternative to traditional centralized Content Delivery Networks (CDNs). A significant challenge in P2P content delivery remains: the fair compensation of relayers for their bandwidth contributions. Existing solutions employ blockchains for payment settlements, however, they are not practical due to high on-chain costs and over-simplified network assumptions. In this paper, we introduce FairRelay, a fair and cost-efficient protocol that ensures all participants get fair payoff in complex content delivery network settings. We introduce a novel primitive, Enforceable Accumulative Hashed TimeLock Contract (Enforceable A-HTLC), designed to guarantee payment atomicity - ensuring all participants receive their payments upon successful content delivery.
The fairness of FairRelay is proved using the Universal Composability (UC) framework. Our evaluation demonstrates that, in optimistic scenarios, FairRelay employs zero on-chain costs. In pessimistic scenarios, the on-chain dispute costs for relayers and customers are constant, irrespective of the network complexity. Specifically, empirical results indicate that the on-chain dispute costs for relayers and customers are 24,902 gas (equivalent to 0.01 USD on Optimism L2) and 290,797 gas (0.07 USD), respectively. In a 10-hop relay path, FairRelay introduces less than 1.5% additional overhead compared to pure data transmission, showcasing the efficiency of FairRelay.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Solar Flare Ribbon Fronts. II. Evolution of heating rates in individual flare footpoints
Authors:
Graham S. Kerr,
Vanessa Polito,
Yan Xu,
Joel C. Allred
Abstract:
Solar flare ribbon fronts appear ahead of the bright structures that normally characterise solar flares, and can persist for an extended period of time in spatially localised patches before transitioning to `regular' bright ribbons. They likely represent the initial onset of flare energy deposition into the chromosphere. Chromospheric spectra (e.g. He I 10830A and the Mg II near-UV lines) from rib…
▽ More
Solar flare ribbon fronts appear ahead of the bright structures that normally characterise solar flares, and can persist for an extended period of time in spatially localised patches before transitioning to `regular' bright ribbons. They likely represent the initial onset of flare energy deposition into the chromosphere. Chromospheric spectra (e.g. He I 10830A and the Mg II near-UV lines) from ribbon fronts exhibit properties rather different to typical flare behaviour. In prior numerical modelling efforts we were unable to reproduce the long lifetime of ribbon fronts. Here we present a series of numerical experiments that are rather simple but which have important implications. We inject a very low flux of nonthermal electrons ($F = 5\times10^{8}$ erg s$^{-1}$ cm$^{-2}$) into the chromosphere for 100 s before ram** up to standard flare energy fluxes $(F = 10^{10-11}$ erg s$^{-1}$ cm$^{-2}$). Synthetic spectra not only sustained their ribbon front-like properties for significantly longer, in the case of harder nonthermal electron spectra the ribbon front behaviour persisted for the entirety of this weak-heating phase. Lengthening or shortening the duration of the weak-heating phase commensurately lengthened or shortened the ribbon front lifetimes. Ribbon fronts transitioned to regular bright ribbons when the upper chromosphere became sufficiently hot and dense, which happened faster for softer nonthermal electron spectra. Thus, the lifetime of flare ribbon fronts are a direct measure of the duration over which a relatively low flux of high energy electrons precipitates to the chromosphere prior to the bombardment of a much larger energy flux.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property Generation
Authors:
Ye Liu,
Yue Xue,
Daoyuan Wu,
Yuqiang Sun,
Yi Li,
Miaolei Shi,
Yang Liu
Abstract:
With recent advances in large language models (LLMs), this paper explores the potential of leveraging state-of-the-art LLMs, such as GPT-4, to transfer existing human-written properties (e.g., those from Certora auditing reports) and automatically generate customized properties for unknown code. To this end, we embed existing properties into a vector database and retrieve a reference property for…
▽ More
With recent advances in large language models (LLMs), this paper explores the potential of leveraging state-of-the-art LLMs, such as GPT-4, to transfer existing human-written properties (e.g., those from Certora auditing reports) and automatically generate customized properties for unknown code. To this end, we embed existing properties into a vector database and retrieve a reference property for LLM-based in-context learning to generate a new prop- erty for a given code. While this basic process is relatively straight- forward, ensuring that the generated properties are (i) compilable, (ii) appropriate, and (iii) runtime-verifiable presents challenges. To address (i), we use the compilation and static analysis feedback as an external oracle to guide LLMs in iteratively revising the generated properties. For (ii), we consider multiple dimensions of similarity to rank the properties and employ a weighted algorithm to identify the top-K properties as the final result. For (iii), we design a dedicated prover to formally verify the correctness of the generated prop- erties. We have implemented these strategies into a novel system called PropertyGPT, with 623 human-written properties collected from 23 Certora projects. Our experiments show that PropertyGPT can generate comprehensive and high-quality properties, achieving an 80% recall compared to the ground truth. It successfully detected 26 CVEs/attack incidents out of 37 tested and also uncovered 12 zero-day vulnerabilities, resulting in $8,256 bug bounty rewards.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Strong He I Emission Lines in High N/O Galaxies at $z \sim 6$ Identified in JWST Spectra: High He/H Abundance Ratios or High Electron Densities?
Authors:
Hiroto Yanagisawa,
Masami Ouchi,
Kuria Watanabe,
Akinori Matsumoto,
Kimihiko Nakajima,
Hidenobu Yajima,
Kentaro Nagamine,
Koh Takahashi,
Minami Nakane,
Nozomu Tominaga,
Hiroya Umeda,
Hajime Fukushima,
Yuichi Harikane,
Yuki Isobe,
Yoshiaki Ono,
Yi Xu,
Yechi Zhang
Abstract:
We present HeI/H$β$-flux and He/H-abundance ratios in three JWST galaxies with significant constraints on N/O-abundance ratios, GS-NDG-9422, RXCJ2248-ID, and GLASS150008 at $z\sim 6$ mostly with the spectroscopic coverage from HeI$λ$4471 and HeII$λ$4686 to HeI$λ$7065, comparing with 68 local-dwarf galaxies. We find that these high-$z$ galaxies present strong HeI emission with HeI/H$β$ flux ratios…
▽ More
We present HeI/H$β$-flux and He/H-abundance ratios in three JWST galaxies with significant constraints on N/O-abundance ratios, GS-NDG-9422, RXCJ2248-ID, and GLASS150008 at $z\sim 6$ mostly with the spectroscopic coverage from HeI$λ$4471 and HeII$λ$4686 to HeI$λ$7065, comparing with 68 local-dwarf galaxies. We find that these high-$z$ galaxies present strong HeI emission with HeI/H$β$ flux ratios generally larger than those of local-dwarf galaxies. We derive He/H with all of the detected HeI, HeII, and $2-3$ hydrogen Balmer lines in the same manner as the local He/H determination conducted for cosmology studies. These high-$z$ galaxies show He overabundance He/H$\gtrsim 0.10$ or high electron density $n_\mathrm{e}\sim 10^{3-4}$ cm$^{-3}$ much larger than local values at low O/H, $12+\log \mathrm{(O/H)}=7-8$. In contrast, we obtain low He/H and $n_\mathrm{e}$ values for our local-dwarf galaxies by the same technique with the same helium and hydrogen lines, and confirm that the difference between the high-$z$ and local-dwarf galaxies are not mimicked by systematics. While two scenarios of 1) He overabundance and 2) high electron density are not clearly concluded, we find that there is a positive correlation on the He/H-N/O or $n_\mathrm{e}$-N/O plane by the comparison of the high-$z$ and local-dwarf galaxies. The scenario 1) suggests that the overabundant helium and nitrogen are not explained by the standard chemical enrichment of core-collapse supernovae, but the CNO-cycle products and equilibrium ratios, respectively. The scenario 2) indicates that the strong helium lines are originated from the central dense clouds of the high-$z$ galaxies by excessive collisional excitation.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Authors:
Guo** Xu,
Xiaxia Wang,
Xinglong Wu,
Xuesong Leng,
Yongchao Xu
Abstract:
Deep learning has made significant progress in computer vision, specifically in image classification, object detection, and semantic segmentation. The skip connection has played an essential role in the architecture of deep neural networks,enabling easier optimization through residual learning during the training stage and improving accuracy during testing. Many neural networks have inherited the…
▽ More
Deep learning has made significant progress in computer vision, specifically in image classification, object detection, and semantic segmentation. The skip connection has played an essential role in the architecture of deep neural networks,enabling easier optimization through residual learning during the training stage and improving accuracy during testing. Many neural networks have inherited the idea of residual learning with skip connections for various tasks, and it has been the standard choice for designing neural networks. This survey provides a comprehensive summary and outlook on the development of skip connections in deep neural networks. The short history of skip connections is outlined, and the development of residual learning in deep neural networks is surveyed. The effectiveness of skip connections in the training and testing stages is summarized, and future directions for using skip connections in residual learning are discussed. Finally, we summarize seminal papers, source code, models, and datasets that utilize skip connections in computer vision, including image classification, object detection, semantic segmentation, and image reconstruction. We hope this survey could inspire peer researchers in the community to develop further skip connections in various forms and tasks and the theory of residual learning in deep neural networks. The project page can be found at https://github.com/apple1986/Residual_Learning_For_Images
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Dynamics for a diffusive epidemic model with a free boundary: spreading-vanishing dichotomy
Authors:
Xue** Li,
Lei Li,
Ying Xu,
DanDan Zhu
Abstract:
This paper involves a diffusive epidemic model whose domain has one free boundary with the Stefan boundary condition, and one fixed boundary subject to the usual homogeneous Dirichlet or Neumann condition. By using the standard upper and lower solutions method and the regularity theory, we first study some related steady state problems which help us obtain the exact longtime behaviors of solution…
▽ More
This paper involves a diffusive epidemic model whose domain has one free boundary with the Stefan boundary condition, and one fixed boundary subject to the usual homogeneous Dirichlet or Neumann condition. By using the standard upper and lower solutions method and the regularity theory, we first study some related steady state problems which help us obtain the exact longtime behaviors of solution component $(u,v)$. Then we prove there exists the unique classical solution whose longtime behaviors are governed by a spreading-vanishing dichotomy. Lastly, the criteria determining when spreading or vanishing happens are given with respect to the basic reproduction number $\mathcal{R}_0$, the initial habitat $[0,h_0]$, the expanding rates $μ_1$ and $μ_2$ as well as the initial function $(u_0,v_0)$. The criteria reveal the effect of the cooperative behaviors of agents and humans on spreading and vanishing.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Differentiable Particles for General-Purpose Deformable Object Manipulation
Authors:
Siwei Chen,
Yiqing Xu,
Cunjun Yu,
Linfeng Li,
David Hsu
Abstract:
Deformable object manipulation is a long-standing challenge in robotics. While existing approaches often focus narrowly on a specific type of object, we seek a general-purpose algorithm, capable of manipulating many different types of objects: beans, rope, cloth, liquid, . . . . One key difficulty is a suitable representation, rich enough to capture object shape, dynamics for manipulation and yet…
▽ More
Deformable object manipulation is a long-standing challenge in robotics. While existing approaches often focus narrowly on a specific type of object, we seek a general-purpose algorithm, capable of manipulating many different types of objects: beans, rope, cloth, liquid, . . . . One key difficulty is a suitable representation, rich enough to capture object shape, dynamics for manipulation and yet simple enough to be acquired effectively from sensor data. Specifically, we propose Differentiable Particles (DiPac), a new algorithm for deformable object manipulation. DiPac represents a deformable object as a set of particles and uses a differentiable particle dynamics simulator to reason about robot manipulation. To find the best manipulation action, DiPac combines learning, planning, and trajectory optimization through differentiable trajectory tree optimization. Differentiable dynamics provides significant benefits and enable DiPac to (i) estimate the dynamics parameters efficiently, thereby narrowing the sim-to-real gap, and (ii) choose the best action by backpropagating the gradient along sampled trajectories. Both simulation and real-robot experiments show promising results. DiPac handles a variety of object types. By combining planning and learning, DiPac outperforms both pure model-based planning methods and pure data-driven learning methods. In addition, DiPac is robust and adapts to changes in dynamics, thereby enabling the transfer of an expert policy from one object to another with different physical properties, e.g., from a rigid rod to a deformable rope.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Hundness and band renormalization in the kagome antiferromagnets Mn$_3X$
Authors:
Yingying Cao,
Yuanji Xu,
Yi-feng Yang
Abstract:
The interplay of topological band structures and electronic correlations may lead to novel exotic quantum phenomena with potential applications. First-principles calculations are critical for guiding the experimental discoveries and interpretations, but often fail if electronic correlations cannot be properly treated. Here we show that this issue occurs also in the antiferromagnetic kagome lattice…
▽ More
The interplay of topological band structures and electronic correlations may lead to novel exotic quantum phenomena with potential applications. First-principles calculations are critical for guiding the experimental discoveries and interpretations, but often fail if electronic correlations cannot be properly treated. Here we show that this issue occurs also in the antiferromagnetic kagome lattice Mn$_3X$ ($X=$ Sn, Ge), which exhibit a large anomalous Hall effect due to topological band structures with Weyl nodes near the Fermi energy. Our systematic investigations reveal a crucial role of the Hund's rule coupling on three key aspects of their magnetic, electronic, and topological properties: (1) the establishment of noncollinear antiferromagnetic orders, (2) the weakly renormalized bands in excellent agreement with ARPES, and (3) a sensitive tuning of the Weyl nodes beyond previous expectations. Our work provides a basis for understanding the topological properties of Mn$_3X$ and challenges previous experimental interpretations based on incorrect band structures.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Active Cell Balancing for Extended Operational Time of Lithium-Ion Battery Systems in Energy Storage Applications
Authors:
Yiming Xu,
Xiaohua Ge,
Ruohan Guo,
Weixiang Shen
Abstract:
Cell inconsistency within a lithium-ion battery system poses a significant challenge in maximizing the system operational time. This study presents an optimization-driven active balancing method to minimize the effects of cell inconsistency on the system operational time while simultaneously satisfying the system output power demand and prolonging the system operational time in energy storage appl…
▽ More
Cell inconsistency within a lithium-ion battery system poses a significant challenge in maximizing the system operational time. This study presents an optimization-driven active balancing method to minimize the effects of cell inconsistency on the system operational time while simultaneously satisfying the system output power demand and prolonging the system operational time in energy storage applications. The proposed method utilizes a fractional order model to forecast the terminal voltage dynamics of each cell within a battery system, enhanced with a particle-swarm-optimisation-genetic algorithm for precise parameter identification. It is implemented under two distinct cell-level balancing topologies: independent cell balancing and differential cell balancing. Subsequently, the current distribution for each topology is determined by resolving two optimization control problems constrained by the battery's operational specifications and power demands. The effectiveness of the proposed method is validated by extensive experiments based on the two balancing topologies. The results demonstrate that the proposed method increases the operational time by 3.2%.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Conformal Risk Control for Ordinal Classification
Authors:
Yunpeng Xu,
Wenge Guo,
Zhi Wei
Abstract:
As a natural extension to the standard conformal prediction method, several conformal risk control methods have been recently developed and applied to various learning problems. In this work, we seek to control the conformal risk in expectation for ordinal classification tasks, which have broad applications to many real problems. For this purpose, we firstly formulated the ordinal classification t…
▽ More
As a natural extension to the standard conformal prediction method, several conformal risk control methods have been recently developed and applied to various learning problems. In this work, we seek to control the conformal risk in expectation for ordinal classification tasks, which have broad applications to many real problems. For this purpose, we firstly formulated the ordinal classification task in the conformal risk control framework, and provided theoretic risk bounds of the risk control method. Then we proposed two types of loss functions specially designed for ordinal classification tasks, and developed corresponding algorithms to determine the prediction set for each case to control their risks at a desired level. We demonstrated the effectiveness of our proposed methods, and analyzed the difference between the two types of risks on three different datasets, including a simulated dataset, the UTKFace dataset and the diabetic retinopathy detection dataset.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images
Authors:
Shadab Ahamed,
Yixi Xu,
Arman Rahmim
Abstract:
Minimizing the need for pixel-level annotated data for training PET anomaly segmentation networks is crucial, particularly due to time and cost constraints related to expert annotations. Current un-/weakly-supervised anomaly detection methods rely on autoencoder or generative adversarial networks trained only on healthy data, although these are more challenging to train. In this work, we present a…
▽ More
Minimizing the need for pixel-level annotated data for training PET anomaly segmentation networks is crucial, particularly due to time and cost constraints related to expert annotations. Current un-/weakly-supervised anomaly detection methods rely on autoencoder or generative adversarial networks trained only on healthy data, although these are more challenging to train. In this work, we present a weakly supervised and Implicitly guided COuNterfactual diffusion model for Detecting Anomalies in PET images, branded as IgCONDA-PET. The training is conditioned on image class labels (healthy vs. unhealthy) along with implicit guidance to generate counterfactuals for an unhealthy image with anomalies. The counterfactual generation process synthesizes the healthy counterpart for a given unhealthy image, and the difference between the two facilitates the identification of anomaly locations. The code is available at: https://github.com/igcondapet/IgCONDA-PET.git
△ Less
Submitted 30 April, 2024;
originally announced May 2024.